sanitize HTML before stripping tags in order to avoid empty results with certain Word formattings

This commit is contained in:
Helmut Merz 2012-05-10 10:19:12 +02:00
parent 3beae86d4d
commit 0e060bc34e

View file

@ -79,6 +79,7 @@ def checkStyle(k):
def stripAll(value):
value = sanitize(value)
def collectText(tags):
for tag in tags:
if type(tag) is NavigableString: