[gen] Bugfix in the way to index text fields containing line breaks.
This commit is contained in:
parent
06039b300c
commit
e83f0f3815
|
@ -74,7 +74,7 @@ def splitIntoWords(text):
|
||||||
words). Words of a single char are ignored, excepted digits which are
|
words). Words of a single char are ignored, excepted digits which are
|
||||||
always kept. Duplicate words are removed (result is a set and not a
|
always kept. Duplicate words are removed (result is a set and not a
|
||||||
list).'''
|
list).'''
|
||||||
res = text.split(' ')
|
res = text.split()
|
||||||
# Remove tokens of a single char (excepted if this char is a digit).
|
# Remove tokens of a single char (excepted if this char is a digit).
|
||||||
i = len(res)-1
|
i = len(res)-1
|
||||||
while i > -1 :
|
while i > -1 :
|
||||||
|
|
Loading…
Reference in a new issue