[sword-devel] Search bug & New Arabic Bible, Not Shaped SVD Version

David Haslam dfhmch at googlemail.com
Mon Dec 10 15:17:53 MST 2012


Thanks DM, for the reminder.

Even for English, when we include those modern versions that make use of
contractions such as 

"I'm"
"You've"
"He's"
"They're"
"We'd"
"She'll"
"Can't"

It's easy for humans to spot the fact that "m" "ve" "s" "re" "d" "ll" & "t"
are not whole words in and of themselves. Yet that's what would result from
stripping the apostrophes.

Anyone using a front-end in which one of the search options is "whole
words", might end up with misleading results.
The Lucene search index would presumably include such suffices as distinct
words.

A proper indexing method would classify each whole contraction as a "word".

David



--
View this message in context: http://sword-dev.350566.n4.nabble.com/Re-Search-bug-New-Arabic-Bible-Not-Shaped-SVD-Version-tp4651330p4651385.html
Sent from the SWORD Dev mailing list archive at Nabble.com.



More information about the sword-devel mailing list