[sword-devel] Soft hyphens?
DM Smith
dmsmith at crosswire.org
Sat Apr 1 06:48:19 MST 2017
SWORD uses Lucene’s StandardAnalyzer which in turn uses WhitespaceTokenizer. It doesn’t use WordDelimiterFilter. As such it doesn’t handle hyphenated words well, including soft hyphen.
In Him,
DM
> On Apr 1, 2017, at 8:56 AM, David Haslam <dfhmch at googlemail.com> wrote:
>
> Does SWORD search using Lucene ignore the presence of a soft hyphen in any
> word?
>
> i.e. If the user searches for 'violence' and the word in the text was
> 'violence' would it be found?
>
> NB. The second instance contains a soft hyphen \xAD between 'vio' and
> 'lence'.
>
> Best regards,
>
> David
>
>
>
> --
> View this message in context: http://sword-dev.350566.n4.nabble.com/Soft-hyphens-tp4657045.html
> Sent from the SWORD Dev mailing list archive at Nabble.com.
>
> _______________________________________________
> sword-devel mailing list: sword-devel at crosswire.org
> http://www.crosswire.org/mailman/listinfo/sword-devel
> Instructions to unsubscribe/change your settings at above page
More information about the sword-devel
mailing list