[sword-devel] Soft hyphens?

DM Smith dmsmith at crosswire.org
Sat Apr 1 06:48:19 MST 2017


SWORD uses Lucene’s StandardAnalyzer which in turn uses WhitespaceTokenizer. It doesn’t use WordDelimiterFilter. As such it doesn’t handle hyphenated words well, including soft hyphen.

In Him,
	DM

> On Apr 1, 2017, at 8:56 AM, David Haslam <dfhmch at googlemail.com> wrote:
> 
> Does SWORD search using Lucene ignore the presence of a soft hyphen in any
> word?
> 
> i.e. If the user searches for 'violence' and the word in the text was
> 'vio­lence' would it be found?
> 
> NB. The second instance contains a soft hyphen \xAD between 'vio' and
> 'lence'.
> 
> Best regards,
> 
> David
> 
> 
> 
> --
> View this message in context: http://sword-dev.350566.n4.nabble.com/Soft-hyphens-tp4657045.html
> Sent from the SWORD Dev mailing list archive at Nabble.com.
> 
> _______________________________________________
> sword-devel mailing list: sword-devel at crosswire.org
> http://www.crosswire.org/mailman/listinfo/sword-devel
> Instructions to unsubscribe/change your settings at above page




More information about the sword-devel mailing list