[sword-devel] Searching for hyphenated words?

DM Smith dmsmith at crosswire.org
Sun Mar 3 06:25:53 MST 2013


Lucene is highly configurable, but out of the box StandardAnalyzer and SimpleAnalyzer do not do anything with this. To handle it requires adding a folding filter. I think they have one suitable for it, but it is not hard to write one to do it.

It may require creating ones own Analyzer, which is nothing more than a tokenizer and a chain of filters, to add in a filter.

This is something that JSword needs also and I haven't gotten around to work on it. I see it as part of a larger solution of handling accents which is another folding problem.

In Him,
	DM

On Mar 3, 2013, at 7:53 AM, Jonathan Morgan <jonmmorgan at gmail.com> wrote:

> Another possibly related normalisation problem which BPBible at least has an open issue about is Caesar vs. Cæsar.  Theoretically I guess you want either search to match both forms.  I don't know how Lucene etc. deals with this (if at all).
> 
> Jon
> 
> On Mon, Feb 25, 2013 at 2:48 AM, David Haslam <dfhmch at googlemail.com> wrote:
> In the KJV module, if you want to search for [say] the hyphenated name
> "Maher–shalal–hash–baz", you first have to be aware that this module uses
> the ndash in place of the hyphen.
> 
> btw.  It's not so easy to enter the ndash from a keyboard, and probably even
> harder in an Android tablet or mobile.
> 
> If you use ordinary hyphen/minus for the search key hyphen for this module,
> you don't find anything with "Exact phrase".
> If you use "Multi-word", you do find "Maher" highlighted in the found verse.
> (e.g. using Xiphos).
> 
> For modules in general, however, the user cannot usually know in advance
> whether hyphenated words use the ndash, the hyphen or something else.
> 
> Has anyone else looked into this aspect of the search feature?
> 
> David
> 
> 
> 
> 
> 
> --
> View this message in context: http://sword-dev.350566.n4.nabble.com/Searching-for-hyphenated-words-tp4652016.html
> Sent from the SWORD Dev mailing list archive at Nabble.com.
> 
> _______________________________________________
> sword-devel mailing list: sword-devel at crosswire.org
> http://www.crosswire.org/mailman/listinfo/sword-devel
> Instructions to unsubscribe/change your settings at above page
> 
> _______________________________________________
> sword-devel mailing list: sword-devel at crosswire.org
> http://www.crosswire.org/mailman/listinfo/sword-devel
> Instructions to unsubscribe/change your settings at above page

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://www.crosswire.org/pipermail/sword-devel/attachments/20130303/153c3ebb/attachment.html>


More information about the sword-devel mailing list