[sword-devel] Search bug & New Arabic Bible, Not Shaped SVD Version

David Haslam dfhmch at googlemail.com
Mon Dec 10 09:17:37 MST 2012


There are some languages in which the apostrophe is used a letter of the
alphabet rather than an item of punctuation.

e.g. Somali, in which the apostrophe represents the /Alef/.

See http://en.wikipedia.org/wiki/Somali_alphabet

Guessing that our Lucene indexing method generally strips out such
punctuation marks, it would be a useful enhancement in SWORD to be able to
specify in the conf file that a particular punctuation mark should be parsed
as a letter, such that the search index would then include the words
containing this letter.

David

PS. There is a related issue in the SomKQA module that I'm researching with
the providers of the source text. 
It's conceivable that all the single right quotation marks should really be
apostrophes.
Their inclusion in the text may easily have been due to an artifact of their
original editing environment.



--
View this message in context: http://sword-dev.350566.n4.nabble.com/Re-Search-bug-New-Arabic-Bible-Not-Shaped-SVD-Version-tp4651330p4651383.html
Sent from the SWORD Dev mailing list archive at Nabble.com.



More information about the sword-devel mailing list