[sword-devel] Search bug & New Arabic Bible, Not Shaped SVD Version
David Haslam
dfhmch at googlemail.com
Mon Dec 10 09:17:37 MST 2012
There are some languages in which the apostrophe is used a letter of the
alphabet rather than an item of punctuation.
e.g. Somali, in which the apostrophe represents the /Alef/.
See http://en.wikipedia.org/wiki/Somali_alphabet
Guessing that our Lucene indexing method generally strips out such
punctuation marks, it would be a useful enhancement in SWORD to be able to
specify in the conf file that a particular punctuation mark should be parsed
as a letter, such that the search index would then include the words
containing this letter.
David
PS. There is a related issue in the SomKQA module that I'm researching with
the providers of the source text.
It's conceivable that all the single right quotation marks should really be
apostrophes.
Their inclusion in the text may easily have been due to an artifact of their
original editing environment.
--
View this message in context: http://sword-dev.350566.n4.nabble.com/Re-Search-bug-New-Arabic-Bible-Not-Shaped-SVD-Version-tp4651330p4651383.html
Sent from the SWORD Dev mailing list archive at Nabble.com.
More information about the sword-devel
mailing list