<html>
<head>
<style><!--
.hmmessage P
{
margin:0px;
padding:0px
}
body.hmmessage
{
font-size: 12pt;
font-family:Calibri
}
--></style></head>
<body class='hmmessage'><div dir='ltr'>Sorry for choosing the wrong word <br>this wikipedia article talking about this topic <br>https://en.wikipedia.org/wiki/Arabic_diacritics<br><br>Thanks Chris for your reply about the filter, Actually I don't have any contact details for the developers of the frontends to report them this problem, hope someone in this list report them about all this discussion :)<br><br>So now we know the problem and the solution .<br><br><div><div id="SkyDrivePlaceholder"></div>> Date: Mon, 26 Nov 2012 01:05:16 -0800<br>> From: chrislit@crosswire.org<br>> To: sword-devel@crosswire.org<br>> Subject: Re: [sword-devel] Search bug & New Arabic Bible, Not Shaped SVD Version<br>> <br>> You're talking about vowels, not shaping. Shaping in Arabic changes the <br>> shape of the letter according to its context in the word (initial, <br>> medial, final, or isolated). I imagine unshaped Arabic would be very <br>> difficult to read. Arabic without vowel marks, on the other hand, is <br>> standard.<br>> <br>> I would have thought that the indexing would have been done without <br>> vowels or both with and without vowels. It should be easy to recover the <br>> vowel-less text for indexing by applying the UTF8ArabicPoints filter.<br>> <br>> --Chris<br>> <br>> On 11/25/2012 11:45 PM, pola ashraf wrote:<br>> > Using a comparison tool from ICU the two strings resulted in different<br>> > character numbers<br>> > Words to compare<br>> > يَسُوعَ<br>> > يسوع<br>> > Which is the Name of JESUS Christ in Arabic but one is shaped and the<br>> > other isn't<br>> ><br>> > Words converted to HEX Format<br>> > \u064a \u064e \u0633 \u064f \u0648 \u0639 \u064e<br>> > \u064a \u0633 \u0648 \u0639<br>> ><br>> > That's why search engines of some frontends doesn't come with any<br>> > results for not shaped words<br>> ><br>> > The suggestion is to make the index contain the shaped words plus the<br>> > same words without shaping<br>> ><br>> > Comparison Tool link https://ssl.icu-project.org/icu-bin/scompare<br>> ><br>> > Note: to clarify the meaning of shaping, shaping is the usage of<br>> > Characters like the following ( ٌ ُ ٍ َ ْ ً )<br>> > these special characters are shapes, and may change the whole word<br>> > meaning and help in correct reading, but as mentioned before, it make<br>> > reading harder and make problem with search functions<br>> ><br>> > Note: And Bible search normally without problems, but the desktop<br>> > programs like Xiphos and Bible Time have this problem<br>> ><br>> > Pola<br>> > ------------------------------------------------------------------------<br>> ><br>> > I think Arabic shapes add extra Unicode characters that's why the 2 same<br>> > words - i mentioned before - don't give the same results<br>> ><br>> > ------------------<br>> > Any Arabic search problem is unconnected to shaping.<br>> ><br>> > Modules are routinely created and stored in a normalised format, user<br>> > entries, e.g. for search ate equally normalised<br>> ><br>> ><br>> ><br>> > _______________________________________________<br>> > sword-devel mailing list: sword-devel@crosswire.org<br>> > http://www.crosswire.org/mailman/listinfo/sword-devel<br>> > Instructions to unsubscribe/change your settings at above page<br>> ><br>> <br>> <br>> _______________________________________________<br>> sword-devel mailing list: sword-devel@crosswire.org<br>> http://www.crosswire.org/mailman/listinfo/sword-devel<br>> Instructions to unsubscribe/change your settings at above page<br></div>                                            </div></body>
</html>