[sword-devel] Dotted and dotless I in Turkish & Azerbaijani, etc.
David Haslam
d.haslam at ukonline.co.uk
Sat Feb 20 02:58:44 MST 2010
Please first read this article.
http://en.wikipedia.org/wiki/Dotted_and_dotless_I
http://en.wikipedia.org/wiki/Dotted_and_dotless_I
Languages such as Turkish and Northern Azeri have both dotted and dotless
letter I in their Latin-based alphabets.
This has implications for letter case.
Such alphabets break the familiar case relationship between uppercase I and
lowercase i.
Instead they have as a upper and lower case pairs:
I and ı
İ and i
Questions:
Does sword_icu properly address this in terms of case folding?
How does each front-end application address these issues, e.g. in terms of
case-insensitive searches, etc?
cf. We already have two Turkish Bible modules, and work is about to start
on a Bible module for Northern Azeri.
Working on the Go Bible for the Azerbaijani translation is how I became
alerted to this issue.
David
--
View this message in context: http://n4.nabble.com/Dotted-and-dotless-I-in-Turkish-Azerbaijani-etc-tp1562704p1562704.html
Sent from the SWORD Dev mailing list archive at Nabble.com.
More information about the sword-devel
mailing list