[sword-devel] testing for diacritics
Peter Von Kaehne
refdoc at gmx.net
Fri Aug 28 08:29:53 MST 2015
That is an option, but I do not like it. Reason is that it requires continous maintenance - adding new diacritic characters to the strip filters to expand their range is an ongoing effort. This would mean two places need constant attention. I am trying as much as possible to take the human factor out of module making.
Peter
> Gesendet: Freitag, 28. August 2015 um 15:42 Uhr
> Von: "David Troidl" <DavidTroidl at aol.com>
> An: sword-devel at crosswire.org
> Betreff: Re: [sword-devel] testing for diacritics
>
> How about regular expressions:
>
> Modern Greek Accented
> [\u0370-\u0390 \u03AA-\u03B0 \u03CA-\u03D4]
>
> Polytonic Greek Accented
> [\u1F00-\u1FFE]
>
> Hebrew Vowel Points
> [\u05BB-\u05B0]
>
> Hebrew Cantillation
> [\u0591-\u05AE]
>
> I don't know about Arabic.
>
> Peace,
>
> David
>
> On 8/28/2015 4:21 AM, Peter von Kaehne wrote:
> > On Fri, 2015-08-28 at 01:27 +0200, Matěj Cepl wrote:
> >> iconv -f utf8 -t us-ascii//translit file.xml \
> >> |diff -u - file.xml
> > Thanks Matej,
> >
> > This would probably work on latin scripts with diacritics, but not on
> > the scripts I am interested in - Hebrew, Arabic derrived and Greek.
> >
> > Peter
> >
> > _______________________________________________
> > sword-devel mailing list: sword-devel at crosswire.org
> > http://www.crosswire.org/mailman/listinfo/sword-devel
> > Instructions to unsubscribe/change your settings at above page
>
>
> ---
> This email has been checked for viruses by Avast antivirus software.
> https://www.avast.com/antivirus
>
>
> _______________________________________________
> sword-devel mailing list: sword-devel at crosswire.org
> http://www.crosswire.org/mailman/listinfo/sword-devel
> Instructions to unsubscribe/change your settings at above page
More information about the sword-devel
mailing list