[sword-devel] GlobalOptionFilter=UTF8GreekAccents and non-Greek modules
David Haslam
dfhmch at googlemail.com
Tue Feb 21 06:09:28 MST 2017
Further proof .... (specially for Peter)
As far as I know, Luther wasn't Greek.
A similar experiment with module GerLut1545 showed that all the umlauts are
removed by the UTF8GreekAccents filter.
diff B
S:/Export/GerLut1545/2014-01-17/GerLut1545.diatheke.character.frequency.txt
S:/Export/GerLut1545/Greek
accents/GerLut1545.diatheke.character.frequency.txt
23c23
< 000041 A 10,838 LATIN CAPITAL LETTER A
---
> 000041 A 11,887 LATIN CAPITAL LETTER A
37c37
< 00004F O 5,758 LATIN CAPITAL LETTER O
---
> 00004F O 6,017 LATIN CAPITAL LETTER O
43c43
< 000055 U 9,646 LATIN CAPITAL LETTER U
---
> 000055 U 9,951 LATIN CAPITAL LETTER U
49c49
< 000061 a 194,446 LATIN SMALL LETTER A
---
> 000061 a 206,137 LATIN SMALL LETTER A
63c63
< 00006F o 81,022 LATIN SMALL LETTER O
---
> 00006F o 92,193 LATIN SMALL LETTER O
69c69
< 000075 u 135,404 LATIN SMALL LETTER U
---
> 000075 u 152,211 LATIN SMALL LETTER U
77,79d76
< 0000C4 Ä 1,049 LATIN CAPITAL LETTER A WITH DIAERESIS
< 0000D6 Ö 259 LATIN CAPITAL LETTER O WITH DIAERESIS
< 0000DC Ü 305 LATIN CAPITAL LETTER U WITH DIAERESIS
81,83d77
< 0000E4 ä 11,691 LATIN SMALL LETTER A WITH DIAERESIS
< 0000F6 ö 11,171 LATIN SMALL LETTER O WITH DIAERESIS
< 0000FC ü 16,807 LATIN SMALL LETTER U WITH DIAERESIS
Best regards,
David
--
View this message in context: http://sword-dev.350566.n4.nabble.com/GlobalOptionFilter-UTF8GreekAccents-and-non-Greek-modules-tp4656719p4656741.html
Sent from the SWORD Dev mailing list archive at Nabble.com.
More information about the sword-devel
mailing list