[sword-devel] GlobalOptionFilter=UTF8GreekAccents and non-Greek modules

David Haslam dfhmch at googlemail.com
Tue Feb 21 06:09:28 MST 2017


Further proof .... (specially for Peter) 

As far as I know, Luther wasn't Greek.

A similar experiment with module GerLut1545 showed that all the umlauts are
removed by the UTF8GreekAccents filter.

diff B
S:/Export/GerLut1545/2014-01-17/GerLut1545.diatheke.character.frequency.txt
S:/Export/GerLut1545/Greek
accents/GerLut1545.diatheke.character.frequency.txt
23c23
< 000041	A	10,838	LATIN CAPITAL LETTER A
---
> 000041	A	11,887	LATIN CAPITAL LETTER A
37c37
< 00004F	O	5,758	LATIN CAPITAL LETTER O
---
> 00004F	O	6,017	LATIN CAPITAL LETTER O
43c43
< 000055	U	9,646	LATIN CAPITAL LETTER U
---
> 000055	U	9,951	LATIN CAPITAL LETTER U
49c49
< 000061	a	194,446	LATIN SMALL LETTER A
---
> 000061	a	206,137	LATIN SMALL LETTER A
63c63
< 00006F	o	81,022	LATIN SMALL LETTER O
---
> 00006F	o	92,193	LATIN SMALL LETTER O
69c69
< 000075	u	135,404	LATIN SMALL LETTER U
---
> 000075	u	152,211	LATIN SMALL LETTER U
77,79d76
< 0000C4	Ä	1,049	LATIN CAPITAL LETTER A WITH DIAERESIS
< 0000D6	Ö	259	LATIN CAPITAL LETTER O WITH DIAERESIS
< 0000DC	Ü	305	LATIN CAPITAL LETTER U WITH DIAERESIS
81,83d77
< 0000E4	ä	11,691	LATIN SMALL LETTER A WITH DIAERESIS
< 0000F6	ö	11,171	LATIN SMALL LETTER O WITH DIAERESIS
< 0000FC	ü	16,807	LATIN SMALL LETTER U WITH DIAERESIS

Best regards,

David




--
View this message in context: http://sword-dev.350566.n4.nabble.com/GlobalOptionFilter-UTF8GreekAccents-and-non-Greek-modules-tp4656719p4656741.html
Sent from the SWORD Dev mailing list archive at Nabble.com.



More information about the sword-devel mailing list