[sword-devel] GlobalOptionFilter=UTF8GreekAccents and non-Greek modules

David Haslam dfhmch at googlemail.com
Tue Feb 28 05:34:53 MST 2017


Could take a while to do a full analysis on all the Greek modules with
accents.

The situation is further complicated by the fact that at least one Greek NT
module was made without automatically normalising the UTF-8 text to NFC, so
there are still a smattering of these separate diacritics next to certain
Greek letters:

U+0313	̓	1,220		COMBINING COMMA ABOVE
U+1FBD	᾽	1		GREEK KORONIS

This was a surprise to me!

It's the way module SBLGNT was made - presumably using osis2mod with its -N
option. SwordVersionDate=2013-07-18 (which was while Osk was still active in
CrossWire)

The sole instance of the standalone KORONIS U+1FBD occurs in this verse.

Matthew 10:33: ὅστις ⸂δ᾽ ἂν⸃ ἀρνήσηταί με ἔμπροσθεν τῶν ἀνθρώπων, ἀρνήσομαι
⸂κἀγὼ αὐτὸν⸃ ἔμπροσθεν τοῦ πατρός μου τοῦ ⸀ἐν οὐρανοῖς. 

It's just after the first small letter delta.
Whether this is correct rather than being δ̓ (small delta with U+0343
COMBINING GREEK KORONIS) is beyond my current knowledge.  I will ask Dr
David I-B about this instance.

btw. This KORONIS does get removed by the filter as one should expect.
Its inclusion here is merely because its presence was a suprise to me.

However, using the UTF8GreekAccents filter on module SBLGNT converts all
those 
U+0313	̓	1,220		COMBINING COMMA ABOVE
to 
U+02BC	ʼ	1,220		MODIFIER LETTER APOSTROPHE

which is another peculiarity that I was certainly not expecting.

Breaking down those 1220 instances, the Greek letters followed by U+0313
are:

00033	δ̓
00107	θ̓
00233	τ̓
00292	π̓
00213	λ̓
00132	φ̓
00061	ῤ
00149	ἰ

This exactly matches the corresponding counts with U+02BC for the filtered
file.

00033	δʼ
00107	θʼ
00233	τʼ
00292	πʼ
00213	λʼ
00132	φʼ
00061	ρʼ
00149	ιʼ

Well there we are, you see. One accent gets changed rather than stripped!

We need the combined skills of an engineer and NT Greek scholar to get to
the bottom of these things.

Best regards,

David





--
View this message in context: http://sword-dev.350566.n4.nabble.com/GlobalOptionFilter-UTF8GreekAccents-and-non-Greek-modules-tp4656719p4656843.html
Sent from the SWORD Dev mailing list archive at Nabble.com.



More information about the sword-devel mailing list