[sword-devel] GlobalOptionFilter=UTF8GreekAccents and non-Greek modules
David Haslam
dfhmch at googlemail.com
Tue Feb 28 05:34:53 MST 2017
Could take a while to do a full analysis on all the Greek modules with
accents.
The situation is further complicated by the fact that at least one Greek NT
module was made without automatically normalising the UTF-8 text to NFC, so
there are still a smattering of these separate diacritics next to certain
Greek letters:
U+0313 ̓ 1,220 COMBINING COMMA ABOVE
U+1FBD ᾽ 1 GREEK KORONIS
This was a surprise to me!
It's the way module SBLGNT was made - presumably using osis2mod with its -N
option. SwordVersionDate=2013-07-18 (which was while Osk was still active in
CrossWire)
The sole instance of the standalone KORONIS U+1FBD occurs in this verse.
Matthew 10:33: ὅστις ⸂δ᾽ ἂν⸃ ἀρνήσηταί με ἔμπροσθεν τῶν ἀνθρώπων, ἀρνήσομαι
⸂κἀγὼ αὐτὸν⸃ ἔμπροσθεν τοῦ πατρός μου τοῦ ⸀ἐν οὐρανοῖς.
It's just after the first small letter delta.
Whether this is correct rather than being δ̓ (small delta with U+0343
COMBINING GREEK KORONIS) is beyond my current knowledge. I will ask Dr
David I-B about this instance.
btw. This KORONIS does get removed by the filter as one should expect.
Its inclusion here is merely because its presence was a suprise to me.
However, using the UTF8GreekAccents filter on module SBLGNT converts all
those
U+0313 ̓ 1,220 COMBINING COMMA ABOVE
to
U+02BC ʼ 1,220 MODIFIER LETTER APOSTROPHE
which is another peculiarity that I was certainly not expecting.
Breaking down those 1220 instances, the Greek letters followed by U+0313
are:
00033 δ̓
00107 θ̓
00233 τ̓
00292 π̓
00213 λ̓
00132 φ̓
00061 ῤ
00149 ἰ
This exactly matches the corresponding counts with U+02BC for the filtered
file.
00033 δʼ
00107 θʼ
00233 τʼ
00292 πʼ
00213 λʼ
00132 φʼ
00061 ρʼ
00149 ιʼ
Well there we are, you see. One accent gets changed rather than stripped!
We need the combined skills of an engineer and NT Greek scholar to get to
the bottom of these things.
Best regards,
David
--
View this message in context: http://sword-dev.350566.n4.nabble.com/GlobalOptionFilter-UTF8GreekAccents-and-non-Greek-modules-tp4656719p4656843.html
Sent from the SWORD Dev mailing list archive at Nabble.com.
More information about the sword-devel
mailing list