[sword-devel] OSIS 2.0.1 modules available

Chris Little sword-devel@crosswire.org
Wed, 04 Feb 2004 08:22:03 -0600


Michael Paul Johnson wrote:
> American Standard Version http://eBible.org/asv/asvosis.zip
> God's Living Word http://eBible.org/glw/glwosis.zip
> Hebrew Names Version http://eBible.org/hnv/hnvosis.zip
> King James Version http://eBible.org/kjv/kjvosis.zip
> Melanesian Pidgin http://eBible.org/pdg/TokPisinOSIS.zip
> World English Bible http://eBible.org/web/webosis.zip

Looks good.  I saw just a few issues that need some correction.  The 
most important is that <verse> eID's need a value matching the preceding 
sID on another <verse> element.  I think this is the only issue that 
actually violates the spec.

Aside from that:
The book <div> elements should have an osisID attribute where you used 
scope.
The code for English is "en".  You can use "ENG" in the <language 
type="SIL"> element, however.  (This isn't yet clear from the manual, of 
course, but I expect the final version of the manual will cover this 
area adequately.)

Various other issues, like the format of the <identifier type="OSIS">, 
are in flux, and will probably be defined in OSIS 2.1 or the final 
manual.  (My current best guess at the value 
"Bible.en.Rainbow_Ministries.WEB.2004-01-22".)

> If you care to alter the <q> marker and quote marks to strictly comply 
> with the OSIS 2 documentation, then you face the following 
> difficulties:
> 
> 1. You MUST provide additional information outside of the OSIS 
> standard to the users of OSIS text that allows the punctuation to be 
> EXACTLY recreated as in the original text. The rules of this 
> recreation and the exact markers used are different for different 
> languages, different dialects, and even for different translations 
> within the same dialect. They aren't even the same for all of the 
> texts above. If you use the <q> marks in the KJV to generate red text, 
> that is OK, but if you generate quotation marks, you are changing the 
> text. The KJV has no quotation marks, nor does the ASV.

I was sympathetic with this position, since it really does make 
conversion from other formats easier, but using <q> is undeniably better.

It is true that different language, dialects, and translations use 
different standards of placing quotation marks.  However, there are also 
plenty of instances when the SAME translation demonstrates different 
standards of placing quotation marks, depending on locale, paragraphing, 
and contemporary standards.  This is part of why OSIS requires marking 
with <q> rather than typographic quotation marks.  (Another benefit is 
the potential for more richly tagged text, with speaker information.)

For example....  Neighboring Francophone and Anglophone regions may 
allow use of a single vernacular translation, but would still need to be 
printed in two editions, using French and English quotation mark 
standards, respectively.  Translations may be rendered as paragraph or 
in verse-per-line format, and will require different numbers of 
continuation quotation marks, depending on this rendering.  And 
quotation mark standards for the KJV will vary, depending on whether you 
are rendering a facsimile of the original, a modern UK edition, or a 
modern US edition.

> 2. If you scan a new Bible text that has correct quotation marks, you 
> probably won't be able to fully automate conversion from those marks 
> to <q> markup.
> 
> 3. If you fail in doing 1 or 2, above, you may be in violation of 
> copyright, trademark, and/or common law. Worse yet, you shift 
> responsibility before God from the translators to yourself for the 
> accurate transmission of His Word.

Copyright, trademark, common law, aren't involved, though contract law 
might be (depending on your contract).  Suggesting that you will somehow 
have "responsibility before God" (unless you're intentionally rendering 
incorrectly) would be pretty ridiculous and implies that every 
typesetter or translator who ever made a mistake while working on a 
Bible (probably all of them) will be held responsible for those acts.

> The OSIS spec should be changed to allow separation of quotation mark 
> generation markers from words of Jesus markers.

We probably won't ever see that, precisely because there already exists 
a way to express this.

There also probably won't ever be anything akin to a note start anchor, 
since it can already be expressed.  The first verse of the WEB reads:

<verse sID="Gen.1.1" osisID="Gen.1.1" />In the beginning <milestone 
type="x-noteStartAnchor" />God<note type="translation">After “God,” the 
Hebrew has the two letters “Aleph Tav” (the first and last letters of 
the Hebrew alphabet) as a grammatical marker.</note> created the heavens 
and the earth.<verse eID="Gen.1.1" />

and could instead be encoded with a <catchWord> to indicate the annotant 
of the <note>:

<verse sID="Gen.1.1" osisID="Gen.1.1" />In the beginning God<note 
type="translation"><catchWord>God</catchWord> After “God,” the Hebrew 
has the two letters “Aleph Tav” (the first and last letters of the 
Hebrew alphabet) as a grammatical marker.</note> created the heavens and 
the earth.<verse eID="Gen.1.1" />

or with an osisRef with a grain, to explicitly define the range of the 
annotant:

<verse sID="Gen.1.1" osisID="Gen.1.1" />In the beginning God<note 
osisRef="Gen.1.1@s[G]-Gen.1.1@s[d]" type="translation">After “God,” the 
Hebrew has the two letters “Aleph Tav” (the first and last letters of 
the Hebrew alphabet) as a grammatical marker.</note> created the heavens 
and the earth.<verse eID="Gen.1.1" />

or:

<verse sID="Gen.1.1" osisID="Gen.1.1" />In the beginning God<note 
osisRef="Gen.1.1@c[18]-Gen.1.1@c[20]" type="translation">After “God,” 
the Hebrew has the two letters “Aleph Tav” (the first and last letters 
of the Hebrew alphabet) as a grammatical marker.</note> created the 
heavens and the earth.<verse eID="Gen.1.1" />

(The first grain example uses the string matching grain, the second uses 
character counting.  These should be detailed in the manual.  If not... 
they will be in the final. :)

--Chris