[osis-editors] suggested corrections
Michael Paul Johnson
osis-editors@bibletechnologieswg.org
Wed, 17 Mar 2004 09:47:56 +1000
-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1
At 01:29 17-03-04, you wrote:
>Michael Paul Johnson wrote:
...
>> The first difficulty came in attempting to encode the required
>> xml:lang attribute on the osisText element. There is no 2-letter
>> code
>> for this language. Two-letter codes for languages are inadequate
>> for
>> Bible translation use, because there are over 6,000 living
>> languages
>> spoken today on this planet. Two-letter codes are limited to 676
>> combinations. Three letters could work, albeit a bit cryptically,
>> using SIL Ethnologue codes. Tok Pisin does have a perfectly good
>> three-letter Ethnologue code (PDG), which I tried to insert, and
>> that
>> caused a validation failure. Suggestion: make this attribute
>> OPTIONAL,
>> but require the header/work/language element, which does allow
>> language type "SIL".
>>
>Actually, it should validate if you use "x-" to preface the value you
>are using for xml:lang. I don't know why the W3C choose such a small
>list of language codes but they do allow IANA language codes and
>anything that is preceded by an "x-".
I now use xml:lang="x-SIL-PDG" in that spot.
>> The second difficulty came in encoding verse identifiers for verse
>> bridges. Because of word order constraints and translational
>> considerations, it is often better to translate more than one verse
>> as
>> a unit. A few less literal translations and paraphrases even in
>> English use verse bridges, too. In these cases, the verses are
>> marked
>> with a range. Allowing an osisID range in the same way as an
>> osisRef
>> range would solve this problem. OSIS is not usable for a broad
>> range
>> of Bible texts without a good way to encode verse bridges. I
>> recommend
>> using the same syntax for verse bridge OSIS IDs as for OSIS
>> references, but document that the meaning of this syntax for an
>> osisID
>> is for a verse bridge, and should not be used to identify passages
>> that really are translated verse by verse.
>>
>
>The value of an osisID can be a list of osisIDs. That is you can say:
><p
>osisID="John.1.1 John.1.2 John.1.3"> Note the whitespace separation
>of
>the IDs.
>
>Unlike the range solution you propose, I don't know of a requirement
>that the refereces be contiguous, although they are most likely to be
>so.
OK. This works well enough. Lists can be cumbersome, but most verse
bridges are short, anyway, and it is possible (although not highly
probable) that someone would want to drop a verse from the middle of a
bridge and put it in a footnote for textual criticism reasons.
>> One minor point I noticed while attempting to convert the World
>> English Bible and Hebrew Names Version to OSIS 2.0 was that
>> footnote
>> start tags had no equivalent in OSIS. These are the tags that make
>> it
>> easy to convert the text that a note refers to into a hyperlink (as
>> you will see in, for example, http://eBible.org/web/John.htm, which
>> was converted from web.gbf in http://eBible.org/web/webgbf.zip).
>> For
>> now, I threw in some milestone elements with x-startNoteAnchor
>> attributes at those points. The equivalent element in XSEM is the
>> "anchor" element.
>>
>
>Not sure I understand but OSIS has a note element, as well as the
>HTMP
><a> element. Oh, you mean how does the anchor get there to point to
>the
>note? If so, that is a matter of processing, not encoding.
>
>For example, I have a note in the text and I want to print the text
>to
>paper (yikes!). What gets rendered is a number in the text, usually
>superscripted and a matching number at the bottom of the page or end
>of
>the chapter. Neither of those is actually marked in the text.
>
>The reason for that is the question you raise, now I want to do an
>HTML
>document and I want a hyperlink to the note. In addition to doing the
>superscripted number, perhaps even highlighting the word where it is
>found, I automatically generate a hyperlink as well that points to
>the
>note.
But where should the hyperlink start?
>Think of the numbers you see on footnotes as being manual pointers
>and
>you will see what I am talking about.
I don't think you get my point, yet, so I'll try again by explaining
how GBF works, and how I process it, and then maybe you will see the
hole in the current draft of OSIS. In GBF, each footnote MUST have a
point marked where the footnote is inserted. At this point, the
footnote marker is NOT specified, but that is where one would go. The
text of the footnote is inserted within the element (between <RF> and
<Rf>, which if it were HTML would probably be <footnote> and
</footnote>. In GBF, each footnote MAY also have an indicator that
indicates the beginning, rather than the end, of what the footnote
pertains to (<RB>). If the footnote does not have this marker, then
the footnote itself should make that obvious. In processing for
display or conversion to another format, the mandatory footnote
elements are processed just as for OSIS when that is all there is. For
example, in print, a marker may be inserted in the text (such as a
superscript letter, number, or a symbol like asterisk or dagger), and
the text rendered at the end of the page. In HTML, a hyperlinked
marker may be used to invoke a pop-up, a link to a footnote in another
frame, or to an anchor indicating where the footnote is in the current
document, with a return link back to the footnote insertion point.
When the optional beginning marker (<RB>) is present, then much better
HTML is easier to render for footnotes, in that the hyperlink can
easily by applied to the word or phrase so marked, rather than to an
extra marker. You can't do that with OSIS, without resorting to an
"extra" milestone, like I did. I suppose that you could use the markup
within the footnote to extrapolate back to where the beginning marker
should be if it were there, but that is not worth the effort in my
opinion.
>Hope you are having a great day!
Yes, I am. :-)
>> Thank you for your work towards making OSIS a usable standard. It
>> isn't too far off...
You are welcome. I hope that it actually gets accepted widely as a
Scripture interchange & archiving format. Most of its shortcomings can
be overcome, and in those cases where they can't, conversions can be
made to and from more suitable formats.
May God bless you.
Michael
-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.2.4 (MingW32)
Comment: http://eBible.org/mpj/gpg.htm
iD8DBQFAV5IjRI/gxxfXR7sRAqwcAJ90FSUrvzQ6pVshdiTUH0ZgK2rjoACbB5x7
UuoYeH4OJENJq1tdI5PDkBY=
=xwrr
-----END PGP SIGNATURE-----