[osis-core] Thoughts and Questions on compare file
Patrick Durusau
Patrick.Durusau at sbl-site.org
Wed Oct 27 05:54:37 MST 2004
Jim,
Some thoughts and questions on the latest compare file.
At first blush I did not see anything that we can't handle in OSIS now
or with minor modification.
Decided to copy the osis-core list so we can get comments from others as
well.
Will be checking today on actually producing a more restricted schema,
that is one that takes out the x- capability and leaves only enumerated
values. I take it you are aware that your software interface does not
have to display the possibility of an x- value for attributes? That is
to say you could only display enumerated values and not leave the user
any mechanism for departing from the list?
Side note to osis-core on the restricted schema: What I am envisioning
is a very small schema that imports osis-core and redefines/restricts
attribute values to the enumerated lists. Since such an instance would
be a conforming OSIS document (the greater always includes the lesser) I
don't see any compatibility problems.
Specific comments follow:
Alluded_Text: Posting a note today to add this as enumerated value on seg.
Attribution: Should this be on lg or l? Thinking that it is more likely
on l as part of a lg. On the other hand, don't you have the same case
where you would not be using lg or l?
Book_ID: I am not sure what you are asking for here? Isn't this already
contained in the osisID?
Chapter_Head, Chapter_Label, Chapter_Number: Aren't these variants of
title? That is to say that a chapter has a title and these are
'additional' titles of some particular type?
My first reaction is to have:
Chapter_Head = title type="main"
Chapter_Label = title type="sub"
Chapter_Number = title type="sub"
Granted that osisTitles (for type on title) only enumerates
<xs:enumeration value="acrostic"/>
<xs:enumeration value="continued"/>
<xs:enumeration value="main"/>
<xs:enumeration value="parallel"/>
<xs:enumeration value="psalm"/>
<xs:enumeration value="sub"/>
Citation_Line1 (etc): Question, you list lg but I assume l?
BTW, we have otPassage on seg. So, would <l><seg>...</seg></l>, meet the
need here?
Err, actually just looked at the John the Baptist example and so you
would want something like otPassage on lg. Hmmm, then you could do the
line1, line2, etc. with XSLT. Actually would reduce the amount of markup
you would need since if I am in a lg type=otPassage, then lines 1, 2,
etc. fall out from the structure. I think that works for me if it works
for you.
Citation_Paragraph: Looks like a block quote that contains a paragraph,
which as you know can contain a reference element. I think this is what
we used to call <cit> in TEI, which had a <q> followed by a <ref> (don't
hold me to the names) element.
Not sure what would be different about using a block quote that contains
a reference element.
Citation_Reference: Why isn't this simply a reference with a type? That
is to say all references are citation_references in some sense of the
term. Since we have an element for marking all references, why not use
that and add a type if necessary?
Closing: What is the problem with closer here?
Congregational_Response: I will post this to the list. Suggestion for
attribute value? response? congregation? on lg.
Copyright_Statement: Covered under rights in the header. This is the
standard location in Dublin Core. Don't think we gain anything by adding
another potential location for the information.
Unless you mean to say that the copyright page as an artifact needs to
be encoded. I suspect we could add a type to div but to be honest, I
don't see the point. Just make it a div and if you need copyright
information, get it from the header.
Credits: Same here, I could counsel just paragraphs with the usual
sub-elements. Don't gain anything if you have properly prepared the
header which has the very long enumeration of roles for credit, etc. I
guess in part I don't see any reason to priviledge a poor retelling of
information already presented in a useful fashion in the header.
Not saying people should not enter it, but it is just an artifact of
printing and not something you will need to identify later for
processing. For those purposes, use the header information.
Doxology: Let me check on that one. I know we have discussed, probably
should be added.
Embedded_text (all entries): We have discussed, posting to list for
adding type to q.
Emphasis: ??? Sorry, why isn't this covered by hi?
Gloss: Hmmm, would require annotateRef so you could like to the word or
phrase being glossed. Suggest same places and content model as hi?
Hand: Perhaps it is just confusion with the use of 'hand' to mean in
transcription circles the scribal hand and not references to hand in the
text, but I don't see this as a different element. Fair enough that the
text says: 'in my own hand' but that does not seem to me to be a
separate element in the structure of the text. We should discuss this
one. I would suggest hi or seg at first blush.
Inscription_Paragraph: Why doesn't the inscription element work here?
Interlude: Selah is enumerated under osisLine. Suggest same for Interlude?
Intro (all): Suggest introduction type on div?
Line1-*: I assume from your notes you are handling these with XPath
expressions?
Name_of_God: enumerate types, I assume you don't have any to add to the
list I posted?
Paragraph_Continuation: As we discussed, this is handled automatically
in tree representations.
Parallel_Passage_Ref: Yes, handled by reference element
Quoted_Text: Why isn't this handled by q? As opposed to seg type =
quoted text?
Refrain: add type to lg?
Rights: In terms of accessable information, handled by Dublin Core
element in header. Is there some reason to duplicate here?
Section_Head_List: Do you mean a list within a list? If so, note that
list contains list and all lists have head.
See_in_Glossary: ?? The reference element is not empty. Reference is not
limited to simply being Bible references but can contain a reference to
another part of the work, such as a glossary, perhaps a map, etc.
So_Called: Actually these are examples of the mentioned element.
BTW, the example in the help file is incorrect. The last occurrence of
'sinners' is not a mentioned or so_called element. The quoted statement
of the Pharisee's were *using* the term. The preceeding uses are
examples of mentioned, that is the apeaker of those occurrences was not
*using* the term.
Speech_lines (all): I assume you are going to handle these with XPath?
Stanza_Break: add type to lg?
Title_Main and Title_Tertiary: I assume the type on title works for
main. Since sub titles should be inside of title, is there a need for
another type here?
Thinking <title type="main"> blah, blah <title type="sub">blah, blah
<title type="sub">blah, blah</title>(closes tertiary title) </title>
(closes secondary title) </title> closes main title, and allow you to do
uniform XPath expressions for all cases.
If you are going to use styles, etc., so no one sees the XML, suggest
the embedding method for more reliable XPath processing.
Untranslated_Word: ??? Sorry, you have me on that one and I could not
find an example in the help file. Wouldn't this be foreign?
Variant_Section_Paragraph/Head/Tail: Note sure what is being requested.
Look at rdg. By definition, rdg is a variant so everything inside is
about a variant. Perhaps if you could say a bit more about this one.
Verse_Number_Alternate: Hmmm, why not have more than one osisID, with
the alternative prefixed by a work? Display is of course up to the
application.
Words_of_Christ: I take it that the who attribute works for you?
Hope you are having a great day!
Patrick
--
Patrick Durusau
Director of Research and Development
Society of Biblical Literature
Patrick.Durusau at sbl-site.org
Chair, V1 - Text Processing: Office and Publishing Systems Interface
Co-Editor, ISO 13250, Topic Maps -- Reference Model
Topic Maps: Human, not artificial, intelligence at work!
More information about the osis-core
mailing list