[sword-devel] OSIS 2.0 and JSword

Kahunapule Michael P. Johnson Kahunapule at mpj.cx
Wed Aug 18 16:30:28 MST 2004


At 03:17 19-08-04, DM Smith wrote:
>Looking past a JSword 1.0 release, I was studying the OSIS 2.0 schema 
>and it looks like it may be tough to handle well. Specifically, there 
>are elements that can be either a marker or a container. With regard to 
>Bibles specifically a  verse may start smack dab in the middle of one of 
>these other elements. Or one of these elements may end in a verse. And 
>it might not be just one element that is split by a verse, it may be 
>several.
>...
>Does anyone know of a best practice for OSIS, or any other XML field?

I believe that the recommendation in the XSEM documentation is valid for OSIS as well. Basically, it gives lots of good reasons for making the natural poetry and prose structure of the document primary, and the chapter/verse structure secondary. It then goes on to say that for use in applications where verse priority is required (i. e. a Bible search engine), it would make sense to use XSLT to transform the XML to prioritize the chapters and verses as containers, and make the other elements milestones. This makes sense to me. The sword engine is entirely verse-oriented, as are most Bible search engines, but it is highly desirable to preserve poetry and prose formatting within each displayed verse range.

Of course, the details of what the chapter and verse priority version of OSIS, XSEM, or other similar XML standard isn't fully clear, but I would think that if you start with OSIS, you would want to keep it as close to OSIS as you can while still making every verse well-formed XML, and while making the other stuff into milestones. OSIS already allows verses to be containers, for example, and it has a milestone mechanism that can be extended to the containers for anything that can cross verse boundaries, such as paragraphs. Perhaps using the milestone element just after every verse open tag to put in redundant reminders of what kind of paragraph you are in would make sense; likewise for character styles, quotations being continued, etc.

Thoughts?




More information about the sword-devel mailing list