[sword-devel] OSIS quote marker attribute type="x-doNotGeneratePunctuation"

DM Smith dmsmith at crosswire.org
Sat Oct 16 09:05:18 MST 2010


Looks like our emails crossed in the ether.

On Oct 16, 2010, at 11:38 AM, David Haslam wrote:

> 
> My earlier conjecture was incorrect. 
> 
> The root causes are to do with the WEB module source text, 
> for the 680 places which are marked as Words of Jesus.
> 
> (a) attribute type="x-doNotGeneratePunctuation"
> 
> This is not documented in the OSIS standard and does not do anything useful.
> It may be removed entirely.

Yes.

> 
> (b) who="Jesus" 
> 
> When used at the start of a quotation marked by <q sID="..." who="Jesus" />, 
> the corresponding end of quotation marked by <q eID="..." /> must also
> include who="Jesus".
> These must be inserted in all 680 places.

I don't think that the original source should have who="Jesus" on the <q eID="xxx"/> end quote. Osis2mod should add it in for the SWORD engine. If it doesn't, it is a shortcoming of osis2mod.

Osis2mod uses the sID to match the eID. So these pairs have to be unique in a document.


> 
> (c) A workaround is needed for the curly quotation marks used in the WEB
> module, 
>     but only for the start and end of places marked as Words of Jesus.

If quotation marks are being used, only the start and end of the quotes need to be marked. If these are also marked as who="Jesus", Osis2mod will do lots of transformations if these cross verse (and even chapter boundaries) to ensure that a verse in isolation will show "red-letter text".

There is another type of quote marker that you will probably encounter, the continuing quotation mark. I think this is used in the WEB at the start of paragraphs, within a quote. It will be marked with (if memory serves):
<milestone type="cQuote" marker="“"/>

> 
> (i) At the start of each passage, remove the left curly quotation mark,
>    and insert the appropriate attribute
>    marker="“" for double quotation mark, or marker="‘" for single quotation
> mark.
> 
> (ii) At the end of each passage, remove the right curly quotation mark,
>    and insert the appropriate attribute
>    marker="”" for double quotation mark, or marker="’" for single quotation
> mark.
> 
> (d) There were 13 remaining places which were special cases. Variously...
>      The end of quotation XML tag needed to be moved back to before the
> note tag.
>      The end of quotation XML tag needed to be moved back to before the
> end-paragraph tag.
>      An end quotation mark was missing in a few places.
>      A question mark had to be moved in John 7:36.
>    These special cases were few enough to be corrected manually.
> 
> I have performed all these corrections on the exported file WEB.raw.imp.txt 
> and remade the module using mod2vs - checking that it works with Xiphos for
> Windows.
> 
> I have demonstrated that the WEB module can be rectified by suitable changes
> to the OSIS markup.
> 
> The biggest surprise was the discovery in point (b) above, astounding
> because it is not documented.
> This requires further attention in the CrossWire wiki. 

It may need to be documented if osis2mod does not do it for you. But it should be marked as a workaround for its shortcoming.
f
> 
> The question now is how to proceed for a properly engineered repeatable
> solution ?
> 
> I am assuming that the WEB module was made from the file webosis.xml
> downloaded from ebible.org/web
> 
> Michael Johnson is moving towards maintaining all his Bible translations
> with USFM format as the master copy.  (email to me a few days ago).
> 
> I would therefore recommend that CrossWire uses his USFM downloads,
> and that we convert these to OSIS ourselves as a step towards module making.

I agree and we should share back with Paul the usfm2osis.pl (i.e. make him aware that is what we used and where he can get it.)

> 
> If we find that we need some further tweaks to usfm2osis.pl then so be it.
> 
> Best regards,
> 
> David Haslam
> Go Bible project leader
> CrossWire Bible Society
> 
> 
> 
> 
> 
> -- 
> View this message in context: http://sword-dev.350566.n4.nabble.com/OSIS-quote-marker-attribute-type-x-doNotGeneratePunctuation-tp2996890p2998371.html
> Sent from the SWORD Dev mailing list archive at Nabble.com.
> 
> _______________________________________________
> sword-devel mailing list: sword-devel at crosswire.org
> http://www.crosswire.org/mailman/listinfo/sword-devel
> Instructions to unsubscribe/change your settings at above page




More information about the sword-devel mailing list