[sword-devel] Orphaned verse tags in SWORD modules

David Haslam dfhdfh at protonmail.com
Sat Feb 9 23:55:39 MST 2019


Eliminating orphaned verse tags can be done by tweaking the OSIS input file.

The workaround involves moving some verse eID milestones such that there are no other OSIS elements before the next verse sID milestone.

Except, of course, at a chapter boundary.

The fact that I enquired about XML whitespace in a separate email should not deflect anyone from thinking that this issue is anything to do with whitespace.

Whether this is partly due to anything in our software or wholly due to the conversion from USFM to OSIS is now the main focus, or should be.

I added an issue in adyeths/u2o just as he requested.

Best regards,

David

Sent from ProtonMail Mobile

On Sun, Feb 10, 2019 at 03:43, DM Smith <dmsmith at crosswire.org> wrote:

> osis2mod does not understand the CDATA construct <![CDATA[some stuff]]> nor does the SWORD parser. The OSIS schema (xsd) defines text as xs:string, not xs:normalizeString. So spaces, tabs and newlines are not to be collapsed or replaced. I think CDATA and #PCDATA is an SGML construct, not an XML one.
>
> However, osis2mod does convert newlines to spaces and collapses consecutive strings of spaces to a single space. The only newline in the module is at the end of the each stored entry in the module’s data file.
>
> DM
>
>> On Feb 9, 2019, at 5:45 PM, Greg Hellings <greg.hellings at gmail.com> wrote:
>>
>> Are we sure that this is a problem with the OSIS and not just an issue with the render filters? All consecutive strings of whitespace characters in XML cdata segments is supposed to be collapsed to a single space character upon processing. If we've got newlines showing up in places that a newline isn't indicated from markup, then it sounds like the issue is in the render filter, not the OSIS. But I'd have to know what OSIS construct you're referring to.
>>
>> --Greg
>>
>> On Sat, Feb 9, 2019 at 6:21 AM David Haslam <dfhdfh at protonmail.com> wrote:
>>
>>> Issue added:
>>>
>>> https://github.com/adyeths/u2o/issues/82
>>>
>>> Best regards,
>>>
>>> David
>>>
>>> Sent with [ProtonMail](https://protonmail.com/) Secure Email.
>>>
>>> ‐‐‐‐‐‐‐ Original Message ‐‐‐‐‐‐‐
>>> On Saturday, 9 February 2019 11:17, David Haslam <dfhdfh at protonmail.com> wrote:
>>>
>>>> Thanks Ryan,
>>>>
>>>> I will add an issue.
>>>>
>>>> Aside: Are we all certain that this isn’t due to a bug in the SWORD API ?
>>>>
>>>> David
>>>>
>>>> Sent from ProtonMail Mobile
>>>>
>>>> On Fri, Feb 8, 2019 at 22:36, Ryan V <adyeths at gmail.com> wrote:
>>>>
>>>>> A special utility isn't needed. This is a bug in the usfm to osis converter being used.
>>>>>
>>>>> If this happens with the output of u2o.py please submit a bug report so i can fix my converter.
>>>>>
>>>>> On Fri, Feb 8, 2019, 4:48 PM David Haslam <dfhdfh at protonmail.com wrote:
>>>>>
>>>>>> Friends,
>>>>>>
>>>>>> How to ensure that modules made from OSIS do not display orphaned verse tags (verse number followed by a new line before the paragraph starts) has been known and understood for years.
>>>>>>
>>>>>> Why is there not a simple Sword utility that fixes OSIS files for this one issue?
>>>>>>
>>>>>> We still see far too many module releases in which a Bible with paragraphed text is splattered with such orphaned verse tags.
>>>>>>
>>>>>> Best regards,
>>>>>>
>>>>>> David
>>>>>>
>>>>>> Sent with ProtonMail Secure Email.
>>>>>>
>>>>>> _______________________________________________
>>>>>> sword-devel mailing list: sword-devel at crosswire.org
>>>>>>
>>>>>> http://www.crosswire.org/mailman/listinfo/sword-devel
>>>>>>
>>>>>> Instructions to unsubscribe/change your settings at above page
>>>
>>> _______________________________________________
>>> sword-devel mailing list: sword-devel at crosswire.org
>>> http://www.crosswire.org/mailman/listinfo/sword-devel
>>> Instructions to unsubscribe/change your settings at above page
>>
>> _______________________________________________
>> sword-devel mailing list: sword-devel at crosswire.org
>> http://www.crosswire.org/mailman/listinfo/sword-devel
>> Instructions to unsubscribe/change your settings at above page
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://www.crosswire.org/pipermail/sword-devel/attachments/20190210/3785365a/attachment-0001.html>


More information about the sword-devel mailing list