[sword-devel] Orphaned verse tags in SWORD modules
David Haslam
dfhdfh at protonmail.com
Sun Feb 10 01:45:42 MST 2019
https://github.com/adyeths/u2o/issues/82
Sent from ProtonMail Mobile
On Sun, Feb 10, 2019 at 06:55, David Haslam <dfhdfh at protonmail.com> wrote:
> Eliminating orphaned verse tags can be done by tweaking the OSIS input file.
>
> The workaround involves moving some verse eID milestones such that there are no other OSIS elements before the next verse sID milestone.
>
> Except, of course, at a chapter boundary.
>
> The fact that I enquired about XML whitespace in a separate email should not deflect anyone from thinking that this issue is anything to do with whitespace.
>
> Whether this is partly due to anything in our software or wholly due to the conversion from USFM to OSIS is now the main focus, or should be.
>
> I added an issue in adyeths/u2o just as he requested.
>
> Best regards,
>
> David
>
> Sent from ProtonMail Mobile
>
> On Sun, Feb 10, 2019 at 03:43, DM Smith <dmsmith at crosswire.org> wrote:
>
>> osis2mod does not understand the CDATA construct <![CDATA[some stuff]]> nor does the SWORD parser. The OSIS schema (xsd) defines text as xs:string, not xs:normalizeString. So spaces, tabs and newlines are not to be collapsed or replaced. I think CDATA and #PCDATA is an SGML construct, not an XML one.
>>
>> However, osis2mod does convert newlines to spaces and collapses consecutive strings of spaces to a single space. The only newline in the module is at the end of the each stored entry in the module’s data file.
>>
>> DM
>>
>>> On Feb 9, 2019, at 5:45 PM, Greg Hellings <greg.hellings at gmail.com> wrote:
>>>
>>> Are we sure that this is a problem with the OSIS and not just an issue with the render filters? All consecutive strings of whitespace characters in XML cdata segments is supposed to be collapsed to a single space character upon processing. If we've got newlines showing up in places that a newline isn't indicated from markup, then it sounds like the issue is in the render filter, not the OSIS. But I'd have to know what OSIS construct you're referring to.
>>>
>>> --Greg
>>>
>>> On Sat, Feb 9, 2019 at 6:21 AM David Haslam <dfhdfh at protonmail.com> wrote:
>>>
>>>> Issue added:
>>>>
>>>> https://github.com/adyeths/u2o/issues/82
>>>>
>>>> Best regards,
>>>>
>>>> David
>>>>
>>>> Sent with [ProtonMail](https://protonmail.com/) Secure Email.
>>>>
>>>> ‐‐‐‐‐‐‐ Original Message ‐‐‐‐‐‐‐
>>>> On Saturday, 9 February 2019 11:17, David Haslam <dfhdfh at protonmail.com> wrote:
>>>>
>>>>> Thanks Ryan,
>>>>>
>>>>> I will add an issue.
>>>>>
>>>>> Aside: Are we all certain that this isn’t due to a bug in the SWORD API ?
>>>>>
>>>>> David
>>>>>
>>>>> Sent from ProtonMail Mobile
>>>>>
>>>>> On Fri, Feb 8, 2019 at 22:36, Ryan V <adyeths at gmail.com> wrote:
>>>>>
>>>>>> A special utility isn't needed. This is a bug in the usfm to osis converter being used.
>>>>>>
>>>>>> If this happens with the output of u2o.py please submit a bug report so i can fix my converter.
>>>>>>
>>>>>> On Fri, Feb 8, 2019, 4:48 PM David Haslam <dfhdfh at protonmail.com wrote:
>>>>>>
>>>>>>> Friends,
>>>>>>>
>>>>>>> How to ensure that modules made from OSIS do not display orphaned verse tags (verse number followed by a new line before the paragraph starts) has been known and understood for years.
>>>>>>>
>>>>>>> Why is there not a simple Sword utility that fixes OSIS files for this one issue?
>>>>>>>
>>>>>>> We still see far too many module releases in which a Bible with paragraphed text is splattered with such orphaned verse tags.
>>>>>>>
>>>>>>> Best regards,
>>>>>>>
>>>>>>> David
>>>>>>>
>>>>>>> Sent with ProtonMail Secure Email.
>>>>>>>
>>>>>>> _______________________________________________
>>>>>>> sword-devel mailing list: sword-devel at crosswire.org
>>>>>>>
>>>>>>> http://www.crosswire.org/mailman/listinfo/sword-devel
>>>>>>>
>>>>>>> Instructions to unsubscribe/change your settings at above page
>>>>
>>>> _______________________________________________
>>>> sword-devel mailing list: sword-devel at crosswire.org
>>>> http://www.crosswire.org/mailman/listinfo/sword-devel
>>>> Instructions to unsubscribe/change your settings at above page
>>>
>>> _______________________________________________
>>> sword-devel mailing list: sword-devel at crosswire.org
>>> http://www.crosswire.org/mailman/listinfo/sword-devel
>>> Instructions to unsubscribe/change your settings at above page
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://www.crosswire.org/pipermail/sword-devel/attachments/20190210/452a968d/attachment.html>
More information about the sword-devel
mailing list