[sword-devel] Orphaned verse tags in SWORD modules
DM Smith
dmsmith at crosswire.org
Sat Feb 9 20:43:15 MST 2019
osis2mod does not understand the CDATA construct <![CDATA[some stuff]]> nor does the SWORD parser. The OSIS schema (xsd) defines text as xs:string, not xs:normalizeString. So spaces, tabs and newlines are not to be collapsed or replaced. I think CDATA and #PCDATA is an SGML construct, not an XML one.
However, osis2mod does convert newlines to spaces and collapses consecutive strings of spaces to a single space. The only newline in the module is at the end of the each stored entry in the module’s data file.
DM
> On Feb 9, 2019, at 5:45 PM, Greg Hellings <greg.hellings at gmail.com> wrote:
>
> Are we sure that this is a problem with the OSIS and not just an issue with the render filters? All consecutive strings of whitespace characters in XML cdata segments is supposed to be collapsed to a single space character upon processing. If we've got newlines showing up in places that a newline isn't indicated from markup, then it sounds like the issue is in the render filter, not the OSIS. But I'd have to know what OSIS construct you're referring to.
>
> --Greg
>
> On Sat, Feb 9, 2019 at 6:21 AM David Haslam <dfhdfh at protonmail.com <mailto:dfhdfh at protonmail.com>> wrote:
> Issue added:
>
> https://github.com/adyeths/u2o/issues/82 <https://github.com/adyeths/u2o/issues/82>
>
> Best regards,
>
> David
>
> Sent with ProtonMail <https://protonmail.com/> Secure Email.
>
> ‐‐‐‐‐‐‐ Original Message ‐‐‐‐‐‐‐
> On Saturday, 9 February 2019 11:17, David Haslam <dfhdfh at protonmail.com <mailto:dfhdfh at protonmail.com>> wrote:
>
>> Thanks Ryan,
>>
>> I will add an issue.
>>
>> Aside: Are we all certain that this isn’t due to a bug in the SWORD API ?
>>
>> David
>>
>> Sent from ProtonMail Mobile
>>
>>
>> On Fri, Feb 8, 2019 at 22:36, Ryan V <adyeths at gmail.com <mailto:adyeths at gmail.com>> wrote:
>>> A special utility isn't needed. This is a bug in the usfm to osis converter being used.
>>>
>>> If this happens with the output of u2o.py please submit a bug report so i can fix my converter.
>>>
>>> On Fri, Feb 8, 2019, 4:48 PM David Haslam <dfhdfh at protonmail.com <mailto:dfhdfh at protonmail.com> wrote:
>>> Friends,
>>>
>>> How to ensure that modules made from OSIS do not display orphaned verse tags (verse number followed by a new line before the paragraph starts) has been known and understood for years.
>>>
>>> Why is there not a simple Sword utility that fixes OSIS files for this one issue?
>>>
>>> We still see far too many module releases in which a Bible with paragraphed text is splattered with such orphaned verse tags.
>>>
>>>
>>> Best regards,
>>>
>>> David
>>>
>>> Sent with ProtonMail Secure Email.
>>>
>>>
>>>
>>> _______________________________________________
>>> sword-devel mailing list: sword-devel at crosswire.org <mailto:sword-devel at crosswire.org>
>>> http://www.crosswire.org/mailman/listinfo/sword-devel <http://www.crosswire.org/mailman/listinfo/sword-devel>
>>> Instructions to unsubscribe/change your settings at above page
>>
>>
>
> _______________________________________________
> sword-devel mailing list: sword-devel at crosswire.org <mailto:sword-devel at crosswire.org>
> http://www.crosswire.org/mailman/listinfo/sword-devel <http://www.crosswire.org/mailman/listinfo/sword-devel>
> Instructions to unsubscribe/change your settings at above page
> _______________________________________________
> sword-devel mailing list: sword-devel at crosswire.org
> http://www.crosswire.org/mailman/listinfo/sword-devel
> Instructions to unsubscribe/change your settings at above page
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://www.crosswire.org/pipermail/sword-devel/attachments/20190209/18980150/attachment.html>
More information about the sword-devel
mailing list