[sword-devel] Questions about usfm2osis.pl
DM Smith
dmsmith at crosswire.org
Wed Oct 28 09:19:31 MST 2009
On Oct 28, 2009, at 12:04 PM, <johnduffy at cgcf.net> wrote:
> Hi Daniel,
>
> Yes, it works even using a relative path if only one file is
> specified but not the *.sfm.
Expansion of * is something that the os needs to do. I don't know if
or how Windows does it.
>
> It is interesting that an unrecognised character is seen at the
> start of the file, which appears as a square box in front of (or
> here on top of) the first backslash.
This sounds like a BOM (byte order mark). FEFF, I think. It can be
deleted if the file is utf-8.
> It also keeps the \id as in the extract below, but misses the book
> div and all book abbreviations in the verseIDs.
>
> From USFM2osis.pl for single book of Psalms in Windows:
> </header>
> \id PSA
> <title type="main">Leabhar na SALM</title>
> <div type="majorSection">
> <title>An Chéad Leabhar</title>
> <chapter sID=".1" osisID=".1"/>
> <div type="section">
> <title>Fíorshonas</title>
> <lg>
> <l level="1">
> <verse sID=".1.1" osisID=".1.1"/>
> Is aoibhinn don fhear nach...
>
> From USFM2osis.pl in Linux Psalms extract from whole Bible (Note no
> poetry markup was present when this was done, as well as combined
> titles):
> <div type="book" osisID="Ps">
> <title short="Leabhar na SALM"/>
> <div>
> <title type="main">Leabhar na SALM</title>
> <chapter sID="Ps.1" osisID="Ps.1" n="1"/>
> </div>
> <div type="section" canonical="true">
> <title level="1">An Chéad Leabhar -- Fíorshonas</title>
> <verse sID="Ps.1.1" osisID="Ps.1.1"/>
> <p> Is aoibhinn don fhear nach...
>
> In the process, I've seen on individual files that USFM2osis.pl does
> handle poetry as well as paragraph markup. If it handled the
> verseIDs correctly, I'd be tempted to do each Bible book separately
> and then try to put them all together into a single osis file.
>
> Alternatively, since I'm happy with the USFM content now that I've
> got both the paragraph and poetry markup in place, as well as
> headings separated out into different levels, do you think that
> you'd have time to compile it again in Linux?
>
> Thanks
>
> John Duffy
>
> When I have used usfm2osis.pl successfully it has always been in
> Linux.
> I'd be willing to bet that Chris knows the answer to this one, but the
> problem is not the USFM files but how usfm2osis.pl grabs the input
> from
> them. It is possible to process one file if you give an argument
> with an
> absolute path. However, if you use wildcards like *.sfm, it doesn't
> work. I don't know why. Again, I'm guessing Chris knows how to do
> this,
> but I've never figured it out. I've just used Linux.
>
> Daniel
>
> _______________________________________________
> sword-devel mailing list: sword-devel at crosswire.org
> http://www.crosswire.org/mailman/listinfo/sword-devel
> Instructions to unsubscribe/change your settings at above page
>
>
> _______________________________________________
> sword-devel mailing list: sword-devel at crosswire.org
> http://www.crosswire.org/mailman/listinfo/sword-devel
> Instructions to unsubscribe/change your settings at above page
More information about the sword-devel
mailing list