<!DOCTYPE html>
<html>
<head>
<meta http-equiv="Content-Type" content="text/html; charset=UTF-8">
</head>
<body>
<p>Humans are much more flexible than machines, and I'm sure 100%
accuracy is not possible. However, I'm quite satisfied with the
performance I managed to achieve on the BSB text, and Fr Cyrille
was able to configure it to work to his satisfaction (as far as I
know) on the French-language commentary that I think was giving
Adyeths' script such trouble. I'm hoping the script will be useful
to people in its' current state, but I'd also like to improve it
to be as widely useful as is practical.</p>
<p>I'm interested in learning more about the challenges of adding
reference markup in Urdu. Do you have an OSIS source file in Urdu
and some (perhaps manually-constructed) reference markup I could
look at?<br>
</p>
<div class="moz-cite-prefix">On 29/8/23 19:29, David Haslam wrote:<br>
</div>
<blockquote type="cite"
cite="mid:7kSZldzxQQbvim9_KaWBLUrZLCDc7GavIRVVj83gJ44lpOZglD8ilslLWuluj5H7d51KTPb4ii7a4hVRtSUEgfD0Xt7FVaiDG7zqbf7eUWE=@protonmail.com">
<meta http-equiv="content-type" content="text/html; charset=UTF-8">
<div><br>
</div>
<div dir="auto">Thanks Timothy,</div>
<div dir="auto"><br>
</div>
<div dir="auto">Adyeths withdrew his ‘orefs.py’ script because he
wasn’t satisfied that it could ever achieve 100% accurate
success.</div>
<div dir="auto"><br>
</div>
<div dir="auto">And I have observed that an even greater technical
challenge is to add OSIS reference markup for modules that use a
Right to Left script such as Urdu. Special Unicode formatting
characters are required. <caret></caret></div>
<div dir="auto"><br>
</div>
<div dir="auto">I made some real progress with the underlying
essential principles using a bespoke TextPipe filter several
years ago, but my development never saw the light of day for
various reasons outside CrossWire.</div>
<div dir="auto"><br>
</div>
<div dir="auto">Best regards,</div>
<div dir="auto"><br>
</div>
<div dir="auto">David</div>
<div><br>
</div>
<div><br>
</div>
On Tue, Aug 29, 2023 at 08:53, Timothy Allen <<a class=""
href="mailto:On Tue, Aug 29, 2023 at 08:53, Timothy Allen <<a href="
moz-do-not-send="true">thristian@gmail.com</a>> wrote:
<blockquote type="cite" class="protonmail_quote"> Since the Berean
Standard Bible translation[1] was released under a<br>
Creative Commons Zero licence earlier this year[2], including
the USFM<br>
sources and translation tables, I have been working on
converting it for<br>
use with the SWORD Project[3]. Adyeths' "u2o" script[4] produced
a very<br>
good initial conversion to OSIS XML, but there were additional
cleanups<br>
I wanted to do. Since I didn't want to do them by hand, I wrote
some<br>
scripts to do them for me, and I figured they might be useful
for other<br>
people too. Since they generally work by adding or removing OSIS
markup,<br>
I collectively refer to them as "OSIS Decorators". You can
browse the<br>
code here:<br>
<br>
<a class="moz-txt-link-freetext" href="https://gitlab.com/Screwtapello/osis-decorators">https://gitlab.com/Screwtapello/osis-decorators</a><br>
<br>
...or if you'd prefer, just download them all in ZIP file:<br>
<br>
<a class="moz-txt-link-freetext" href="https://gitlab.com/Screwtapello/osis-decorators/-/archive/main/osis-decorators.zip">https://gitlab.com/Screwtapello/osis-decorators/-/archive/main/osis-decorators.zip</a><br>
<br>
Currently, there's two scripts available, each with full
documentation<br>
and example configuration:<br>
<br>
* "linkify-references" recognises human-readable references like
"John<br>
3:16" or "Exodus 2:4, 7-9, and 17" and adds machine-readable
OSIS<br>
markup to make them clickable links; book names and typographic<br>
conventions are configurable so it should be useful for texts in<br>
languages other than English<br>
* "remove-footnote-backrefs" removes the reference from the
footnote<br>
text back to the verse it came from; very useful in printed
Bibles<br>
but a distraction in digital Bibles where you can just close the<br>
footnote popup to get back to where you were<br>
<br>
All the scripts are under the GPLv2+ licence (same as libSWORD)
and<br>
require only Python 3.6 or higher (no third-party libraries), so<br>
hopefully it won't be too difficult for people to make use of
them.<br>
<br>
If these tools sound useful to you, please try them out! I'd
love to<br>
hear about your experiences with them, or about anything you
think could<br>
be improved.<br>
<br>
Alternatively, if there's some existing repository of OSIS tools
that<br>
these could be contributed to, I'm open to that too.<br>
<br>
<br>
Timothy<br>
<br>
<br>
[1]: <a class="moz-txt-link-freetext" href="https://berean.bible/">https://berean.bible/</a><br>
<br>
[2]: <a class="moz-txt-link-freetext" href="https://berean.bible/licensing.htm">https://berean.bible/licensing.htm</a><br>
<br>
[3]: <a class="moz-txt-link-freetext" href="https://crosswire.org/sword/modules/ModInfo.jsp?modName=BSB">https://crosswire.org/sword/modules/ModInfo.jsp?modName=BSB</a><br>
<br>
[4]: <a class="moz-txt-link-freetext" href="https://github.com/adyeths/u2o/">https://github.com/adyeths/u2o/</a><br>
_______________________________________________<br>
sword-devel mailing list: <a class="moz-txt-link-abbreviated" href="mailto:sword-devel@crosswire.org">sword-devel@crosswire.org</a><br>
<a class="moz-txt-link-freetext" href="http://crosswire.org/mailman/listinfo/sword-devel">http://crosswire.org/mailman/listinfo/sword-devel</a><br>
Instructions to unsubscribe/change your settings at above page<br>
</blockquote>
<br>
<fieldset class="moz-mime-attachment-header"></fieldset>
<pre class="moz-quote-pre" wrap="">_______________________________________________
sword-devel mailing list: <a class="moz-txt-link-abbreviated" href="mailto:sword-devel@crosswire.org">sword-devel@crosswire.org</a>
<a class="moz-txt-link-freetext" href="http://crosswire.org/mailman/listinfo/sword-devel">http://crosswire.org/mailman/listinfo/sword-devel</a>
Instructions to unsubscribe/change your settings at above page
</pre>
</blockquote>
</body>
</html>