[sword-devel] Updating Clarke commentary to become readable

Karl Kleinpaste karl at charcoal.com
Sun Sep 24 14:33:47 MST 2006

"Greg Hellings" <greg.hellings at gmail.com> writes:
> That is an astounding script.  Amazingly done!  I haven't tested it, as I
> don't have Clarke's installed, but it seems that if the Sword lib is
> mishandling the & character and the <br /> tag, then the problem really lies
> within Sword and should be fixed there, ASAP.  Excellent sed-ing, though!

The problem with <br /> tags is specific to the Clarke commentary;
it's simply that whatever converted some original Clarke text into a
ThML Sword module left it with a metric blortful of unnecessary (and
interfering) <br /> tags.  The script strips out the excess in a
coherent fashion.  (Yes, that line noise is coherent.)

The `&' problem appears to be a bona fide Sword flaw.

I'm glad you like the script, but that's pretty minor sed use.
Consider: Syntactic detection of verse references using...

's/\(\(\([123][CJKPST]\|[AC-Z]\)[a-z]\+\.\?\)\? *[0-9]\+:[0-9]\+\([-,:]\( \?[0-9]\+\)\?\)*\)\([a-z]\|ff\.\)\?\([],.;:)=]\| [A-Za-z(=[]\)/<scripRef passage="\1">\1\6<\/scripRef>\7/g'

A little follow-on work is needed after that nightmare, to undo a few
false positives.

More information about the sword-devel mailing list