[jsword-devel] Error with Strongs tags in Russian RST bible
DM Smith
dmsmith at crosswire.org
Mon Feb 28 19:20:45 MST 2011
The problem seems to be the following:
RST produces:
<div>
<title type='x-gen'>
Colossians 1:1
</title>
<verse osisID='Col.1.1'>
<w lemma='strong:3972'>
Павел
</w>
,
<w lemma='strong:1223|strong:2307'>
волею
</w>
<w lemma='strong:2316'>
Божиею
</w>
<w lemma='strong:652'>
Апостол
</w>
<w lemma='strong:2424'>
Иисуса
</w>
<w lemma='strong:5547'>
Христа
</w>
,
<w lemma='strong:2532'>
и
</w>
<w lemma='strong:5095'>
Тимофей
</w>
<w lemma='strong:80'>
брат
</w>
,
</verse>
</div>
KVJ produces:
<div>
<title type='x-gen'>
Colossians 1:1
</title>
<verse osisID='Col.1.1'>
<w lemma='strong:G3972' morph='robinson:N-NSM' src='1'>
Paul
</w>
,
<w lemma='strong:G652' morph='robinson:N-NSM' src='2'>
an apostle
</w>
<w lemma='strong:G2424' morph='robinson:N-GSM' src='3'>
of Jesus
</w>
<w lemma='strong:G5547' morph='robinson:N-GSM' src='4'>
Christ
</w>
<w lemma='strong:G1223' morph='robinson:PREP' src='5'>
by
</w>
<w lemma='strong:G2307' morph='robinson:N-GSN' src='6'>
the will
</w>
<w lemma='strong:G2316' morph='robinson:N-GSM' src='7'>
of God
</w>
,
<w lemma='strong:G2532' morph='robinson:CONJ' src='8'>
and
</w>
<w lemma='strong:G5095' morph='robinson:N-NSM' src='9'>
Timotheus
</w>
<transChange type='added'>
our
</transChange>
<w lemma='strong:G3588 strong:G80' morph='robinson:T-NSM robinson:N-NSM' src='10 11'>
brother
</w>
,
</verse>
</div>
There are 2 significant differences in the RST:
First is does not prefix the various Strong's Numbers with G or H. Second it separates two numbers in a row with '|' rather than ' '.
The first problem has to be fixed in ..../GBFilter.java as there is no way of telling whether a value is Greek (G) or Hebrew (H) otherwise. The numbers overlap. The second can be fixed in either simple.xsl (or whatever you are using), or in GBF Filter. Since OSIS specifies a space, GBFFilter should be changed. Also, it fixes it once for all frontends.
In Him,
DM
On Feb 28, 2011, at 4:03 PM, Martin Denham wrote:
> And Bible throws an error when attempting to parse the xml from the Russian RST module. It appears to me that the Strongs tags do not conform to the correct xml format.
>
> I printed out the OSIS xml for the beginning of Colossians and got (ignore the question marks which are just Russian characters):
> <verse osisID='Col.1.1'/>?????<WG3972>, ?????<WG1223><WG2307> ??????<WG2316>...
>
> which isn't valid xml because the Strongs tags are all unmatched opening tags, despite not being valid OSIS tags, but then the RST is originally in GBF format and not OSIS.
>
> When I select RST in BibleDesktop and turn on Strongs Numbers it displays the numbers but they are not shown as Strongs links but large red tags.
>
> Any idea what might be the problem here - I thought it could possibly be an error in the module or in JSword GBF conversion to OSIS?
>
> This error also affects the RusVZh module. Both modules are GBF format.
>
> Thanks
> Martin
> _______________________________________________
> jsword-devel mailing list
> jsword-devel at crosswire.org
> http://www.crosswire.org/mailman/listinfo/jsword-devel
More information about the jsword-devel
mailing list