<br><div class="gmail_quote">On Thu, Apr 23, 2009 at 3:42 PM, Jonathan Marsden <span dir="ltr"><<a href="mailto:jmarsden@fastmail.fm">jmarsden@fastmail.fm</a>></span> wrote:<br><blockquote class="gmail_quote" style="border-left: 1px solid rgb(204, 204, 204); margin: 0pt 0pt 0pt 0.8ex; padding-left: 1ex;">
Earlier, in response to Ben, I wrote:<br>
<br>
> If I have time later tonight, once I have a sword 1.6.0RC2 package out<br>
> for front end developers to test with, I'll see about creating the 198<br>
> item single locale test and (if I succeed!) I'll contribute it to the<br>
> SWORD codebase, just to demonstrate that creating such a test really is<br>
> in fact doable :)<br>
<br>
Here is abbreviations.sh. I got an initial set of abbreviations from<br>
the OSIS User Manual, by cut and paste, and then added (usually) two<br>
more per book, that I just made up. </blockquote><blockquote class="gmail_quote" style="border-left: 1px solid rgb(204, 204, 204); margin: 0pt 0pt 0pt 0.8ex; padding-left: 1ex;"><br>
In the process of creating and testing it under 1.6.0RC2 I found a<br>
surprises. It may really be a bug, demonstrating to Ben and the team<br>
the value of this kind of very simple regression test script... or it<br>
may just reflect my lack of understanding of the intent and workings of<br>
the SWORD library.</blockquote><div></div><blockquote class="gmail_quote" style="border-left: 1px solid rgb(204, 204, 204); margin: 0pt 0pt 0pt 0.8ex; padding-left: 1ex;"><br>
<br>
parsekey 1Ki does not display I Kings, but I Samuel. Likewise for 2Ki<br>
and II Samuel. Similar behavior is seen for 1Kin, 2Kin, 1King and<br>
2King. Intuitively, this feels incorrect to me. As some mild degree of<br>
justification for this, I offer:<br>
<br>
<a href="http://www.biblegateway.com/passage/?search=1Ki" target="_blank">http://www.biblegateway.com/passage/?search=1Ki</a><br>
<br>
which does what I would expect -- it displays I Kings.</blockquote><div>This is another problem from adding additional ones and needs to be fixed (specifically, 1 Kingdoms = 1 Samuel). My testing script was only checking against new books, so it didn't pick this up...<br>
</div><div> </div><blockquote class="gmail_quote" style="border-left: 1px solid rgb(204, 204, 204); margin: 0pt 0pt 0pt 0.8ex; padding-left: 1ex;">
(A test tool that compares the results of lookups on <a href="http://biblegateway.com" target="_blank">biblegateway.com</a><br>
and using SWORD might be an interesting thing for someone to create, BTW!).<br>
<br>
Observations: There are more abbreviations which I would like to work,<br>
but which don't seem to, such as Gn for Genesis, SoS for Song of<br>
Solomon, but I have no real idea if these are "legitimate"<br>
abbreviations. I do not claim to be a domain expert on abbreviations :)<br>
I am fairly confident that 1Ki *is* an abbreviation for I Kings that I<br>
have seen and used before, not just on <a href="http://biblegateway.com" target="_blank">biblegateway.com</a>.<br>
</blockquote><div>It seems lots of custom ones were dropped during the transition that used to be there. BPBible has its own english conf file (<a href="http://code.google.com/p/bpbible/source/browse/trunk/locales/locales.d/SWORD_1512/bpbible.conf">http://code.google.com/p/bpbible/source/browse/trunk/locales/locales.d/SWORD_1512/bpbible.conf</a>), so many of the problems others have seen in other frontends do not apply to it. "I C" (and "II C") is another error case which should be handled, but this one looks a little more complicated as the data clearly points it to 1 Corinthians - but it never gets there...<br>
<br></div><blockquote class="gmail_quote" style="border-left: 1px solid rgb(204, 204, 204); margin: 0pt 0pt 0pt 0.8ex; padding-left: 1ex;">
Here is the (very simple!) script. I've assigned copyright to<br>
CrossWire. It's a trivial for loop plus a list of abbreviations. I can<br>
create a Jira item for it, and attach the script and a .good file for<br>
it, if that would be preferred.</blockquote><div>Another thing that could be done is to create all possible abbreviations from the abbreviations file... Anyway, this is a good addition. You have Ez for Ezekiel and Ezra (but currently it is Ezekiel) and Ze for Zephaniah and Zechariah (currently Zechariah). Also Phi for Philemon - it goes to Philippians currently. I've added these to verseparsing.sh with the conflicting ones removed. It doesn't pass currently, due to the kings problem.<br>
<br>God Bless, <br>Ben<br></div></div><br>