[sword-devel] can't do lucene Hebrew searches in KJV

Jonathan Morgan jonmmorgan at gmail.com
Thu Jan 20 06:36:05 MST 2011


Hi Nic,

I think it's simply how the module is encoded.  OT has H0xxxx, NT has Gxxxx,
and presumably the NASB is different.  Picking some text at random from the
OT:

<w lemma="strong:H03123" wn="001">But the dove</w> <w lemma="strong:H04672"
morph="strongMorph:TH8804" wn="002">found</w> <w lemma="strong:H04494"
wn="003">no rest</w> <w lemma="strong:H03709" wn="004">for the sole</w> <w
lemma="strong:H07272" wn="005">of her foot</w>

and the NT:

<w lemma="strong:G3588" morph="robinson:T-ASF" src="10"></w><w
lemma="strong:G2532" morph="robinson:CONJ" src="1">And</w> <w
lemma="strong:G1684" morph="robinson:V-2AAP-NSM" src="2">he entered</w> <w
lemma="strong:G1519" morph="robinson:PREP" src="3">into</w> <w
lemma="strong:G3588 strong:G4143" morph="robinson:T-ASN robinson:N-ASN"
src="4 5">a ship</w>, <w lemma="strong:G1276" morph="robinson:V-AAI-3S"
src="6">and passed over</w>, <w lemma="strong:G2532" morph="robinson:CONJ"
src="7">and</w> <w lemma="strong:G2064" morph="robinson:V-2AAI-3S"
src="8">came</w> <w lemma="strong:G1519" morph="robinson:PREP"
src="9">into</w> <w lemma="strong:G2398" morph="robinson:A-ASF" src="11">his
own</w> <w lemma="strong:G4172" morph="robinson:N-ASF"
src="12">city</w>.<milestone resp="ps 2003-12-01-08:58"
type="x-strongsMarkup"/>

BPBible actually returns search results for both "strongs:H5062" and
"strongs:H05062".  Presumably there is some magic inside the search to
normalise the search terms, but I can't find it at the moment.

Of course, it would be preferable if users didn't have to type arcane
strings like "lemma:H05062" in in the first place.  I know I got a comment
about BPBible 0.5 beta 1 that it would be good to be able to search for
multiple Greek/Hebrew words used close to each other without having to find
each number and enter it in.

Jon

On Fri, Jan 21, 2011 at 12:10 AM, Nic Carter <niccarter at mac.com> wrote:

> Try "lemma:H05062" in the KJV :)
>
> There seem to be inconsistencies between modules and how strong's numbers
> aree done? I'm about to redo search in PS and would love to understand why
> the KJV needs a '0' at the start of Hebrew numbers but other modules
> don't...?
>
> On 21/01/2011, at 0:05, Karl Kleinpaste <karl at kleinpaste.org> wrote:
>
> > Someone came into #xiphos this morning to ask a few questions, one of
> > which was how to search for a Hebrew Strong's reference.  His example is
> > H5062, "smite."
> >
> > We can't get any search results from KJV for Hebrew at all.
> >
> > I'm now experimenting with sword/examples/cmdline/search, having changed
> > char SEARCH_TYPE=-4;
> > to get lucene searches, and it's consistently wrong: No lemma:Hxxxx
> > search ever returns anything, but lemma:Gxxxx always works just fine.
> >
> > Identical searches work just fine in e.g. beta NASB.
> >
> > Hm, continuing to experiment before sending this...  lemma:Hxxxx
> > searches work in RST, but not GerLut or FreLSG.
> >
> > Now Greg Hellings (also in #xiphos) says even BibleTime can't
> > successfully do such searches, and BT has its own re-implemented lucene
> > search.
> >
> > Can others experiment a bit and tell me if I've been smoking
> > particularly bad crack?
> >
> > _______________________________________________
> > sword-devel mailing list: sword-devel at crosswire.org
> > http://www.crosswire.org/mailman/listinfo/sword-devel
> > Instructions to unsubscribe/change your settings at above page
>
> _______________________________________________
> sword-devel mailing list: sword-devel at crosswire.org
> http://www.crosswire.org/mailman/listinfo/sword-devel
> Instructions to unsubscribe/change your settings at above page
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://www.crosswire.org/pipermail/sword-devel/attachments/20110121/97e01ddc/attachment-0001.html>


More information about the sword-devel mailing list