[jsword-devel] Lucene Indexes

Mullins, Steven Steven.Mullins at dmme.virginia.gov
Mon May 19 05:41:55 MST 2008


Here is a simple explanation of what a stem is from SIL: http://www.sil.org/linguistics/GlossaryOfLinguisticTerms/WhatIsAStem.htm
And a lexical form:
http://www.sil.org/linguistics/GlossaryOfLinguisticTerms/WhatIsALexicalForm.htm

A small correction to the reply below is that the MorphGNT does not have the 
"stem" in the tag, but does have the lexical form.  They do differ.  The
in Koine Greek, the Lexical form for a verb is not the infinitive, it is
the first person active indicative.  Nouns are usually in the nominative.
So in MorphGNT we do not really have a stem, but a true word form.

The beauty of the MorphGNT module is that the analysis is already done!
for every inflected word, you have tagged the lexical form (to search by)
and the morpological tag (to narrow a search).  So for example if I wanted
to to search for all verses with "believe" in the first person active
indicative with "Jesus" as a direct object, I could, if only I had the 
lexical form and morph tags in lucene working.

Just my 2-cents.

Steve


-----Original Message-----
From: Daniel Owens [mailto:dhowens at pmbx.net]
Sent: Mon, May 19, 2008 12:26 AM
To: J-Sword Developers Mailing List
Subject: Re: [jsword-devel] Lucene Indexes


Word stems are the dictionary form of the word (also called lexeme), if I understand what DM is getting at. For example, in English, the word "run" is the like the word stem, but when you put it in past tense you get "ran." Greek and Hebrew actually change the beginning and ending of the word to represent this, so searching for a word can be hard unless you can search by word stem. Since the MorphGNT module has word stems tagged, you could do quite interesting searches using that feature. For example, you could search for every place the word "run" appears in any form (in Greek), whether it is past, present, future, or whatever. That way you can do a word study on the word "run".

Daniel

Tonny Kohar wrote: 
Hi,

On Sun, May 18, 2008 at 9:27 AM, DM Smith <dmsmith555 at yahoo.com> wrote:
  
I've been working on fixing a bug with the indexing and searching
Strong's numbers.
While I was at it I enable the work that Sijo did to add the ability
to search by word stems.

To take advantage of these changes, get the latest build sometime
tomorrow and drop and re-add the index for a Bible.
    

Interesting enhancement and bug fixing. If you do not mind could you
elaborate in a bite more detail regarding word stems? what is word
stems ?

Thanks
Tonny Kohar
  


-- 
PMBX license 1502 


More information about the jsword-devel mailing list