[sword-devel] lucene indexing failing on some modules
DM Smith
dmsmith555 at yahoo.com
Fri May 9 13:57:08 MST 2008
Karl Kleinpaste wrote:
> I have found that indexing is not working on some modules, including
> some of my home-grown ones, but also a few Crosswire-distributed ones.
> Case in point, BosworthToller. This was first noticed from integrated
> support in GS, but use of mkfastmod on its own shows similar problems.
> In 15min wall clock time, mkfastmod grew to 126M and used 13:20 CPU
> without completing, or generating anything in the lucene subdir in the
> module's tree.
The code is in .../modules/swmodule.cpp.
The way indexing works is that:
It creates the location that the index should write to.
Then it uses a RAMDirectory to write the index to.
Then there is an ugly loop that iterates over the entries in a module,
getting their keys and contents, inverting them into the index.
Perhaps of interest is that the first MAX_CONV_SIZE (i.e. 2047) of the
content is used.
Once it is done, it opens an index on disk and writes the RAMDirectory
to it.
It would be a simple matter to add a diagnostic to the loop to have it
spit out the key.
Just a thought:
I would think that if one could run mod2imp successfully on the module
that the indexer would work.
In Him,
DM
More information about the sword-devel
mailing list