[sword-devel] Windows beta 1.5.8pre1
DM Smith
dmsmith555 at yahoo.com
Wed Feb 9 05:55:40 MST 2005
JSword is currently using Lucene to index the modules. We are planning
to pre-computed the indexes as well. In the jsword-devel archives you
will find threads on issues related to making the lucene index available
for download. JSword is planning to make downloadable indexes available.
From what I remember here are a few:
1) Indexes will need to be versioned to track
a) module changes -- If a module changes, then the index will
most likely be off. I say most likely because there are scenarios under
which it will still be correct. I think the risk of having a bad index
because it was not rebuilt is so high that the practice will be to
rebuild when the module changes.
b) application changes -- Lucene gives a lot of flexibility as
to how an index is built and what is searchable, retrievable or stored
in it. Any change in that strategy probably will require a new index.
c) lucene library changes -- An index may be upwardly
compatible, but not backwardly compatible. Because of this risk, I think
it will be necessary to build the indexes with each version of the
library that is currently being used. This probably will require a
policy of the previous version, the current version and the next
version, as used by Sword. (Or something like that)
2) The index built for sword is not usable by JSword and visa versa. At
this point we are not sure why this is. It could be that the c and java
versions of lucene don't build the same index for the same strategy or
it may be that Sword and JSword use different strategies in building the
index. In the earlier JSword threads, Sword was not considering having
downloadable indexes. Since it is being discussed, perhaps we should
collaborate on looking at and perhaps solving this issue.
3) The building of indexes is compute intensive. Building them on the
crosswire server will need to be done in a manner so as to not disrupt
the services that crosswire provides.
4) We cannot presume our users have significant computing power. At
least we have users with old laptops who want to use JSword. They may be
limited by CPU Mhz, RAM and disk.
Chris Little wrote:
> Lynn Allan wrote:
>
... stuff deleted
>
>> I tried the "Create Index" for searching the ACV. It is still grinding
>> and taking a loooooong time ... almost seems locked up, but I'll wait
>> a while longer. Every minute or so, another block appears in the
>> "Progress Indicator" so this could take a while. Seems a little more
>> than half done after 13 minutes on an older 933 mhz PIII ... done @ 25
>> minutes. Is this expected behavior?
>
>
> Yeah, for me, on a 1.5GHz P4-M, it took 20min to index the NASB. The
> ACV just took me 18min. If we can't get the time down to a reasonable
> level, we may post indexes for download (the NASB index is about 6.7mb
> or 2.5mb zipped).
>
More information about the sword-devel
mailing list