[jsword-devel] Fwd: Search Index Downloading
Troy A. Griffitts
scribe at crosswire.org
Tue Oct 12 10:25:37 MST 2004
Hey guys,
I'd like to do some experiments to see if clucene and Java Lucene
indecies are binary compatible.
I also like the idea of a subdirectory under idx for keeping
different kinds of indecies. I might suggest even 1 more level under L1,
if you are planning for version changes of your index structure.
e.g. C++ SWORD supports a pluggable index architecture, and we are
hoping to write some cool indexers for morphologically declined searches,
etc. We could keep pre-generated index sets under different
subdirectories under idx for each plugin.
On the downside, we release updated modules on a regular basis--
some modules more 'regular' than others. To keep the indecies up to date
for each module should not be the module creators responsibility. I
wouldn't expect our current maintainers to run a number of different
indexers every time they release a new module, unless the process was
nearly completely automated to handle ALL types of indexing.
Up until this consideration, we have always taken the methodology
of generating anything needed for a plugin on demand on the end user's
system. Which is always the least maintenance option for us :)
-Troy.
On Mon, 11 Oct 2004, Joe Walker wrote:
> Getting Reply and ReplyAll confused again ...
>
> ---------- Forwarded message ----------
> From: Joe Walker <joseph.walker at gmail.com>
> Date: Mon, 11 Oct 2004 08:12:18 +0100
> Subject: Re: Search Index Downloading
> To: "Troy A. Griffitts" <scribe at crosswire.org>
>
> How about we use /pub/sword/raw/idx/L1/[book].zip then?
> If Java Lucene indexes and CLucene indexes are compatible then it
> won't be proprietary to JSword. If they are not compatible, or if you
> want to use different options in creating the index then you can use
> /pub/sword/raw/idx/C1/[book].zip or something.
>
> Joe.
>
>
>
> On Sun, 10 Oct 2004 22:12:00 -0700, Troy A. Griffitts
> <scribe at crosswire.org> wrote:
>> Hey Joe,
>> That's fine. Let me know if there is anything I need to do for you.
>> Don't we have a /pub/jsword directory for your stuff? I understand what
>> you mean by having the same base directory for modules (which would be
>> /pub/sword/raw for our server, so maybe /pub/sword/raw/idx, but this
>> isn't a sword module data structure. This is jsword's proprietary (in
>> the sense of not publicly sword declared) data. It would be nice to
>> unify a common index format for sword modules.
>>
>> Does it really take lucene 5+ minutes to generate? That's a bummer.
>> You would think it wouldn't take much longer than a single non-index
>> search thru the Bible.
>>
>> To belatedly answer your question on sword-devel, I honestly have no
>> idea if clucene indecies are binary compatible with the java lucene
>> counterpart.
>>
>> -Troy.
>>
>>
>>
>>
>> Joe Walker wrote:
>>> Hi Troy,
>>>
>>> I'd like to allow users of Bible Desktop to download search indexes
>>> because they take about 5 mins to generate. A search index is between
>>> 2-3Mb per book so it ought not to take up too much space.
>>>
>>> Ideally we would use an FTP directory on crosswire something like:
>>> - /pub/sword/search/jsword/L1/[book].zip
>>>
>>> It starts /pub/sword so that if the beta modules site (or other
>>> download sites) come online we can just remember one root path per
>>> module site. The search/jsword bit would keep our stuff from getting
>>> in anyone elses way. L1 is simply a version number so we can update
>>> the index format without huge turmoil.
>>>
>>> Is that OK?
>>> Thanks,
>>>
>>> Joe.
>>
>>
> _______________________________________________
> jsword-devel mailing list
> jsword-devel at crosswire.org
> http://www.crosswire.org/mailman/listinfo/jsword-devel
>
More information about the jsword-devel
mailing list