[sword-devel] sword-devel Digest, Vol 32, Issue 12

Greg Hellings greg.hellings at gmail.com
Tue Nov 7 19:41:53 MST 2006


XSLT for processing whole Bibles is not terribly slow.  When you're
looking at upwards of 10M of Bible text with formatting and all of
that thrown togther into a mess and trying to sort it out nicely into
OSIS or THML, XSLT was decently fast and generic.  Quite useful,
really, in my work in the area.

--Greg

On 11/7/06, DM Smith <dmsmith555 at yahoo.com> wrote:
> Bill,
>
> XPath is an integral part of xslt. I have used xslt to process entire
> bibles, such as the KJV. Depending on what you want to do, it can be very
> fast or very slow. If you are using simple path statement, it is likely to
> be very fast.
>
> In Him,
>  DM
>
>
>
> On Nov 7, 2006, at 4:26 PM, Bill Burton wrote:
>
> Hello,
>
> It looks like Xindice doesn't work as well with documents up to or over 5
> MB.  I would imagine uncompressed OSIS Bible text would be at least that
> large.  Also, I wonder how well XPath would work very well with the OSIS
> format.
>
> -Bill
>
> On 11/7/06, Martin Gruner <mg.pub at gmx.net > wrote:
> > This sounds very interesting. Is there something similar for C++?
> >
> > mg
> >
> > Am Dienstag, 7. November 2006 20:24 schrieb Yiguang Hu:
> > > If you have to think of other repository than sword
> > > module, how about xindice. It stores XML directly and
> > > you can access the data using XPATH. Sure xindice is
> > > young also.
> > >
> > > http://xml.apache.org/xindice/
> > >
> > > --- sword-devel-request at crosswire.org wrote:
> > > > Send sword-devel mailing list submissions to
> > > >     sword-devel at crosswire.org
> > > >
> > > > To subscribe or unsubscribe via the World Wide Web,
> > > > visit
> > >
> > > http://www.crosswire.org/mailman/listinfo/sword-devel
> > >
> > > > or, via email, send a message with subject or body
> > > > 'help' to
> > > >     sword-devel-request at crosswire.org
> > > >
> > > > You can reach the person managing the list at
> > > >     sword-devel-owner at crosswire.org
> > > >
> > > > When replying, please edit your Subject line so it
> > > > is more specific
> > > > than "Re: Contents of sword-devel digest..."
> > > >
> > > >
> > > > Today's Topics:
> > > >
> > > >    1. Re: modules to relational database (Gabriel M.
> > > > Beddingfield)
> > >
> > >
> ----------------------------------------------------------------------
> > >
> > > > Message: 1
> > > > Date: Tue, 7 Nov 2006 12:27:57 -0600 (CST)
> > > > From: "Gabriel M. Beddingfield" < gabriel at teuton.org>
> > > > Subject: Re: [sword-devel] modules to relational
> > > > database
> > > > To: "SWORD Developers' Collaboration Forum"
> > > >     < sword-devel at crosswire.org>
> > > > Message-ID:
> > >
> > > <1539.65.197.90.190.1162924077.squirrel at www.teuton.org>
> > >
> > > > Content-Type: text/plain;charset=iso-8859-1
> > > >
> > > > > You can split up any XML document at its nodes
> > > >
> > > > like <osisID>, <div> etc.
> > > >
> > > > > into
> > > > > rows of a database that also holds the rendering
> > > >
> > > > context for each of these
> > > >
> > > > > nodes, and also information about the tree
> > > >
> > > > structure of the xml document
> > > >
> > > > > (parent-children etc.). How deep you want to split
> > > >
> > > > it beyond osisID
> > > >
> > > > > depends
> > > > > on what you want to do.
> > > > >
> > > > >From what I've seen, a lot (maybe most) of the
> > > >
> > > > "serious" bible texts have
> > > > structured their documents like you would a book,
> > > > and then littered it
> > > > with "milestones" to mark where chapters and verses
> > > > begin.  E.g.
> > > >
> > > > <!-- my aplologies for butchering ThML -->
> > > > <p>
> > > >   <ScripRef verse='Gen 1:1' />In the beginning, God
> > > > created
> > > >   the heavens and the earth.  <ScripRef verse='Gen
> > > > 1:2' />
> > > >   And the earth was shapeless and void...
> > > > </p>
> > > >
> > > > On the one hand, you can store the XML document in a
> > > > relational database
> > > > as an XML document... preserving each tag, location,
> > > > attributes, etc.  On
> > > > the other hand, I would expect someone (like me or
> > > > the OP) to try and
> > > > divide things up by Book/Chapter/Verse:
> > >
> > >
> +------+----+-------+---------------------------------------+
> > >
> > > > | Book | Ch | Verse | Text
> > >
> > >
> +------+----+-------+---------------------------------------+
> > >
> > > > | Gen  | 1  |   1   | In the beginning, God created
> > > >
> > > > the     |
> > > >
> > > > |      |    |       | heavens and the earth.
> > >
> > >
> +------+----+-------+---------------------------------------+
> > >
> > > > | Gen  | 1  |   2   | And the earth was shapeless
> > > >
> > > > and       |
> > > >
> > > > |      |    |       | void...
> > >
> > >
> +------+----+-------+---------------------------------------+
> > >
> > > > To be clear on what I'm getting at... IMHO, I just
> > > > don't think there's a
> > > > large advantage to converting a module to a RDMS.
> > > > If you preserve the
> > > > original document (OSIS, ThML, etc.)... why not just
> > > > leave it in OSIS,
> > > > ThML, etc.?  If you redo the schema to the RDMS, I
> > > > think you'll end up
> > > > with a lot of headaches that the XML/SGML schemas
> > > > solve well.  I can be
> > > > convinced otherwise (after all, I really do love
> > > > RDMS's)... but this is
> > > > how I see it.
> > > >
> > > >
> > > > --
> > > >                G a b r i e l   M   B e d d i n g f i
> > > > e l d
> >
> _______________________________________________
> sword-devel mailing list: sword-devel at crosswire.org
> http://www.crosswire.org/mailman/listinfo/sword-devel
> Instructions to unsubscribe/change your settings at above page
>
> _______________________________________________
> sword-devel mailing list: sword-devel at crosswire.org
> http://www.crosswire.org/mailman/listinfo/sword-devel
> Instructions to unsubscribe/change your settings at above page
>
>



More information about the sword-devel mailing list