[sword-devel] Chinese Strongs markup
Daniel Owens
dhowens at pmbx.net
Wed Apr 25 07:34:51 MST 2012
David,
This issue crops up in the WHM module because prefixed and suffixed
morphemes are parsed separately. Here is an example: <w
lemma="strong:lemma1 strong:lemma2" morph="whmmorph:morph1
whmmorph:morph2" src="1 2">וַיְהִ֣י</w>. The waw conjunction is parsed
separately from the verb. I do not think that @src is essential, but it
might be useful information some day.
Daniel
On 04/25/2012 08:49 AM, Troy A. Griffitts wrote:
> David,
>
> I'm sorry no (including me) has responded to you on this yet.
>
> We used to use '|' to separate lists in osis in the first rev or so
> (or at least we (CrossWire) used them, until it was pointed out that
> lists in XML (at least in TEI) were delineated with spaces, so the
> lemma and morph attributes on <w> elements are space separated lists.
> fine, but this doesn't solve your problem. With '|', I would have
> encoded your example like this:
>
> > eg: Gen.3.8: <w lemma="strong:H8085 strong:H853 strong:H3068"
> > morph="StrongsMorph:H8799">[hear]</w>
>
> <w lemma="strong:H8085|strong:H853|strong:H3068"
> morph="StrongsMorph:H8799||">[hear]</w>
>
> I suppose you could still do that with 2 spaces instead of of the '|',
> but this isn't that it's not simply extraneous spaces in your
> attribute value.
>
> Anyway, the morph list should mirror the lemma list. How you do that?
> Well, maybe:
>
> <w lemma="strong:H8085 strong:H853 strong:H3068"
> morph="StrongsMorph:H8799 StrongsMorph:H00 StrongsMorph:H00">[hear]</w>
>
> (your email to jsword reminding me of this old question from you and
> sparked a possible solution)
>
> Maybe someone else has a better suggestion,
>
> Troy
>
>
> On 04/12/2012 10:16 AM, David Instone-Brewer wrote:
>> I'm starting work on correcting some errors in the Chinese Strongs
>> markup,
>> but I want to step carefully cos I'm new to this.
>> I couldn't find a Wiki page outlining what the rules are for Strongs
>> markup.
>> There's a note at
>> http://crosswire.org/wiki/Modules_in_the_beta_repository/ChiUns
>> pointing out one problem, but there are many, though it is basically a
>> very good markup.
>> It appears to be based on the free KJV markup, though it also notes
>> every occurrence of the Hebrew article /eth
>> /This isn't very useful, but it shows that they were keen to include
>> everything and that they did a lot of work on it.
>>
>> In order to make sure they include every Hebrew word, they often
>> combined words in one entry,
>> eg: Gen.1.2: <w lemma="strong:H5921 strong:H6440">[on the surface
>> of]</w>
>>
>> Sometimes this becomes complicated when one or more of the words is a
>> verb,
>> eg: Gen.2.2: <w lemma="strong:H4480 strong:H3605 strong:H834
>> strong:H6213" morph="StrongsMorph:H8804">[rested him from all the]</w>
>> - the numbers represent the words for "from" "all" "/eth/" and "working"
>> - in this case the morphology goes with the last lemma (H6213) but this
>> isn't always the case.
>> eg: Gen.3.8: <w lemma="strong:H8085 strong:H853 strong:H3068"
>> morph="StrongsMorph:H8799">[hear]</w>
>> - the numbers represent: "hear" "/eth" /"LORD" and "Qal Imperfect"
>> -in this case the morphology relates to the first of the three lemmas
>> listed.
>>
>> I can sort this out, but I'd like to know how to record it
>> - ie how can I indicate in correct coding that a morphology links with a
>> particular lemma when more than one lemma is translated by a single
>> word?
>>
>> I notice that the same thing happens in the KJV strongs (though less
>> often),
>> eg Gen.6.3 <w lemma="strong:H07683 strong:H01571"
>> morph="strongMorph:TH8800 strongMorph:TH8677">for that he also</w>
>>
>> Is it OK to mark such occurrences as follows?
>> Gen.6.3 <w lemma="strong:H07683" morph="strongMorph:TH8800"
>> lemma="strong:H01571" morph="strongMorph:TH8677">for that he also</w>
>> This isn't good XML, but perhaps it is allowable?
>> Or is there some other way to indicate which lemma the morphology
>> belongs to?
>>
>> David IB
>> /// Dr David Instone-Brewer
>> dib Senior Research Fellow in Rabbinics and the New Testament
>> ^ Tyndale House, 36 Selwyn Gardens, Cambridge, CB3 9BA, UK
>> \=/ Rabbinics at Tyndale.cam.ac.uk www.TyndaleHouse.com
>> <http://www.tyndalehouse.com/>
>>
>>
>>
>> _______________________________________________
>> sword-devel mailing list: sword-devel at crosswire.org
>> http://www.crosswire.org/mailman/listinfo/sword-devel
>> Instructions to unsubscribe/change your settings at above page
>
>
> _______________________________________________
> sword-devel mailing list: sword-devel at crosswire.org
> http://www.crosswire.org/mailman/listinfo/sword-devel
> Instructions to unsubscribe/change your settings at above page
>
More information about the sword-devel
mailing list