[osis-core] 1.9.7 attached!
Patrick Durusau
osis-core@bibletechnologieswg.org
Sat, 18 Oct 2003 04:56:52 -0400
Todd,
The regex is probably not correct. Written after I was very tired. Up
too early now but after some more sleep I will do it again and this time
follow my usual course of checking it in my regex checking XML file.
Sorry 'bout that!
Hope you are having a great day!
Patrick
Todd Tillinghast wrote:
> Patrick,
>
> Looking at the schema and trying out an example, I think the new regex
> for osisGenType is not quite right.
>
> I was expecting the following to be valid: <w lemma="x:34 x:554">xyz</w>
> did I miss something?
>
> Todd
>
>
>>Greetings!
>>
>>osisCore.1.9.7 is attached!
>>
>>xml:lang:
>>
>>Now required on osisTextCT.
>>
>>XML Spec says the following:
>>
>>
>>>[IETF RFC 1766] tags are constructed from two-letter language codes
>>
> as
>
>>defined by [ISO 639], from two-letter country codes as defined by [ISO
>>3166], or from language identifiers registered with the Internet
>
> Assigned
>
>>Numbers Authority [IANA-LANGCODES]. It is expected that the successor
>
> to
>
>>[IETF RFC 1766] will introduce three-letter language codes for
>
> languages
>
>>not presently covered by [ISO 639].
>>
>>This means you get three bites:
>>
>>1. ISO 639 - http://sunsite.berkeley.edu/amher/iso_639.html
>>
>>2. RFC 1766 - http://www.ietf.org/rfc/rfc1766.txt
>>
>>3. User-defined language code (must begin with "x-"
>>
>>Note that attempting to add it to language (as an attribute) produces
>
> a
>
>>fist full of errors.
>>
>>Since xml:lang should be sufficient, deleted all the osisLanguage
>
> stuff
>
>>that we added for ISO 639 and the like. If need more, use IETF or
>
> x-/X-.
>
>>(does that work for everybody? I somehow screwed up the syntax and had
>>to redo the edits with periodic validation. Really tired so we can
>>revisit tomorrow if need be.)
>>
>>osisGenType:
>>
>>Used on subject, osisIdentifier, POS, lemma and morph.
>>
>>Structure: prefix refers to work, followed by any combination of
>>letters, numbers, plus _ (underscore), - (hyphen), . (period), or :
>>(colon). (NameChar)
>>
>>Regex: <xs:pattern
>>value="(((\p{L}|\p{N}|_)+)((\.(\p{L}|\p{N}|_)+)*)+:)(\c)*"/>
>>
>>Other mods:
>>
>>Added base attribute to osisLanguageUsage.
>>
>>
>>Added default value to cell role to be data (can be label or data)
>>
>>
>>Added encoder as note type (needs note in users manual to suppress
>
> when
>
>>producing a printed or electronic version for viewing)
>>
>>Well, virus scan about to run so shutting down for the night!
>>
>>Thanks guys!
>>
>>Will poll the congregation again tomorrow!
>>
>>Hope everyone is having a great evening!
>>
>>Patrick
>>
>>
>>
>>
>>--
>>Patrick Durusau
>>Director of Research and Development
>>Society of Biblical Literature
>>Patrick.Durusau@sbl-site.org
>>Chair, V1 - Text Processing: Office and Publishing Systems Interface
>>Co-Editor, ISO 13250, Topic Maps -- Reference Model
>>
>>Topic Maps: Human, not artificial, intelligence at work!
>
>
> _______________________________________________
> osis-core mailing list
> osis-core@bibletechnologieswg.org
> http://www.bibletechnologieswg.org/mailman/listinfo/osis-core
>
--
Patrick Durusau
Director of Research and Development
Society of Biblical Literature
Patrick.Durusau@sbl-site.org
Chair, V1 - Text Processing: Office and Publishing Systems Interface
Co-Editor, ISO 13250, Topic Maps -- Reference Model
Topic Maps: Human, not artificial, intelligence at work!