[osis-core] Regex News!
Patrick Durusau
osis-core@bibletechnologieswg.org
Wed, 11 Feb 2004 15:29:06 -0500
Chris,
Should not be a problem since non-breaking space is by definition not
XML 1.0 whitespace.
From the spec:
http://www.w3.org/TR/2004/REC-xml-20040204/#NT-S
S ::= (#x20 | #x9 | #xD | #xA)+
Hope you are having a great day!
Patrick
Chris Little wrote:
> Patrick,
>
> Sounds okay, but I'll go ahead and play the devil's advocate (read: look
> at things from Troy's position)...
>
> I think one of Troy's desires was to be able to encode _anything_ as a
> valid osisGenRef, and I presume this change is partly/mostly intended to
> placate him. Since spaces are part of some morphology codes, how should
> he encode those? Non-breaking space?
>
> If so, Troy, does that work for you?
>
> --Chris
>
> Patrick Durusau wrote:
>
>> Greetings!
>>
>> The new addition to the regexes reads as follows:
>>
>> |(\\[^\s])
>>
>> This means that any single character (excluding space, including all
>> Unicode (which means PUA) can be used in any of the OSIS regex
>> expressions.
>>
>> Must be preceded by a "\" if the character is one of the ones we have
>> reserved for use in ID or REF syntax.
>>
>> Those characters are: ".", ":", "!", "[", "]", "@", "-" and "\".
>>
>> This is only for the portion following the prefix, which is terminated
>> by a ":".
>>
>> All applications are required to recognize the "\" as an escape
>> character applying to the single character that follows it.
>>
>> Hope everyone is having a great day!
>>
>> Patrick
>>
>
> _______________________________________________
> osis-core mailing list
> osis-core@bibletechnologieswg.org
> http://www.bibletechnologieswg.org/mailman/listinfo/osis-core
>
--
Patrick Durusau
Director of Research and Development
Society of Biblical Literature
Patrick.Durusau@sbl-site.org
Chair, V1 - Text Processing: Office and Publishing Systems Interface
Co-Editor, ISO 13250, Topic Maps -- Reference Model
Topic Maps: Human, not artificial, intelligence at work!