[osis-core] OSIS work regex
Patrick Durusau
osis-core@bibletechnologieswg.org
Wed, 14 Aug 2002 11:16:44 -0400
Steve,
Yes, not a hyphen since it is the range delimiter.
Patrick
Steve DeRose wrote:
> At 09:30 AM -0400 08/14/02, Patrick Durusau wrote:
>
>> Harry,
>>
>> Harry Plantinga wrote:
>>
>>> If schema RegExps behave as they do in Perl, the ? is superfluous.
>>> Perhaps
>>>
>>> [\L\N][\.\L\N]*
>>
>> Unfortunately, XML Schemas are deliberately inconsistent with
>> customary regex expressions. Well, the "?" operator is familiar from
>> SGML DTD syntax but the use of "^" as negation, for example, is
>> contrary to its usual role as an anchor (the reasoning being that in
>> XML Schemas, matches are always anchored at the start of the line. no
>> sure why that justifies being inconsistent but there you have it)
>>
>>>
>>> The underscore character (_) is pretty commonly used in names and
>>> may be
>>> present in documents converted to OSIS. I can't see that it would do
>>> any
>>> harm. Could it be included? Perhaps
>>> [\L\N_][\.\L\N_]*
>>>
>> Actually I think:
>>
>> (\p{L}\p{N}_)((\.(\p{L}\p{N}_)*)
>>
>> (untested)
>>
>> Steve: comments on adding the underscore?
>>
>
> I'm fine with adding underscore; presumably not hyphen, though, since
> it's our range delimiter, yes?
>
>
>
>
--
Patrick Durusau
Director of Research and Development
Society of Biblical Literature
pdurusau@emory.edu