[osis-core] Proposed Regex Change (ALL)

Patrick Durusau osis-core@bibletechnologieswg.org
Tue, 10 Feb 2004 16:56:00 -0500


Greetings!

Steve and I are laboring away on various OSIS issues and are thinking 
about the following change to the regexes (ALL):

1. All prefixes remain as valid XML names, i.e., no spaces, colons or 
hyphens.

2. Escape character defined as "\" + one character, the one character 
can be anything other than XML 1.0 Whitespace. Note that the backslash 
and character must occur as a pair. Backslash is otherwise prohibited in 
the regex.

This allows for osisIDs, osisRefs and things governed by osisGen to 
operate as valid tokens, even if they contain otherwise prohibited 
characters.

The addition to the regex will be an or group that looks like:

(\\([^\#x20\t\n\r]))

in other words, a literal backslash followed by any non-XML 1.0 
whitespace character.

Scheduled for inclusion in 2.1.

Comments?

Hope you are having a great day!

Patrick

-- 
Patrick Durusau
Director of Research and Development
Society of Biblical Literature
Patrick.Durusau@sbl-site.org
Chair, V1 - Text Processing: Office and Publishing Systems Interface
Co-Editor, ISO 13250, Topic Maps -- Reference Model

Topic Maps: Human, not artificial, intelligence at work!