<p dir="ltr">It should be possible to escape any such characters with an XML entity, no?</p>
<div class="gmail_quote">On Dec 12, 2014 7:44 AM, "DM Smith" <<a href="mailto:dmsmith@crosswire.org">dmsmith@crosswire.org</a>> wrote:<br type="attribution"><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex"><br>
> On Dec 12, 2014, at 8:26 AM, Peter Von Kaehne <<a href="mailto:refdoc@gmx.net">refdoc@gmx.net</a>> wrote:<br>
><br>
> Gesendet: Freitag, 12. Dezember 2014 um 13:16 Uhr<br>
> Von: "Troy A. Griffitts" <<a href="mailto:scribe@crosswire.org">scribe@crosswire.org</a>><br>
><br>
>> Not sure, but I thought we used optional prefixes to specify the kind of gloss if there are multiple, e.g., > gloss="en_US:18&nbsp;wheeler en_UK:articulated&nbsp;lorry"<br>
><br>
> Should there be an option to escape colons?<br>
<br>
IMHO:<br>
Yes.<br>
<br>
The definition of gloss in the schema is xs:string, not osisGenRegex.<br>
The former places no semantic on the content an allows for an empty string.<br>
<br>
If gloss should have a semantic, then it should be changed in the OSIS spec.<br>
<br>
The latter is used by lemma and morph and is specified as:<br>
((((\p{L}|\p{N}|_)+)(\.(\p{L}|\p{N}|_))*:)?([^:\s])+)<br>
which basically is work:value.<br>
If I read this right it does not allow for : to be escaped. I know we allow lemma=“x:a y:b” but I don’t see that this allows for the pattern to be repeated, separated by spaces.<br>
<br>
The pattern would need to change ([^:\s])+ to (\\:|[^:\s])+ [ not tested ]<br>
<br>
In His Service,<br>
DM<br>
_______________________________________________<br>
sword-devel mailing list: <a href="mailto:sword-devel@crosswire.org">sword-devel@crosswire.org</a><br>
<a href="http://www.crosswire.org/mailman/listinfo/sword-devel" target="_blank">http://www.crosswire.org/mailman/listinfo/sword-devel</a><br>
Instructions to unsubscribe/change your settings at above page</blockquote></div>