[osis-core] Use of "+" in grain of a reference.
Steve DeRose
osis-core@bibletechnologieswg.org
Wed, 19 Jun 2002 18:19:58 -0400
At 01:49 PM -0400 06/14/02, Patrick Durusau wrote:
>Todd,
>
>The general structure of the grain for character references is:
>
>grain ::= 'char:' offset length? content
>offset ::= integer
>length ::= '+' integer
>content ::= '(' [^0]+ ')'
>integer ::= [0-9]+
>
>The '+' sign is just a separator character. (Steve, didn't XPointer
>use a comma?)
We used a comma since that's the general parameter syntax in the language.
The reason for the content and length is that the content piece need
not be the whole thing linked -- for example, you might want to link
to a very long or very short portion, but use a moderate size content
string for the checking function (which is mainly what the content is
there for -- checking and reattachment).
>
>Look at Steve's post on 9 May 2002, subject line Pointer syntax.
>
>I have discussed this with Steve and the intent is to be able to point to:
>
>1. a particular location, i.e., just the offset
We need to state explicitly what 'just the offset' means. I recommend
that it mean the point immediately preceding the nth character -- and
that this be defined as the same as a length of 0. No negative
offsets or lengths allowed, and the first text character within the
unit that the (pre-grain) reference identifies, is counted as
character number 1.
>
>2. a particular range of characters, offset+length
>
>3. a particular range of characters, offset+lenth and then to a
>particular word in that range (at least that is how I read it)
I was thinking of the (string) part as merely a check -- that's what
you should find at the stated offset, and if you don't, you can issue
a warning (and maybe offer to reattach to the nearest place that
string *does* occur).
>
>(Don't know that I want to complicate the regex any more but I
>suspect that the text string would be more useful if it were match
>on this ref, then find this string and not have string as part of
>offset and length. If I know that much detail, why do I need the
>string? (Steve, can you help out here?)
>
>Patrick
>
>
>Todd Tillinghast wrote:
>
>>And I did not use the "+" behavior? It seems to indicate a range or
>>characters rather than a point. When referring to a segment of a verse
>>it would seem the only appropriate value following the "+" would be the
>>number of characters in the segment not the number of characters in the
>>first word in the segment.
>>Have I missed the boat on the "+"? Is it to help parse the word in
>>parentheses? If this is the purpose
>>would it be more "third normal" to just count the characters of the word
>>in parentheses?
>>
>>Is there a reason to not remove the "+"?
>>
>>Todd
>>
>
>--
>Patrick Durusau
>Director of Research and Development
>Society of Biblical Literature
>pdurusau@emory.edu
--
Steve DeRose -- http://www.stg.brown.edu/~sjd
Chair, Bible Technologies Group -- http://www.bibletechnologies.net
Email: sderose@speakeasy.net
Backup email: sderose@mac.com, sjd@stg.brown.edu