<html><head><meta http-equiv="content-type" content="text/html; charset=utf-8"></head><body style="overflow-wrap: break-word; -webkit-nbsp-mode: space; line-break: after-white-space;">David, I read your Grok 3 analysis.<div><br></div><div><div>What is the impact of not having this change? What is the impact of making the change? Is it merely presentation of is there an issue with searching too?</div><div><br></div><div>I’ve also been reading <a href="https://corp.unicode.org/pipermail/unicode/2019-January/007563.html">https://corp.unicode.org/pipermail/unicode/2019-January/007563.html</a> which was referenced in a prior recent thread on U+2019 in Ancient Greek. This is long and worth reading to understand how it might impact SWORD. The thread is initiated by James Tauber.<div><br></div><div>TL;DR:</div><div>U+2019 (and in older texts U+0027) in Ancient Greek was never used for quotations and is only used for elision. It is considered the recommended character for elisions.</div><div>The Unicode rules (when the thread was written in January 2019) of TR29 have that U+2019 is a word break when at the front or end of a word, but not within a word. It is not simply punctuation. These rules are not language aware.</div><div>There is no zero width character in Unicode to join words.</div><div>It is impossible for TR29 to distinguish between U+2019 used as a quotation mark and as an elision.</div><div>There is no other character that is an appropriate replacement for U+2019.</div><div><br></div><div>I haven’t yet looked at Unicode TR30 regarding folding rules as it pertains to this.<br><div><br></div><div>In Him,</div><div><span class="Apple-tab-span" style="white-space:pre"> </span>DM</div><div><br></div><div><div><br><blockquote type="cite"><div>On Mar 17, 2025, at 8:46 AM, David Haslam <dfhdfh@protonmail.com> wrote:</div><br class="Apple-interchange-newline"><div><div style="font-family: Arial, sans-serif; font-size: 14px;">Dear SWORD developers,</div><div style="font-family: Arial, sans-serif; font-size: 14px;"><br></div><div style="font-family: Arial, sans-serif; font-size: 14px;">I asked about this topic several years ago, and I'm no longer convinced by what we were told back then.<br>
<br>
After doing further research, it's my understanding that <b>U+2019 RIGHT SINGLE QUOTATION MARK</b> ought <b><u>not</u></b> to be hidden by this SWORD filter.<br>
<br></div><div style="font-family: Arial, sans-serif; font-size: 14px;"><ol data-editing-info="{"orderedStyleType":1,"unorderedStyleType":1}" style="margin-top: 0px; margin-bottom: 0px;" data-listchain="__List_Chain_93"><li style="list-style-type: "1. ";"><span>
This codepoint is <u>not</u> a diacritic that modifies the previous Greek letter. In other words, it's <b><u>not</u></b> a Greek accent.<br></span></li><li style="list-style-type: "2. ";"><span>This codepoint has the Unicode properties of a <b>punctuation mark</b>.</span></li><li style="list-style-type: "3. ";"><span>In Ancient Greek text, it's used to mark an <b>elision</b>, where the final vowel of a word is omitted when the next word begins with a vowel.</span></li></ol></div><div style="font-family: Arial, sans-serif; font-size: 14px;"><br></div><div style="font-family: Arial, sans-serif; font-size: 14px;">
To view my research, conducted with the help of <b>Grok 3</b>, please visit the following link.</div><div style="font-family: Arial, sans-serif; font-size: 14px;"><ul data-editing-info="{"orderedStyleType":1,"unorderedStyleType":1}" style="margin-top: 0px; margin-bottom: 0px;"><li style="list-style-type: disc;"><span><a target="_blank" rel="noreferrer nofollow noopener" href="https://grok.com/share/bGVnYWN5_43ff1922-3876-4d9a-9e42-6ae940007fd0">https://grok.com/share/bGVnYWN5_43ff1922-3876-4d9a-9e42-6ae940007fd0</a></span><br></li></ul></div><div style="font-family: Arial, sans-serif; font-size: 14px;"><br></div><div style="font-family: Arial, sans-serif; font-size: 14px;">
I therefore recommend that SWORD developers revisit the specification for this filter, and update it so that <b>U+2019</b> is <u>never</u> hidden.<br>
<br>
<div class="protonmail_signature_block" style="font-family: Arial, sans-serif; font-size: 14px;">
<div class="protonmail_signature_block-user">
Best regards,<br><br>David
</div>
<div style="font-family: Arial, sans-serif; font-size: 14px;"><br></div>
<div class="protonmail_signature_block-proton">
Sent with <a target="_blank" href="https://pr.tn/ref/SWXT9A5YZ67G">Proton Mail</a> secure email.
</div>
</div>
</div>_______________________________________________<br>sword-devel mailing list: sword-devel@crosswire.org<br>http://crosswire.org/mailman/listinfo/sword-devel<br>Instructions to unsubscribe/change your settings at above page<br></div></blockquote></div><br></div></div></div></div></body></html>