<html><head><meta http-equiv="content-type" content="text/html; charset=utf-8"></head><body dir="auto"><div>Does setting CLANG (or whatever it is) in the env help? In unix you have to tell the program what charset you are using. <br><br>Cent from my fone so theer mite be tipos. ;)</div><div><br>On Mar 6, 2017, at 7:52 PM, Karl Kleinpaste <<a href="mailto:karl@kleinpaste.org">karl@kleinpaste.org</a>> wrote:<br><br></div><blockquote type="cite"><div>
<meta content="text/html; charset=windows-1252" http-equiv="Content-Type">
<div class="moz-cite-prefix">On 03/06/2017 05:25 PM, Greg Hellings
wrote:<br>
</div>
<blockquote cite="mid:CAHxvOV+bn4WpfAmUPQmarZZz-y3tu7tqLyPMOLA2h4t+OdFo8w@mail.gmail.com" type="cite">being off by 2 would seem strange to me</blockquote>
<font face="FreeSerif">I don't understand this question at all.<br>
<br>
0xE2 = 226 = 0342<br>
0x80 = 128 = 0200<br>
0x93 = 147 = 0223<br>
<br>
There's no off-by error at all.<br>
<br>
"od" is the "octal dump" tool; given -c, it tries to dump
characters, but outside 7-bit ASCII, it still dumps octal.<br>
<br>
For those familiar with dc(1), this will make sense<br>
$ dc<br>
8o<br>
226p<br>
342<br>
128p<br>
200<br>
147p<br>
223<br>
16i<br>
0XE2p<br>
342<br>
0X80p<br>
200<br>
0X93p<br>
223<br>
<br>
The interesting questions are why C++11 regex can't find <i>en
dash</i>, and why non-C++11 regex doesn't understand multibyte.<br>
</font>
</div></blockquote><blockquote type="cite"><div><span>_______________________________________________</span><br><span>sword-devel mailing list: <a href="mailto:sword-devel@crosswire.org">sword-devel@crosswire.org</a></span><br><span><a href="http://www.crosswire.org/mailman/listinfo/sword-devel">http://www.crosswire.org/mailman/listinfo/sword-devel</a></span><br><span>Instructions to unsubscribe/change your settings at above page</span></div></blockquote></body></html>