<html><body style="word-wrap: break-word; -webkit-nbsp-mode: space; -webkit-line-break: after-white-space; ">Robin,<div><br class="webkit-block-placeholder"></div><div>There are a few other issues relating to getting this to work.</div><div><br class="webkit-block-placeholder"></div><div>Unicode allows for a decorated character to be a single code point, called a composed character, or multiple code points for the letter followed by it's decorations.</div><div><br class="webkit-block-placeholder"></div><div>These are called NFC and NFD, respectively. There are two other ways to represent unicode characters called NFKC and NFKD. For a good description see: <a href="http://unicode.org/reports/tr15/">http://unicode.org/reports/tr15/</a> and <a href="http://unicode.org/faq/normalization.html">http://unicode.org/faq/normalization.html</a></div><div><br class="webkit-block-placeholder"></div><div>At CrossWire, we have settled on NFC. This appears to be the recommendation of the w3c. See: <a href="http://www.crosswire.org/pipermail/sword-devel/2007-September/025896.html">http://www.crosswire.org/pipermail/sword-devel/2007-September/025896.html</a></div><div><br class="webkit-block-placeholder"></div><div>At this time it is the module encoder's responsibility to encode the module correctly. Later osis2mod (and perhaps some of the other filters) will be changed to force the text to NFC.</div><div><br class="webkit-block-placeholder"></div><div>Basically, you need to first run the text through a filter to that does Canonical Decomposition and then through one that does Canonical Composition. (The Sword filter utf8nfc.cpp does this)</div><div><br class="webkit-block-placeholder"></div><div>Once that is done, make the module as you have always done.</div><div><br class="webkit-block-placeholder"></div><div>The next step is to ensure that you have a font that can handle the text. On Windows, I believe Arial should work. However, SIL has a bunch of open source fonts which are excellent. See: <a href="http://scripts.sil.org/cms/scripts/page.php?site_id=nrsi&item_id=silfontlist">http://scripts.sil.org/cms/scripts/page.php?site_id=nrsi&item_id=silfontlist</a></div><div><br class="webkit-block-placeholder"></div><div>If it doesn't work in BibleCS, try BibleDesktop, FireFox and IE. (you should be able to open the dat file if the module is not compressed)</div><div><br></div><div>Hope that helps.</div><div><br class="webkit-block-placeholder"></div><div>In His Service,</div><div><span class="Apple-tab-span" style="white-space:pre">        </span>DM</div><div><br class="webkit-block-placeholder"></div><div><div><div>On Jan 19, 2008, at 11:47 PM, <a href="mailto:RLRANDALLX@aol.com">RLRANDALLX@aol.com</a> wrote:</div><br class="Apple-interchange-newline"><blockquote type="cite"> <div id="role_body" style="FONT-SIZE: 10pt; COLOR: #000000; FONT-FAMILY: Arial" bottommargin="7" leftmargin="7" topmargin="7" rightmargin="7"><font id="role_document" face="Arial" color="#000000" size="2"> <div> </div> <div>DM, Sabastien,</div> <div> </div> <div>Thanks for your references on encoding. I have read about encodings and Now I need a practical example of making it thru the module process. Let's say I have an alpha with an accent (I use PSPad to get the codes in right) and I have this in XXX.imp. When I bring this into NotePad with UTF-8 as encoding type (Format is also Greek script) it looks just fine. Then I run it thru "imp2ld XXX.imp XXX 2" to get XXX.dat and XXX.idx. No errors, no problems. The XXX.conf file has Encoding=UTF-8. But when I fire up BibleCS and look at XXX in the LD pane I see a box where I am expecting an accented alpha. Unfortunately I know of no accented Greek text that I can reverse engineer to see where I am going wrong. Without clear answers at this point</div> <div>I have resigned to include only unaccented Greek text. If there are better tools out there to ensure I am on the right track please let me know.</div> <div> </div> <div>In His Grace,</div> <div>Robin</div> <div> </div> <div>>On Jan 18, 2008, at 5:24 AM, Sebastien Koechlin wrote:<br><br>>> On Thu, Jan 17, 2008 at 11:58:10PM -0500, <a href="mailto:RLRANDALLX@aol.com">RLRANDALLX@aol.com</a> wrote:<br>>>>>> I'm trying to display Unicode Greek in RawLD ThML with 1.5.9 <br>>>>>> BibleCS.<br>>>>>> Does anyone know what the .conf file should look like?<br>>>>>> "Encoding=Unicode or "Encoding=UNICODE" does NOT work. I just <br>>>>>> get open<br>>>>>> squares where the letters should have accents.<br>>>><br>>>>> Should be UTF-8, "unicode" is usually for internal representation <br>>>>> only<br>>>>> and "unicode" in itself is ambiguous.<br>>><br>>> Unicode is not an encoding.<br>>><br>>> As encoding is a common source of problems, I tried to write a small <br>>> text<br>>> about it. As english is not my native language, someone should <br>>> probably<br>>> correct it.<br>>><br>>> <a href="http://www.crosswire.org/wiki/index.php/Encoding">http://www.crosswire.org/wiki/index.php/Encoding</a><br>><br>>I've added links to your excellent page from <a href="http://www.crosswire.org/wiki/index.php/DevTools:Modules">http://www.crosswire.org/wiki/index.php/DevTools:Modules</a> <br>> both in the section on Encoding and in Related Links.<br></div> <div><font lang="0" face="Arial" size="2" family="SANSSERIF" ptsize="10">E-mail: </font><font lang="0" face="Arial" color="#ff0000" size="2" family="SANSSERIF" ptsize="10"><a href="mailto:RLRandallX@aol.com">RLRandallX@aol.com</a><br></font></div></font><br><br><br><div><font style="color: black; font: normal 10pt ARIAL, SAN-SERIF;"><hr style="MARGIN-TOP: 10px">Start the year off right. <a title="http://body.aol.com/fitness/winter-exercise?NCID=aolcmp00300000002489" href="http://body.aol.com/fitness/winter-exercise?NCID=aolcmp00300000002489" target="_blank">Easy ways to stay in shape</a> in the new year. </font></div></div> _______________________________________________<br>sword-devel mailing list: <a href="mailto:sword-devel@crosswire.org">sword-devel@crosswire.org</a><br><a href="http://www.crosswire.org/mailman/listinfo/sword-devel">http://www.crosswire.org/mailman/listinfo/sword-devel</a><br>Instructions to unsubscribe/change your settings at above page</blockquote></div><br></div></body></html>