<div dir="ltr"><div><div>Disregard about the module, I found it in a different section of the module manager. So I have the ERtr_en module now, but as far as I can figure in Xephos it's useless. Turkish is an agglutinated language and almost no words in an actual text like the Bible appear in their root or stem form as found in a dictionary. Ergo no words (except a handful of conjunctions, numbers, etc. that sometimes have no suffixes) you click on to look up in the dictionary even have a chance of coming up with an actual meaning. Even if you know how to parse words and type the stems into the dictionary lookup bar, it rarely has them and throws the closest match (Alphabetical? Levenshtein distance?) which is less than useless.<br><br></div>Unless I'm missing something, it might be just as well to disable the module as insult anybody that tries to use it with data this useless. Am I missing something? Is there a use-case that makes it worth trying to cleanup the character set issue? I'll still look into it if you say it's worth some time to do.<br><br></div><div>Caleb<br></div></div><div class="gmail_extra"><br><div class="gmail_quote">On Tue, Jan 5, 2016 at 8:39 PM, Caleb Maclennan <span dir="ltr"><<a href="mailto:caleb@alerque.com" target="_blank">caleb@alerque.com</a>></span> wrote:<br><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex"><div dir="ltr"><div><div>DM,<br><br></div>Honestly I'm willing to put some effort into this if it will be beneficial to anybody using Turkish scriptures, but the Wayback Machine link you provided is not encouraging. Not only is the encoding garbage, but the data itself is rife with mistakes.Not a full minute of skimming it and I found several misspelled Turkish words (not just wrong encoding, actual misspellings) and outright bogus definitions. It's a very low quality data set. Is what's an the page representative of what is going to come out even if I dive down an archaic Windows rabbit hole and manage to surface with a properly encoded list? Is such a dictionary really helping anybody? It doesn't seem to have much in the way of Biblical/theological terminology anyway. Is this just for looking up word definitions in while reading a text or does it serve some purpose for cross referencing translations?<br><br></div><div>I have a copy of Xiphos handy, but for some reason Turkish isn't showing up in the dictionary modules available for download. Is this not in the default CrossWire repo?<span class="HOEnZb"><font color="#888888"><br></font></span></div><span class="HOEnZb"><font color="#888888"><div><br></div><div>Caleb<br></div></font></span></div><div class="HOEnZb"><div class="h5"><div class="gmail_extra"><br><div class="gmail_quote">On Tue, Jan 5, 2016 at 8:11 PM, DM Smith <span dir="ltr"><<a href="mailto:dmsmith@crosswire.org" target="_blank">dmsmith@crosswire.org</a>></span> wrote:<br><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex"><div style="word-wrap:break-word">Thanks Caleb,<div><br></div><div>I’m working on JSword which is the Java version of the SWORD engine. As such I run all the modules I can get my hands on through a process that reads all of each module reporting what it cannot handle. It was that effort that made me look closer at the module. Either the problem was in JSword or it was in the module.</div><div><br></div><div>With Peter, David and your input, we can safely say that it is the module’s problem.</div><div><br></div><div>Most front-ends don’t display the module as a list (i.e. browse the contents). Bible Desktop does. Most front-ends allow you to select a word and look it up in a dictionary. The Glossary modules allow you to look up a word in one language and display it in another. Bible Desktop doesn’t.</div><div><br></div><div>If you let us know which front-end you use, we can explain how to download the module for it and how to use it in that program.</div><div><br></div><div>The SWORD utility mod2imp will dump a module’s content in imp format. Since this module is a RawLD module, the *dat file is readable. In your modules folder it would be: modules/lexdict/rawld/glossaries/ertr_en/ertr_en.dat. The ertr_en.idx file is not readable as it is in a proprietary binary format.</div><div><br></div><div>While it certainly is possible to take the dump from mod2imp, edit it and rebuild the module, we prefer not to do that. What is best is to get the source again and create a module from it. And if the source was not the original location, it is best to identify the original and get it from there. In the case of our source, we got it from:</div><div><a href="http://www.wordgumbo.com/al/tur/ertureng.htm" target="_blank">http://www.wordgumbo.com/al/tur/ertureng.htm</a></div><div>Currently this site is down, so I found it via the Internet Wayback Machine:</div><div><a href="https://web.archive.org/web/20131124010613/http://www.wordgumbo.com/al/tur/ertureng.htm" target="_blank">https://web.archive.org/web/20131124010613/http://www.wordgumbo.com/al/tur/ertureng.htm</a></div><div><br></div><div>I noted that WordGumbo sourced the files from Ergane. That is the originator of the data and it can be found here:</div><div><a href="http://download.travlang.com//" target="_blank">http://download.travlang.com//</a></div><div><br></div><div>Ergane is software that runs under Windows only. It doesn’t run under Windows 10 (64-bit). I haven’t tried Windows 7 (64-bit). The software requires various zips to be installed to be useful. I downloaded one of the zip files and it contained an MDB file, which I’m pretty sure is a Windows database file, perhaps Access. From the website’s description of the program:</div><div><br></div><div><blockquote type="cite"><span style="background-color:rgb(255,255,255)">Ergane is a </span><a href="http://users.nccs.gov/~rickyk/scicomp/" target="_blank">multilingual</a><span style="background-color:rgb(255,255,255)"> translation dictionary for Windows that uses the artificial language Esperanto to translate words and short expressions from one natural language to another. Ergane is a product of </span><a href="http://www.majstro.com/Bedrijf/contact_eng.html" target="_blank">Majstro Aplikaĵoj</a><span style="background-color:rgb(255,255,255)">.</span></blockquote></div><div><br></div><div>and</div><div><blockquote type="cite"><p>You won't need a <a href="https://cisonline.bu.edu/master-of-science-in-computer-information-systems/" target="_blank">masters in computer science</a> to download Ergane ,but make sure you do have Windows. </p><p>Windows 95 or higher.</p></blockquote></div><div>Ideally, the output of the program for the Turkish to English needs to be obtained from it, converted into UTF-8, if it isn’t and a module source file created for it. Proof-reading is invaluable.</div><div><br></div><div>Let us know what you are willing to do.</div><div><br></div><div>In Him,</div><div><span style="white-space:pre-wrap">        </span>DM</div><div><div><div><br><div><blockquote type="cite"><div>On Jan 5, 2016, at 12:28 PM, Caleb Maclennan <<a href="mailto:caleb@alerque.com" target="_blank">caleb@alerque.com</a>> wrote:</div><br><div><div dir="ltr"><div><div><div>Hey DM,<br><br></div>I am fluent in Turkish and can help out here. That being said I'm a little confused what you're into here. Can you point me at where to see the source files for this in context and where it comes out in an app?<br><br>It looks from the bits you pasted like a file somewhere along the line got read and interpreted with the wrong code-page. Of the text you pasted, all of it is wrong, but it is all off with a 1-to-1 character transpose that could make it right. All the "O"s are "İ" and all the "1"s are "I" in the dictionary list for example.<br></div><br></div>Caleb<br></div><div class="gmail_extra"><br><div class="gmail_quote">On Tue, Jan 5, 2016 at 4:56 PM, DM Smith <span dir="ltr"><<a href="mailto:dmsmith@crosswire.org" target="_blank">dmsmith@crosswire.org</a>></span> wrote:<br><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex"><div style="word-wrap:break-word">Does anyone know Turkish that can help figure out a problem I am having?<div><br></div><div>Background: In ASCII the first 32 characters (00 to 1F) are control characters and most are not valid for XML, but are valid for UTF-8.</div><div><br></div><div>In one of our modules, ERtr_en, I am seeing data such as:</div><div>For the 26th entry, the entry looks like</div><div><span style="font-family:Monaco;font-size:11px"><br></span></div><div><span style="font-family:Monaco;font-size:11px">AUSTOS</span><span style="font-family:Monaco;font-size:11px;white-space:pre-wrap">        </span><span style="font-family:Monaco;font-size:11px">1. August<br /></span></div><div><br></div><div>However, the key AUSTOS has a non-printable between A and U of the control character with the hex value 1F:</div><div>‘A’ ‘1F’ ‘U’ ’S’ ’T’ ‘O’ ’S’</div><div><br></div><div>What is the correct value?</div><div><br></div><div>Note: There are hundreds of such problems in this module. And I’m seeing such non-printables in many other modules from the same source (<a href="http://wordgumbo.com/" target="_blank">wordgumbo.com</a>).</div><div><br></div><div>For those that are interested, here are the first entries in the dictionary, none of which see right to me (ran a few of the definitions through google translate):</div><div><div style="margin:0px;font-size:11px;line-height:normal;font-family:Monaco">index<span style="white-space:pre-wrap">        </span>offset<span style="white-space:pre-wrap">        </span>size<span style="white-space:pre-wrap">        </span>key<span style="white-space:pre-wrap">        </span>value</div><div style="margin:0px;font-size:11px;line-height:normal;font-family:Monaco">0<span style="white-space:pre-wrap">        </span>33132<span style="white-space:pre-wrap">        </span>22<span style="white-space:pre-wrap">        </span>0NCIL<span style="white-space:pre-wrap">        </span>1. Bible<br /></div><div style="margin:0px;font-size:11px;line-height:normal;font-family:Monaco">1<span style="white-space:pre-wrap">        </span>33156<span style="white-space:pre-wrap">        </span>72<span style="white-space:pre-wrap">        </span>0NGILIZ<span style="white-space:pre-wrap">        </span>1. English<br />2. Englishman; Sassenach...</div><div style="margin:0px;font-size:11px;line-height:normal;font-family:Monaco">2<span style="white-space:pre-wrap">        </span>33260<span style="white-space:pre-wrap">        </span>32<span style="white-space:pre-wrap">        </span>0NGILIZ KAM1_1<span style="white-space:pre-wrap">        </span>1. bamboo<br /></div><div style="margin:0px;font-size:11px;line-height:normal;font-family:Monaco">3<span style="white-space:pre-wrap">        </span>33230<span style="white-space:pre-wrap">        </span>28<span style="white-space:pre-wrap">        </span>0NGILIZCE<span style="white-space:pre-wrap">        </span>1. English<br /></div><div style="margin:0px;font-size:11px;line-height:normal;font-family:Monaco">4<span style="white-space:pre-wrap">        </span>33294<span style="white-space:pre-wrap">        </span>44<span style="white-space:pre-wrap">        </span>0NGILTERE<span style="white-space:pre-wrap">        </span>1. England<br />2. England<br /></div><div style="margin:0px;font-size:11px;line-height:normal;font-family:Monaco">5<span style="white-space:pre-wrap">        </span>33340<span style="white-space:pre-wrap">        </span>28<span style="white-space:pre-wrap">        </span>0RAN<span style="white-space:pre-wrap">        </span>1. Iran; Persia<br /></div><div style="margin:0px;font-size:11px;line-height:normal;font-family:Monaco">6<span style="white-space:pre-wrap">        </span>33370<span style="white-space:pre-wrap">        </span>25<span style="white-space:pre-wrap">        </span>0RANL1<span style="white-space:pre-wrap">        </span>1. Iranian<br /></div><div style="margin:0px;font-size:11px;line-height:normal;font-family:Monaco">7<span style="white-space:pre-wrap">        </span>33397<span style="white-space:pre-wrap">        </span>26<span style="white-space:pre-wrap">        </span>0RLANDA<span style="white-space:pre-wrap">        </span>1. Ireland<br /></div><div style="margin:0px;font-size:11px;line-height:normal;font-family:Monaco">8<span style="white-space:pre-wrap">        </span>33425<span style="white-space:pre-wrap">        </span>43<span style="white-space:pre-wrap">        </span>0RLANDAL1<span style="white-space:pre-wrap">        </span>1. Irish<br />2. Irishman<br /></div><div style="margin:0px;font-size:11px;line-height:normal;font-family:Monaco">9<span style="white-space:pre-wrap">        </span>33470<span style="white-space:pre-wrap">        </span>21<span style="white-space:pre-wrap">        </span>0SA<span style="white-space:pre-wrap">        </span>1. Christ<br /></div><div style="margin:0px;font-size:11px;line-height:normal;font-family:Monaco">10<span style="white-space:pre-wrap">        </span>33493<span style="white-space:pre-wrap">        </span>22<span style="white-space:pre-wrap">        </span>0SLAM<span style="white-space:pre-wrap">        </span>1. Islam<br /></div><div style="margin:0px;font-size:11px;line-height:normal;font-family:Monaco">11<span style="white-space:pre-wrap">        </span>33517<span style="white-space:pre-wrap">        </span>24<span style="white-space:pre-wrap">        </span>0SPANYA<span style="white-space:pre-wrap">        </span>1. Spain<br /></div><div style="margin:0px;font-size:11px;line-height:normal;font-family:Monaco">12<span style="white-space:pre-wrap">        </span>33543<span style="white-space:pre-wrap">        </span>28<span style="white-space:pre-wrap">        </span>0SPANYOL<span style="white-space:pre-wrap">        </span>1. Spaniard<br /></div><div style="margin:0px;font-size:11px;line-height:normal;font-family:Monaco">13<span style="white-space:pre-wrap">        </span>33573<span style="white-space:pre-wrap">        </span>39<span style="white-space:pre-wrap">        </span>0SRAIL<span style="white-space:pre-wrap">        </span>1. Israel<br />2. Israel<br /></div><div style="margin:0px;font-size:11px;line-height:normal;font-family:Monaco">14<span style="white-space:pre-wrap">        </span>33614<span style="white-space:pre-wrap">        </span>28<span style="white-space:pre-wrap">        </span>0STANBUL<span style="white-space:pre-wrap">        </span>1. Istanbul<br /></div><div style="margin:0px;font-size:11px;line-height:normal;font-family:Monaco">15<span style="white-space:pre-wrap">        </span>33644<span style="white-space:pre-wrap">        </span>24<span style="white-space:pre-wrap">        </span>0SVEÇ<span style="white-space:pre-wrap">        </span>1. Sweden<br /></div><div style="margin:0px;font-size:11px;line-height:normal;font-family:Monaco">16<span style="white-space:pre-wrap">        </span>33670<span style="white-space:pre-wrap">        </span>41<span style="white-space:pre-wrap">        </span>0SVEÇLI<span style="white-space:pre-wrap">        </span>1. Swedish<br />2. Swede<br /></div><div style="margin:0px;font-size:11px;line-height:normal;font-family:Monaco">17<span style="white-space:pre-wrap">        </span>33713<span style="white-space:pre-wrap">        </span>31<span style="white-space:pre-wrap">        </span>0SVIÇRE<span style="white-space:pre-wrap">        </span>1. Switzerland<br /></div><div style="margin:0px;font-size:11px;line-height:normal;font-family:Monaco">18<span style="white-space:pre-wrap">        </span>33746<span style="white-space:pre-wrap">        </span>41<span style="white-space:pre-wrap">        </span>0SVIÇRELI<span style="white-space:pre-wrap">        </span>1. Swiss<br />2. Swiss<br /></div><div style="margin:0px;font-size:11px;line-height:normal;font-family:Monaco">19<span style="white-space:pre-wrap">        </span>33789<span style="white-space:pre-wrap">        </span>23<span style="white-space:pre-wrap">        </span>0TALYA<span style="white-space:pre-wrap">        </span>1. Italy<br /></div><div style="margin:0px;font-size:11px;line-height:normal;font-family:Monaco">20<span style="white-space:pre-wrap">        </span>33814<span style="white-space:pre-wrap">        </span>42<span style="white-space:pre-wrap">        </span>0TALYAN<span style="white-space:pre-wrap">        </span>1. Italian<br />2. Italian<br /></div><div style="margin:0px;font-size:11px;line-height:normal;font-family:Monaco">21<span style="white-space:pre-wrap">        </span>33858<span style="white-space:pre-wrap">        </span>44<span style="white-space:pre-wrap">        </span>0TALYANCA<span style="white-space:pre-wrap">        </span>1. Italian<br />2. Italian<br /></div><div style="margin:0px;font-size:11px;line-height:normal;font-family:Monaco">22<span style="white-space:pre-wrap">        </span>33904<span style="white-space:pre-wrap">        </span>26<span style="white-space:pre-wrap">        </span>0ZLANDA<span style="white-space:pre-wrap">        </span>1. Iceland<br /></div><div style="margin:0px;font-size:11px;line-height:normal;font-family:Monaco">23<span style="white-space:pre-wrap">        </span>33086<span style="white-space:pre-wrap">        </span>20<span style="white-space:pre-wrap">        </span>1L1K<span style="white-space:pre-wrap">        </span>1. warm<br /></div><div style="margin:0px;font-size:11px;line-height:normal;font-family:Monaco">24<span style="white-space:pre-wrap">        </span>33108<span style="white-space:pre-wrap">        </span>22<span style="white-space:pre-wrap">        </span>1RMAK<span style="white-space:pre-wrap">        </span>1. river<br /></div><div style="margin:0px;font-size:11px;line-height:normal;font-family:Monaco">25<span style="white-space:pre-wrap">        </span>7062<span style="white-space:pre-wrap">        </span>25<span style="white-space:pre-wrap">        </span>AUSTOS<span style="white-space:pre-wrap">        </span>1. August<br /></div></div><div><br></div><div><br></div><div>Thanks in advance!</div><div><br></div><div>In Him,</div><div><span style="white-space:pre-wrap">        </span>DM Smith</div><div><br></div></div><br>_______________________________________________<br>
sword-devel mailing list: <a href="mailto:sword-devel@crosswire.org" target="_blank">sword-devel@crosswire.org</a><br>
<a href="http://www.crosswire.org/mailman/listinfo/sword-devel" rel="noreferrer" target="_blank">http://www.crosswire.org/mailman/listinfo/sword-devel</a><br>
Instructions to unsubscribe/change your settings at above page<br></blockquote></div><br></div>
_______________________________________________<br>sword-devel mailing list: <a href="mailto:sword-devel@crosswire.org" target="_blank">sword-devel@crosswire.org</a><br><a href="http://www.crosswire.org/mailman/listinfo/sword-devel" target="_blank">http://www.crosswire.org/mailman/listinfo/sword-devel</a><br>Instructions to unsubscribe/change your settings at above page</div></blockquote></div><br></div></div></div></div><br>_______________________________________________<br>
sword-devel mailing list: <a href="mailto:sword-devel@crosswire.org" target="_blank">sword-devel@crosswire.org</a><br>
<a href="http://www.crosswire.org/mailman/listinfo/sword-devel" rel="noreferrer" target="_blank">http://www.crosswire.org/mailman/listinfo/sword-devel</a><br>
Instructions to unsubscribe/change your settings at above page<br></blockquote></div><br></div>
</div></div></blockquote></div><br></div>