[sword-devel] utf lexicons + bug

Christian Renz sword-devel@crosswire.org
Tue, 25 Jun 2002 05:12:22 +0800


--Q68bSM7Ycu6FN28Q
Content-Type: text/plain; charset=us-ascii; format=flowed
Content-Disposition: inline

>Entries appear repeatedly. Can you confirm this? I shall attach a debug

I had similar problems with the windows version of the Sword
project. Also, especially in the area of the number entries you
mentioned ("10 MILLION" etc.), there are weird jumps when you try to
scroll using the cursor keys -- after scrolling forward n keys, the
dictionary will just jump back to a certain entry.

As I mentioned, I am also having problems with imp2ld. I managed to
compile a cygwin version today (the compress functions seem to be
missing in MingW) from the current release, so I feel confident
reporting the bugs. One of them is related to your problem.

Please find enclosed two modules, abc_imp2ld and abc_addld. abc_imp2ld
was created by imp2ld from

---- abc.imp ----
$$$a
The first letter of the latin alphabet.
$$$b
Being the second letter of the latin alphabet, it can be found just after a, but before c.
$$$c
This is the third letter of the latin alphabet and the end of this silly dictionary.
---- abc.imp ----

abc_addld was created by calling addld manually (same entries). As you
can see, this dictionary is only a minor improvement over the "home town
chicken" ;-).

The generated .dat files are identical. The generated .idx files,
however, differ. Here's the problems I encountered:

1. imp2ld crashes when given the name of a nonexistant input
   file. Talk about "nice error messages"...

2. When displaying abc_imp2ld in sword, sword (windows) displays a lot
   of "A" entries (no B or C). Accessing any of them will result in a
   write (!) access violation. sword survives, however.

3. When displaying abc_addld in sword, sword displays the entries "A A
   B C" (!). Accessing the entries works, though, but clicking on the
   first A entry will cause sword to jump to the second one.

   Interestingly enough, in the CEDICT the A entry will appear twice
   as well. (only checked up to beta-G. Up to beta-G this was the first
   entry. beta-H changed the display ordering).

4. mod2zmod can't handle abc_imp2ld as well -- endless loop. More
   about that later when I have more time to test. (This was the
   mod2zmod from the alpha website, my mod2zmod cygwin-compiled
   executable performs only endless loops of doing nothing).

Sorry for distributing the attachment to everybody, but I felt it's
not too much of a burden, since it's only 1.5K.

My guess is that the index created by imp2ld is corrupt, and that
there is a bug in the sword library lexdict handling. Can't
investigate further right now, though.

Greetings and blessings,
   Christian

-- 
crenz@web42.com - http://www.web42.com/crenz/ - http://www.web42.com/

"It's so much easier to pray for a bore than to go and see one."  -- C.S. Lewis

--Q68bSM7Ycu6FN28Q
Content-Type: application/x-zip-compressed
Content-Disposition: attachment; filename="abc.zip"
Content-Transfer-Encoding: base64

UEsDBAoAAAAAAIIm2SwAAAAAAAAAAAAAAAAEAAAAYWJjL1BLAwQUAAAACABMttgswMDk2YEA
AADnAAAACwAAAGFiYy9hYmMuaW1wfU7LDcMgDL0zxTtwjDJEZ+gChpjiCkEF5pDtaxggkiVL
7++9J/fOjCR9KAqrckdLUMMKqVRQ+WUKrKfz3gf3YqmfTQ+OrV6PngOiiFQRrKFNU3+n1VBa
DjoQphqVWmfEnR9tjAzYrTDN0p8LQJa5YLa/FWYdUsqNS6JKq9Tv0/0BUEsDBAoAAAAAAI0m
2SwAAAAAAAAAAAAAAAALAAAAYWJjL21vZHMuZC9QSwMEFAAAAAgAYibZLBI7XJtDAAAAdAAA
ABMAAABhYmMvbW9kcy5kL2FiYy5jb25mi05MSo7PzC0wykmJ5eVySSxJDEgsybDV08/NTynN
SS3WR8jzcvnmp7gUldkGJZb7uPBy8XJFgyQTU1Lw6QVLo2oFAFBLAwQKAAAAAACDJtksAAAA
AAAAAAAAAAAADAAAAGFiYy9tb2R1bGVzL1BLAwQUAAAACADMtdgsrPdbPIEAAADnAAAAGQAA
AGFiYy9tb2R1bGVzL2FiY19hZGRsZC5kYXR9TtsJwzAM/A9khxsgdIe2K3QB2ZFjFWMXW/7I
9pUyQOBAcE891+WTGUn6UBRW5Y6WoMYVUqmg8ssUWB/LurwMLPW45MGx1f02s0EUkSqCLbRp
7u+0GUqeoA1hqkmpdUb0/rc/IwMGL9Ms/X4AZJ1Os93LYdEhpZzYJaq0Sv305j9QSwMEFAAA
AAgAzLXYLK1i4zcQAAAAGAAAABkAAABhYmMvbW9kdWxlcy9hYmNfYWRkbGQuaWR4Y2CAAW0G
XSAZx9ALJCMYAFBLAwQUAAAACABPttgsrPdbPIEAAADnAAAAGgAAAGFiYy9tb2R1bGVzL2Fi
Y19pbXAybGQuZGF0fU7bCcMwDPwPZIcbIHSHtit0AdmRYxVjF1v+yPaVMkDgQHBPPdflkxlJ
+lAUVuWOlqDGFVKpoPLLFFgfy7q8DCz1uOTBsdX9NrNBFJEqgi20ae7vtBlKnqANYapJqXVG
9P63PyMDBi/TLP1+AGSdTrPdy2HRIaWc2CWqtEr99OY/UEsDBBQAAAAIAE+22CyumTw8EgAA
ACAAAAAaAAAAYWJjL21vZHVsZXMvYWJjX2ltcDJsZC5pZHhjYEAAbSDWBeI4IO4F4gggBgBQ
SwECFAsKAAAAAACCJtksAAAAAAAAAAAAAAAABAAAAAAAAAAAABAAAAAAAAAAYWJjL1BLAQIU
CxQAAAAIAEy22CzAwOTZgQAAAOcAAAALAAAAAAAAAAEAIAAAACIAAABhYmMvYWJjLmltcFBL
AQIUCwoAAAAAAI0m2SwAAAAAAAAAAAAAAAALAAAAAAAAAAAAEAAAAMwAAABhYmMvbW9kcy5k
L1BLAQIUCxQAAAAIAGIm2SwSO1ybQwAAAHQAAAATAAAAAAAAAAEAIAAAAPUAAABhYmMvbW9k
cy5kL2FiYy5jb25mUEsBAhQLCgAAAAAAgybZLAAAAAAAAAAAAAAAAAwAAAAAAAAAAAAQAAAA
aQEAAGFiYy9tb2R1bGVzL1BLAQIUCxQAAAAIAMy12Cys91s8gQAAAOcAAAAZAAAAAAAAAAEA
IAAAAJMBAABhYmMvbW9kdWxlcy9hYmNfYWRkbGQuZGF0UEsBAhQLFAAAAAgAzLXYLK1i4zcQ
AAAAGAAAABkAAAAAAAAAAAAgAAAASwIAAGFiYy9tb2R1bGVzL2FiY19hZGRsZC5pZHhQSwEC
FAsUAAAACABPttgsrPdbPIEAAADnAAAAGgAAAAAAAAABACAAAACSAgAAYWJjL21vZHVsZXMv
YWJjX2ltcDJsZC5kYXRQSwECFAsUAAAACABPttgsrpk8PBIAAAAgAAAAGgAAAAAAAAAAACAA
AABLAwAAYWJjL21vZHVsZXMvYWJjX2ltcDJsZC5pZHhQSwUGAAAAAAkACQA9AgAAlQMAAAAA

--Q68bSM7Ycu6FN28Q--