[sword-devel] module making problem - U_INVALID_CHAR_FOUND
DM Smith
dmsmith555 at gmail.com
Wed Apr 13 10:21:12 MST 2005
I found that there are 302 lines with a non ISO-8859-1 character with a
hex value of 96, which on the MS cp1252 code page is an EN DASH. I
suspect that you have your standard code page set to something other
than cp1252 and that imp2ld is using that. But I don't know how that
would be done or how to tell imp2ld to use cp1252.
If you have perl installed the following will replace all en dash with
two hyphens.
perl -pe 's/\226/--/g' philpot.imp >philpot.imp2
Don A. Elbourne Jr. wrote:
> I tried
>
> imp2ld philpot.imp
> imp2ld philpot.imp 4
> imp2ld philpot.imp 2
> imp2ld philpot.imp Philpot
> imp2ld philpot.imp Philpot 4
> imp2ld philpot.imp Philpot 2
>
> I get dozens of: from: U_INVALID_CHAR_FOUND every time
>
> I tried removing all the "-" all together and still get the error.
>
> If you are not getting the errors with the same file, then I must have
> something going wrong on my system. Where are these drivers? Maybe I
> have some old ones or something.
>
>
>
>
> Chris Little wrote:
>
>> Could you post the commends you issued (both to imp2ld and to
>> mod2imp) and the .conf file you created for the module?
>>
>> I tried it myself and it worked fine roundtrip with the rawLD4
>> driver. The zLD driver looks like it has a bug since it quit
>> exporting at 12.25.
>>
>> I have a couple of other recommendations for the markup though:
>> Convert the sequence "--" to an actual emdash character, when it
>> appears. The encoding is inconsistent in this matter, since some
>> emdashes do appear in the text.
>> Pick any markup format other than plaintext. imp2ld doesn't seem to
>> be respecting linebreaks, and the frontends wouldn't necessarily do
>> so either. If you want to guarantee a linebreak toss some <p>'s in
>> there and change the .conf accordingly. This will also permit markup
>> of scripture references.
>>
>> --Chris
>>
>> Don A. Elbourne Jr. wrote:
>>
>>> Running mod2imp results in a mangled imp file. A few of the lines
>>> start with a $$$ and no date, other lines start with a date, and no
>>> $$$, and some of the dates are out of order. The file only goes
>>> through Jan. 6. I do not see any ryme or reason to the placement.
>>>
>>>
>>> Chris Little wrote:
>>>
>>>> This is an error coming from ICU (not Sword) and I don't believe it
>>>> is any problem. If you have a question about whether your data is
>>>> being preserved correctly, you can do imp2mod on your input file
>>>> followed by mod2imp on the resulting module. Then diff the two imp
>>>> files and see if there's any difference.
>>>>
>>>> --Chris
>>>>
>>>> Don A. Elbourne Jr. wrote:
>>>>
>>>>> I am creating a new Daily Devo module out of Joseph Philpot's
>>>>> "Daily Portions." I have created an imp file and I'm using the
>>>>> latest imp2ld tool on WinXP.
>>>>>
>>>>> I get output like this:
>>>>>
>>>>> 12.30
>>>>> from: U_INVALID_CHAR_FOUND
>>>>> from: U_INVALID_CHAR_FOUND
>>>>> from: U_INVALID_CHAR_FOUND
>>>>> from: U_INVALID_CHAR_FOUND
>>>>> from: U_INVALID_CHAR_FOUND
>>>>> 12.31
>>>>> from: U_INVALID_CHAR_FOUND
>>>>> from: U_INVALID_CHAR_FOUND
>>>>> from: U_INVALID_CHAR_FOUND
>>>>> from: U_INVALID_CHAR_FOUND
>>>>> from: U_INVALID_CHAR_FOUND
>>>>>
>>>>> You can see my imp file here:
>>>>> http://elbourne.org/temp/philpot.imp (640 KB)
>>>>>
>>>>> What character could be flagging the errors and what should I do
>>>>> to correct it?
>>>>>
>>>>
>>>> _______________________________________________
>>>> sword-devel mailing list: sword-devel at crosswire.org
>>>> http://www.crosswire.org/mailman/listinfo/sword-devel
>>>> Instructions to unsubscribe/change your settings at above page
>>>>
>>>
>>
>> _______________________________________________
>> sword-devel mailing list: sword-devel at crosswire.org
>> http://www.crosswire.org/mailman/listinfo/sword-devel
>> Instructions to unsubscribe/change your settings at above page
>>
>
More information about the sword-devel
mailing list