[sword-devel] module making problem - U_INVALID_CHAR_FOUND

DM Smith dmsmith555 at gmail.com
Wed Apr 13 10:33:10 MST 2005


Meant to quote the unicode standard for mapping of ISO-8859-1 to Unicode:
http://www.unicode.org/Public/MAPPINGS/ISO8859/8859-1.TXT
Note that everything from 0x80-0x9F is marked as <control>.
These are used by the MS code pages.

On 4/13/05, DM Smith <dmsmith555 at gmail.com> wrote:
> I found that there are 302 lines with a non ISO-8859-1 character with a
> hex value of 96, which on the MS cp1252 code page is an EN DASH. I
> suspect that you have your standard code page set to something other
> than cp1252 and that imp2ld is using that. But I don't know how that
> would be done or how to tell imp2ld to use cp1252.
> 
> If you have perl installed the following will replace all en dash with
> two hyphens.
> 
> perl -pe 's/\226/--/g' philpot.imp >philpot.imp2
> 
> Don A. Elbourne Jr. wrote:
> 
> > I tried
> >
> > imp2ld philpot.imp
> > imp2ld philpot.imp 4
> > imp2ld philpot.imp 2
> > imp2ld philpot.imp Philpot
> > imp2ld philpot.imp Philpot 4
> > imp2ld philpot.imp Philpot 2
> >
> > I get dozens of: from: U_INVALID_CHAR_FOUND  every time
> >
> > I tried removing all the "-" all together and still get the error.
> >
> > If you are not getting the errors with the same file, then I must have
> > something going wrong on my system. Where are these drivers? Maybe I
> > have some old ones or something.
> >
> >
> >
> >
> > Chris Little wrote:
> >
> >> Could you post the commends you issued (both to imp2ld and to
> >> mod2imp) and the .conf file you created for the module?
> >>
> >> I tried it myself and it worked fine roundtrip with the rawLD4
> >> driver. The zLD driver looks like it has a bug since it quit
> >> exporting at 12.25.
> >>
> >> I have a couple of other recommendations for the markup though:
> >> Convert the sequence "--" to an actual emdash character, when it
> >> appears. The encoding is inconsistent in this matter, since some
> >> emdashes do appear in the text.
> >> Pick any markup format other than plaintext. imp2ld doesn't seem to
> >> be respecting linebreaks, and the frontends wouldn't necessarily do
> >> so either. If you want to guarantee a linebreak toss some <p>'s in
> >> there and change the .conf accordingly. This will also permit markup
> >> of scripture references.
> >>
> >> --Chris
> >>
> >> Don A. Elbourne Jr. wrote:
> >>
> >>> Running mod2imp results in a mangled imp file. A few of the lines
> >>> start with a $$$ and no date, other lines start with a date, and no
> >>> $$$, and some of the dates are out of order. The file only goes
> >>> through Jan. 6. I do not see any ryme or reason to the placement.
> >>>
> >>>
> >>> Chris Little wrote:
> >>>
> >>>> This is an error coming from ICU (not Sword) and I don't believe it
> >>>> is any problem. If you have a question about whether your data is
> >>>> being preserved correctly, you can do imp2mod on your input file
> >>>> followed by mod2imp on the resulting module. Then diff the two imp
> >>>> files and see if there's any difference.
> >>>>
> >>>> --Chris
> >>>>
> >>>> Don A. Elbourne Jr. wrote:
> >>>>
> >>>>> I am creating a new Daily Devo module out of Joseph Philpot's
> >>>>> "Daily Portions." I have created an imp file and I'm using the
> >>>>> latest imp2ld tool on WinXP.
> >>>>>
> >>>>> I get output like this:
> >>>>>
> >>>>> 12.30
> >>>>> from: U_INVALID_CHAR_FOUND
> >>>>> from: U_INVALID_CHAR_FOUND
> >>>>> from: U_INVALID_CHAR_FOUND
> >>>>> from: U_INVALID_CHAR_FOUND
> >>>>> from: U_INVALID_CHAR_FOUND
> >>>>> 12.31
> >>>>> from: U_INVALID_CHAR_FOUND
> >>>>> from: U_INVALID_CHAR_FOUND
> >>>>> from: U_INVALID_CHAR_FOUND
> >>>>> from: U_INVALID_CHAR_FOUND
> >>>>> from: U_INVALID_CHAR_FOUND
> >>>>>
> >>>>> You can see my imp file here:
> >>>>> http://elbourne.org/temp/philpot.imp (640 KB)
> >>>>>
> >>>>> What character could be flagging the errors and what should I do
> >>>>> to correct it?
> >>>>>
> >>>>
> >>>> _______________________________________________
> >>>> sword-devel mailing list: sword-devel at crosswire.org
> >>>> http://www.crosswire.org/mailman/listinfo/sword-devel
> >>>> Instructions to unsubscribe/change your settings at above page
> >>>>
> >>>
> >>
> >> _______________________________________________
> >> sword-devel mailing list: sword-devel at crosswire.org
> >> http://www.crosswire.org/mailman/listinfo/sword-devel
> >> Instructions to unsubscribe/change your settings at above page
> >>
> >
>



More information about the sword-devel mailing list