[sword-devel] SIL Ezra fonts

Joel Mawhorter sword-devel@crosswire.org
Fri, 27 Sep 2002 19:18:18 -0700


> > ICU also has support for converting between Unicode and many many legacy
> > encodings. Since Sword already has some dependencies on ICU perhaps this
> > would be a better tool to use first. The only problem I see with using
> > ICU for conversion is if you want to convert from encoding a to encoding
> > b where neither a nor b are Unicode. In that case you have to do a ->
> > Unicode -> b, which may not give the same results as a -> b for some
> > encoding schemes. However, it looks like TCECkit has the same limitation.
>
> This shouldn't be a problem for ICU since this is what compatability
> characters are for.  The only problem would occur if Unicode REALLY had no
> character for whatever both a & b could encode.
>
> I wasn't suggesting using TECKit in sword, just that module makers might
> find it useful for recoding documents. You can do this with ICU, if you
> want to fiddle with rules files and then recompile the data files.  TECKit
> can do the same without requiring a recompile.  Sword doesn't need to
> transcode since we only take data in Unicode. :)

Are there any encodings that we currently use besides UTF-8 (given that UTF-8 
is a superset of ISO-8859-1)?

Joel

> --Chris