[sword-devel] NFC Normalization and osis2mod
Chris Little
chrislit at crosswire.org
Thu Feb 21 14:54:25 MST 2008
I'd prefer we don't completely rewrite the NFC filter, like this patch
does. I realize the ICU tutorials demonstrate the C++ interface, but I'm
pretty certain it's still just a wrapper around the C interfaces that we
were using.
I'll take a look at the filters today.
--Chris
Troy A. Griffitts wrote:
> DM,
>
> The patch looks good to me.
>
> DM Smith wrote:
>> I've added a -n flag to osis2mod that will normalize UTF-8 to NFC, which
>> we've agreed as the standard for UTF-8 modules.
>>
>> I used Sword's UTF8NFC filter to do the work, but found that it was
>> buggy with trailing garbage on some verses.
>>
>> I have created a patch for both
>> at www.crosswire.org/~dmsmith/nfcPatch.txt
>> <http://www.crosswire.org/~dmsmith/nfcPatch.txt> and would greatly
>> appreciate some more testing of it.
>>
>> My test was fairly trivial. I took an OSIS file with limited UTF-8,
>> already nfc and ran it through osis2mod with and without the -n flag and
>> then compared the two files. Before I fixed UTF8NFC there were
>> differences. After fixing UTF8NFC, there were none.
>>
>> All that this shows is that it does not corrupt an already good nfc
>> utf-8 file.
>>
>> Many thanks in advance.
>>
>> DM
>>
>>
>> ------------------------------------------------------------------------
>>
>> _______________________________________________
>> sword-devel mailing list: sword-devel at crosswire.org
>> http://www.crosswire.org/mailman/listinfo/sword-devel
>> Instructions to unsubscribe/change your settings at above page
>
>
> _______________________________________________
> sword-devel mailing list: sword-devel at crosswire.org
> http://www.crosswire.org/mailman/listinfo/sword-devel
> Instructions to unsubscribe/change your settings at above page
More information about the sword-devel
mailing list