[sword-devel] better UTF-sensitive sort

Matěj Cepl mcepl at cepl.eu
Wed Jan 13 02:46:40 MST 2016


On 2016-01-12, 16:52 GMT, DM Smith wrote:
> You can take the second column and sort it by each of the 
> locales mentioned.

https://mcepl.fedorapeople.org/tmp/sort-complicated.txt is the 
second column as a simple plain text in UTF8.

My colleague working on LibreOffice claims that he doesn’t know 
about anything better than ICU. Yes, it is a monster. Perhaps 
UTF-8->UTF-16LE->UTF-8 round-trip is not that expensive after 
all?

Best,

Matěj

-- 
https://matej.ceplovi.cz/blog/, Jabber: mcepl at ceplovi.cz
GPG Finger: 89EF 4BC6 288A BF43 1BAB  25C3 E09F EF25 D964 84AC
 
He can compress the most words into the smallest idea of any man
I know.
      -- Abraham Lincoln




More information about the sword-devel mailing list