<html>
<head>
<meta http-equiv="Content-Type" content="text/html; charset=UTF-8">
</head>
<body text="#000000" bgcolor="#FFFFFF">
Sorry I was occupied! <br>
<br>
<div class="moz-cite-prefix">Il 15/05/2019 02:34, Michael H ha
scritto:<br>
</div>
<blockquote type="cite"
cite="mid:CAJ9hia-XuQEe2pzkz9Gf79CmcQxrh64D6M39HzxYTuBnk-uo8A@mail.gmail.com">
<meta http-equiv="content-type" content="text/html; charset=UTF-8">
<div dir="ltr">
<div dir="ltr">
<div class="gmail_default"
style="font-family:garamond,serif;font-size:large">I don't
read Burmese and I don't know anyone who does. I was
suggesting you contact whoever provided you the files and
permission, and ask them if they can verify the unicode text
reads correctly. <br>
<br>
</div>
</div>
</div>
</blockquote>
I did it, and the conversion seems to be very well. Only few
characters are not well recognized. Even if I don't know
BUrmese/myanmar. It was easy to check I only look how the characters
are. I did it for Mat when trying to add the verses in the good
place, looking were it is in the pdf. Then I compare the pdf with my
converted txt file.<br>
<blockquote type="cite"
cite="mid:CAJ9hia-XuQEe2pzkz9Gf79CmcQxrh64D6M39HzxYTuBnk-uo8A@mail.gmail.com">
<div dir="ltr">
<div dir="ltr">
<div class="gmail_default"
style="font-family:garamond,serif;font-size:large">To a
native speaker, the text will quickly be recognized as
readable, or misspelled (letters are out of place.) It's
better to check that now than spend a lot of time on the
numbering. <br>
<br>
</div>
</div>
<div class="gmail_default"
style="font-family:garamond,serif;font-size:large">In regard
to the verse numbering. What I described (insert markers
based on a reg ex search, then apply markup based on a
sequential list) would take me about 2-4 hours to complete.
At the end of that the text will have proper USFM tags for
book titles, section titles, section references/ranges,
chapter numbers, and verse numbers. It will need to be
checked that the verse numbers are well placed. This is
depending on the text not having issues on my machine. I
haven't tested all my tools to respect Burmese text. </div>
<br>
</div>
<br>
<fieldset class="mimeAttachmentHeader"></fieldset>
<pre class="moz-quote-pre" wrap="">_______________________________________________
sword-devel mailing list: <a class="moz-txt-link-abbreviated" href="mailto:sword-devel@crosswire.org">sword-devel@crosswire.org</a>
<a class="moz-txt-link-freetext" href="http://www.crosswire.org/mailman/listinfo/sword-devel">http://www.crosswire.org/mailman/listinfo/sword-devel</a>
Instructions to unsubscribe/change your settings at above page</pre>
</blockquote>
<br>
</body>
</html>