[sword-devel] Is Delitzsch Hebrew NT available?

David Haslam dfhmch at googlemail.com
Wed Feb 23 01:11:15 MST 2011

PDF files with embedded custom fonts can be a pain for extracting text.

Have you checked document properties | fonts to see what these are?

Also, some PDF files are encrypted to prevent copying of content.
If printing is allowed, it might sometimes work to intercept the printing
output stream.
You might still get gibberish, as a result of the embedded fonts though.

btw. There are several utilities available that can convert other encodings
to Unicode.
Unless the embedded font is properly documented, it's a hard slog to remap
the encoding.
I once tried this for an Indian language, but gave up after a few hours.

View this message in context: http://sword-dev.350566.n4.nabble.com/Re-Is-Delitzsch-Hebrew-NT-available-tp3231746p3320641.html
Sent from the SWORD Dev mailing list archive at Nabble.com.

More information about the sword-devel mailing list