[sword-devel] Character Frequency

Peter von Kaehne refdoc at gmx.net
Fri Jul 8 03:06:53 MST 2011


On Fri, 2011-07-08 at 01:17 -0700, David Haslam wrote:
> For projects that begin at USFM (or earlier), it would be great to develop a
> tool that analyses character frequency of the text (for the whole Bible)
> apart from all the USFM tags, etc.

Done for USFM.

sword-tools/modules/misc_cleanup/usfm_charmap.pl

Anything build from XML (this includes files coming out of e.g. a styled
MS word document, once exported properly, e.g to abiword.xml) the
previously mentioned will do the job largely. Shortcomings there would
be verse and chapter numbers are usually part of the pain text.

Peter




More information about the sword-devel mailing list