[sword-devel] Character Frequency
Peter von Kaehne
refdoc at gmx.net
Fri Jul 8 03:06:53 MST 2011
On Fri, 2011-07-08 at 01:17 -0700, David Haslam wrote:
> For projects that begin at USFM (or earlier), it would be great to develop a
> tool that analyses character frequency of the text (for the whole Bible)
> apart from all the USFM tags, etc.
Done for USFM.
sword-tools/modules/misc_cleanup/usfm_charmap.pl
Anything build from XML (this includes files coming out of e.g. a styled
MS word document, once exported properly, e.g to abiword.xml) the
previously mentioned will do the job largely. Shortcomings there would
be verse and chapter numbers are usually part of the pain text.
Peter
More information about the sword-devel
mailing list