<html><head><meta http-equiv="Content-Type" content="text/html charset=iso-8859-1"></head><body style="word-wrap: break-word; -webkit-nbsp-mode: space; -webkit-line-break: after-white-space; ">It was based upon an earlier version of diff-match-patch, which was written in javascript, not java. The selection criteria I had was that it had to have a license compatible to JSword. When the original author was hired by google, the code changed to an incompatible license for porting. Since then it was ported to Java 5.<div><br><div>I ported the earlier version to Java 1.4. But I broke it out into multiple classes. (We might be able to eliminate our version and use the google version directly).</div><div><div><br></div><div>I think there is a way to have it do a word based match, but with code changes:</div><div><a href="http://code.google.com/p/google-diff-match-patch/wiki/LineOrWordDiffs">http://code.google.com/p/google-diff-match-patch/wiki/LineOrWordDiffs</a></div><div><br></div><div><br><div><div>On Aug 29, 2012, at 12:50 PM, Chris Burrell <<a href="mailto:chris@burrell.me.uk">chris@burrell.me.uk</a>> wrote:</div><br class="Apple-interchange-newline"><blockquote type="cite">Hi all<div><br></div><div>The current diffing produces some fairly strange results from time to time. I was wondering how much work it would be to make it work for a word by word diff, rather than letter by letter. I've a quick scan through the diff-ing engine, but it looks fairly complicated and can't figure out how much of this is a copy of <a href="http://code.google.com/p/google-diff-match-patch">http://code.google.com/p/google-diff-match-patch</a> and how much has changed.</div>
<div><br></div><div>In the example below, </div><div><table class="table">
<tbody><tr class="row">
<td valign="top" dir="ltr" class="cell"><br> "And God saw th<u>at th</u>e light <font class="strike"><b>, that it was
good : and God divid</b></font><u>was good. And God separat</u>ed the
light from the darkness<font class="strike"> </font> "<br><br>The new diff would hopefully not chop "that and "the" in the first occurrence above. It would not chop "divid" off either, but rather have longer words, which would in turn make things slightly more readable.<br>
<br></td></tr></tbody></table></div><div>(bold indicates strike through)</div><div><br></div><div>Chris</div><div><br></div>
_______________________________________________<br>jsword-devel mailing list<br><a href="mailto:jsword-devel@crosswire.org">jsword-devel@crosswire.org</a><br>http://www.crosswire.org/mailman/listinfo/jsword-devel<br></blockquote></div><br></div></div></div></body></html>