<html><head><meta http-equiv="Content-Type" content="text/html charset=iso-8859-7"></head><body style="word-wrap: break-word; -webkit-nbsp-mode: space; -webkit-line-break: after-white-space; ">Does luke give you access to the counts? Is it double too?<div>-- DM</div><div><br><div><div>On Feb 7, 2013, at 8:22 AM, Chris Burrell <<a href="mailto:chris@burrell.me.uk">chris@burrell.me.uk</a>> wrote:</div><br class="Apple-interchange-newline"><blockquote type="cite"><div dir="ltr">No doubt that would cause issue too, but my case here is actually for most words, even those not split.<div><br></div><div style="">I think a term vector allows you to store the position/offsets of the terms in each document, so that you can accurately work out where it was in the original sentence/verse even though you may not have the original stored any longer. </div>
<div style=""><br></div><div style="">For the purpose of counts I don't think it's necessary, although I haven't tried without yet.</div><div style="">Chris</div><div style=""><br></div></div><div class="gmail_extra"><br>
<br><div class="gmail_quote">On 7 February 2013 13:12, DM Smith <span dir="ltr"><<a href="mailto:dmsmith@crosswire.org" target="_blank">dmsmith@crosswire.org</a>></span> wrote:<br><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">
Not sure if this is the problem:<br>
In the KJV, there are a lot of splits of Greek as it translates into English.<br>
<br>
For example, in Rev 22.5 look at φωτιζει αυτους which translates directly into English as "gives light to them", but is translated in the KJV as "giveth them light", so "them" splits "giveth light":<br>
<verse osisID="Rev.22.5" sID="Rev.22.5"/><br>
<w src="1" lemma="strong:G2532 tr:και" morph="robinson:CONJ">And</w><br>
<w src="4" lemma="strong:G2071 tr:εσται" morph="robinson:V-FXI-3S">there shall be</w><br>
<w src="3" lemma="strong:G3756 tr:ουκ" morph="robinson:PRT-N">no</w><br>
<w src="2" lemma="strong:G3571 tr:νυξ" morph="robinson:N-NSF">night</w><br>
<w src="5" lemma="strong:G1563 tr:εκει" morph="robinson:ADV">there</w>;<br>
<w src="6" lemma="strong:G2532 tr:και" morph="robinson:CONJ">and</w><br>
<w src="9" lemma="strong:G2192 tr:εχουσιν" morph="robinson:V-PAI-3P">they</w><br>
<w src="7" lemma="strong:G5532 tr:χρειαν" morph="robinson:N-ASF">need</w><br>
<w src="8" lemma="strong:G3756 tr:ουκ" morph="robinson:PRT-N">no</w><br>
<w src="10" lemma="strong:G3088 tr:λυχνου" morph="robinson:N-GSM">candle</w>,<br>
<w src="11" lemma="strong:G2532 tr:και" morph="robinson:CONJ">neither</w><br>
<w src="12" lemma="strong:G5457 tr:φωτος" morph="robinson:N-GSN">light</w><br>
<w src="13" lemma="strong:G2246 tr:ηλιου" morph="robinson:N-GSM">of the sun</w>;<br>
<w src="14" lemma="strong:G3754 tr:οτι" morph="robinson:CONJ">for</w><br>
<w src="15" lemma="strong:G2962 tr:κυριος" morph="robinson:N-NSM">the Lord</w><br>
<w src="16 17" lemma="strong:G3588 strong:G2316 tr:ο tr:θεος" morph="robinson:T-NSM robinson:N-NSM">God</w><br>
<w src="18" lemma="strong:G5461 tr:φωτιζει" morph="robinson:V-PAI-3S" type="x-split-3868">giveth</w><br>
<w src="19" lemma="strong:G846 tr:αυτους" morph="robinson:P-APM">them</w><br>
<w src="18" lemma="strong:G5461 tr:φωτιζει" morph="robinson:V-PAI-3S" type="x-split-3868">light</w>:<br>
<w src="20" lemma="strong:G2532 tr:και" morph="robinson:CONJ">and</w><br>
<w src="21" lemma="strong:G936 tr:βασιλευσουσιν" morph="robinson:V-FAI-3P">they shall reign</w><br>
<w src="22" lemma="strong:G1519 tr:εις" morph="robinson:PREP">for</w><br>
<w src="23 24" lemma="strong:G3588 strong:G165 tr:τους tr:αιωνας" morph="robinson:T-APM robinson:N-APM">ever</w><br>
<w src="25 26" lemma="strong:G3588 strong:G165 tr:των tr:αιωνων" morph="robinson:T-GPM robinson:N-GPM">and ever</w>.<br>
<milestone type="x-strongsMarkup" resp="pdy 2003-12-31-00:30"/><br>
<verse eID="Rev.22.5"/><br>
<br>
BTW, I'm not sure what a TermVector is nor how it would be used.<br>
<br>
In Him,<br>
DM<br>
<div><div class="h5"><br>
On Feb 7, 2013, at 6:36 AM, Chris Burrell <<a href="mailto:chris@burrell.me.uk">chris@burrell.me.uk</a>> wrote:<br>
<br>
> Hi<br>
><br>
> Using Luke, and my own code to look at the indexes created by JSword shows that the term count is double what it should be...<br>
><br>
> Any ideas why that might be? I can't quite follow the logic in StrongAnalyser but I attempted to work step/debug through it and it didn't look like it was double counting. Might need to do that again.<br>
><br>
> DM, haven't checked, but apparently the TermVector may not be what I'm using..<br>
><br>
> Chris<br>
><br>
</div></div>> _______________________________________________<br>
> jsword-devel mailing list<br>
> <a href="mailto:jsword-devel@crosswire.org">jsword-devel@crosswire.org</a><br>
> <a href="http://www.crosswire.org/mailman/listinfo/jsword-devel" target="_blank">http://www.crosswire.org/mailman/listinfo/jsword-devel</a><br>
<br>
</blockquote></div><br></div>
</blockquote></div><br></div></body></html>