<html><head><meta http-equiv="Content-Type" content="text/html; charset=utf-8"></head><body style="word-wrap: break-word; -webkit-nbsp-mode: space; line-break: after-white-space;" class="">I wish you had started with this. I find it more satisfying to know what the end goal is. I had thought you merely wanted to update the wiki.<div class=""><br class=""></div><div class="">I’m wondering how a “morphological segment” should be rendered to an end user. I presume that it is of value to a student of Hebrew. Is it merely highlighting the segment differently than an adjacent one or are their other attributes that should be provided to the end user, e.g. Strong’s Number, part of speech, ….</div><div class=""><br class=""></div><div class=""><div class="">Regarding having / in the OSIS as a semantic marker is bad, bad, bad. Text in XML shouldn’t have to be parsed. Never. You are right to suggest a different markup. The problem with a milestone is that it marks the “between” but not the segment.</div><div class=""><br class=""></div><div class="">The incompletely supported markup for SWORD is <seg type=“morph”>text</seg> or <seg type=“x-morph”>text</seg>.</div><div class=""><br class=""></div><div class="">Not sure how it should be indexed. At least by whole word. Perhaps also by segment?</div><div class=""><br class=""></div><div class="">In Him,</div><div class=""><span class="Apple-tab-span" style="white-space:pre">        </span>DM</div><div class=""><br class=""></div><div><br class=""><blockquote type="cite" class=""><div class="">On Dec 30, 2017, at 3:54 AM, David Haslam <<a href="mailto:dfhdfh@protonmail.com" class="">dfhdfh@protonmail.com</a>> wrote:</div><br class="Apple-interchange-newline"><div class=""><div class="">Thanks everyone for each "pearl of wisdom".<br class=""></div><div class=""><br class=""></div><div class="">It wasn't that I didn't understand what Hebrew morpheme <b class="">segmentation</b> is.<br class=""></div><div class=""><i class="">That's pretty evident from examining the content of the modules concerned.</i><br class=""></div><div class=""><br class=""></div><div class="">Nor was it that I am unfamiliar with use of the <b class="">seg</b> element in OSIS XML.<br class=""></div><div class=""><i class="">I've been using OSIS for long enough to have seen this many times.</i><br class=""></div><div class=""><br class=""></div><div class="">My concern is that we have a listed filter that either seems to <u class="">do nothing</u></div><div class="">or that the module[s] where it's specified <u class="">do not match</u> what the filter expects.<br class=""></div><div class=""><br class=""></div><div class="">Neither scenario is good. <br class=""></div><div class=""><br class=""></div><div class=""> Recently, I've been looking at the <a href="https://github.com/openscriptures/morphhb" title="https://github.com/openscriptures/morphhb" rel="nofollow" class="">openscriptures/morphhb</a> project on GitHub. <br class=""></div><div class=""><u class="">Aside</u>: I even issued a pull request yesterday.<br class=""></div><div class="">btw. Some of the team members are known to us.<br class=""></div><div class=""><br class=""></div><div class="">They have adopted a simpler method to separate morpheme segments.<br class=""></div><div class="">They just use a <b class="">solidus</b> as a separator character within the Hebrew word, e.g.<br class=""></div><div class=""><w lemma="b/7225" n="1.0" morph="HR/Ncfsa"><b class="">בְּ/רֵאשִׁ֖ית</b></w><br class=""></div><div class="">That's fine in the raw XML but it would look very odd to a Hebrew reader.<br class=""></div><div class=""><br class=""></div><div class="">When the openscriptures team gets round to rebuilding the <b class="">OSHB</b> module, we require a filter that works.<br class=""></div><div class="">It need not look for the OSIS feature that doesn't seem to have any effect in our <i class="">defective</i> <b class="">WLC</b> module.<br class=""></div><div class=""><br class=""></div><div class="">My proposal would be to provide a simple mechanism in SWORD to <u class="">hide</u> or <u class="">display</u> a specified <b class="">marker</b>.<br class=""></div><div class=""><br class=""></div><div class="">Their present OSIS files could be converted to replace each solidus in the Hebrew text to the following XML element.<br class=""></div><div class=""><milestone type="x-mss" marker="/" /><br class=""></div><div class=""><i class="">The type attribute proposed is merely an abbreviation for "morpheme segment separator".</i><br class=""></div><div class=""><br class=""></div><div class="">We already have something like this in SWORD.<br class=""></div><div class="">When the KJV module switches to paragraphs, the <b class="">Pilcrow</b> signs disappear.<br class=""></div><div class="">They are coded as milestone markers.<br class=""></div><div class=""><br class=""></div><div class="">We could generalise the concept by having a filter called<br class=""></div><div class=""><b class="">GlobalOptionFilter=OSISMilestoneMarker</b><br class=""></div><div class=""><br class=""></div><div class="">Who knows how many further good uses it might find?<br class=""></div><div class=""><br class=""></div><div class="protonmail_signature_block"><div class="protonmail_signature_block-user"><div class="">Best regards,<br class=""></div><div class=""><br class=""></div><div class="">David<br class=""></div></div><div class=""><br class=""></div><div class="protonmail_signature_block-proton">Sent with <a href="https://protonmail.com/" class="">ProtonMail</a> Secure Email.<br class=""></div></div><div class=""><br class=""></div><blockquote class="protonmail_quote" type="cite"><div class="">-------- Original Message --------<br class=""></div><div class="">Subject: Re: [sword-devel] OSISMorphSegmentation<br class=""></div><div class="">Local Time: 30 December 2017 12:03 AM<br class=""></div><div class="">UTC Time: 30 December 2017 00:03<br class=""></div><div class="">From: <a href="mailto:scribe@crosswire.org" class="">scribe@crosswire.org</a><br class=""></div><div class="">To: <a href="mailto:sword-devel@crosswire.org" class="">sword-devel@crosswire.org</a><br class=""></div><div class=""><br class=""></div><div class=""><div class="">A few brief points:<br class=""></div><div class=""> <br class=""></div><div class=""> The logs have this as the initial commit:<br class=""></div><div class=""> <br class=""></div><div class=""> commit ecaac871e4fa607a32d81f1049e928795db4eaa1<br class=""></div><div class=""> Author: chrislit <a href="mailto:chrislit@bcd7d363-81e1-0310-97ec-a550e20fc99c" class="">chrislit@bcd7d363-81e1-0310-97ec-a550e20fc99c</a><br class=""></div></div><h2 class=""><div class="">Date: Wed Jan 11 19:45:21 2006 +0000<br class=""></div><div class=""> <br class=""></div><div class=""> added OSISMorphSegmentation files (from BibleTime) to repository;<br class=""></div><div class=""> not integrated into projects/make system yet<br class=""></div><div class=""> <br class=""></div><div class=""> git-svn-id: <a href="https://crosswire.org/svn/sword/trunk@1884" class="">https://crosswire.org/svn/sword/trunk@1884</a><br class=""></div><div class=""> bcd7d363-81e1-0310-97ec-a550e20fc99c<br class=""></div><div class=""> <br class=""></div></h2><h1 class=""><div class="">Maybe the BibleTime team can lend a little info on the original intent.<br class=""></div><div class=""> <br class=""></div></h1><div class=""><div class=""> <br class=""></div><div class=""> I did a little work on it 7 years ago:<br class=""></div><div class=""> <br class=""></div><div class=""> commit 0eda5565f50a1a6b22b4b96e147e81b04e88b859<br class=""></div><div class=""> Author: scribe <a href="mailto:scribe@bcd7d363-81e1-0310-97ec-a550e20fc99c" class="">scribe@bcd7d363-81e1-0310-97ec-a550e20fc99c</a><br class=""></div></div><div class=""><div class="">Date: Mon Apr 14 16:22:11 2014 +0000<br class=""></div><div class=""> <br class=""></div><div class=""> fixed osismorphsegmentation to look for both type=morph and x-morph<br class=""></div><div class=""> fixed close seg to check inMorph before processing </seg> as close morph<br class=""></div></div><h2 class=""><div class=""> <br class=""></div><div class=""> git-svn-id: <a href="https://crosswire.org/svn/sword/trunk@3153" class="">https://crosswire.org/svn/sword/trunk@3153</a><br class=""></div><div class=""> bcd7d363-81e1-0310-97ec-a550e20fc99c<br class=""></div></h2><h1 class=""><div class="">... which I believe was in reference to Daniel Owen's work with the WHM<br class=""></div><div class=""> database:<br class=""></div><div class=""> <br class=""></div><div class=""> <a href="http://crosswire.org/~dowens76/swordweb/parallelstudy.jsp?add=KJV&add=WHM&key=Gen.1.1" class="">http://crosswire.org/~dowens76/swordweb/parallelstudy.jsp?add=KJV&add=WHM&key=Gen.1.1</a><br class=""></div><div class=""> <br class=""></div></h1><div class=""><div class=""> <br class=""></div><div class=""> <br class=""></div><div class=""> <br class=""></div><div class=""> <br class=""></div><div class=""> <br class=""></div><div class=""> <br class=""></div><div class=""> On 12/29/2017 04:34 PM, Tom Sullivan wrote:<br class=""></div></div><blockquote class=""><div class="">DM:<br class=""></div><div class="">There may be a terminology problem here.<br class=""></div><div class=""><div class="">Re:<br class=""></div><div class=""> <seg type="x-morph">הַ</seg><br class=""></div><div class=""> The letter He is used as the definite article and it is prepended to<br class=""></div><div class=""> the word. Example using English, L to R: "The Land" would be He-Eretz.<br class=""></div></div><div class=""><div class="">Hebrew also appends pronounimal suffixes, so perhaps those are<br class=""></div><div class=""> segments as well. The pronounimal suffixes also have meaning on their<br class=""></div><div class=""> own.<br class=""></div></div><div class=""><div class="">Highly inflected languages can be a bear for English speakers, so it<br class=""></div><div class=""> would make some sense to parse out the word. I am no Hebrew scholar<br class=""></div><div class=""> and cannot recall all of the exact terminology that should be used.<br class=""></div></div><div class=""><div class="">We could use some help here from someone whose Hebrew is fresh in<br class=""></div><div class=""> their mind. Correct terminology and a bit more explanation on all of<br class=""></div><div class=""> these kind of options would help.<br class=""></div></div><div class=""><div class="">All of us who are programmers should take heed from this issue. One<br class=""></div><div class=""> should not have to decipher code to know about the inputs and outputs.<br class=""></div></div><div class="">Tom<br class=""></div><h2 class=""><div class="">Tom Sullivan<br class=""></div><div class=""> <a href="mailto:info@BeForgiven.INFO" class="">info@BeForgiven.INFO</a><br class=""></div><div class=""> FAX: 815-301-2835<br class=""></div></h2><div class=""><div class="">Great News!<br class=""></div><div class=""> God created you, owns you and gave you commands to obey.<br class=""></div><div class=""> You have disobeyed God - as your conscience very well attests to you.<br class=""></div><div class=""> God's holiness and justice compel Him to punish you in Hell.<br class=""></div><div class=""> Jesus Christ became Man, was crucified, buried and rose from the dead<br class=""></div><div class=""> as a substitute for all who trust in Him, redeeming them from Hell.<br class=""></div><div class=""> If you repent (turn from your sin) and believe (trust) in Jesus Christ,<br class=""></div><div class=""> you will go to Heaven. Otherwise you will go to Hell.<br class=""></div><div class=""> Warning! Good works are a result, not cause, of saving trust.<br class=""></div><div class=""> More info is at <a href="http://www.esig.beforgiven.info/" class="">www.esig.beforgiven.info</a><br class=""></div><div class=""> Do you believe this? Copy this signature into your email program<br class=""></div><div class=""> and use the Internet to spread the Great News every time you email.<br class=""></div></div><div class="">On 12/29/2017 06:12 PM, DM Smith wrote:<br class=""></div><blockquote class=""><div class=""><div class="">I have no idea. I can read and write C++, but it’s been over 20 years<br class=""></div><div class=""> since I did it on a regular basis. I’m not interested in trying to<br class=""></div><div class=""> decipher the code or what Chris L. had in mind. Just glancing at the<br class=""></div><div class=""> code it says it pertains to WLC and it has Morph and Segmentation in<br class=""></div><div class=""> the name. That’s quite a clue.<br class=""></div></div><div class=""><div class="">The code has a construct I’ve seen wrt to footnotes and strongs<br class=""></div><div class=""> numbers, though I don’t what it does or how it is used. (within a<br class=""></div><div class=""> verse buf is set to 1 for the first seg and 2 for the next and so on.<br class=""></div><div class=""> tagText is the text content of the seg element.)<br class=""></div></div><div class="">module->getEntryAttributes()["Morpheme"][buf]["body"] = tagText;<br class=""></div><div class=""><div class="">If it parallels footnotes, strongs, … then perhaps it is a numerical<br class=""></div><div class=""> superscript that when clicked on brings up the segment. I don’t think<br class=""></div><div class=""> that makes sense. Unless someone can make sense of it, I don’t think<br class=""></div><div class=""> it’s worthy of documenting in the wiki.<br class=""></div></div><div class=""><div class="">Perhaps the following is a clue. It is the content of Genesis 1:1.<br class=""></div><div class=""> <w><seg type="x-morph">בְּ</seg><seg<br class=""></div><div class=""> type="x-morph">רֵאשִׁ֖ית</seg></w> <w><seg<br class=""></div><div class=""> type="x-morph">בָּרָ֣א</seg></w> <w><seg<br class=""></div><div class=""> type="x-morph">אֱלֹהִ֑ים</seg></w> <w><seg<br class=""></div><div class=""> type="x-morph">אֵ֥ת</seg></w> <w><seg type="x-morph">הַ</seg><seg<br class=""></div><div class=""> type="x-morph">שָּׁמַ֖יִם</seg></w> <w><seg<br class=""></div><div class=""> type="x-morph">וְ</seg><seg type="x-morph">אֵ֥ת</seg></w> <w><seg<br class=""></div><div class=""> type="x-morph">הָ</seg><seg type="x-morph">אָֽרֶץ׃</seg></w><br class=""></div></div><div class=""><div class="">It appears that each w (aka word) is made up of one or more seg<br class=""></div><div class=""> (segments). Each segment is marked as x-morph. While I took 7 credits<br class=""></div><div class=""> of Biblical Hebrew, I don’t remember a lick of it. I’m guessing that<br class=""></div><div class=""> a segment is part of the word that has meaning on its own.<br class=""></div></div><div class="">DM<br class=""></div><blockquote class=""><div class=""><div class="">On Dec 29, 2017, at 5:14 PM, David Haslam <<a href="mailto:dfhdfh@protonmail.com" class="">dfhdfh@protonmail.com</a><br class=""></div><div class=""> <a href="mailto:dfhdfh@protonmail.com" class="">mailto:dfhdfh@protonmail.com</a>> wrote:<br class=""></div></div><div class=""><div class="">I know it's still the holiday season, yet I would still like to have<br class=""></div><div class=""> it explained what is the <i class="">difference in output</i> that we should see<br class=""></div><div class=""> when the <i class="">OSISMorphSegmentation</i> filter is applied.<br class=""></div></div><div class=""><div class="">There are modules which have this specified in the .conf file, yet<br class=""></div><div class=""> I've not seen any discernable difference in what (e.g.) <i class="">Xiphos</i><br class=""></div><div class=""> displays when this module option is ticked.<br class=""></div><div class=""> /Is that too much to ask?/<br class=""></div></div><div class="">Best regards,<br class=""></div><div class="">David<br class=""></div><div class="">Sent with ProtonMail <a href="https://protonmail.com/" class="">https://protonmail.com/</a> Secure Email.<br class=""></div><blockquote class=""><div class=""><div class="">-------- Original Message --------<br class=""></div><div class=""> Subject: Re: [sword-devel] OSISMorphSegmentation<br class=""></div><div class=""> Local Time: 26 December 2017 12:10 PM<br class=""></div><div class=""> UTC Time: 26 December 2017 12:10<br class=""></div><div class=""> From: <a href="mailto:dfhdfh@protonmail.com" class="">dfhdfh@protonmail.com</a> <a href="mailto:dfhdfh@protonmail.com" class="">mailto:dfhdfh@protonmail.com</a><br class=""></div><div class=""> To: sword-devel mailing list <<a href="mailto:sword-devel@crosswire.org" class="">sword-devel@crosswire.org</a><br class=""></div><div class=""> <a href="mailto:sword-devel@crosswire.org" class="">mailto:sword-devel@crosswire.org</a>><br class=""></div></div><div class=""><div class="">All very well if you're a C++ programmer, but "as clear as mud" to<br class=""></div><div class=""> those like me that aren't.<br class=""></div></div><div class=""><div class="">What exactly is the intended difference in output with the filter<br class=""></div><div class=""> enabled?<br class=""></div></div><div class=""><div class="">Where segments of a Hebrew word are in different seg elements, what<br class=""></div><div class=""> should I expect to see at the locations where the OSIS has<br class=""></div><div class=""> <i class=""></seg><seg.+></i> ?<br class=""></div></div><div class=""><div class="">Does the filter insert a space or some other character as a<br class=""></div><div class=""> <i class="">separator</i> between consecutive segments?<br class=""></div></div><div class="">Best regards,<br class=""></div><div class="">David<br class=""></div><div class="">Sent from ProtonMail <a href="https://protonmail.com/" class="">https://protonmail.com/</a> Secure Email.<br class=""></div><blockquote class=""><div class=""><div class="">-------- Original Message --------<br class=""></div><div class=""> Subject: Re: [sword-devel] OSISMorphSegmentation<br class=""></div><div class=""> Local Time: 25 December 2017 3:16 PM<br class=""></div><div class=""> UTC Time: 25 December 2017 15:16<br class=""></div><div class=""> From: <a href="mailto:dmsmith@crosswire.org" class="">dmsmith@crosswire.org</a> <a href="mailto:dmsmith@crosswire.org" class="">mailto:dmsmith@crosswire.org</a><br class=""></div><div class=""> To: David Haslam <<a href="mailto:dfhdfh@protonmail.com" class="">dfhdfh@protonmail.com</a><br class=""></div><div class=""> <a href="mailto:dfhdfh@protonmail.com" class="">mailto:dfhdfh@protonmail.com</a>>, SWORD Developers' Collaboration<br class=""></div><div class=""> Forum <<a href="mailto:sword-devel@crosswire.org" class="">sword-devel@crosswire.org</a> <a href="mailto:sword-devel@crosswire.org" class="">mailto:sword-devel@crosswire.org</a>><br class=""></div></div><div class=""><div class="">All of the filters are in the folder<br class=""></div><div class=""> <a href="http://www.crosswire.org/svn/sword/trunk/src/modules/filters/" class="">http://www.crosswire.org/svn/sword/trunk/src/modules/filters/</a><br class=""></div></div><div class=""><div class="">Each filter has a corresponding file who’s name is in lowercase<br class=""></div><div class=""> with the extension of cpp.<br class=""></div></div><div class=""><div class="">See:<br class=""></div><div class=""> <a href="http://www.crosswire.org/svn/sword/trunk/src/modules/filters/osismorphsegmentation.cpp" class="">http://www.crosswire.org/svn/sword/trunk/src/modules/filters/osismorphsegmentation.cpp</a><br class=""></div></div><div class=""><div class="">From the code:<br class=""></div><div class=""> SWFilter descendant to toggle splitting of<br class=""></div><div class=""> morphemes (for morpheme segmented Hebrew in<br class=""></div><div class=""> the WLC)<br class=""></div></div><blockquote class=""><div class=""><div class="">On Dec 25, 2017, at 9:23 AM, David Haslam <<a href="mailto:dfhdfh@protonmail.com" class="">dfhdfh@protonmail.com</a><br class=""></div><div class=""> <a href="mailto:dfhdfh@protonmail.com" class="">mailto:dfhdfh@protonmail.com</a>> wrote:<br class=""></div></div><div class=""><div class="">I want to update the wiki page for OSIS Bibles<br class=""></div><div class=""> <a href="https://crosswire.org/wiki/OSIS_Bibles#Marking_morpheme_segmentation" class="">https://crosswire.org/wiki/OSIS_Bibles#Marking_morpheme_segmentation</a>.<br class=""></div></div><div class=""><div class="">Please would some one explain exactly what is looked for in the<br class=""></div><div class=""> OSIS XML for SWORD to actually filter something in the module for<br class=""></div></div><div class=""><i class="">GlobalOptionFilter=OSISMorphSegmentation</i><br class=""></div><div class=""><div class="">/We seemed to have overlooked the documentation requirements<br class=""></div><div class=""> since I first enquired almost 4 years ago/.<br class=""></div></div><div class="">Best regards,<br class=""></div><div class="">David<br class=""></div><div class="">Sent with ProtonMail <a href="https://protonmail.com/" class="">https://protonmail.com/</a> Secure Email.<br class=""></div><div class=""><hr class=""><br class=""></div><div class=""><div class="">sword-devel mailing list: <a href="mailto:sword-devel@crosswire.org" class="">sword-devel@crosswire.org</a><br class=""></div><div class=""> <a href="mailto:sword-devel@crosswire.org" class="">mailto:sword-devel@crosswire.org</a><br class=""></div><div class=""> <a href="http://www.crosswire.org/mailman/listinfo/sword-devel" class="">http://www.crosswire.org/mailman/listinfo/sword-devel</a><br class=""></div><div class=""> Instructions to unsubscribe/change your settings at above page<br class=""></div></div></blockquote></blockquote></blockquote><div class=""><hr class=""><br class=""></div><div class=""><div class="">sword-devel mailing list: <a href="mailto:sword-devel@crosswire.org" class="">sword-devel@crosswire.org</a><br class=""></div><div class=""> <a href="mailto:sword-devel@crosswire.org" class="">mailto:sword-devel@crosswire.org</a><br class=""></div><div class=""> <a href="http://www.crosswire.org/mailman/listinfo/sword-devel" class="">http://www.crosswire.org/mailman/listinfo/sword-devel</a><br class=""></div><div class=""> Instructions to unsubscribe/change your settings at above page<br class=""></div></div></blockquote><div class=""><hr class=""><br class=""></div><div class=""><div class="">This email has been scanned by the Symantec Email Security.cloud<br class=""></div><div class=""> service.<br class=""></div><div class=""> For more information please visit <a href="http://www.symanteccloud.com/" class="">http://www.symanteccloud.com</a><br class=""></div></div><div class=""><hr class=""><hr class=""><br class=""></div><div class=""><div class="">sword-devel mailing list: <a href="mailto:sword-devel@crosswire.org" class="">sword-devel@crosswire.org</a><br class=""></div><div class=""> <a href="http://www.crosswire.org/mailman/listinfo/sword-devel" class="">http://www.crosswire.org/mailman/listinfo/sword-devel</a><br class=""></div><div class=""> Instructions to unsubscribe/change your settings at above page<br class=""></div></div></blockquote><div class=""><hr class=""><br class=""></div><div class=""><div class="">sword-devel mailing list: <a href="mailto:sword-devel@crosswire.org" class="">sword-devel@crosswire.org</a><br class=""></div><div class=""> <a href="http://www.crosswire.org/mailman/listinfo/sword-devel" class="">http://www.crosswire.org/mailman/listinfo/sword-devel</a><br class=""></div><div class=""> Instructions to unsubscribe/change your settings at above page<br class=""></div></div></blockquote><div class=""><div class=""> <br class=""></div><div class=""> <br class=""></div><div class=""> <br class=""></div></div><div class=""><hr class=""><br class=""></div><div class=""><div class="">sword-devel mailing list: <a href="mailto:sword-devel@crosswire.org" class="">sword-devel@crosswire.org</a><br class=""></div><div class=""> <a href="http://www.crosswire.org/mailman/listinfo/sword-devel" class="">http://www.crosswire.org/mailman/listinfo/sword-devel</a><br class=""></div><div class=""> Instructions to unsubscribe/change your settings at above page<br class=""></div></div></blockquote><div class=""><br class=""></div>_______________________________________________<br class="">sword-devel mailing list: <a href="mailto:sword-devel@crosswire.org" class="">sword-devel@crosswire.org</a><br class=""><a href="http://www.crosswire.org/mailman/listinfo/sword-devel" class="">http://www.crosswire.org/mailman/listinfo/sword-devel</a><br class="">Instructions to unsubscribe/change your settings at above page</div></blockquote></div><br class=""></div></body></html>