[sword-devel] Soft hyphens

Michael H cmahte at gmail.com
Thu Nov 2 06:33:51 MST 2017


CAUTION:

The soft hyphen is sometimes used in Indian and East Asian language scripts
to prevent two adjacent characters from becoming a combined ligature. This
is more common in minor languages. It is commonly used when the font in use
while being typed is designed for another language using the same base
script. This use of soft hyphens isn't really appropriate, but does
indicate an issue with the language definition that may warrant leaving the
soft hyphens or zero width spaces in place. That is, The soft hyphen (or
zero width space) tricks the font into seeing 2 discrete letters and
prevents combining them. it works until unicode and fontography catches up
to properly include the language.

That is, stripping all soft-hyphens out might render the text unreadable to
the target audience. (Assuming the language you're working on is one with
letter conjunctions in it.)



On Tue, Oct 31, 2017 at 3:53 PM, David Haslam <dfhmch at googlemail.com> wrote:

> The problem with invisible characters is that you can all too easily key
> more
> than one without realising it.
>
> This is the case with soft hyphens, which may be found in a few source
> texts.
>
> For example, in a text development currenly under my horizon, there are not
> only a large number of soft hyphens, but a significant quantity of multiple
> soft hyphens, the longest group having 8 of them.
>
> What does osis2mod do with soft hyphens?
>
> How does SWORD treat soft hyphens, particularly during search?
>
> Should we replace multiple soft hyphens by a single sogt hyphen?
>
> Should we even consider stripping the text of soft hyphens (as a command
> line option for osis2mod)?
>
> Best regards,
>
> David
>
>
>
>
>
> --
> Sent from: http://sword-dev.350566.n4.nabble.com/
>
> _______________________________________________
> sword-devel mailing list: sword-devel at crosswire.org
> http://www.crosswire.org/mailman/listinfo/sword-devel
> Instructions to unsubscribe/change your settings at above page
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://www.crosswire.org/pipermail/sword-devel/attachments/20171102/4a0d6711/attachment.html>


More information about the sword-devel mailing list