0 votes

Having not looked at the Wordlist for a long time, I was surprised to see a big red box mentioning inconsistent encoding.

So the question is whether or not to turn normalisation on. After reading the following advice :

We do NOT recommend selecting "Off (no normalization)" unless the orthography of the project language uses diacritics in a non-standard way and the order in which the diacritics occur is important.

I wonder what is considered using diacritics in a non-standard way?  For example, it is possible to encounter : ŋ́ or Ŋ́ or ŋ̀, ɛ̀, Ɛ̀ɛ̀, ɛ́ɛ́ and ɛ̀ɛ̀ ɔ̀, ɔ́. Is this considered standard use or non-standard use?

Advice or tips on how to handle this are more than welcome.

Bart.

Paratext ago by (320 points)

1 Answer

0 votes
I am no expert on this, but I would assume that the non-standard way of handling diacritics is that you want multiple diacritics to stack on a base character in a way that does not follow the way that Unicode has them in precomposed characters. If you only have one diacritic on a base character, then you are not doing things in a non-standard way, and it should be OK to turn normalisation on and normalise the forms.

John
ago by (320 points)

Related questions

+1 vote
0 answers
0 votes
0 answers
0 votes
1 answer
0 votes
1 answer
Welcome to Support Bible, where you can ask questions and receive answers from other members of the community.
But if we walk in the light, as he is in the light, we have fellowship with one another, and the blood of Jesus, his Son, purifies us from all sin.
1 John 1:7
2,617 questions
5,350 answers
5,037 comments
1,420 users