0 votes

Years ago I remember seeing a Paratext newsletter with an announcement about a new feature that helps users visualize the Unicode codepoints in the text. It said something like “We know that there is often more than one combination of Unicode codepoints that can be used to represent a given character. Often, the way the character or word is composed depends a lot on the keyboard the translator is using and the particular keystrokes he used. If you’re having trouble finding words in search results or when trying to perform regex operations, you can use this tool we made for you to visualize how each word and character is represented underlyingly.”

I’m familiar with Alt+X to see the codepoints for a character, and also with Tools>Checking Inventories>Characters Inventory, but neither of these show entire words with their codepoints- that’s what I remember seeing in the printscreen in the announcement.

Does anyone know where to find this announcement or the feature it describes? I’ve searched around and can’t find it.

My ultimate goal is to better understand how different ways of encoding characters and representing words affect language projects. Be it through looking through accompanying documentation or talking with experts on the topic.

Thank you!

Paratext by (363 points)
reshown

1 Answer

+1 vote
Best answer

anon094061, I use a tool that was developed by Marc Durdin called charident.exe. Here is a link to his 2011 blog article about the tool with a link to download.

by (8.0k points)

Ah, yes, I think this is it! Thank you anon848905. The fact that it is a Keyman tool is probably why I couldn’t find it amongst Paratext resources :joy:. Thanks again.

For me, Inkey’s Unicode String Inspector works a bit better.

Related questions

Welcome to Support Bible, where you can ask questions and receive answers from other members of the community.
How good and pleasant it is when God’s people live together in unity!
Psalm 133:1
2,479 questions
5,174 answers
4,872 comments
1,283 users