0 votes

Is it possible to tell Paratext to mark uncapitalized proper names as invalid?

So:

  • “Jesus” would be marked as correct/valid
  • “jesus” would be marked as incorrect/invalid

And then check for proper names that are invalid?

Thanks,

james_post

Paratext by [Moderator]
(2.0k points)

reshown

4 Answers

0 votes
Best answer

I was researching something related to proper names and capitalization last week and noticed that I cannot get the regex feature in the wordlist to show me only all the capitalized entries. I get totally unexpected behaviour.

Normally I get very helpful results with regex, in the wordlist and elsewhere in PT. I (again) looked up some examples in the help to confirm flavour for PT9 andI tried ^[\p{Lu}] and got no joy, not a single hit, box turns red. But when I try ^[\p{Ll}] the search box turns green and I get many results, including things like “Aaɖi” as a search result, which does not feel right.

The functionality is there, minus the capitalisation. For example ^\p{Lu}{5,5}$ is giving me only all the five-letter-words as expected, but again it mixes results like “Afʊmɩ” which is wrong.

I have a hunch that the internal-data for the capitalisation is somehow more complex that what the GUI is showing. But then it would be nice to have documentation for that, rather than spending time doing “research” on a tool that is built by my own org.

I also noticed that I cannot find the regex tool in PT9, even following the inbuilt help like this:
Using RegEx Pal in a particular project
To use Paratext RegEx Pal in a particular project, do the following:

  1. Click the tab of that project to make it the active tab.
  2. Open Paratext RegEx Pal. (Click the main menu icon ![](file:///C:/Users/martin/AppData/Local/ParatextHelp/HelpSystemImages/menu_icon.png), then from the Paratext menu, point to Advanced, and select RegEx Pal.)

I am normally using my own paid tools, but would not mind “tuning” my tools against the inbuilt “helper”.

Thank you for any input about the capitalization in the wordlist.

by (855 points)
reshown

Note that you don’t look at advanced in the project menu. You look under advanced of the main menu.

Yes I came to the same conclusion a while ago and also think there should be documentation.
If you look in the file SpellingStatus.xml you will see that all words are uncapitalized. If it appears capitalized in the Wordlist tool it has an additional field <SpecificCase>

  <Status Word="bøgerne" State="R" />
  <Status Word="moses" State="R">
    <SpecificCase>Moses</SpecificCase>
  </Status>

Searching on capitalization then becomes programmatically difficult.

That said I certainly agree that it would be good to have more tools in the Wordlist to fix capitalization issues.

0 votes

If Jesus is marked as valid in the wordlist, then if there is a lowercase occurrence you should get a message indicating that not all occurrences are capitalized. You can also use the Menu > Tools > Checking Tools > Missing Capitals to look at normally capitalized words. This would report that not all of the words are capitalized and give you the option of checking them. This check works both ways and actually looks for words that are usually capitalized, but are lowercase in the list (and not currently marked “Valid”)
Screenshot 2022-02-04 16.29.20

by (8.4k points)

A small addition for clarification. If a word occurs in lower case in the Word List, and you would expect it to be uppercase, Paratext is telling you that the word occurs in both lower case and upper case. If you want it to be always uppercase, correct it to the uppercase spelling, and what anon848905 describes in his will occur. Paratext will report that this word is not always capitalized. Clicking on Show Incorrect, pictured in anon848905’s , and will get a list of where that word occurs lower case.

0 votes

I am interested in the answer to this question also. anon441204

by (250 points)
+1 vote

If you type “Regex Pal” in the search menus/help box at the top of the Paratext 9 window you should see the first entry is:
Paratext > Advanced > Regex Pal… (main menu)

As to the capitalization, the wordlist stores the words in lowercase. Therefore when you Filter in the wordlist you can’t find uppercase words. However, if you use the Wordlist Menu > Find you can search for regex:\p{Lu} and it will find the next uppercase word.

by (8.4k points)

Thank you anon848905, both your answers are very helpful.

Related questions

0 votes
1 answer
0 votes
1 answer
Paratext May 15, 2019 asked by anon180868 (188 points)
0 votes
2 answers
0 votes
0 answers
0 votes
3 answers
Welcome to Support Bible, where you can ask questions and receive answers from other members of the community.
Dear friends, since God so loved us, we also ought to love one another.
1 John 4:11
2,628 questions
5,369 answers
5,045 comments
1,420 users