Thank you very much for this!
I just tried it out, but it doesn't quite do yet what I would like it to do.
Here is my RegEx, as I adapted it:
regex:(?<!\w|\\[cv] )(kopo|menas|misika|foa|faev|siks|seven|eit|nain|ten|ileven|tuelv|tetin|fotin|fiftin|seventin|eitin|naintin|tuenti|teti|foti|fifti|siksti|seventi|eiti|nainti|andled|taosen|[0-9])
As it is, this finds far more instances than it should, for example, verb prefixes starting with ‹ten›. It would be great if we could limit the search to words being:
- at the beginning of a word or preceded by a hyphen
- and at the end of word or followed by a hyphen
On the other hand, since we have grammatical tone on clause level, most of the vowels can end up with a diacritic (áàāa᷄â), but such occurrences are not found. It seems that whenever I use RegEx in Paratext's Find, Ignore Diacritics And Vowel Points, does not work, and therefore Paratext doesn't find ‘sevén’, for example.
Any idea how I could solve these issues?
Thanks again!
PS: I'm aware of the Numbers Settings and it is all set up for the NT, but there doesn't seem to be a way to include numbers written in words. (Or is there?)