RegEx Pal flavour and scope

Question

8 Answers

It is switching out the library for another library. However, this is, by no means, a simple thing to do. No two libraries have the exact same API and they will each have their own bugs/idiosyncrasies that will need to be worked around. Our current .Net implementation is used because, well, it’s built-in to .Net and doesn’t require anything new. We could also use Python (which Paratext uses for other things), but it also does not support what is wanted.

Theoretically, we could use a Regex-only library built for .Net that does this this type of replacement, but I haven’t been able to find one (probably because .Net has Regex functionality built-in).
Using Perl would also theoretically work, but there is no good library for .Net to interact with Perl scripts (that I could find in my quick search).

So, at least for now, there is no good way to change Paratext to work the way that’s wanted unless we added in our own parsing into the mix - which would be a lot of work for not much gain.

Unfortunately, I think what you’re left with is creating a regular expression that finds the errors and then having to fix them manually.

Mar 6, 2017 commented by [Expert]

Fool Running (16.2k points)
Mar 6, 2017 reshown

Related questions

0 votes

1 answer

PT regex flavour and scope

Paratext Jul 20, 2016 asked by wdavidhj (1.4k points)

0 votes

3 answers

RegEx Pal: using regular expressions to do full search and replace, and more

Paratext Jun 5, 2019 asked by wdavidhj (1.4k points)

+1 vote

5 answers

Using a list for replacing in RegEx Pal

Paratext Mar 2, 2022 asked by Phil_Leckrone (8.8k points)

0 votes

1 answer

RegEx Pal in PTX9

Paratext Aug 23, 2021 asked by anon180868 (190 points)

0 votes

8 answers

Is there a RegEx Pal string we can use to find all \pn or \nd inside of a \ft

Paratext Aug 20, 2018 asked by MSEAIT_LT (476 points)

wdavidhj · Answer 1 · 2016-07-20T20:23:42+0000

I’ve noticed that RegEx Pal does not accept:

(?:<string>)

… and today, I could not get this to work:

(?-i)

… i.e. turn off case insensitivity.

Jul 20, 2016 answered by wdavidhj (1.4k points)

wdavidhj · Answer 2 · 2016-07-20T20:56:01+0000

It is very common for regex software to start searching immediately because
that allows the user, especially a learner, to build the expression piece
by piece seeing the immediate result and correcting it as needed. I find
that helpful myself, but do bemoan the inevitable slow down. I appreciate
anon467281’s suggestion which will be useful in the future. It would be nice if
we could limit the search to a specific chapter where we knew the targeted
text existed until we built the expression properly.

If you are doing a lot of regexes, you might find Regex Buddy a nice extra
tool. I often paste the problem text from Paratext or other data into Regex
Buddy and build my expression there. It has an excellent regex
building/teaching tool, very extensive helps, and you can create a
searchable library of regexes. I build my regexes there and paste them into
Regex Pal or Paratext itself once I have tested them. It is a one-time paid
program but not unreasonable given its abilities.

Blessings,

Shegnada J.

Language Technology and Publishing Coordinator, Nigeria

Text Processing Specialist GPS Dallas

Skype: Shegnada..

+[Phone Removed]

Jul 21, 2016 commented by Shegnada (1.3k points)

The reason for peripheral material taking a long time is because of the fact that RegExPal only searches one chapter at a time. The peripheral material have no defined max chapter number so our only option is to go through all possible ones looking for text (well, 998 of them anyways). Theoretically, we could search them by-book instead, but that would slow down the text views considerably.

This has been my favorite site for creating regular expressions as it also allows you to debug them when they don’t work. You can see step-by-step what it does.

Jul 21, 2016 commented by [Expert]

Fool Running (16.2k points)

wdavidhj · Answer 3 · 2016-07-22T10:57:08+0000

Cross-posting this: it defines the flavour

From PT regex flavour and scope (click link to see the rest of this thread):

wdavidhj · Answer 4 · 2016-07-22T11:02:56+0000

Cross-posting this, since it’s about RegEx Pal.

From PT regex flavour and scope (click link to see the thread, which has more discussion of marking Replace operations in the Project History:

wdavidhj · Answer 5 · 2018-06-26T11:22:17+0000

(?s) – i.e. DOTALL mode (the dot (.) matches new line characters (\r\n)) – does not seem to work in PT8. Is this inline modifier not supported?

Phil_Leckrone · Answer 6 · 2018-06-26T12:04:04+0000

So is rexegg.com wrong when it says:

This implies that (?-s) would disable DOTALL.

So why does this regex to find blank lines after Hebrew titles in the Psalm not work (I want to delete the blank lines)?:

(\\d.*)\\b

In the raw .SFM text, there is a newline after the \d, and the .* won’t match anything after the \d .

Jun 27, 2018 commented by wdavidhj (1.4k points)

Phil_Leckrone · Answer 7 · 2018-06-28T12:01:02+0000

Sorry - I mis-read your message. (?-s) is used in Paratext to turn off the match newline (which is on by default). In RegExPal there is an option to have . match newline or the (?s) code should work.

Try this for the search:
regex:(?<=\\d.*?)\\b

The (?s) is not needed since it is default in Paratext find. The ? after the * says “don’t be greedy”
This will find the \b and if the replace is blank it will remove the \b.

RegEx Pal flavour and scope

Please log in or register to answer this question.

8 Answers

Please log in or register to add a comment.

Please log in or register to add a comment.

Please log in or register to add a comment.

Please log in or register to add a comment.

Please log in or register to add a comment.

Please log in or register to add a comment.

Please log in or register to add a comment.

Please log in or register to add a comment.

Related questions

Categories