Using Glyssen output to mark up Words of Jesus in Paratext

Question

Given that teams are starting to use Glyssen to pre-process their NT texts to identify each speaker in the text for a dramatized recording, I wondered whether it would be worth piggy-backing on that work to AUTOMATICALLY mark up the Words of Jesus \wj … \wj* in the Paratext project.

Glyssen produces an Excel file with the needed data in it, which looks like this:

So it should be relatively easy to generate a script to find these places in the text and wrap them with the \wj … markup …\wj*

I tried something simple myself with a couple of generated CC tables, and managed to get 88% of the 1901 occurrences of the words of Jesus marked up successfully. However, this still leaves 200+ places where I would need to go in and fix it manually. Being unsatisfied with this result, and wanting to save others time in the future by making it automated, complete and more foolproof, I wondered about creating a Custom Script (in Python) that could do this task directly from within Paratext.

Most of the “failed” cases with the CC method are because of other markup like \w angel|angels\w* or \f footnotes and \x cross-references being embedded within the text being searched for. I’ve got some ideas about how to get around those using Regular Expressions (and searching for the start and end of strings rather than the whole string, etc.) but that is beyond CC and would need to use some Python code to make it possible.

BUT, before I attempt to make a Custom Script to do this, I’m wondering if anyone else has done something similar already. I don’t want to re-invent the wheel! Looking at the sample scripts shipped with Paratext, the closest thing I see is TransferParallelPassageRefs.py by DRM.

Does anyone else have something similar, or is anyone else more gifted at Python programming with ScriptureObjects who could pull this together better than I ever could? I’m willing to write the pseudo code if that’s helpful. And I’m willing to learn and work with someone else on this…

Paratext Aug 5, 2019 asked by Mark P (2.9k points)

2 Answers

Related questions

0 votes

1 answer

Nesting OT quotes within Words of Jesus

Paratext Nov 23, 2015 asked by mnjames (1.8k points)

0 votes

0 answers

Words of Jesus colored in Text Collection?

Paratext Mar 31, 2021 asked by [Moderator]

james_post (2.0k points)

0 votes

4 answers

Jesus words in red not exporting to PDF file

Paratext Jul 7, 2021 asked by ace541611 (123 points)

0 votes

3 answers

Will \qt-s\* replace \wj ? Is the markers check broken? So many questions

Paratext Oct 21, 2020 asked by anon150053 (286 points)

0 votes

1 answer

quote /qt within words of Jesus /wj not working (v9.5)

Paratext Jul 10 asked by ASmith (169 points)

davidc78 · Answer 1 · 2019-08-06T15:45:08+0000

If you figure this out, I’d really like to see it.
But it would make all our lives easier, if PT would allow \w \w* to span across verse number references, footnotes, and xrefs. It is tedious (and error prone) to start and stop the \w markings around things that have nothing to do with the actual words of Christ. (In fact, it’d be nice if the \w style went across paragraph marks too, and would only throw an error if the end of a chapter was reached without seeing a \w*.)

Phil_Leckrone · Answer 2 · 2023-11-08T20:42:20+0000

There are a series of tools at https://lingtran.net/Voice-Marking-Tools that can be used to help with this process.

Using Glyssen output to mark up Words of Jesus in Paratext

Please log in or register to answer this question.

2 Answers

Please log in or register to add a comment.

Please log in or register to add a comment.

Related questions

Categories