0 votes

In each project folder there is a file Lexicon.xml (at least if some back-translation has been done).

Also in many projects there are foreigners involved (consultants, facilitators) who could rather benefit from a “lexicon”, even a humble one.

I have done some thinking and some research to see how to easily “look into the PT lexicon”. My dream would be to have a very humble Python-plugin (made by me; well, in my dreams I can) that brings up what has been entered upon right-click on a word. And optionally a lexicon-window, where one could browse or search…

I have also started looking into dedicated XML tools and editors. Some can turn an xml file into a rather pretty table. But I found the structure of the PT lexica is not helping such tools by having the actual “lexemes” stored rather like “attributes” not so much like a “content”. And the tools I have seen so far (the affordable ones) also do not like the fact that some lexemes have more senses than others, i.e. that a suitable table-definition would need a structured system of index, entry, indixes and more entries.

Before I spend more time on this, I would like to hear from other supporters and users about what is out there already, what other people are doing and if any fitting tools are coming to mind.

The project I work most with, does not want to link PT and Flex because those two tools evolve (here) through versions (and re-namings) at different speeds. This is a historical decision, due to bad experieces from orthography changes, program-migrations and other reasons. There is offline-transporting of PT data into Flex, but never just-in-time so “looking up through Flex” is one option only, if keeping a forked special-Flex for that purpose on the same machine as the production-Flex is possible.

This is a very open question, no urgency for once, but any input muchly welcome. Thank you.

Paratext by (855 points)
reshown

1 Answer

+1 vote
Best answer

Look at Glossinator: http://lingtransoft.info/apps/glossinator-paratext-75-or-8x
Maybe it will do some of what you are looking to do.

There is also an utility called Glossy that looks at the same data.
https://bitbucket.org/davidrowbory/glossy/downloads/

by [Expert]
(2.9k points)

reshown

Awesome, I even love those names “Glossinator” and “Glossy”. Will certainly take a look. Thank you.

Let me know if you’re interested in Glossy at all. It aims to be a minimalist interactive live browser of the lexicon, sorting from the start or from the end of words and glosses. I wrote it for a colleague and I subsequently use it a lot myself, but making it reliable and easy to install needs more time and I’ll only prioritise that if there’s more interest. Glossinator is nice too and probably better, but glossy works on any python-running platform!

Hi and thank you. I am definitely interested. The proof is that I even wanted to write something similar myself. I am learning Python for the past few years with that motto “automate the boring stuff”. If I can look into your code I might even learn a lot…

(Getting closer to having my own first tool with a GUI: “Hyphenator”, which will take data from the PT8 hyphenated words and apply soft-hyphens to any other text; which should be very useful for publishing via Scribus or LibreOffice for minority languages. I found that PT does good hyphen-data-management (even keeping track which entries are user-confirmed) while Flex does not allow custom-fields at wordforms-level (where hyphenation-data needs to be.)

Anyway, I am not at all a real programmer, but I am not afraid to follow instructions and to do a manual install or some mild hacking if needed. Thanks.

Download glossy from the link above and you can leaf through the source. It’s all open. The interesting stuff is really the XSL that transforms the XML into an interactive HTML browser of the lexicon. The python is just there to copy the lexicon and insert the xsl reference appropriately. This probably isn’t the best way to do it, but it’s safe, easy, and gives you files you can email someone. Some kind of GUI and embedded python might be nice but I haven’t done any python GUI work yet.

I've finally got around to updating glossy for Python3 and Paratext 9.

I'd love it if you could test it. Tested on my Mac (Intel) and Windows (10, 64 bit) environments. There are some sample lexicons you can download so you can see what it is supposed to do.

Welcome to Support Bible, where you can ask questions and receive answers from other members of the community.
If anyone destroys God’s temple, God will destroy that person; for God’s temple is sacred, and you together are that temple.
1 Corinthians 3:17
2,645 questions
5,394 answers
5,065 comments
1,436 users