0 votes

In the Partnership for Applied Biblical NLP, we are defining a “shared task” for word segmentation - basically, a contest to see which software can do it best. The goal is to create NLP software that can do it better. We are considering using the format Paratext uses in Wordanalyses.xml, and we need high quality data for a variety of languages.

I have these files for some languages where I am at least an observer on the project, but I need more, particularly for languages where a lot of work has been done on the morphology in either the Interlinearizer or the Wordlist.

We are interested in both languages of wider communication and languages that are morphologically complex and represent different kinds of languages. If you have data you are willing to share, please let me know!

[Email Removed]

Paratext by (448 points)

1 Answer

0 votes
by [Expert]
(16.3k points)

Related questions

0 votes
5 answers
0 votes
1 answer
Paratext Jul 20, 2018 asked by mnjames (1.8k points)
0 votes
1 answer
0 votes
3 answers
Welcome to Support Bible, where you can ask questions and receive answers from other members of the community.
I appeal to you, brothers and sisters, in the name of our Lord Jesus Christ, that all of you agree with one another in what you say and that there be no divisions among you, but that you be perfectly united in mind and thought.
1 Corinthians 1:10
2,833 questions
5,697 answers
5,262 comments
1,711 users