With about 50% of the NT consultant checked, I recently started using the Wordlist tool in Paratext to compare the lexical choices between three related language translations. The data is remarkable in that a vast majority of the lexical choices are nearly identical, but there are significant enough differences to warrant separate (yet related) translations. The Wordlist is very helpful in being able to sort by Count, so that I can see which words occur most frequently, and it even gives the number of occurrences. It would seem a very easy step then to total up all of the counts of every word listed in the current view (filtered for whatever selection of books is desired) and list the total number of tokens for all words included in the current view. This would lend some data then towards statistics.
Thank you!