Princeton-CDH / ppa-nlp

Discovering patterns in poetry’s data with machine learning; software for use with Princeton Prosody Archive (PPA) full-text corpus
1 stars 0 forks source link

I want a list of UTF-8 characters in the corpus and their frequencies #19

Closed mnaydan closed 5 months ago

mnaydan commented 5 months ago

Here is Laure's report: https://docs.google.com/spreadsheets/d/1AsjfHyIXjkgQ_3tWUBBZgHqE2eihp6bG9SYQPvV6tXI/edit?usp=sharing