Princeton-CDH / ppa-nlp

Discovering patterns in poetry’s data with machine learning; software for use with Princeton Prosody Archive (PPA) full-text corpus
1 stars 0 forks source link
digital-humanities ocr python text-analysis

corppa PPA full-text corpus utilities

This repository provides code and other resources associated with the Princeton Prosody Archive (PPA), with a particular focus on working with the PPA full-text corpus.

Development instructions

This repo uses git-flow branching conventions; main contains the most recent release, and work in progress will be on the develop branch. Pull requests for new features should be made against develop.

Developer setup and installation

Experimental Scripts

Experimental scripts associated with corppa are located within the scripts directory. See this directory's README for more detail.