Living-with-machines / alto2txt

Convert ALTO XML to plain text + minimal metadata
https://living-with-machines.github.io/alto2txt/
MIT License
13 stars 2 forks source link

Document assumptions about the relationship between mets and alto files #47

Open andrewphilipsmith opened 2 years ago

andrewphilipsmith commented 2 years ago

What are the assumptions in our code vs in the Alto standard?

How do we test whether or not these assumptions are valid with real-world data?