neuml / paperetl

📄 ⚙️ ETL processes for medical and scientific papers
Apache License 2.0
342 stars 27 forks source link

Use XML id for file figure processing #13

Closed davidmezzetti closed 3 years ago

davidmezzetti commented 3 years ago

Currently the file process attempts to find a caption/label/name to use as the section name for TEI files. This is error prone. xml:id is unique and more reliable.