Brown-University-Library / iip-texts

IIP inscriptions encoded in Epidoc XML and supporting files
5 stars 11 forks source link

iip-texts

IIP inscriptions encoded in Epidoc XML and supporting files

Contents

This repository has the XML files of the IIP collection, encoded using the Epidoc schema. It also holds some other versions of the files and scripts needed for their transformations.

The scripts directory also has a subdirectory word-segmentation that holds scripts and also data directories for the pipeline developed to add <w> elements indicating discrete words, to the files. For more information see the README file in that directory.

How to use archival bib processor.

run python xmlprocessor.py and the output files will be in archival-files. You may have to install some Python libraries, please run these two command seperately: pip install beautifulsoup4, pip install requests check - think we don't use this

Other files