extends the MEIParser class rewritten in #806 to create MEITokenizer which divides content parsed from the MEI file into documents that can be indexed in solr
introduces lxml for MEI (xml) parsing
does some refactoring of the previously introduced MEIParser and tests
This concludes (hopefully) a re-writing of tools (along with #806 and #833) needed for MEI parsing.
This PR:
MEIParser
class rewritten in #806 to createMEITokenizer
which divides content parsed from the MEI file into documents that can be indexed in solrlxml
for MEI (xml) parsingMEIParser
and testsThis concludes (hopefully) a re-writing of tools (along with #806 and #833) needed for MEI parsing.