cu-mkp / GR8975

Repository for "What is a Book in the 21st Century" GR8975, Spring 2017
2 stars 2 forks source link

remove xml tags from txt files #1

Closed tcatapano closed 7 years ago

tcatapano commented 7 years ago

@VarshaMaragi I've added the translation txt files to the folio_files directory. Could you remove all the xml tags (i.e., anything matching the patter '<.+>') from the files?

VarshaMaragi commented 7 years ago

yea.sure

VarshaMaragi commented 7 years ago

Do check n let me know if the files are fine

tcatapano commented 7 years ago

Files look ok. We are retaining some non-transcription text. My inclination is to leave it in as it gives something to cleanup when we start working with text processing.