simsays / Shakespeare

CS 121 Repository for Shakespeare dev
0 stars 0 forks source link

Parse Word Files #43

Closed robertmlinden closed 7 years ago

robertmlinden commented 7 years ago

As a developer, I want to parse Microsoft Word documents from http://www.shakespearescanned.com/shakespeare.html using the python-docx library so that I can solve both scansion and syllable counts in one go

robertmlinden commented 7 years ago

The output file format can be found in the /docx folder in the origin/docx branch.