Open epoz opened 12 years ago
See http://datahub.io/dataset/harvard-library-catalog/resource/1895f702-59e0-4cf8-8bcb-c3b8cb8f05fd for the data. See https://github.com/aristus/copymine-harvard for some useful heuristic for improving import.
Ran some tests using the existing parser, looking promising. Have emailed for feedback.
See http://datahub.io/dataset/harvard-library-catalog/resource/1895f702-59e0-4cf8-8bcb-c3b8cb8f05fd for the data. See https://github.com/aristus/copymine-harvard for some useful heuristic for improving import.