lib-re / dublin-core-text-parser

Cataloguing tool for converting specially formatted text files containing dublin core metadata into various formats
MIT License
6 stars 0 forks source link

Add body parsing for subject keywords #26

Closed atla5 closed 8 years ago

atla5 commented 8 years ago

TableOfContents, and Contributors are examples of elements that tend to be added en masse, and that was the reason I devoted so much of the body to them.

Subject is another one of these that could benefit from bulk processing, and the change may be a large added value/feature, especially in the absence of a quality OCR scan (or bPress database) :p

questions would be whether or not...

*LCSH codes don't have to include lowercase letters and would often match the current criteria for a qualifier switch

ex:

-SUBJECTS-
KEYWORDS/TAGS
dogs
cats
fish
LCSH
QA76.
DDC
LCC
MESH
UDC
atla5 commented 8 years ago

won't remove it from the inline parsing for now. don't see a need.