USPTO / PatentPublicData

Utility tools to help download and parse patent data made available to the public
Other
182 stars 80 forks source link

DEV: Green Book: optional normalization of dot codes #60

Open lucboruta opened 6 years ago

lucboruta commented 6 years ago

First PR for #59. (Happy #hacktoberfest!)

This commit provides optional normalization of dot codes in Green Book documents, building upon gov.uspto.patent.doc.greenbook.DotCodes. Normalization is disabled by default, so changes are backward-compatible. It can be toggled at various points in the pipeline, including in PatentReader, providing a format-independent endpoint for dot codes and the upcoming normalization of "brace codes".

(My fork's master branch includes a handful of commits to reformat the code, organize imports, etc. I know these commits are tedious to review, but let me know if you're interested in merging such changes to clean the codebase.)