funginstitute / patentprocessor

BSD 2-Clause "Simplified" License
68 stars 31 forks source link

Low memory adaptations #72

Closed gtfierro closed 10 years ago

gtfierro commented 10 years ago

When operating on the full dataset, it is difficult to get the cleaning/consolidating/integrating steps to tractably function. These fix up some of the rough edges of the scripts and provide some nice config options for doing low-memory versions of the steps.