openva / crump

A parser for the Virginia State Corporation Commission's business registration records.
https://vabusinesses.org/
MIT License
20 stars 3 forks source link

Generate Elasticsearch-compatible files #52

Closed waldoj closed 10 years ago

waldoj commented 10 years ago

Optionally, generate JSON that Elasticsearch's Bulk API can ingest. This should be a simple addition. I'm thinking that those files would be generated instead of the normal JSON files, instead of in addition to them—the program can simply be run twice if somebody wants both sets of files. But the only reason I prefer that is because I think it's going to be easier than generating two sets of files.

waldoj commented 10 years ago

The catch is that this only works for those files that have a corporate ID field. That is, this will not index table 8, which lists registered names.