thequbit / monroeminutes

Town Meeting Minutes Aggregator and Search Engine
GNU General Public License v3.0
9 stars 5 forks source link

implement mm_docprocessor.py #44

Open thequbit opened 10 years ago

thequbit commented 10 years ago

This file will grab unprocessed documents and try and match them to a particular organization. An Entity can have multiple Organizations (think a the organizations of town board, a zoning board, a school board all within the entity of Henrietta, NY).

This file will be responsible for:

This can be implemented by simple pattern matching.

Note: a new collection within the database needs to be created that is a list of organizations. Also need to figure out how we want to match entities up with orgs, or vice versa. I'm thinking an entity should have a list of organizations underneath it.

thequbit commented 10 years ago

This require organizations to be loaded into the mongodb database. Web API's now are implemented to allow for inserting these.