This file will grab unprocessed documents and try and match them to a particular organization. An Entity can have multiple Organizations (think a the organizations of town board, a zoning board, a school board all within the entity of Henrietta, NY).
This file will be responsible for:
deciding which organization each document belongs to
updating the mongodb database with that information
pushing the doc and all it's meta data to elastic search
This can be implemented by simple pattern matching.
Note: a new collection within the database needs to be created that is a list of organizations. Also need to figure out how we want to match entities up with orgs, or vice versa. I'm thinking an entity should have a list of organizations underneath it.
This file will grab unprocessed documents and try and match them to a particular organization. An Entity can have multiple Organizations (think a the organizations of town board, a zoning board, a school board all within the entity of Henrietta, NY).
This file will be responsible for:
This can be implemented by simple pattern matching.
Note: a new collection within the database needs to be created that is a list of organizations. Also need to figure out how we want to match entities up with orgs, or vice versa. I'm thinking an entity should have a list of organizations underneath it.