opendenton / denton_data

A prototype of an ETL and API for Open Denton
https://www.opendenton.com/denton_data
3 stars 6 forks source link

adding main.py and name_entity.py to a new branch for sentiment anali… #17

Open tinkerofwales opened 7 years ago

tinkerofwales commented 7 years ago

Adding datamining branch for branching and sentiment analysis. Parsing "libraries" that returns time/location information with LDA assumed sentiment topics

newswim commented 7 years ago

This could come in really handy when we're handling more text-based documents. We should definitely think about integrating this with:

Let's think about splitting the off into its own microservice. Doing that would allow us to run background tasks and not worry about eating up limited server resources. Then, we can host both the list of documents that should be processed, along with the results of the analysis, on the denton_data api.

Plus, this allows teams to work on adding features and maintaining the codebase in a more fine-grained way. Let's initialize a new repository under OpenDenton -- tag me or Kyle if you need any help -- but let's also continue to track this (future) service here within this PR.

Here's a few runway items for consideration:

This is awesome, y'all!