Data4Democracy / indivisible

Aggregating call to action sites into a single application.
25 stars 19 forks source link

Investigate email structure #7

Open pghosh opened 7 years ago

pghosh commented 7 years ago

This task is to go through different action emails and identify best way to extract the text data. The things we are looking for in email are

  1. action detail
  2. Date,time,place
  3. Organization
  4. anything else might be helpful for the action. This is a parent task for investigating all emails. It might be a better approach if we split this into subtasks by specific email id/group.

Acceptance criteria,

  1. What kind of informations we can get from different emails
  2. Recommendation for best way to extract the information (scrapper text based on template vs using machine learning)
  3. if the emails can be grouped such that same script can work on the group.
Phinneas commented 7 years ago

Okay, I'll start working on this issue.