datactive / bigbang

Scientific analysis of collaborative communities
http://datactive.github.io/bigbang/
MIT License
154 stars 51 forks source link

include signature and quoted text detection on emails #88

Open sbenthall opened 10 years ago

sbenthall commented 10 years ago

Include a method that detects email signatures and quoted text.

Look into the open source libraries released by Mailgun:

https://github.com/mailgun/talon

http://blog.mailgun.com/open-sourcing-our-email-signature-parsing-library/

sbenthall commented 9 years ago

We decided to roll our own solution to this. Quoted text detection has already been submitted. We've been punting for now on signature.

sbenthall commented 9 years ago

Quote detection is in thread.py. This can be generalized to other libraries.