anidata / ht-etl

Anidata 1.0: ETL and algorithm code.
0 stars 10 forks source link

Parse emails #11

Closed dlrobertson closed 7 years ago

dlrobertson commented 7 years ago

Given a csv containing the page id and the page content, find all the emails in the content. The regex should catch cases in which "at" and "dot" are used instead of "@" and ".".