shonfeder / borgmanities

A short-lived twitter bot in Haskell (and a bit of Python)
MIT License
4 stars 0 forks source link

Move all parsing to Haskell #12

Open ghost opened 9 years ago

ghost commented 9 years ago

Reasons for this change:

  1. I want to get familiar with parsing in Haskell.
  2. In order to really refine the output, we need to have more advanced parsing of the text. Immediate issues that call for this include:
    • Stripping out all links (requires parsing out relatively complex string patters).
    • Rejecting sequences that contain tweet-speek ("RT", "u r", etc).
    • Rejecting phrases with terms in the black list.
  3. Thus, we'll either need to do more parsing on the python side or the Haskell side. Given (1) and the fact that I don't enjoy working in Python---Point made.

    Todo

    • [ ] Modify python to return un-filtered unicode text of tweets, one per line.
    • [ ] Parse this text in the Haskell.
ghost commented 9 years ago

As per @KitLiterate's advice, I am removing this from the milestone, and will mark it as an enhancement rather than a todo.