callbackwomen / callback-women-rails

App will help women who are programming speakers better connect with programming conferences' CFPs.
http://callbackwomen.com
2 stars 6 forks source link

A admin user needs app to do majority of the work of digging up relevant info from Twitter #18

Open cczona opened 10 years ago

cczona commented 10 years ago

tl;dr: Finding relevant tweets is a clunky, time-consuming, pain in the butt. Stuff needs to be collected far better, continuously, and cached.

Background

The most common types of Twitter searches currently:

  1. Seeking additional conferences to follow, via searching Twitter as a whole
  2. Seeking CFPs to share, via searching CallbackWomen's timeline for keywords (e.g. "CFP", "call for proposals", etc).
  3. Seeking, in various ways, other relevant info to share. e.g. travel funding, diversity scholarships, speaker mentorship, childcare, codes of conduct, etc.

    Good news:

  4. Twitter search supports booleans and search criteria such as date range, username, etc. http://thesocialchic.com/2013/04/26/how-to-master-twitter-search/ Sweet!

    Pain points:

  5. Clients implement Twitter boolean searches poorly, if at all. Support for search by criteria is rarer.
  6. Twitter's API doesn't support regex http://stackoverflow.com/questions/23363940/using-regular-expression-in-twitter-api
  7. There aren't (afaik) clients that let you run regexes against result sets it has obtained via the API
  8. Some conferences call things a CFP that don't meet CallbackWomen's particular criteria.
  9. Some conferences use less particular vocabulary (for instance, calls in languages other than English; or peculiaries such as "C4P")
  10. It takes a lot of searches, separately, to find the genuine hits.
  11. Because the searches have to be separate and imprecise, there's a lot of noise to wade through each time
  12. There are other relevant tweets to cull, such as regarding scholarships, childcare, mentorship, etc. These also have a range of expressions to match.
  13. Stuff rolls of of timelines too fast. If you're not checking search results daily, relevant tweets get missed. Which is a problem since some conferences only tweet a CFP announcement once ("open!") or twice ("deadline is tonight!"). So not checking frequently can cause a CFP to be missed altogether.

    Proposed solution:

Ongoing administrative searches of tweets. A collection of complex searches, whose results are merged/de-duped/refined by the app, and cached by the app so that admin(s) can pull from that narrower resultset when convenient.

Setting administrative searches via an .ini file is fine. There probably needs to be some kind of basic sanity checking before passing searches forward to the API.

Alternative solutions:

I'm open to alterative ideas for dealing with the core problem. Thoughts?