Code4HR / va-circuit-court-search

Virginia Courts Case Information - Statewide Searches ARE Possible
http://vacircuitcourtsearch.com/
MIT License
11 stars 7 forks source link

implement a task queue #7

Open prgr4m opened 9 years ago

prgr4m commented 9 years ago

It just doesn't feel right... Yes, the project works with the current solution but with the district court issue, solving captchas and stateful browser hacking (wish mechanize was up to date, but there's always robobrowser d^_^b )... definitely seems like we should be using gearman or celery for this sort of background work with server-sent events but I'm not necessarily aware of the original requirements (if there are any). Here's a couple of questions:

I'm pretty flexible but I need to know what your thoughts are on this. I noticed you opened up a ticket for this but closed it.

bschoenfeld commented 9 years ago

Yeah, technically all this work shouldn't be done while the browser is waiting for a response. A background worker would be more appropriate but I did the simplest thing that could work.

I'm busy with so other things, so if you want to work on this thing, go for it. I'd like to see more sites be incorporated into the search. I think that's what the users want. I tried to add Alexandria to the search but their site is very difficult to scrape. Fairfax doesn't even have a public site.

prgr4m commented 9 years ago

I know what you mean on the busy part. It's taken this long to reply. I can only dedicate times on the weekends to this project after I take care of a couple of things. Paying out of pocket can be pretty pricey (will keep that in mind)... What I meant in regards to hashing out a solution was using libraries/techniques that you would be comfortable with since I don't know your background. Since I found out about this project through a local meetup, I figured I could contribute but would like to be congruent with the main developer on the project. That's all. I'm offline a bit but you should see something in my repo in about a week. Then I'll just add to this issue and get feedback before submitting a pull request. Have a good week.

bschoenfeld commented 9 years ago

I'm open to using and learning whatever works. I rarely use Python, but I know it well enough to get a quick web server running. I don't have plans to take this thing much further, so if it goes in a different direction, that's cool. I'm prepared to step back if someone else wants to lead the way. I think there may be some interest in Code for DC, so I hope we hear from them soon.

Did you find out about this through the Code for HR meetup?

I should be getting billed by Heroku, but I can't find any charges on my account. I hope I'm not eating up CfA's credits. At any rate, GA says I'm only getting a couple hits a day so I've backed it down to one dyno.