Kimurai is a modern web scraping framework written in Ruby which works out of box with Headless Chromium/Firefox, PhantomJS, or simple HTTP requests and allows to scrape and interact with JavaScript rendered websites
As I was writing tests for the scraper I've made, I've realised its not super straightforward at the moment. It would be great to improve on that front:
Add testing section to the documentation, showcasing how to set it up and test in Rails for example
Expand global configuration options. I would have liked to be able to disable delay globally in the test environment, instead of doing this in every scraper I write: @config = { before_request: { delay: 1..2 } } unless Rails.env.test?
Add automatic detection of the test environment. Currently I have to manually set it in the rails_helper: ENV['KIMURAI_ENV'] ||= 'test'
As I was writing tests for the scraper I've made, I've realised its not super straightforward at the moment. It would be great to improve on that front:
@config = { before_request: { delay: 1..2 } } unless Rails.env.test?
rails_helper
:ENV['KIMURAI_ENV'] ||= 'test'