Add search result. As a user, I can search for keyword and view search result

What happened 👀

[UI] Updated keyword table
[Backend] Implemented google_search to search each keyword and store the result in the database
[UI] Implemented a page to view HTML code of the keyword search result page

Insight 📝

For how to scrap the search result page, I use httparty and nokogiri library following this the article "Web Scraping in Ruby: Complete Guide 2024".
known issue [1]: The more keywords you have in the CSV file, the longer it takes. Using thread (or sidekiq library) might help solving this issue.
known issue [2]: mass-searching keywords is blocked. (possible solutions: user-agent rotation or in this interesting article "Web Scraping Google Without Getting Blocked") When the request get blocked, we will see this page:

How searching for each keyword works

For each keyword, I sequentially send a get request to http://www.google.com/search?q=#{keyword}&hl=en (use hl=en to get the result in English) NOTE: this can be improved by sending multiple requests simultaneously.

How extracting data from HTML works

I used nokogiri css selector to extract data from the HTML document.

total_search_results selector: div#result-stats
total_links I count <a> tag in the page
total_ads selector: div#tads[aria-label=Ads] (if not found, fallback to div#tads) Then return element.childrent.count

Proof Of Work 📹

scrappy-web-search-keywords

macropusgiganteus / scrappy-web

Add search result. As a user, I can search for keyword and view search result #4

What happened 👀

Insight 📝

How searching for each keyword works

How extracting data from HTML works

Proof Of Work 📹