elixir-crawly / crawly

Crawly, a high-level web crawling & scraping framework for Elixir.
https://hexdocs.pm/crawly
Apache License 2.0
965 stars 113 forks source link

This is actually a question, Nested scraping #229

Closed shellking4 closed 1 year ago

shellking4 commented 1 year ago

I have a scraper that is scraping data from a website. On the website we have products and each product has different colors options which come with a given set of images. My question is how to set up my scraper so that if I'm on a product page I can also check the other options pages in order to get the specific images. Sounds like a nested stuff but I cannot yet figure out how to implement it. Thanks for any tips

ivanhercaz commented 1 year ago

Hi @shellking4! I can't answer your question now, because I would need to replicate what you tried, but I suggest you to move it to Discussions section and include more details about the HTML structure and what you have tried.

Ziinc commented 1 year ago

Performing specific html parsing and extraction is up to your own personal skill, this is outside of the scope of Crawly and crawling management, nor is it an issue with the library.