Open gkzsolt opened 1 month ago
Looking at the code of ScrapeWebsiteTool
, it does get stuck, indeed. It is a simple requests.get
call. By the way, the site's content in question can be obtained even without enabling cookies, but there are also other problems: it redirects (301) and also has some primitive but effective scraping protection.
I tried scraping with Selenium. This worked in my home setup, but I was unable to make it work in a custom tool (SeleniumScrapingTool
failed as well). Installing a webdriver compatible with your browser version seems to be very challenging. A few years ago, it was easy to download the webdriver for the most recent browsers (I am using Chrome), but starting from version 115, they discontinued it. Now, there is a webdriver manager expected to detect and download the driver for you, but I have never seen this work.
Has anybody managed to run the SeleniumScrapingTool
successfully, and if so, could they share it with me? I would be very grateful. I like the agents crew
idea and I'd like to contribute to it as well. I am on Ubuntu 22.04. Many thanks!
Hi,
I (almost) finished yesterday your presentation course on Deeplearning.ai and I was impressed ;) My first try did not succeed, although.
I am just trying to get an agent to analyze a job posting and give a structured output of the requirements, like in
L7_job_application_crew.ipynb
from the presentation. I just copied the agent and task:But when running the crew, the output is:
Did it stuck when asked to enable Javascript and cookies?