Closed Longleaves closed 1 month ago
@Longleaves can you share the url you're trying to scrape?
@Longleaves can you share the url you're trying to scrape?
@rafaelsideguide Thank u for answering me!
the url is your example: "url": "https://mendable.ai"
these showed in the error column:
But I found that the return result showed in "data" column:
{
"jobData": {
"url": "https://mendable.ai",
"mode": "crawl",
"crawlerOptions": {
"allowBackwardCrawling": false
},
"pageOptions": {
"onlyMainContent": false,
"includeHtml": false,
"removeTags": [],
"parsePDF": true
},
"origin": "api"
},
"returnValue": [
{
"content": "\n\nIntroducing Firecrawl\n 🔥 - Turn entire websites into LLM-ready markdown or structured data\n\n![Mendable logo](https://www.mendable.ai/Frame%20566%20(2)Mendable\n\n Getting started\n Use Cases\n Docs\n \n Pricing\n \n* Blog\n
…………
Is this a normal mistake I can ignore? And I wanna ask how can I get the crawl result directly?
I also found: "error": "No pages found" and the same errors showed in pnpm.
{ "jobData": { "url": "https://www.baidu.com/", "mode": "crawl", "crawlerOptions": { "allowBackwardCrawling": false }, "pageOptions": { "includeHtml": false, "onlyMainContent": true, "waitFor": 5000 }, "origin": "api" }, "returnValue": { "success": true, "result": { "links": [] }, "error": "No pages found" } }
having some issue
ccing @rafaelsideguide
@xingfanxia @Longleaves we were able to reproduce this error and we're fixing the missing lock for job id
error in our next release. In the meantime, you can still fetch the results from your crawls using the crawl/status endpoint. Check out the docs here for how to use it.
When I run locally, I found an error on UI(http://0.0.0.0:3002/admin/@/queues), the error is: And my pnpm showed: Error: Error: All scraping methods failed for URL Error sending webhook for team ID: undefined Failed to parse URL from undefined
Please help me, thanks a lot!