mendableai / firecrawl

🔥 Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.
https://firecrawl.dev
GNU Affero General Public License v3.0
15.08k stars 1.09k forks source link

[BUG]Why “Error: Missing lock for job 0677c04f-78b2-4917-ae53-e701a5611b3c finished” happened? #320

Closed Longleaves closed 1 month ago

Longleaves commented 3 months ago

When I run locally, I found an error on UI(http://0.0.0.0:3002/admin/@/queues), the error is: 图片 And my pnpm showed: Error: Error: All scraping methods failed for URL Error sending webhook for team ID: undefined Failed to parse URL from undefined 图片 图片

Please help me, thanks a lot!

rafaelsideguide commented 3 months ago

@Longleaves can you share the url you're trying to scrape?

Longleaves commented 3 months ago

@Longleaves can you share the url you're trying to scrape?

@rafaelsideguide Thank u for answering me!

the url is your example: "url": "https://mendable.ai"

these showed in the error column: 343095148-542166b2-a07f-4215-8081-1451f7a9599d

But I found that the return result showed in "data" column: { "jobData": { "url": "https://mendable.ai", "mode": "crawl", "crawlerOptions": { "allowBackwardCrawling": false }, "pageOptions": { "onlyMainContent": false, "includeHtml": false, "removeTags": [], "parsePDF": true }, "origin": "api" }, "returnValue": [ { "content": "\n\nIntroducing Firecrawl\n 🔥 - Turn entire websites into LLM-ready markdown or structured data\n\n![Mendable logo](https://www.mendable.ai/Frame%20566%20(2)Mendable\n\n Getting started\n Use Cases\n Docs\n \n Pricing\n \n* Blog\n
…………

Is this a normal mistake I can ignore? And I wanna ask how can I get the crawl result directly?

Longleaves commented 3 months ago

I also found: "error": "No pages found" and the same errors showed in pnpm.

{ "jobData": { "url": "https://www.baidu.com/", "mode": "crawl", "crawlerOptions": { "allowBackwardCrawling": false }, "pageOptions": { "includeHtml": false, "onlyMainContent": true, "waitFor": 5000 }, "origin": "api" }, "returnValue": { "success": true, "result": { "links": [] }, "error": "No pages found" } }

343095951-145f9975-7779-4248-9e20-55f08ebcefd4 343115458-b61095c5-8f26-422a-99ab-1351d1aad337

xingfanxia commented 2 months ago

having some issue

nickscamara commented 2 months ago

ccing @rafaelsideguide

rafaelsideguide commented 1 month ago

@xingfanxia @Longleaves we were able to reproduce this error and we're fixing the missing lock for job id error in our next release. In the meantime, you can still fetch the results from your crawls using the crawl/status endpoint. Check out the docs here for how to use it.