mendableai / firecrawl

🔥 Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.
https://firecrawl.dev
GNU Affero General Public License v3.0
18.84k stars 1.44k forks source link

[Feat] Support TXT files #812

Open rafaelsideguide opened 3 weeks ago

rafaelsideguide commented 3 weeks ago

From customer via email. When a google drive file is shared with everyone, it seems can scrape pdf file in google drive, but have some issues with text file.

Here is an example https://drive.google.com/file/d/1CCayVbWoNRnNYyvSFW1aIbhNaFMVh3SN/view?usp=sharing

The error is:

The page returned an error while being scraped.
BAD REQUEST
sikehish commented 3 weeks ago

I'd like to take this up. Any specific file that you'd want me to look into?