'CustomHTML2Text' is not defined

unclecode / crawl4ai

🔥🕷️ Crawl4AI: Open-source LLM Friendly Web Crawler & Scrapper

Apache License 2.0

16.38k stars 1.2k forks source link

@yassello55 You're using the synchronous version which is no longer maintained. I suggest switching to the main asynchronous example to avoid similar errors. Sorry for the inconvenience; try the async version instead.

import asyncio
from crawl4ai import AsyncWebCrawler

async def main():
    # Create an instance of AsyncWebCrawler
    async with AsyncWebCrawler(verbose=True) as crawler:
        # Run the crawler on a URL
        result = await crawler.arun(url="https://www.nbcnews.com/business")

        # Print the extracted content
        print(result.markdown)

# Run the async main function
asyncio.run(main())

unclecode / crawl4ai

'CustomHTML2Text' is not defined #243

Create an instance of WebCrawler

Warm up the crawler (load necessary models)

Run the crawler on a URL

Print the extracted content