Open ccstan99 opened 1 year ago
Would it help to use LangChain's WebBaseLoader as a default until the unimplemented parsers get implemented? https://python.langchain.com/docs/integrations/document_loaders/web_base
from langchain.document_loaders import WebBaseLoader
loader = WebBaseLoader("https://epochai.org/blog/")
docs = loader.load()
To handle suggestions from agisf:
Add to scrape entire blog:
Implement parsers for special_docs/indices: