MIRACLE-cowf / Powerful-Auto-Researcher

Powerful Auto Research powered by LangChain, and Anthropic.
29 stars 4 forks source link
agent ai anthropic anthropic-claude artificial-intelligence langchain langchain-python langgraph large-language-models llm rag retrieved-augmented-generation

Powerful-Auto-Researcher(PAR)

๐Ÿ‘‹ Introduction ๐Ÿ‘‹

Hello! I am a beginner developer who is greatly interested in the rapidly emerging field of LLMs.

This is an experimental project in its early stages, created for the purpose of studying LLM prompting and Python for my own study.
However, I believe it to be quite an intriguing idea, and I wish to receive ample feedback and opinions from the brilliant developers on GitHub, while honing my development skills!!

There's a new update! The performance is better than expected compared to the previous version, so I've rewritten the README!

Thank you for starred this project!

Thank you for coming by, and please keep an eye out for future updates!

๐ŸŒ  OverView ๐ŸŒ 

This project PAR is an advanced automated researcher system powered by LangChain, LangGraph, and Large Language Model(LLM) technologies.
The PAR goes beyond traditional RAG(Retrieval-Augmented Generation) systems, offering not only more powerful and efficient information retrieval but also document generation capabilities.
It aims to achieve highly accurate and efficient information retrieval and document generation by prompting the LLM to consider human perspectives, such as why a particular question might be asked or what information a human might desire.
Utilizing a wide range of search engines and data sources, The PAR collects comprehensive information and then automatically generates high-quality documents based on this collected data. These documents are then embedded and stored in a vector database for optimized future retrieval.

The PAR project seeks to efficiently extract and organize essential knowledge from the vast sea of information, aiming to transform the paradigm of knowledge work across various fields, including researchers, students, and business professionals.

While the project is still in its early experimental stages and there are many steps that I ahead, it is an endeavor that I would like to share with the brilliant developers on GitHub, discussing exciting ideas and possibilities!

Before you start, you can see Test Case and Result in Test_Case folder!

Version Average Time Average Token Document Quality
2024.04.02 About 30 ~ 40 min About 600,000 ~ 1,100,000 tokens 1.5 ~ 2
2024.05.30 About 7 ~ 15 min About 125,000 ~ 750,000 tokens 2.5 ~ 4

๐Ÿ˜‰ Help Me & Discuss Me ๐Ÿ˜‰

If you would like to see the results of the document generation, please leave a comment in the designated issue. I will be happy to provide you with the generated documents for your review. Or you can freely leave any comments in Dicussions!

๐ŸŒ  Contents ๐ŸŒ 

Before getting start, strongly recommend that you read through the contents thoroughly!

  1. Detailed
  2. How Does It Work?
  3. All Flow Chart

๐Ÿš€ HOW TO START ๐Ÿš€

โ—WARNINGโ—โ—WARNINGโ—โ—WARNINGโ—

This project may use so many tokens, so be careful!

โ—WARNINGโ—โ—WARNINGโ—โ—WARNINGโ—

๐Ÿค Main Third-party libraries ๐Ÿค

1. LangChain(Main Interface)

2. LangGraph(Recursive structure and Clear flow)

3. LangSmith(Debugging)

4. Anthropic(Language Model)

5. OpenAI(Embedding Model)

6. Tavily API(Main Search Engine)

7. PineCone(Vector Store)

๐ŸŽฏ Try it ๐ŸŽฏ

The main libraries required for running the project can be found in the requirements.txt file.

  1. Clone this repository

    git clone https://github.com/MIRACLE-cowf/Powerful-Auto-Researcher.git
  2. Move to the cloned repository

    cd Powerful-Auto-Researcher
  3. Inside the Powerful-Auto-Researcher, fill in the necessary API keys in the .env file

    • LangChain
    • AskNews
    • Anthropic
    • Tavily
    • Brave Search
    • AskNews
    • PineCone
    • Mongo DB
  4. Install the required libraries

    pip install -r requirements.txt 
  5. Run main.py

    python3 -m main

โœ… Update Log โœ…

2024.05.30

2024.04.02

๐Ÿ”ฅ FeedBack ๐Ÿ”ฅ

As a beginner developer, I am greatly seeking diverse feedback from the brilliant developers on GitHub!

I would appreciate any kind of feedback, regardless of the type, be it Python syntax, structure, prompting, readme, etc.!

Or you can freely leave comment at Issue or Dicussion!

์ ๊ทน์ ์ธ ํ”ผ๋“œ๋ฐฑ ๋ถ€ํƒ๋“œ๋ฆฝ๋‹ˆ๋‹ค!

Thank you!
miracle.cowf@gmail.com