Hello! I am a beginner developer who is greatly interested in the rapidly emerging field of LLMs.
This is an experimental project in its early stages, created for the purpose of studying LLM prompting and Python for my own study.
However, I believe it to be quite an intriguing idea, and I wish to receive ample feedback and opinions from the brilliant developers on GitHub, while honing my development skills!!
There's a new update! The performance is better than expected compared to the previous version, so I've rewritten the README!
Thank you for starred this project!
Thank you for coming by, and please keep an eye out for future updates!
This project PAR is an advanced automated researcher system powered by LangChain, LangGraph, and Large Language Model(LLM) technologies.
The PAR goes beyond traditional RAG(Retrieval-Augmented Generation) systems, offering not only more powerful and efficient information retrieval but also document generation capabilities.
It aims to achieve highly accurate and efficient information retrieval and document generation by prompting the LLM to consider human perspectives, such as why a particular question might be asked or what information a human might desire.
Utilizing a wide range of search engines and data sources, The PAR collects comprehensive information and then automatically generates high-quality documents based on this collected data. These documents are then embedded and stored in a vector database for optimized future retrieval.
The PAR project seeks to efficiently extract and organize essential knowledge from the vast sea of information, aiming to transform the paradigm of knowledge work across various fields, including researchers, students, and business professionals.
While the project is still in its early experimental stages and there are many steps that I ahead, it is an endeavor that I would like to share with the brilliant developers on GitHub, discussing exciting ideas and possibilities!
Before you start, you can see Test Case and Result in Test_Case folder!
Version | Average Time | Average Token | Document Quality |
---|---|---|---|
2024.04.02 | About 30 ~ 40 min | About 600,000 ~ 1,100,000 tokens | 1.5 ~ 2 |
2024.05.30 | About 7 ~ 15 min | About 125,000 ~ 750,000 tokens | 2.5 ~ 4 |
If you would like to see the results of the document generation, please leave a comment in the designated issue. I will be happy to provide you with the generated documents for your review.
Or you can freely leave any comments in Dicussions
!
Before getting start, strongly recommend that you read through the contents thoroughly!
The main libraries required for running the project can be found in the requirements.txt file.
Clone this repository
git clone https://github.com/MIRACLE-cowf/Powerful-Auto-Researcher.git
Move to the cloned repository
cd Powerful-Auto-Researcher
Inside the Powerful-Auto-Researcher
, fill in the necessary API keys in the .env
file
Install the required libraries
pip install -r requirements.txt
Run main.py
python3 -m main
As a beginner developer, I am greatly seeking diverse feedback from the brilliant developers on GitHub!
I would appreciate any kind of feedback, regardless of the type, be it Python syntax, structure, prompting, readme, etc.!
Or you can freely leave comment at Issue
or Dicussion
!
์ ๊ทน์ ์ธ ํผ๋๋ฐฑ ๋ถํ๋๋ฆฝ๋๋ค!
Thank you!
miracle.cowf@gmail.com