yuiseki / TRIDENT

🤖🛠🗺 TRIDENT - Interactive Smart Maps Assistant
https://trident.yuiseki.net/
Do What The F*ck You Want To Public License
18 stars 1 forks source link

Consider supporting FOSS LLMs #58

Open yuiseki opened 11 months ago

yuiseki commented 11 months ago

Currently, TRIDENT is only based on the OpenAI API as LLMs. The OpenAI API is difficult to freely customize and control the base model. Supporting FOSS LLMs should make it possible to develop TRIDENT as a fair and transparent FOSS AI assistant.

kshitijrajsharma commented 11 months ago

Hi @yuiseki , Lovely you have already opened issue on this , Some open source LLM on your mind that you are looking to do research on ?

yuiseki commented 11 months ago

@kshitijrajsharma Thanks for your comment! I have already done some research on some OSS LLMs. I share an overview of them below.

In summary, I have started my trial and error with most attention to llama.cpp and OpenLLaMA, but I will continue to keep an eye on LLaMA 2.

kshitijrajsharma commented 11 months ago

Lovely , Are you seeking for help on any of these ? I can checkout few Can you lay out what should I check and test ?

kshitijrajsharma commented 11 months ago

I get little bit hands on LLama model training :

Here is the sample dataset that can be used to train LLama with RLHF

https://huggingface.co/datasets/HuggingFaceH4/stack-exchange-preferences

Now we need similar training dataset to be prepared for overpass questions and query ! There will be some challenges : We might search for training data , in order to retrain or play with it , it needs massive GPU and machine , I tried a demo with collab couldn't go through on free version

Two references : https://lightning.ai/pages/community/tutorial/accelerating-llama-with-fabric-a-comprehensive-guide-to-training-and-fine-tuning-llama/ https://huggingface.co/blog/stackllama

Looks like it can run on 8GB of GPU which is good a standard personal computer nowadays have this , but a Solid GPU and training dataset is needed ,Training dataset is something we can generate by asking community , bootstrapping overpass query examples , Challenge is the machine

yuiseki commented 11 months ago

@kshitijrajsharma Wow! I am amazed at your quickness! Sorry for my delay in responding to you.

I was thinking of having 👍 and 👎 buttons on the frontend user interface of TRIDENT, like the ones on ChatGPT, to actively receive feedback from users. It should allow the model to be continuously improved by building and updating datasets that paired natural language sentences entered by the user and Overpass API queries generated by the model.

But, currently, TRIDENT does not yet have a database to store data permanently, so this will be a major change. So that will be a time-consuming development process.

If you will find the dataset that paired natural language sentences and Overpass API query, please let me know. Narrowing it down from the Stackoverflow question and answer data set seems like a very realistic idea.

https://stackoverflow.com/questions/tagged/overpass-api https://stackoverflow.com/questions/tagged/overpass-api?tab=Votes https://stackoverflow.com/search?tab=votes&q=overpass&searchOn=3

My PC has a 12GB VRAM GPU so I would be able to run the training. I also have a contract with Google Colaboratory Pro.

kshitijrajsharma commented 11 months ago

Does stackoverflow provides API to collect questions and answers ? I need to check license if it enables us to pull data from , I heard stackoverflow is also coming up with its own AI so assuming we might get data to train , if not we can store the result and query from the tool itself , We can design a small python service with database table attach it to the app and include like and dislike button . This seems good idea this will help us collect query and questions tagged with like button and can be used to retrain other LM

yuiseki commented 2 months ago

@kshitijrajsharma Forgive me for contacting you at such a sudden moment. This Issue is my only connection to you. I feel I need your advice now.

Related:

kshitijrajsharma commented 2 months ago

@yuiseki Sorry I was away previous week Thats awesome While you have the resources , I would expect may be you can try running some of the llma with sample spatial queries to start with ? Have you found and updates regarding that issues . How do they performed ? Any closer ??

If you need training data probably this is something we can generate