Closed neokd closed 1 year ago
@neokd do we need our own dataset or can we refer external data sets.
I find some external dataset from Kaggle and other sources
We can have from external sources too.
We can have from external sources too.
I have a few external links how do want me to add them
I would suggest the 1st approach
Share in this issue we'll check and add them to the repo
Can you assign it to me and close the issue?
@neokd cam we close this issue or do we need Anything more.
You can add it as an readme or some file referencing the links something similar to https://github.com/Zjh-819/LLMDataHub this repo which u shared
Sure @neokd . I will create the readme.
Created a new PR for this issue
Description
Add datasets that can be used to fine tune LLM. This issue is open for any type of dataset for respective LLM's (LLAMA2, MISTRAL,etc)
Expected Behaviour
The dataset should be large in size and should follow formats for respective LLM's