-
Hi, could you please release the training data too, to enable further research into the model behavior ? Other projects like EleuterAI's pythia project have done that, which has helped get more intere…
-
Can I use lora or adapter to fine-tune some non-instruction set data? What script should I use to process the data and run fine-tuning?
-
### Required prerequisites
- [X] I have searched the [Issue Tracker](https://github.com/camel-ai/camel/issues) and [Discussions](https://github.com/camel-ai/camel/discussions) that this hasn't alre…
-
**Is your feature request related to a problem? Please describe.**
I'm one of the maintainers of the [llm](https://github.com/rustformers/llm) project, and we're looking for a robust, cross-platform …
-
DataChunkRecipe is not working when used in litgpt's TinyLlama pretraining example
error: AttributeError: 'SlimPajamaDataRecipe' object has no attribute 'is_generator'
the type of SlimPajamaDataReci…
-
When I try to run the code it doesn't work for some random reason...
![image](https://github.com/karpathy/nanoGPT/assets/136958814/02189d2d-7d4b-4a3d-a7bb-57365f5b3313)
It does not talk like a norma…
-
To reproduce the Llama3 8B weight-only quantization result, should we use 4096 RedPajama with a 2048 context length for 2 epochs or 512 Pile with a 1k context length for 10 epochs? Which approach …
-
I know Stanford alpaca is non-commercial, but what about tloen/alpaca-lora?
-
I'm trying to use the data visualization using meerkat. The viz/main.py visualization is a sample data of Git Hub. Is there a script with which I can expand to other datasets?
-
We now have an implementation of HPLT 1.2 mono importer that can merge multiple lines from a document until it reaches a threshold of a maximum number of words or characters.
The idea is to provid…