-
Thanks for this wonderful project. I used the following script to train the model.
```
torchrun --nnodes=1 --nproc_per_node=2 /home/share/yongqi/project/AutoregressiveImageRetrieval/code/open_flamin…
-
Hi, thank you for your release. I've been reviewing the method we use to calculate the repetition score for identifying duplicate content in documents, specifically the segment where we compute this s…
-
Hi! Thank you for releasing the code.
In the [paper](https://arxiv.org/pdf/2409.04431) you report training Llama2 recipe on 300M tokens of RedPajama dataset. However, in your code I only found exampl…
-
I am trying to reproduce this repo on my macOS, and I don't have a aws account .can i get your help, i'd appreciate it
-
The RETRO model we are currently investigating: https://arxiv.org/pdf/2112.04426.pdf
An example implementation: https://github.com/lucidrains/RETRO-pytorch
Initial data set: https://huggingface.co…
-
### Required prerequisites
- [X] I have searched the [Issue Tracker](https://github.com/camel-ai/camel/issues) and [Discussions](https://github.com/camel-ai/camel/discussions) that this hasn't alre…
-
## 一言でいうと
音響モデルと大規模言語モデル(LLM)の融合によって、ターンテイキングとバックチャンネル予測の精度を向上させる新しいアプローチを提案。
「VAPに対して引用しているが、たいして触れていなかった(自身の主張を通すための一文の引用しかしていない)ため、論文自体の信ぴょう性が微妙に感じたため、スキップ」
### 論文リンク
[2401.14717v1](https://a…
-
-
Hi, thanks for making this project public.
I am trying to run training with fp16 and get the following error:
>RuntimeError: Input type (torch.cuda.HalfTensor) and weight type (torch.cuda.FloatTen…
-
Can you provide an example of how to launch a training instance? how can one choose the llama model size (350M, 750M, .. 7B, etc)? Thanks in advance