redpajama Search Results

mlfoundations/open_flamingo #232

fsdp Error report

Thanks for this wonderful project. I used the following script to train the model. ``` torchrun --nnodes=1 --nproc_per_node=2 /home/share/yongqi/project/AutoregressiveImageRetrieval/code/open_flamin…

liyongqi67 updated 1 year ago

togethercomputer/RedPajama-Data #116

Inquiry About Character-Level Basis of Duplication Calculati…

Hi, thank you for your release. I've been reviewing the method we use to calculate the repetition score for identifying duplicate content in documents, specifically the segment where we compute this s…

luc1fer3 updated 2 months ago

apple/ml-sigmoid-attention #4

reproducing Language Modeling results

Hi! Thank you for releasing the code. In the [paper](https://arxiv.org/pdf/2409.04431) you report training Llama2 recipe on 300M tokens of RedPajama dataset. However, in your code I only found exampl…

Golovneva updated 4 days ago

togethercomputer/RedPajama-Data #99

what's the specific meaning of dsir?

I am trying to reproduce this repo on my macOS, and I don't have a aws account .can i get your help, i'd appreciate it

BBetteroff updated 8 months ago

artificialwisdomai/origin #48

Load data on an RETROformer based on Deepmind Retroformer mo…

The RETRO model we are currently investigating: https://arxiv.org/pdf/2112.04426.pdf An example implementation: https://github.com/lucidrains/RETRO-pytorch Initial data set: https://huggingface.co…

rstarmer updated 1 year ago

camel-ai/camel #850

[Feature Request] Add `SmolLM` model and WebLLM

### Required prerequisites - [X] I have searched the [Issue Tracker](https://github.com/camel-ai/camel/issues) and [Discussions](https://github.com/camel-ai/camel/discussions) that this hasn't alre…

lightaime updated 3 weeks ago

personabb/survey_paper #2

【2024/01】TURN-TAKING AND BACKCHANNEL PREDICTION WITH ACOUSTI…

## 一言でいうと音響モデルと大規模言語モデル（LLM）の融合によって、ターンテイキングとバックチャンネル予測の精度を向上させる新しいアプローチを提案。「VAPに対して引用しているが、たいして触れていなかった（自身の主張を通すための一文の引用しかしていない）ため、論文自体の信ぴょう性が微妙に感じたため、スキップ」 ### 論文リンク [2401.14717v1](https://a…

personabb updated 3 months ago

Lightning-AI/lit-llama #167

Add Howto for converting RedPajama data and run pre-training…

lantiga updated 1 year ago

mlfoundations/open_flamingo #260

Mismatch input type and weight type when training with preci…

Hi, thanks for making this project public. I am trying to run training with fp16 and get the following error: >RuntimeError: Input type (torch.cuda.HalfTensor) and weight type (torch.cuda.FloatTen…

hungvo304ml updated 2 months ago

dhakalnirajan/LLaMA-BitNet #1

Usage example

Can you provide an example of how to launch a training instance? how can one choose the llama model size (350M, 750M, .. 7B, etc)? Thanks in advance

andreamigliorati updated 5 months ago

504 results for redpajama

504 results
for redpajama