redpajama Search Results

514 results
for redpajama

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

shm007g/LLaMA-Cult-and-More #1

main page

track

shm007g updated 1 year ago
7
ai-shifu/ChatALL #188

Falcon 7b[FEAT]

johnfelipe updated 1 year ago
9
togethercomputer/OpenChatKit #97

RuntimeError: Socket Timeout

# sh training/finetune_Pythia-Chat-Base-7B.sh Namespace(use_cuda=True, cuda_id=0, cuda_num=1, debug_mem=True, dist_backend='cupy_nccl', dp_backend='nccl', dist_url='tcp://127.0.0.1:7033', world_size=…

angeliababy updated 1 year ago
8
ManifoldRG/NEKO #58

v0 Dataset and Benchmark Specification

Context: @snat-s has done great work w/ the analysis of various data that may be relevant to Neko We should now, w/ the input of the team, finalize our proposed V0 dataset, justify its Output: doc…

harshsikka updated 10 months ago
4
jungwoo-ha/WeeklyArxivTalk #81

[20230423] Weekly AI ArXiv 만담 시즌2 - 15회차

### News - Conference 소식 - [CHI 2023](https://chi2023.acm.org/): 독일 함부르크, 4.23 - 28 - [ICLR 2023](https://iclr.cc/): 르완다 키갈리(Aㅏ), 5.1-5 - Google Deepmind!!! - Google Brain 과 Deepmind가 하나의 팀…

jungwoo-ha updated 1 year ago
2
irthomasthomas/undecidability #627

MTEB: Massive Text Embedding Benchmark

- [ ] [blog/mteb.md at main · huggingface/blog](https://github.com/huggingface/blog/blob/main/mteb.md?plain=1) # Title: blog/mteb.md at main · huggingface/blog **Description:** "--- title: "MTEB: …

irthomasthomas updated 8 months ago
1
irthomasthomas/undecidability #664

cohere-ai/quick-start-connectors: code for integrating workp…

- [ ] [cohere-ai/quick-start-connectors: This open-source repository offers reference code for integrating workplace datastores with Cohere's LLMs, enabling developers and businesses to perform seamle…

irthomasthomas updated 8 months ago
1
princeton-nlp/LLM-Shearing #46

在进行Building trainer时，训练会卡住；

你好，我使用的是样例测试集，想跑通README. 但是发现，在训练的时候，会卡住，然后超时； [batch=23/3200]: Train time/batch: 22 Train time/sample: 198 Train time/batch_in_epoch: 6 Train time/sample_in_e…

coderchem updated 10 months ago
1
OpenGVLab/EfficientQAT #12

DATA FOR TRAINING

hi，in the paper you said “we use 4096 samples from RedPajama with a context length of 2048”， is it enough for QAT?

LiMa-cas updated 3 months ago
1
mlfoundations/open_flamingo #294

Failure to use https://huggingface.co/anas-awadalla/mpt-7b m…

100%|███████████████████████████████████████| 933M/933M [01:59 [18](https://file+.vscode-resource.vscode-cdn.net/home/mraway/Desktop/src/open_flamingo/~/.cache/huggingface/modules/transformers_modules…

MRAWAY77 updated 3 months ago
1

上一页 1...8 9 10 11 12 13 14...52 下一页

514 results for redpajama

514 results
for redpajama