cs224n Search Results - Githubissues

121 results
for cs224n

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

fly51fly/aicoco #3

爱可可老师24小时热门分享

微博内容精选

fly51fly updated 4 weeks ago
1906
chenyangMl/llama2.c-zh #2

请教一下训练流程

你好！请教一下咱们训练自有中文数据的流程是怎样的？我的理解是首先要分词，也就是 tokenizer . 但是我看你的介绍用 sentensepiece，这个是分词工具吗？适合中文吗？方便介绍一下您是怎么从头训练中文语料的吗？谢谢了!

edisondeng updated 1 year ago
8
antoprince001/QA_Master #2

Further information regarding the project

Did u consider any research paper for this project? If yes, please let me know. How did you calculate the accuracy of this project?

Gondhi-Sahana updated 1 year ago
1
JosselinSomervilleRoberts/BERT-Multitask-learning #31

Batch size needing to be super small with PCGrad

I kept getting out-of-memory errors (CUDA) with PCGrad until I put the batch size all the way down to 16. This might be related to the fact that we don't support AMP with PCGrad right now: https://gi…

JosselinSomervilleRoberts updated 1 year ago
1
JosselinSomervilleRoberts/BERT-Multitask-learning #27

Loss Function Improvement

marie-huynh updated 1 year ago
1
JosselinSomervilleRoberts/BERT-Multitask-learning #30

Incompatibility Scheduler and PCGrad

PCGrad (Gradient surgery) requires the loss for each task, which assumes you ran 1 batch per task, which requires round-robin scheduling. How could we integrate other schedulers into that? (especially…

JosselinSomervilleRoberts updated 1 year ago
1
JosselinSomervilleRoberts/BERT-Multitask-learning #25

Implement gradient surgery (as described in the paper)

JosselinSomervilleRoberts updated 1 year ago
1
JosselinSomervilleRoberts/BERT-Multitask-learning #13

Implement weighted based scheduling (from paper)

JosselinSomervilleRoberts updated 1 year ago
1
TylerYep/torchinfo #161

Failed Cases When Testing with Pytorch v1.12

**Describe the bug** Using current main branch (without any change in the code), several test cases fail **To Reproduce** Steps to reproduce the behavior: 1. Clone the project to your local mac…

mert-kurttutan updated 2 years ago
5
eubinecto/train-of-thoughts #17

Language Model이란 정확히 무엇을 의미할까?

# Rationale? [집현전에서 발표](https://youtu.be/AvW8zX9xG8s)를 하면서, 명준님의 이 논문을 정말 재미있게 읽었다 | --- | | 논문의 제목이 Let Language Models Learn Meaning-Text Correspondence인데. 명준님과 미리 이야기를 해보기도 했어서, 논문의 내용에 …

eubinecto updated 2 years ago
3

上一页 1...4 5 6 7 8 9 10...13 下一页

121 results for cs224n

121 results
for cs224n