llm-training Search Results

1000+ results
for llm-training

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

axolotl-ai-cloud/axolotl #1741

Spwaning multiple processes on card 0 when performing distri…

### Please check that this issue hasn't been reported before. - [X] I searched previous [Bug Reports](https://github.com/axolotl-ai-cloud/axolotl/labels/bug) didn't find any similar reports. ### Exp…

MengqingCao updated 3 weeks ago
2
pigbreeder/CodeMemo #16

LLM

# 行业角度看LLM 通向AGI之路：大型语言模型（LLM）技术精要 # 大模型有哪些 https://zhuanlan.zhihu.com/p/611403556 # 模型结构为什么现在的LLM都是Decoder only的架构？ lowrank角度 # 如何训练 [Ladder Side-Tuning：预训练模型的“过墙梯”](https://kexue.f…

testpppppp updated 1 year ago
10
premAI-io/state-of-open-source-ai #116

Inference Optimization Chapter

### Type new chapter ### Chapter/Page Something else ### Description Doing training or inference models are fairly easy, when we have smaller number of parameters. But when the scale of…

Anindyadeep updated 1 year ago
1
huggingface/transformers #20179

🌐 [i18n-KO] Translating docs to Korean

Hi! Let's bring the documentation to all the Korean-speaking community 🌏 (currently 9 out of 77 complete) Would you want to translate? Please follow the 🤗 [TRANSLATING guide](https://github.com…

wonhyeongseo updated 3 weeks ago
19
PeoplePlusAI/sunva-ai #18

LLM to filter irregular speech

The deaf person will also try to speak sometimes. We need a model to figure out if the transcription is poor quality or not and if it is below a particular threshold we should not show it.

gksoriginals updated 2 months ago
8
artidoro/qlora #286

[Questions]: How to implement NF4/NF2 matmul kernel function…

hi @TimDettmers . The paper shows that you quantize the weights to 2/4 bits using NF format. I wonder how to you handle the input activations (denoted as x). Is x also quantized to 2/4 bits? If…

llCurious updated 1 week ago
2
THUDM/ChatGLM-6B #1239

ChatGLM能做增量训练吗

以ChatGLM为基座，增加自己的语料库进行预训练，不是微调，这个该怎么做？

Bob-Annette updated 1 year ago
13
yanndebray/programming-GPTs #27

chap 10 - local and open-weight models

Ollama - local models on your machine https://youtu.be/Ox8hhpgrUi0?si=LxpAd1n29InncB78 Open-weight models - Llama3 - Mistral 7B v0.3 Use cases: - interactive vs non-intersecting - local RAG…

yanndebray updated 5 months ago
1
AkihikoWatanabe/paper_notes #1089

Detecting Pretraining Data from Large Language Models, Weiji…

# URL - https://arxiv.org/abs/2310.16789 # Affiliations - Weijia Shi, N/A - Anirudh Ajith, N/A - Mengzhou Xia, N/A - Yangsibo Huang, N/A - Daogao Liu, N/A - Terra Blevins, N/A - Danqi Ch…

AkihikoWatanabe updated 8 months ago
1
open-mmlab/Live2Diff #6

Inquiring about training codes

Thanks for the excellent work! could you please release the training code when you are available?

xuanxu92 updated 3 weeks ago
3

上一页 1...82 83 84 85 86 87 88...100 下一页

1000+ results for llm-training

1000+ results
for llm-training