llm-training Search Results

1000+ results
for llm-training

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

vllm-project/vllm #8303

[Bug]: 3090 P@P

### Your current environment The output of `python collect_env.py` ```text C:\Users\bobni\OneDrive\Desktop\Projects\p2pIssue>bash training@Training:/mnt/c/Users/bobni/OneDrive/Desktop/Projects…

NicolasMejiaPetit updated 2 months ago
2
evalplus/evalplus #169

🤗 [REQUEST] - WaveCoder-Ultra-6.7B

### Model introduction WaveCoder 🌊 is a series of large language models (LLMs) for the coding domain, designed to solve relevant problems in the field of code through instruction-following learning. …

answers111 updated 6 months ago
11
xai-org/grok-1 #300

An open-source third-party training with 8 GPUs

Hi everyone interested in Grok-1: We are the ModelScope team, we trained Grok-1 HF version(https://www.modelscope.cn/models/colossalai/grok-1-pytorch/summary) with our training framework SWIFT(http…

tastelikefeet updated 8 months ago
1
InternLM/xtuner #796

chatglm3-6B 微调报错

现在新版本xtuner增加了dispatch后，不支持chatglm3-6b的微调了吗 File "/mnt/afs/xtuner/xtuner/model/sft.py", line 93, in __init__ dispatch_modules(self.llm, use_varlen_attn=use_varlen_attn) File "/mnt/afs/xtuner/…

poisonwine updated 4 months ago
1
pytorch/pytorch #127176

RuntimeError: "_amp_foreach_non_finite_check_and_unscale_cud…

### 🐛 Describe the bug i use pytorch==2.3.0 and peft to train llama3 8b , when i run my code, its raise error like: ```text torch._amp_foreach_non_finite_check_and_unscale_( RuntimeError:…

ykallan updated 6 months ago
3
AkihikoWatanabe/paper_notes #768

Unifying Large Language Models and Knowledge Graphs: A Roadm…

# URL - https://arxiv.org/abs/2306.08302 # Affiliations - Shirui Pan, N/A - Linhao Luo, N/A - Yufei Wang, N/A - Chen Chen, N/A - Jiapu Wang, N/A - Xindong Wu, N/A # Abstract - Large lang…

AkihikoWatanabe updated 1 year ago
1
facebookresearch/xformers #782

Request for a casual AttentionBias mask with qkv padding

# 🚀 Feature We need one kind of AttentionBias like BlockDiagonalCausalMask, but with some optional padding. ## Motivation When training LLM, training data may be packed. It may look like …

peterjc123 updated 1 year ago
5
AkihikoWatanabe/paper_notes #693

What In-Context Learning "Learns" In-Context: Disentangling …

# URL - https://arxiv.org/abs/2305.09731 # Affiliations - Jane Pan, N/A - Tianyu Gao, N/A - Howard Chen, N/A - Danqi Chen, N/A # Abstract - Large language models (LLMs) exploit in-context le…

AkihikoWatanabe updated 1 year ago
1
hao-ai-lab/Consistency_LLM #9

Error occured when i train the model

Hi, thanks for your great job! I want to reproduce the training process but some error occured as follows. Could you please help to have a look? Thanks! Training scripts (I just have 4xA100, so the…

littletomatodonkey updated 6 months ago
5
allenai/bff #3

Ngram instead of paragraph removal?

Hi @dirkgr! Here is a feature that would be very much desirable for decontamination, but I'm not sure how difficult it would be to implement into BFF: The essential part of the feature would be to …

IanMagnusson updated 1 year ago
1

上一页 1...83 84 85 86 87 88 89...100 下一页

1000+ results for llm-training

1000+ results
for llm-training