llm-training Search Results

openreasoner/openr #13

questions about Training using train_llm.sh

python -u train_math.py --seed 10 \ --dataset_name "prealgebra" \ --dataset_path "../envs/math/data/math_500.jsonl" \ --model…

wphtrying updated 1 month ago

tqzhong/my-blog #1

my-blog/posts/llm-post-training/

# 大模型post-training方法 | Rs' Log 这是一个测试样例 [http://localhost:1313/my-blog/posts/llm-post-training/](http://localhost:1313/my-blog/posts/llm-post-training/)

utterances-bot updated 1 month ago

ServiceNow/Fast-LLM #39

[feat] Llama 3.x rope scaling support

# 🧐 Problem Description Fast-LLM lacks support for Llama 3.x models due to missing compatibility with Llama-3-style RoPE scaling. This prevents us from effectively training or using Llama 3.x check…

tscholak updated 2 days ago

kubeflow/training-operator #2321

KEP-2170: Design Trainer for the LLM Runtimes

As part of Kubeflow Training V2 work, we should design and implement custom Trainer to fine-tune LLMs that we are planning to support via TrainingRuntimes in Kubeflow upstream. We should discuss wh…

andreyvelich updated 1 week ago

AkihikoWatanabe/paper_notes #1523

Understanding LLMs: A Comprehensive Overview from Training t…

# URL - https://arxiv.org/abs/2401.02038 # Authors - Yiheng Liu - Hao He - Tianle Han - Xu Zhang - Mengyuan Liu - Jiaming Tian - Yutong Zhang - Jiaqi Wang - Xiaohui Gao - Tianyang …

AkihikoWatanabe updated 4 days ago

huggingface/transformers #34730

RuntimeError in `_group_tensors_by_device_and_dtype` (torch/…

### System Info - `transformers` version: 4.46.2 - Platform: Linux-5.4.0-125-generic-x86_64-with-glibc2.31 - Python version: 3.10.15 - Huggingface_hub version: 0.26.2 - Safetensors version: 0.4…

julien-piet updated 1 day ago

OpenPecha/stt-split-audio #31

STT0072: A research on benefits of transfer text in our trai…

### Description We aim to evaluate the effectiveness of our transfer text function and the LLM-generated corrected transcript in improving the quality of training data. This analysis will focus on th…

jim-gyas updated 2 days ago

openreasoner/openr #11

Single GPU goes OOM when training LLM with PRM

仓库的代码是单卡训练的，bs设置为1，能加载模型，训练的时候OOM,如果需要多卡，直接torchrun --nproc_per_node=4 train_math.py的话在加载模型的时候就OOM `torchtorch.OutOfMemoryError.: OutOfMemoryErrorCUDA out of memory. Tried to allocate 50.00 MiB. GPU …

wusijie123 updated 3 weeks ago

janhq/ichigo #126

idea: Training Ichigo on Structured Output

### Problem Statement Current LLM development is moving toward structured output. It's proved to improve model performance in various tasks. Also when training with structured output, we can explore …

hahuyhoang411 updated 1 day ago

NVIDIA/TensorRT-LLM #2387

How to use Medusa to support encoder decoder model?

TRT-LLM version: v0.11.0 I'm deploying a bart model with medusa heads, and i notice this issue https://github.com/NVIDIA/TensorRT-LLM/issues/1946, then i adapted my model with follow steps: ``` 1…

TianzhongSong updated 3 weeks ago

1000+ results for llm-training

1000+ results
for llm-training