llm-training Search Results

1000+ results
for llm-training

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

allenai/bff #3

Ngram instead of paragraph removal?

Hi @dirkgr! Here is a feature that would be very much desirable for decontamination, but I'm not sure how difficult it would be to implement into BFF: The essential part of the feature would be to …

IanMagnusson updated 1 year ago
1
AkihikoWatanabe/paper_notes #1092

Auto-Instruct: Automatic Instruction Generation and Ranking …

# URL - https://arxiv.org/abs/2310.13127 # Affiliations - Zhihan Zhang, N/A - Shuohang Wang, N/A - Wenhao Yu, N/A - Yichong Xu, N/A - Dan Iter, N/A - Qingkai Zeng, N/A - Yang Liu, N/A …

AkihikoWatanabe updated 1 year ago
2
zhangfaen/finetune-Qwen2-VL #10

pretrain 是否会支持呢

请问大佬，qwen2-vl 的pretrain是否有计划支持呢

Wangman1 updated 1 month ago
2
weaviate/weaviate #3289

Proposal: `Auto` Query API

### The `Auto` API: Enhancing Developer and LLM Experience with Weaviate User-friendliness and intuitiveness of interaction are becoming as crucial as a system's technical capabilities. Recognizing…

CShorten updated 8 months ago
8
SylphAI-Inc/AdalFlow #228

Training with float loss

I'm trying to run prompt training with an LLMasJudge float loss alike G-Eval: 0-0.2-0.4-0.6-0.8-1 values. And the Trainer crashes since it expects the eval values to be 0 or 1 ``` ValueError: acc_sc…

mrdrprofuroboros updated 1 month ago
2
XpressAI/xai-llm-server #2

Feature Request: Add support for Llama-3.2-11B-vision/

### Problem We want to add support for this new model that unlike the previous ones also supports vision. The readme for the model is described below: --- language: - en - de - fr - it - pt…

wmeddie updated 2 months ago
3
ray-project/ray #28860

[Backlog][Collective] Facilitate NCCL test in ray cluster

### Description LLM training in GPU cluster constantly run into NCCL / bad host issues. Ray can help to make running NCCL test in a cluster much easier. We should be able to: - Make it easy t…

jiaodong updated 2 years ago
2
declare-lab/flan-alpaca #21

Any plan to support trl-peft load_in_8bit for training.py ?

Hello, I am fairly new with LLM in general (only started to study 2 weeks ago). So if I say/ask something silly, please excuse me. And I stumble upon this blog post from HuggingFace https://hug…

the-unsoul updated 1 year ago
1
autogluon/autogluon #4082

[AutoMM] Enhancing Multi-GPU Support in Multimodal Training …

## Description: In AutoGluon's multimodal framework, Distributed Data Parallel (DDP) is the primary strategy employed for leveraging multiple GPUs across most problem types. A known limitation of D…

FANGAreNotGnu updated 7 months ago
1
unslothai/unsloth #857

Mlflow Support

Can we log fine tuned llama models using mlflow ?

shaunck96 updated 1 month ago
6

上一页 1...84 85 86 87 88 89 90...100 下一页

1000+ results for llm-training

1000+ results
for llm-training