llm-training Search Results

1000+ results
for llm-training

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

NVIDIA/nccl #1368

NCCL test, Tree is slower than Ring

We have GPU cluster nodes with 8 * H100 and 4*400 RoCE. I try nccl test on this cluster with the same nodes. But I find tree bus bandwidth(150GB/s) is slower than ring bandwidth (190GB/s). From my…

wangdaw2023 updated 5 days ago
2
metavoiceio/metavoice-src #157

Finetuining 1B first-stage on non-English datasets: thoughts

According to original [discord message](https://discord.com/channels/902229215993282581/913488649734213672/1242600853882535946) Hello everyone! I am fine-tuning a model in a non-English language. T…

Ar4ikov updated 4 weeks ago
3
huggingface/text-embeddings-inference #261

Support gte-Qwen1.5-7B-instruct

### Model description Here is the model description > gte-Qwen1.5-7B-instruct is the latest addition to the gte embedding family. This model has been engineered starting from the [Qwen1.5-7B](https:…

reverland updated 3 weeks ago
1
HydPy/HydPy-meetups #66

The Guide to Building Open Indic LLMs Today

**Title of the talk/workshop** The Guide to Building Open Indic LLMs Today **Abstract of the talk/workshop** - Steps in training modern LLMs - Challenges specific to Indic Models (Tokeni…

ramsrigouthamg updated 2 months ago
1
modelscope/swift #1399

Finetuning LLaVA Mistral, gets error when working on multipl…

conda activate swift CUDA_VISIBLE_DEVICES=0,1,2,3 swift sft --model_type llava1_6-mistral-7b-instruct --dataset dataset/abc.jsonl \ Command that i am using, dont know whats wrong in it …

HimanshuBaurai updated 1 week ago
10
e-p-armstrong/augmentoolkit #25

how to use the output/which file to use?

i am using the version from Pinokio, it installs the script by itself. after running it i have 3 output files (my input.txt is 77KB) ; master_list.jsonl processed_master_list.json simplified_data…

ares0027 updated 2 weeks ago
6
kubeflow/training-operator #2032

[SDK] Use HuggingFace Data Collator for more Transformers i…

More context: https://github.com/kubeflow/training-operator/pull/2031#discussion_r1526533371. Currently, we apply [HuggingFace Data Collator](https://huggingface.co/docs/transformers/en/main_classes/…

andreyvelich updated 3 weeks ago
6
TinyLLaVA/TinyLLaVA_Factory #81

require_grad

![微信图片_20240614110705](https://github.com/TinyLLaVA/TinyLLaVA_Factory/assets/138667911/2006b591-3bda-4bfe-882e-4710dc9d02b7) ![微信图片_20240614110705](https://github.com/TinyLLaVA/TinyLLaVA_Factory/asse…

1764758458 updated 3 weeks ago
6
kubeflow/training-operator #2101

Export Fine-Tuned LLM after Trainer is Complete

We discussed here: https://github.com/kubeflow/website/pull/3718#issuecomment-2096619898 that [our LLM Trainer](https://github.com/kubeflow/training-operator/blob/bb8bba00ff0b48de922c523b0d3051f8b2d4e…

andreyvelich updated 2 months ago
3
fiatrete/OpenDAN-Personal-AI-OS #88

Discussion on the call process for training individual portr…

# Destination AIOS integrates AI portrait process. Includes personal ID photos, artistic photos, pictures of hairstyle changes, clothing changes, sense changs, etc # Basic process - Connect to a …

alexsunxl updated 1 week ago
3

上一页 1...10 11 12 13 14 15 16...100 下一页

1000+ results for llm-training

1000+ results
for llm-training