-
We have GPU cluster nodes with 8 * H100 and 4*400 RoCE. I try nccl test on this cluster with the same nodes. But I find tree bus bandwidth(150GB/s) is slower than ring bandwidth (190GB/s). From my…
-
According to original [discord message](https://discord.com/channels/902229215993282581/913488649734213672/1242600853882535946)
Hello everyone! I am fine-tuning a model in a non-English language. T…
-
### Model description
Here is the model description
> gte-Qwen1.5-7B-instruct is the latest addition to the gte embedding family. This model has been engineered starting from the [Qwen1.5-7B](https:…
-
**Title of the talk/workshop**
The Guide to Building Open Indic LLMs Today
**Abstract of the talk/workshop**
- Steps in training modern LLMs
- Challenges specific to Indic Models (Tokeni…
-
conda activate swift
CUDA_VISIBLE_DEVICES=0,1,2,3 swift sft --model_type llava1_6-mistral-7b-instruct --dataset dataset/abc.jsonl \
Command that i am using, dont know whats wrong in it
…
-
i am using the version from Pinokio, it installs the script by itself. after running it i have 3 output files (my input.txt is 77KB) ;
master_list.jsonl
processed_master_list.json
simplified_data…
-
More context: https://github.com/kubeflow/training-operator/pull/2031#discussion_r1526533371.
Currently, we apply [HuggingFace Data Collator](https://huggingface.co/docs/transformers/en/main_classes/…
-
![微信图片_20240614110705](https://github.com/TinyLLaVA/TinyLLaVA_Factory/assets/138667911/2006b591-3bda-4bfe-882e-4710dc9d02b7)
![微信图片_20240614110705](https://github.com/TinyLLaVA/TinyLLaVA_Factory/asse…
-
We discussed here: https://github.com/kubeflow/website/pull/3718#issuecomment-2096619898 that [our LLM Trainer](https://github.com/kubeflow/training-operator/blob/bb8bba00ff0b48de922c523b0d3051f8b2d4e…
-
# Destination
AIOS integrates AI portrait process. Includes personal ID photos, artistic photos, pictures of hairstyle changes, clothing changes, sense changs, etc
# Basic process
- Connect to a …