-
**Describe what's wrong**
We're evaluating different ClickHouse cluster topoligies with the ClickHouse Operator but we seem be facing an issue specially in sharded clusters.
Currently we're work…
-
This line in my custom recipe does not work (the only one that I have added):
from torchtune.datasets import text_completion_dataset
When I run tune, the message is:
ImportError: cannot import n…
-
# Description
When you load a dataset from HF with remote code, the load_dataset function prompts the user for permission to run remote code. This prompt only happens the first time the user downlo…
-
**Describe the bug**
While playing around with the benchmark, I saw that at least some datasets are not correctly normalized. Is this intended behaviour?
**To Reproduce**
```
python generate_dat…
-
I am following the [getting started](https://epfllm.github.io/Megatron-LLM/guide/getting_started.html) guide with mistal-7B model.
- I am able to (1) convert `mistralai/Mistral-7B-v0.1` and (2) …
-
### System Info
(myenv) ubuntu@i~$ python --version
Python 3.10.14
(myenv) ubuntu@i~$ pip --version
pip 23.0.1 from /home/ubuntu/myenv/lib/python3.10/site-packages/pip (python 3.10)
### Informa…
-
any way to fix this issue?
(dlt) PS C:\Users\skyne\omages> python -m src.trainer --opts src/models/omages64_DiT/cfgs/pipeline_N2G2M.yaml --gpus 0 --mode 'test'
**** INFO **** Choosing GPUS: [0]
*…
-
**Describe the bug**
If the training data does not live on NFS but on node-specific storage, the current logic in https://github.com/NVIDIA/Megatron-LM/blob/0bc3547702464501feefeb5523b7a17e591b21fa/m…
-
### Search before asking
- [X] I have searched the Ultralytics YOLO [issues](https://github.com/ultralytics/ultralytics/issues) and found no similar bug report.
### Ultralytics YOLO Component
_No …
-
### Feature request
Any documentations for the the `load_dataset(streaming=True)` for (multi-node multi-GPU) DDP training?
### Motivation
Given a bunch of data files, it is expected to split…