distributed-datasets Search Results

1000+ results
for distributed-datasets

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

OptimalScale/LMFlow #482

`preprocessing_num_workers` can not use in `scripts/run_fine…

**Describe the bug** tokenizer map in `hf_decoder_model` use multi `preprocessing_num_workers` will return `TypeError: cannot pickle 'torch._C._distributed_c10d.ProcessGroup' object` **To Reprodu…

csyourui updated 3 months ago
6
InternLM/xtuner #719

执行 NPROC_PER_NODE=2 xtuner train /root/StableDiffusionGPT/co…

error log： Generating train split: 3457 examples [00:00, 14292.20 examples/s] Map (num_proc=32): 0%| | 0/3457 [00:00

LTtt456c updated 3 months ago
2
NVIDIA/Megatron-LM #756

[QUESTION] Training Mixtral 8x7B on 16 x H100 only achieves …

As the title says, I wonder if this is normal. If not, how should I optimize it? Logs ``` using world size: 16, data-parallel size: 4, context-parallel size: 1 tensor-model-parallel size: 4…

ShinoharaHare updated 1 week ago
28
dask/distributed #7726

Removal of Finalizer Causes Breakage for UCX Protocol

As mentioned in https://github.com/dask/distributed/issues/7639#issuecomment-1489013077 , we are seeing what we think is a bug due to the removal of the [finalizer for a ThreadPoolExecutor](https://g…

quasiben updated 1 year ago
16
AILab-CVC/YOLO-World #84

I can run with the detector, but segmentation has an issue. …

Here is the error: File "/home/server/Python_Project/django/yolo/YOLO-World/yolo_world/datasets/utils.py", line 28, in yolow_collate masks = datasamples.gt_instances.masks.to_tensor( AttributeE…

POVTUASTHOV updated 6 months ago
4
huggingface/datasets #6437

Problem in training iterable dataset

### Describe the bug I am using PyTorch DDP (Distributed Data Parallel) to train my model. Since the data is too large to load into memory at once, I am using load_dataset to read the data as an it…

21Timothy updated 4 months ago
5
dynamic-superb/dynamic-superb #132

[Task] Audio Tagging on Multiple Datasets

# Task Name Audio Tagging on Multiple Datasets ## Task Objective This task is a variation of "Audio Tagging on AudioSet" before. For details of the original task, please refer to https://gith…

theSillyDinosaur updated 3 months ago
4
huggingface/accelerate #3013

compute_metrics returns duplicated labels if dataset streami…

### System Info ```Shell !pip install transformers==4.44.0 !pip install accelerate==0.33.0 !pip install datasets==2.21.0 !pip install evaluate==0.4.2 !pip install scipy scikit-learn run on a H…

MoritzLaurer updated 1 week ago
6
TRI-ML/prismatic-vlms #43

Using multiple GPUs for inference

Hi, I am trying to run inference with `llama2+13b` and I have 4 RTX3090 each with 24GB Memory, however I noticed that when I use the sample inference code, it only uses one GPU which causes out of …

yunbinmo updated 2 months ago
1
NVIDIA/Megatron-LM #807

[core dataset compilation error]

**Describe the bug** When I am using the most recent Megatrone-LM fork I get the following error ``` make: Entering directory '/workspace/megatron-lm/megatron/core/datasets' g++ -O3 -Wall -sha…

shamanez updated 1 week ago
3

上一页 1...7 8 9 10 11 12 13...100 下一页

1000+ results for distributed-datasets

1000+ results
for distributed-datasets