-
Source: https://learn.microsoft.com/en-us/training/modules/build-api-azure-functions/4-exercise-create-function-project
Two blocking issues while going through this training module:
Step 5: can'…
-
AttributeError: 'DataParallel' object has no attribute 'device'
Traceback:
Skipping iteration due to error: Caught RuntimeError in replica 0 on device 0.
Original Traceback (most recent call last…
-
after I use the "yolo export model=yolov11_checkpoints/yolo11x-seg.pt format=onnx dynamic=False opset=12" to generate the onnx. When I use the code to inference,there are no results.
-
**How to customise the train.sh for a distributed Mamba Training ?**
Hello,
As i've seen in the megatron modules, there isn't a pre-defined bash script to pre-train a mamba model on multi-gpu, ho…
-
I am encountering an issue while training the PathFormer model with my own custom dataset: NaN values appear during some epochs, causing the training process to halt. Below is the specific error messa…
-
Great work!
I commented all the push_to_hub in the code. Is synthetic_data_llama-3-8b-instruct-sppo-iter3_score dataset generated by PairRM?
[rank4]: Traceback (most recent call last):
[rank4]:…
-
### Search before asking
- [X] I have searched the YOLOv5 [issues](https://github.com/ultralytics/yolov5/issues) and [discussions](https://github.com/ultralytics/yolov5/discussions) and found no simi…
-
**optimum-neuron version: 0.0.10**
aws-neuronx-runtime-discovery 2.9
libneuronxla 0.5.391
neuronx-cc 2.8.0.25+a3ad0f342
neuronx-hwm 2.8.0.3+…
-
(som) ygz@Ubuntu:/shape-of-motion$ python run_training.py --work-dir ./data/som2/ data:custom --data.seq-name cam01 --data.root-dir ./data/cook_spinach/
2024-07-24 11:23:13.397 | DEBUG | flow3d.da…
-
### What happened?
I've been experiencing a periodic crash during training SDXL finetune. My training settings have been identical for the past 2 months and this bug is recent as of a week ago. I h…