-
https://arxiv.org/abs/1903.06586
-
### Describe the bug
Running the tests from the 0.6.0 tag, the accelerate tests fail with the following stack trace:
```
self =
@require_accelerate
@unittest.skipIf(torch_device != "…
-
We are using `examples/nlp/language_modeling/megatron_gpt_pretraining.py` with a small GPT model (1.35 billion parameters) on 4 H100 DGX nodes (8 GPUs each). Our DGX nodes are connected using Infini…
-
I am working on implementing tiny-YOLO using PipeCNN, was just looking for some advice and guidance for the best way to do it and the steps I should take.
I'm going to convert the tiny-YOLO weights…
-
**Describe the bug**
Context parallel does not work in some cases, such as pretrain llama-34b with 64 A800 GPUs and seqlen>=32768. **But using megatron-lm directly has no problem with the same conf…
-
i want to build an application which can predict `single picture`, where should i modify in`test.py`;
another question is can i use `CPU` to run `test.py`
-
I trained a Motion Deeplab using the MOTChallenge according to the instructions. The training process seems to have run fine (sidenote: is it reasonable that I could only use batch size = 1 due to CUD…
-
E:\anaconda3\envs\pytorch\python.exe C:/yolov5-master/detect.py
detect: weights=C:/Users/10980/PycharmProjects/best.pt, source=runs/detect/exp25/zidane.jpg, imgsz=[640, 640], conf_thres=0.25, iou_thr…
-
Just want to share how to run on the latest version of Detectron2 (v0.6):
## 1. Environment
CUDA 11.1
Torch>=1.9.0 ```pip install torch==1.9.0+cu111 torchvision==0.10.0+cu111 torchaudio==0.9.0 …
-
Implemented weighted-multi_input-`[shortcut]` layer with weights-normalization, added:
New [shortcut] can:
* can take more than 2 input layers for adding: `from = -2, -3` (and -1 by default)
*…