-
I wanted to disable the CUDA collective module using the mca parameter `coll_cuda_disable_cuda_coll` but as far as I can tell it isn't actually used in the code.
Is there another way to disable the…
-
I have converted R2D2 model from https://github.com/naver/r2d2/blob/master/extract.py
using fp16_mode=True on T4(Tensor Core supported).
But it shows almost no speed up(just 10~14%).
The model is…
-
## 🐛 Bug
Traceback (most recent call last):
File "tools/train_net.py", line 15, in
from maskrcnn_benchmark.data import make_data_loader
File "/home/harsh/maskrcnn-benchmark/maskrcnn_ben…
-
I know support for multiple GPUs has been requested a few times (#78, #148), but It's now becoming more of issue now that I have much more data.
I was able to get DataParallel to work by modifying …
-
I have Trainable_Segmentation-Trainable_Segmentation-3.2.12.jar in /usr/share/imagej/plugins, where also other installed plugins (jars) are and are working, ie. "Volume Viewer" and "3d Viewer"
It …
-
Hi, I tested EfficientFormerV2-s0 and EfficientFormerV2-s2 on 2080Ti, the input size is 1x3x224x224, and got the result as follows:
EfficientFormerV2-s2: about 24ms/per input,
EfficientFormerV2-s0: …
-
Sometimes it is useful to take a small-ish index and expand it into a large index with K segments for perf/stress testing.
This tool does that. See attached class.
---
Migrated from [LUCENE-2159]…
-
### What happened?
I successfully imported and compiled GPT2 TF with IREE but when running it through the Python bindings, I get a segmentation fault:
```
collected 4 items / 3 deselected / 1 s…
-
teacher:
When I run show_cls.py
i:1 loss: 2.079445 accuracy: 0.031250
i:2 loss: 2.079467 accuracy: 0.000000
i:3 loss: 2.079436 accuracy: 0.062500
i:4 loss: 2.079478 accuracy: 0.000000
i:5 loss: …
-
Really appreciate your fascinating work!
There is no documentation in the examples regarding reproducing the paper's results on the LongBench dataset. Is there any plan to release the scripts used …