-
Hello,
I'm trying to run the dcgan/main.py file to train a GAN.
I'm using a Windows 7 system with python 3.7 (anaconda)
I run the following line
%run main.py --dataset lsun --dataroot bedroom…
-
I download the [pretrained model](https://drive.google.com/drive/folders/1UC3XOoezeum0uck4KBVGa8osahs6rKUY?usp=sharing), and run
```bash
python test.py --dataset Synapse --cfg configs/swin_tiny_patc…
-
Hi.
I used the ft_net_swin for loading the model. When I run my code in my device there is no problem but in the kaggle I got this error.
```
File /opt/conda/lib/python3.10/site-packages/torch/nn…
-
After creating a model with the builder and loading data into it, one then selects parameters to fit.
If any parameter is selected, is usually appears on the summary tab with a reasonable range aroun…
-
I'm putting it here for backlog.
Looking at the example below, the two models produces the same result. By pulling the reduction op ahead of the broadcast+mul, codegen seems to be getting much bett…
-
The ones we need for Transformer Engine are the following:
1) CUBLASLT_EPILOGUE_GELU_AUX
step 1 : matrix multiplication
step 2 : apply gelu
step 3 : store the result to s…
-
just parking a note for computational shortcut
Khuran et al 2014 top of page 5252 for standard parameterization of generalized Ridge
Most of the jackknife-Ridge literature uses canonical parame…
-
### Search before asking
- [x] I have searched the YOLOv8 [issues](https://github.com/ultralytics/ultralytics/issues) and found no similar bug report.
### YOLOv8 Component
Integrations, Other
###…
-
In the MultiheadAttention implementation on the attention branch, attention masking is not implemented. Is that because it is difficult/impossible to do using KeOps? If that's not the case, how would …
-
It looks like the NHWC, with an outer reduction, takes double the time of the NCHW layout. Two things that look suspect about the NHWC kernel is that it uses 128 register compared to 36 registers for…