-
- Zhang L, Agrawala M. [Adding Conditional Control to Text-to-Image Diffusion Models](https://arxiv.org/abs/2302.05543).
- Zhang L. [Ablation Study: Why ControlNets use deep encoder? What if it was…
-
### Search before asking
- [X] I have searched the YOLOv8 [issues](https://github.com/ultralytics/ultralytics/issues) and [discussions](https://github.com/ultralytics/ultralytics/discussions) and fou…
-
Insightful work!!!
During the study of your paper, I have some questions (My English is not very good, and I am not aggressive, just some confusion):
1. The first problem is Figure 2. After Sigmoi…
-
I have a GPU with 15GB and it seems it runs out of space when I try to train the network with 50 videos at a time. Do you think it would be better to repeat the loss training video per video, instead …
-
The [BoofCV QR Benchmark page](http://boofcv.org/index.php?title=Performance:QrCode)
shows that recognizing blurry QR codes is still a problem.
Unfortunately, for mobile QR-code scanning with smar…
-
I would like to use the memory saving gradients to train a U-net model with bigger patches or/and increased batch size. I implemented a toy example to assess the memory usage when switching from tf.Op…
-
I get the following error when starting my training:
```
Traceback (most recent call last):
File "tr_baseline.py", line 75, in
optimizer = Ranger21(params=model.parameters(), lr=learning_ra…
-
I'm also working on DFL recently, but I didn't expect to use gray and white photos as a dataset, I watched your experimental process from start to finish, I think binarized images can only reduce the …
-
Really nice work and great idea for the DeepFillv2!! Any plan and expected date to release the DeepFillv2 code?
-
## 🐛 Bug
When using `Conv1d` with a large kernel size (1024 for instance) on gpu, the cudnn implementation is very slow and gets slower as I increase the kernel size. I thought it was using FFT but…