-
Tesseract has always included its own, internal binarization – which is **not** based on Leptonica and is of rather bad quality (custom global Otsu implementation without normalization). Leptonica doe…
-
### Describe the bug
I am working with the EfficientAD model and I have been training the model in AWS Sagemaker. I have noticed the GPU memory usage explodes during validation. I was wondering if th…
j99ca updated
8 months ago
-
## 🚀 Feature
This feature is to use a moving windowed median or exponential moving average for gradient clipping and normalization.
Additionally, it'd be nice to support batch skipping if the gra…
-
The system has a nice stat tracking feature. Make the frontend record everything possible about the client experiences.
-
It'd be amazing to have support for a pytorch LayerNormMLP implementation that supports a scale and offset tensor to be applied after the layernorm but before the MLP. Would be curious to hear what it…
-
## Exploration
The element of a group giving the best results is marked in bold. If multiple elements are used together, all of them are marked. \
If a minor positive changes were noted in the early…
-
### Community Note
* Please vote on this issue by adding a 👍 [reaction](https://blog.github.com/2016-03-10-add-reactions-to-pull-requests-issues-and-comments/) to the original issue to help the…
-
Hi, thanks for your awesome project!
When I dive into the detail of adaptive_teacher, I find that the vgg16 backbone has BN layers by default.
https://github.com/facebookresearch/adaptive_teach…
-
## Why
Machine Learning 輪講は最新の技術や論文を追うことで、エンジニアが「技術で解決できること」のレベルをあげていくことを目的にした会です。
prev. #84
## What
話したいことがある人はここにコメントしましょう!
面白いものを見つけた時点でとりあえず話すという宣言だけでもしましょう!
-
Hi, @hunto.
Thanks for your answers to my previous questions :[https://github.com/hunto/DiffKD/issues/3](url)
Your work is very meaningful, and it can bring new changes to knowledge distillation. Th…