-
**URL**: http://nicoco.net/mylist/51304858
**Browser / Version**: IE 6.0
**Operating System**: Windows XP
**Tested Another Browser**: Yes Chrome
**Problem type**: Site is not usable
**Descripti…
-
**URL**: https://www.nicovideo.jp/watch/sm26869400
**Browser / Version**: IE 6.0
**Operating System**: Windows XP
**Tested Another Browser**: Yes Edge
**Problem type**: Video or audio doesn't p…
-
https://weiren1998.github.io/archives/164bcfad.html
最近实习中遇到了很多问题,需要慢慢总结下来,彻底记住
-
.
.
Hi, I plan to reproduce the results of the WMT-17 translation task as presented in the deepnet paper. Could you please let me know what the command for running the script shou…
-
Thank you for your interesting research. I have some questions regarding the paper:
1. I'm curious about the adaptability of GHNs to other standard-sized datasets, particularly in different tasks s…
-
### What happened?
Since version 1.1.1a3, the RSTUF CLI required Python 3.10 or above.
It is not clear and confuses the users.
We need changes in our documentation and our `pyproject.toml`.
…
-
When I use dart as a booster I always get very poor performance in term of l2 result for regression task.
Even If I use small drop_rate = 0.01 or big like 0.3.
When I use dart in xgboost on same d…
-
Does the post layernorm and scaling in residual branch and initialization in DeepNet also support vision tasks, like ImageNet classification and mask image modeling?
-
Hi, I am currently looking for the initialization of BEiT. The default setting for output projection and FFN weights are scaled by 1 / sqrt(2*N), where N is the current layer id. When I see the paper …
-
**Need code of deepnet for reproduction**
Failed to reproduce the deepnet paper with the TL:DR section
**Modifications for post-ln tranformers**
1. calculate the alpha and beta for encoder and de…