-
Hi! Thank you for releasing the paper code!
I had some issues understanding the implementation that are solved by now. However, I expect that many of the people who decide to use PowerNorm in their…
-
**Is your feature request related to a problem? Please describe.**
The famous Phi-3 series models offer SOTA performance, especially in reasoning, math and coding. Microsoft released the models under…
-
Greetings. I love what I see here, are you implementing from a specific research paper or is this a home-rolled implementation? I'd love to see some docs on the design/thought process or some statisti…
-
## 🚀 Feature
I would like to contribute to torchmetrics, by implementing the Brier score and its associated decomposition.
### Motivation
The Brier score is widely used when measuring the ca…
-
**Submitting author:** @rodoulak (Rodoula Ktori)
**Repository:** https://github.com/rodoulak/Desalination-and-Brine-Treatment-Simulation-.git
**Branch with paper.md** (empty if default branch): main
*…
-
### Model description
RetNet / Retentive Networks is a new model *archetype* released by microsoft; the research paper is [here](https://arxiv.org/pdf/2307.08621.pdf). As of now, there is *one* model…
-
Hi, Chengcheng Guo and Bo Zhao:
Thanks for your thorough research and clean codes. However, I have some questions about uncertainty based implementation.
As mentioned in the DeepCore paper, samp…
-
Hello, thank you for your good research first of all.
I was trying to reproduce the performance reported in your paper with SCER-GoPro dataset that you shared as a link.
(Before I started training…
-
As suggested by @angeloskath' s code review https://github.com/ml-explore/mlx-examples/pull/315#issuecomment-1894388667, an implementation of `BytePairTokenizer` seems useful for many use cases, but i…
-
Hi, thanks for your good job.
```
# Latent Fusion
def fusion(self, audio_tokens, visual_tokens):
# shapes
BS = audio_tokens.shape[0]
# concat all the tokens
…