-
### Reproduction:
```rust
// Correct
// The kernel call @ candle-metal-kernels/src/lib.rs:2151 receives the following args:
// nrows: 1 ncols: 1024 ncols_pad: 1024
let d = Tensor::rand(-256_f…
-
## Abstract
- present the `Insertion Transformer`, an iterative and partially autoregressive model for sequence generation based on insertion operations
- can generate with an arbitrary ordering
…
-
# URL
- https://arxiv.org/abs/2411.04996
# Authors
- Weixin Liang
- Lili Yu
- Liang Luo
- Srinivasan Iyer
- Ning Dong
- Chunting Zhou
- Gargi Ghosh
- Mike Lewis
- Wen-tau Yih
- Luk…
-
Hi, thanks for you amazing work in ["Mixture of Tokens"](https://arxiv.org/abs/2310.15961). However, during the training of causal language modelling task, the paper says that you perform token mixing…
-
[Local collaborative autoencoders](https://sci-hub.ru/https://dl.acm.org/doi/abs/10.1145/3437963.3441808)
[Local latent space models for top-n recommendation](https://sci-hub.ru/https://dl.acm.org/do…
-
Add authors and linked presentations for http://liveoak.github.io/NlsyLinks/research_publications.html
**2015: Association for Research in Personality Conference**
- [x] Additive effects of maternal…
-
**Is your feature request related to a current problem? Please describe.**
When using a mixture of local and global models, the user needs to distinguish the model types.
Here's a list of pract…
-
tfp.sts.Autoregressive provides AR modeling. Does tfp.sts have a class or function that provides MA or ARMA modeling?
-
First of all thank you for making all of the lectures and other content public. This is really helpful.
I took a look at the demo implementations for lecture 3 and found some bugs which I am report…
-
Hello folks,
I parsed all issues here and although we keep talking about data science, I didn't find proposals for machine learning either using R or Python.
Who is willing to co-start a series of m…