-
I saw the paper use argmax as the equation to obtain the sequence.
I understand that that would be a Viterbi algorithm, where the complexity is again O(n).
I'm confused that how is it faster than Au…
-
### Anything you want to discuss about vllm.
This document includes the features in vLLM's roadmap for Q3 2024. Please feel free to discuss and contribute, as this roadmap is shaped by the vLLM com…
-
## Summary
`audio_slice_frames` seems to be deprecated in v0.2.
Is 10-bit model trained with this version?
## Context
Conditioning network (rrn1) and auto-regressive network (rrn2) used diff…
-
Hi @hengyuan-hu ,
Thanks a lot for your informative answers!
Would you have an approximate estimate on when you would be able to release the Off-Belief Learning (OBL) code?
Also, have you rel…
-
Dear author,
In below eval_foreard function, it seems not the real autoregressive decoding. since you concate the input and answer_ids together to form the new input_ids, it performs decoding in th…
-
### Model/Pipeline/Scheduler description
Novel-view synthesis through diffusion models has demonstrated remarkable potential for generating diverse and high-quality images. Yet, the independent proce…
-
Hi All! Thanks for this wonderful toolbox and documentation!
I've known about glmdenoise for a while and have periodically thought about using it on our data, and seeing this toolbox as a drop in r…
-
Hi,
It seems that you're trying to decode auto-regressively using BERT representations as a drop-in replacement for word embeddings. But BERT is bi-directional; the representation at token i has in…
-
**Is your feature request related to a current problem? Please describe.**
When using a mixture of local and global models, the user needs to distinguish the model types.
Here's a list of pract…
-
I use DenseReparameterization for the transition function of a simple state-space model. To sample posterior sequences, I need to auto-regressively apply the layer to its own output inside of a symbol…