-
I get this error when i Try to run a query
Truncation was not explicitly activated but `max_length` is provided a specific value, please use `truncation=True` to explicitly truncate examples to max…
-
Hi @rprabhavalkar @tonybruguier-google
I was wondering if the code for this paper ("On the Choice of Modeling Unit for Sequence-to-Sequence Speech Recognition") was open source:
[https://www-i6.inf…
-
Hi, I just follow your architecture and run the code based on https://github.com/Toshihiro-Ota/decision-mamba. But the training time is unacceptable, one epoch needs 8 hours. Do you have any suggestio…
-
https://github.com/state-spaces/mamba
_Mamba: Linear-Time Sequence Modeling with Selective State Spaces_
Albert Gu, Tri Dao
[Paper](https://arxiv.org/abs/2312.00752)
-
Thank you for maintaining such an important repository. I really enjoyed and learned a lot from reading your DPO paper.
I have one question regarding the SFT loss implementation in the repository. …
-
Token indices sequence length is longer than the specified maximum sequence length for this model (749 > 512). Running this sequence through the model will result in indexing errors
Traceback (most r…
-
**Describe the bug:**
BUSCO step in eukaryotic binning fails, failing to create database in tmp
**Versions**
e.g.,
```
veba_binning-eukaryotic_2.2.0.sif
```
**Command used to produce erro…
-
Hi,
I recently came across an issue when using context parallelism for splitting long sequence with NeMo and Transformer Engine. The context parallelism splits sequence length across GPUs and use p…
-
https://arxiv.org/abs/2010.06065
-
https://arxiv.org/pdf/1803.01271.pdf
#### Subject
+ An Empirical Evaluation of Generic Convolutional and Recurrent Networks for Sequence Modeling
+ 시퀀스 모델링을 위한 일반 컨볼루션 및 반복 네트워크의 실증적 평가
#### A…