-
This issue collects a wishlist of de-novo implementations of torch based models.
Anyone can suggest models to implement, we will have to prioritize along a impact/cost analysis.
FYI @fnhirwa.
C…
-
numpy 2.0.0 pypi_0 pypi
Why is there no numpy package?
-
---
### Bug Report: In-Place Operation Causes Gradient Error in `conv1d_step` Function
**Issue Description:**
While training the model, I encountered a runtime error related to gradient compu…
-
An Introduction to Vision-Language Modeling
https://arxiv.org/abs/2405.17247
-
-
Consistency Large Language Models: A Family of Efficient Parallel Decoders
https://hao-ai-lab.github.io/blogs/cllm/
Does Fine-Tuning LLMs on New Knowledge Encourage Hallucinations?
https://arxiv.or…
-
Would you mind sharing your run configs?
With random guessing of lr, if used revin, scheduler, num epochs etc. i could hardly reproduce your reported result.
Thanks
-
It would be very helpful for verifying the effect of xLSTM in same environment.
-
Multilingual Amazon Reviews Corpus ([MARC](https://paperswithcode.com/dataset/marc)) {En, Jp, De, Fr, Es, Zh} [2015 2019] text classification: review text, the review title, the star rating, an anonym…
-
I was running xLSTM_shape_verification.py with lstm_type changed to mlstm and found that the hidden layer output was incorrectly output as:
AttributeError: 'tuple' object has no attribute 'shape'
Ou…