-
### Model description
https://github.com/noanabeshima/tiny_model
It's a small language model trained on TinyStories for interpretability with sparse autoencoders and transcoders added. It has no…
-
### Model description
Hello,
The data2vec 2.0 paper has been released quite a while and achieved impressive performance across different modalities: speech, text, and image (results similar or bet…
-
### Describe the issue
I am attempting to export a model from HuggingFace from PyTorch to Onnx. After exporting the model, I am trying to confirm the outputs are still correct however it appears that…
-
### Describe the issue
The following problem occurred when I optimized Babelscape/mrebel-large:
warnings.warn(
Some non-default generation parameters are set in the model config. These should go …
-
I wanted to use a custom loss function with VIT how should I proceed since in the VisionClassifierTrainer there is no use of loss
-
Hi, the speed of decoding is too slow, about 5 seconds per utterance.
How can I use multi-thread or multi-cpu to decode?
-
Video Title: Natural Language Processing: Trends, Challenges and Opportunities | PyData Global 2021
Link: https://youtu.be/Y2WZEV-Ds-o
0:04 - PyData Co-Chair Welcome Remarks
0:18 - Speaker Marco …
-
### Model description
[MobileViT](https://openreview.net/forum?id=vh-0sUt8HlG) is a computer vision model that combines CNNs with transformers that has already been added to Transformers.
[Mobile…
-
I've been trying to run TinyBERT with OpenCL as the backend, but it fails because aten::index_select isn't implemented.
Running this code gives me the error message as shown in the attached log fil…
-
### Bug Description
Hi
Im trying to finetune Nomic-ai-embedding using SentenceTransformersFinetuneEngine and am running into an issue:
![image](https://github.com/run-llama/llama_index/assets/1…