-
### Feature request
In Wav2Vec 2.0, the first few convolution layers affect the attention mask. Thus, if I want to use all Wav2Vec 2.0 outputs (last_hidden_state), I need access to the updated attent…
-
See keras-team/keras#512 and the [main Keras issue](https://github.com/keras-team/tf-keras/issues/183) for some previous discussion on this topic.
`AdaptivePool` is a pooling layer in PyTorch that …
-
i use unsloth to fine tune llama 3-8B..., after traning complete i save this model to hugging face by using 'push_to_hub', but it shows these files :
.gitattributes
README.md
adapter_config.json
…
-
Hi,
Thanks for your great work! I have some questions about the two-stage training. I'd appreciate it if you could share more details.
1. In `Stage 2 - Once-for-All Training`, which model is us…
-
### System Info
```shell
Last version of transformers and Optimum libraries.
```
### Who can help?
@JingyaHuang , @echarlaix, @mi
### Information
- [X] The official example scripts
- [ ] My own …
-
i have 3* A100 40G GPU and i'm trying to train wav2vec2 the pretraining model
the GPU memory is consumed and the utilization is really low
i've tried to increase the max-token till i get out o…
-
### Context
This task regards enabling tests for **baichuan2-7b-chat**. You can find more details under openvino_notebooks [LLM chatbot README.md](https://github.com/openvinotoolkit/openvino_notebook…
-
## ❓ Questions and Help
### Before asking:
1. search the issues.
2. search the docs.
#### What is your question?
When I evaluate a CTC model on wav2cec2.0 according to `fairseq/examples/w…
-
# Export Error Summary Dashboard ##
- This report is generated from branch of https://github.com/huggingface/optimum/pull/1712
- Produced by `RUN_SLOW=1 pytest tests/exporters/onnx -k "test_export…
-
## Why
推薦・機械学習勉強会は、推薦や機械学習、その周辺技術を通じてサービスを改善することにモチベーションのある人達の集まりです。ニュースやブログから論文まで、気になったものについてお互い共有しましょう!
発信のため、ここは **public** にしてあります。外部からの参加をご希望の方は松村(https://twitter.com/yu__ya4) まで DM を送るか、Wa…