-
Is there a way to convert ggml model format into onnx format? just like we convert transformers to ggml?
Or even convert ggml into transformers format?
My goal is to have onnx format from popular g…
-
Hello, thanks for your good work! Text-generation-inference (tgi) supports the deployment of non-core model according to the official documents:
> https://huggingface.co/docs/text-generation-inferen…
-
# URL
- https://www.arxiv.org/abs/2409.10309
# Affiliations
- Vojtěch Vančura, N/A
- Pavel Kordík, N/A
- Milan Straka, N/A
# Abstract
- Recommender systems often use text-side information to i…
-
### 論文へのリンク
[[arXiv:2004.03844] Poor Man's BERT: Smaller and Faster Transformer Models](https://arxiv.org/abs/2004.03844)
### 著者・所属機関
Hassan Sajjad, Fahim Dalvi, Nadir Durrani, Preslav Nakov
…
-
When I run the validate.py,I encounter the following error:
Loading checkpoint shards: 100%|██████████████████████████████████████████████████████████████████████████████████████| 3/3 [00:05
-
## 論文リンク
https://arxiv.org/abs/1901.02860
## 公開日(yyyy/mm/dd)
2019/01/09
## 概要
transformer などの fixed length のモデルだと longer dependence を取り込むことができないという問題がある。それを回避するために fixed segment length を前の se…
-
Teorically it you're using transformers, it is possible to train in aws neuron instances (trn1)
With optimun neuron should be possible https://huggingface.co/docs/optimum/main/en/index, https://hug…
-
### System Info
I try from transformers.models.t5.modeling_t5 import T5_BiLDModel, but it doesn't work. I build the library from transformer repo.
### Who can help?
_No response_
### Information
…
-
Hello! I am working train new Machine Translation model for Thai-English and English-Thai. It's may doesn't done in v5.0.0 deadline but I hope new model will include in the next release of PyThaiNLP (…
-
**Describe the bug**
When defining a value for `repetition_penalty` & fine-tuning the model, predictions fail with the following error:
```
Prediction: 0%| …