transformer-models Search Results

1000+ results
for transformer-models

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

ggerganov/ggml #886

Convert ggml file to onnx format

Is there a way to convert ggml model format into onnx format? just like we convert transformers to ggml? Or even convert ggml into transformers format? My goal is to have onnx format from popular g…

thewh1teagle updated 4 months ago
1
huggingface/text-generation-inference #2447

Generation kwargs assignment when processing a request

Hello, thanks for your good work! Text-generation-inference (tgi) supports the deployment of non-core model according to the official documents: > https://huggingface.co/docs/text-generation-inferen…

ChenlongDeng updated 3 months ago
2
AkihikoWatanabe/paper_notes #1421

beeFormer: Bridging the Gap Between Semantic and Interaction…

# URL - https://www.arxiv.org/abs/2409.10309 # Affiliations - Vojtěch Vančura, N/A - Pavel Kordík, N/A - Milan Straka, N/A # Abstract - Recommender systems often use text-side information to i…

AkihikoWatanabe updated 2 months ago
1
shimopino/papers-challenge #74

Poor Man's BERT: Smaller and Faster Transformer Models

### 論文へのリンク [[arXiv:2004.03844] Poor Man's BERT: Smaller and Faster Transformer Models](https://arxiv.org/abs/2004.03844) ### 著者・所属機関 Hassan Sajjad, Fahim Dalvi, Nadir Durrani, Preslav Nakov …

shimopino updated 4 years ago
1
sail-sg/Agent-Smith #4

ValueError: Expected input batch_size (609) to match target …

When I run the validate.py,I encounter the following error: Loading checkpoint shards: 100%|██████████████████████████████████████████████████████████████████████████████████████| 3/3 [00:05

bifenghaiziyou updated 1 week ago
1
yoheikikuta/paper-reading #30

[1901.02860] Transformer-XL: Attentive Language Models Beyon…

## 論文リンク https://arxiv.org/abs/1901.02860 ## 公開日（yyyy/mm/dd） 2019/01/09 ## 概要 transformer などの fixed length のモデルだと longer dependence を取り込むことができないという問題がある。それを回避するために fixed segment length を前の se…

yoheikikuta updated 3 months ago
8
UKPLab/sentence-transformers #3010

Training/Finetune in trn1

Teorically it you're using transformers, it is possible to train in aws neuron instances (trn1) With optimun neuron should be possible https://huggingface.co/docs/optimum/main/en/index, https://hug…

sonic182 updated 1 month ago
3
kssteven418/BigLittleDecoder #3

How to import T5_BiLD model in run_translation task.

### System Info I try from transformers.models.t5.modeling_t5 import T5_BiLDModel, but it doesn't work. I build the library from transformer repo. ### Who can help? _No response_ ### Information …

sufeidechabei updated 1 month ago
1
PyThaiNLP/pythainlp #899

Retraining Machine Translation model for Thai-English and En…

Hello! I am working train new Machine Translation model for Thai-English and English-Thai. It's may doesn't done in v5.0.0 deadline but I hope new model will include in the next release of PyThaiNLP (…

wannaphong updated 1 month ago
4
ludwig-ai/ludwig #3876

repetition_penalty bugged out

**Describe the bug** When defining a value for `repetition_penalty` & fine-tuning the model, predictions fail with the following error: ``` Prediction: 0%| …

rlleshi updated 1 month ago
2

上一页 1...87 88 89 90 91 92 93...100 下一页

1000+ results for transformer-models

1000+ results
for transformer-models