xlmroberta Search Results

146 results
for xlmroberta

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

GENZITSU/UsefulMaterials #138

almost weekly useful materials - 10/18 -

GENZITSU updated 11 months ago
8
Unbabel/COMET #208

[QUESTION] Memory footprint

Trying to train with examples (trainer.yaml, unified_metric.yaml) I am facing some memory issues. I changed "precision: 16-mixed" in trainer.yaml but it does not help. It starts ok with a low mem…

vince62s updated 6 months ago
21
ThilinaRajapakse/simpletransformers #199

Large models aren't converge while fine-tuning

I tried to fine-tune XLM-Roberta Large model on Google Colab environment for 3 epochs using `1e-5` learning rate, 16 batch size, 2 accumulative steps and 120 warmup steps. But the loss didn't converge…

AliOsm updated 1 month ago
21
mindspore-lab/mindnlp #897

bce-embedding-base_v1 模型迁移到910B上

郑州智算项目要把https://huggingface.co/maidalun1020/bce-embedding-base_v1 模型迁移到npu上 , bce-embedding-base_v1的介绍在https://github.com/netease-youdao/BCEmbedding/blob/master/README_zh.md

yyuan312 updated 6 months ago
7
NVIDIA/TensorRT-LLM #412

Test Bert with original unittest file test_bert.py + positio…

GPU：v100 cuda version: 12.2 Thanks for your great work. Now i wanted to deploy XLMRoberta with TensorRT-LLM, which is only has a tweak from the position_ids in bert_embeddings, so follow the issue…

ehuaa updated 6 months ago
8
axinc-ai/ailia-models #1444

ADD cross-encoder-mmarco-mMiniLMv2-L12-H384-v1

マルチリンガルreranker https://huggingface.co/corrius/cross-encoder-mmarco-mMiniLMv2-L12-H384-v1 Apache

kyakuno updated 5 months ago
18
deepset-ai/haystack #5527

ONNX Conversion for deepset/deberta-v3-large-squad2

I am trying to convert the deberta model to onnx for faster inference but got the following exception: Exception: The current ONNX conversion only support 'BERT', 'RoBERTa', and 'XLMRoberta' mod…

ss2342 updated 1 year ago
2
NVIDIA/TensorRT-LLM #363

How to transfer a Tensor type object to torch.Tensor

I'm working on deploy huggingface roberta model to TensorRT-LLM, which has a little tweak from Bert with the embeddings. In RobertaEmbedding the position_ids is calculated as follows: ![image](https…

ehuaa updated 10 months ago
7
triton-inference-server/server #6042

transformer model output mismatch

**Description** After deploying transformer model using triton inference server i am getting different output from the local copy of the same model. **Triton Information** 23.03-py Are you usi…

riyaj8888 updated 1 year ago
2
deepjavalibrary/djl #2224

Not able to initialize HuggingFaceTokenizer in Vespa environ…

## Description I'm trying to test [Vespa](https://docs.vespa.ai/) application with [custom Embedder](https://docs.vespa.ai/en/embedding.html) that uses DJL's HuggingFaceTokenizer under the hood. …

dnmca updated 11 months ago
11

上一页 1...2 3 4 5 6 7 8...15 下一页

146 results for xlmroberta

146 results
for xlmroberta