roberta Search Results - Githubissues

1000+ results
for roberta

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

Unbabel/COMET #164

Quantization

## 🚀 Feature HF transformers implements 8 bit and 4 bit quantization. It would be nice if that feature can be leveraged for the xlm-r-xxl machine translation eval model. ### Motivation The lar…

KnutJaegersberg updated 1 year ago
2
Tencent/TencentPretrain #92

KeyError: 'd'

python finetune/run_classifier.py --pretrained_model_path models/roberta-base-finetuned-dianping-chinese/pytorch_model.bin \ --vocab_path models/google_zh_vocab.txt…

beat4ocean updated 1 year ago
1
Unbabel/COMET #213

[QUESTION] OOM when load XCOMET-XXL in A100 with 40G memory …

## ❓ Questions and Help ### Before asking: 1. Search for similar [issues](https://github.com/Unbabel/COMET/issues). 3. Search the [docs](https://unbabel.github.io/COMET/html/index.html). …

Nie-Yingying updated 6 months ago
4
huggingface/transformers #25296

BertForSequenceClassification does not support 'device_map':…

### System Info I have trained a model and am now trying to load and quantise it but getting the error: BertForSequenceClassification does not support 'device_map':"auto" yet Code for loading …

goodaytar updated 7 months ago
18
UKPLab/sentence-transformers #725

Questions about the pretrained models and how they are train…

I have to say, that is quote astonishing how fast here everythings moves forward, and also the community! If you pause for 1 month, you feel you get lost with the whole progress. :) My experiments…

datistiquo updated 2 years ago
3
microsoft/onnxruntime #16264

onnx use more memory than pytorch for some model

### Describe the issue cuda 10.2 onnx=1.8 onnxruntime-gpu=1.6 For sequnce labeling task (input the token ids, output the start_pos, end_pos), the pytorch use 1.8G, but onnx use 1.9G (although …

feng-1985 updated 1 year ago
3
NielsRogge/Transformers-Tutorials #333

Donut-How to use a tokenizer with multiple language support

Is it possible to use a different tokenizer with multiple language support for the Donut processor? like mbart tokenizer in the Donut processor instead of xlmrobertafast @NielsRogge

Theerath updated 7 months ago
2
bdzyubak/torch-control #15

Fine-tune most popular LLMs for movie sentiment analysis

Fine tune 3 (or more) popular models and compare performance to DistilBERT for the movie sentiment analysis task. Some choices: GPT-3 LaMDA Turing-NLG XGen Llama 2 (7 billion) Gemini Pic…

bdzyubak updated 7 months ago
1
dirkneuhaeuser/preposition-sense-disambiguation #1

Could you improve the state of the art once again?

@dirkneuhaeuser Thanks for making the world a better place, your classifier is extremely helpful for natural language understanding. Unfortunately, 91% accuracy is still not really great for widespre…

LifeIsStrange updated 1 year ago
1
frankaging/Causal-Distill #2

forward() got an unexpected keyword argument 'interchanged_…

Hi @frankaging , when i run causal_training.py i have Error: forward() got an unexpected keyword argument 'interchanged_variables' log `01/14/2022 07:41:12 - INFO - utils - PID: 2342567 - Using M…

Huynh-Chinh updated 2 years ago
5

上一页 1...94 95 96 97 98 99 100...100 下一页

1000+ results for roberta

1000+ results
for roberta