bert-fine-tuning Search Results

1000+ results
for bert-fine-tuning

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

explosion/spacy-llm #442

How to surpass BERT through large models

The current disadvantage of doing NER for large models is that they cannot achieve the effect of fine-tuning BERT. Is there any way to solve it. For example, through prompt words and so on. If the lar…

tianchiguaixia updated 4 months ago
1
marian-nmt/marian-dev #511

[Question] BERT documentation

I have seen the new BERT-related changes and I'm trying to use this model. Will be the documentation updated with the BERT parameters and an example of pre-training or fine-tuning?

ZJaume updated 4 years ago
3
kiccho1101/paper #19

On the Stability of Fine-tuning BERT: Misconceptions, Explan…

https://arxiv.org/abs/2006.04884 # Background - Fine-tuning BERT, RoBERTa, and ALBERT are not fully known. - Varying only the random seed leads to a large standard deviation of the fin-tuning acc…

kiccho1101 updated 4 years ago
2
Tribleave/SCAPT-ABSA #6

如果直接在fine-tuning阶段增加对比学习loss

很感谢您的工作，非常清晰。想请教一个问题，是否有对比以下两种情况的表现呢： 1. 加载bert权重，在domain-spe数据上fine-tuning，去做aspect分类 2. 加载bert权重，在domain-spe数据上fine-tuning并添加对比学习loss，去做aspect分类

kangbrilliant updated 2 years ago
2
gesistsa/grafzahl #27

Layer extracted by grafzahl

Given the discussion about which layer keeping as a token's representation in a down-streaming analysis (Jawahar et al., 2019; Ethayarajh, 2019) when for example using a pre-trained bert model, I was …

LuigiC72 updated 5 months ago
1
predibase/lorax #57

Project Roadmap

WIP project roadmap for LoRAX. We'll continue to update this over time. # v0.10 - [ ] Speculative decoding adapters - [ ] AQLM # v0.11 - [ ] Prefix caching - [ ] BERT support - [ ] Embe…

tgaddair updated 2 months ago
32
TheRensselaerIDEA/twitter-nlp #2

[Research / Analysis] Fine-tune embedding model on tweet dat…

Currently we are using the pre-trained [Universal Sentence Encoder (large)](https://tfhub.dev/google/universal-sentence-encoder-large/5) from TensorFlow hub. **Open area for investigation:** The …

AbrahamSanders updated 3 years ago
1
jojonki/arXivNotes #225

🚧 2019: Data Augmentation for BERT Fine-Tuning in Open-Domai…

Data Augmentation for BERT Fine-Tuning in Open-Domain Question Answering Wei Yang, Yuqing Xie, Luchen Tan, Kun Xiong, Ming Li, Jimmy Lin https://arxiv.org/abs/1904.06652

jojonki updated 5 years ago
2
dbiir/UER-py #44

What's the hyperparameters to finetune Chinese bert-large on…

Recently you updated "BERT pretrained on mixed large Chinese corpus (bert-large 24-layers) " on ReadMe. What hyperparameters (lr, batch size, max epochs) did you use when fine-tuning on CLUE?

hitvoice updated 4 years ago
2
hanjanghoon/BERT_FP #7

Issue on reimplementation experiment

Hi, authors of Bert-FP, the SOTA in Response Selection tasks. Excited to see that the post-training strategy works so well with the sub-context-response pairs. Recently, I try to reimplement this work…

KuzmaNg updated 1 year ago
1

上一页 1...2 3 4 5 6 7 8...100 下一页

1000+ results for bert-fine-tuning

1000+ results
for bert-fine-tuning