Look into domain adaptive pre-training

nasa-petal / PeTaL-labeller

The PeTaL labeler labels journal articles with biomimicry functions.

https://petal-labeller.readthedocs.io/en/latest/

The Unlicense

6 stars 4 forks source link

Look into domain adaptive pre-training #24

Open bruffridge opened 3 years ago

bruffridge commented 3 years ago

The last thing is "domain adaptive pre-training". If you're using SciBERT you're most of the way there already, so it might not give you that much. But the idea is to take vanilla BERT, fine-tune it as a language model on your entire corpus of domain-specific documents, and then fine-tune again for the classification task. Dirk