Open akash418 opened 1 year ago
Great job!
Few comments:
Great job!
Few comments:
- Did you train the BERT model really for 100 epochs? Or was that max_epochs=100 and flair automatically stopped at one point?
- Please also run the evaluation with a "classifier-only" fine-tuning (transformer weights frozen). See Flair + Huggingface: finetuning classifier or whole model? flairNLP/flair#2934
- Regarding GPT: What batch size / GPU memory did you use? Try to decrease the batch size until it fits to memory. And maybe also start with a smaller model, like https://huggingface.co/malteos/gpt2-wechsel-german-ds-meg
I used 1 RTXA6000 GPU with a maximum of 89 GB of memory allocated to GPU, batch size 32, and hidden size 32. The best option is to try and decrease the batch size to 8 and try and see if it works. In worst case I will work with the smaller model.
You can even decrease the batch size to 1. Generally, please try to not always use the big GPUs on the cluster. For example, the RTX6000 should be totally sufficient.
Model Fine-tuned: https://huggingface.co/bert-base-german-cased
Task:
GERMEVAL_2018_OFFENSIVE_LANGUAGE
: Type: Classification (Full Model Fine Tuning)Results:
Task:
GERMEVAL_2018_OFFENSIVE_LANGUAGE
: Type: Classification (Classifer Only Tuning)Results:
Settings:
Task:
NER_GERMAN_LEGAL
: Type NER (Full Model Fine Tuning)Results:
Task:
NER_GERMAN_LEGAL
: Type NER (Classifier Only Tuning)Results:
Settings:
Model Fine-tuned: https://huggingface.co/malteos/gpt2-wechsel-german-ds-meg
Task:
GERMEVAL_2018_OFFENSIVE_LANGUAGE
: Type: Classification (Full Model Fine Tuning)Results:
Accuracy 0.7894
Task:
GERMEVAL_2018_OFFENSIVE_LANGUAGE
: Type: Classification (Classifier Only Tuning)Results:
Accuracy 0.7868
Settings:
Model-Fine Tuned https://huggingface.co/malteos/gpt2-xl-wechsel-german
Task:
GERMEVAL_2018_OFFENSIVE_LANGUAGE
: Type: Classification (Full Model Fine Tuning)Results:
Accuracy 0.8058
Task:
GERMEVAL_2018_OFFENSIVE_LANGUAGE
: Type: Classification (Classifier Only Tuning)Results:
Accuracy 0.8058