Open tangzhy opened 3 years ago
Is the model loaded from your fine-tuned checkpoint properly? BTW, no logging info like INIT_FROM_CKPT occurs.
It seems that the checkpoint is not being loaded.
Why is the trained model performance so poor? MRR@10 = 0.01 for the top 100 eval examples. Is that expected? Since I only run for 100 eval examples (100 * 1000 entries are actually predicted.)
MRR@10 should be at least 0.30.
If the model is loaded improperly, how shall I load the model instead? Any example code?
I would first try to use a "dummy" path in which no checkpoint exists. If the log is identical to what you have now, then the problem is in BERT_PRETRAINED_DIR
.
Hi, I use the colab code exactly from your demo.
Model config
The general is the same as follows, except that
OUTPUT_DIR
is changed to your decompressed bert based directory, and thebatch_size
is set to 8 since I'm running on V100-16GB. I also changed theMAX_EVAL_EXAMPLES=100
because it takes too much time to get the full eval performance.Logging
The logging and performance are listed as follows. My concerns are:
*INIT_FROM_CKPT*
occurs.MRR@10 = 0.01
for the top 100 eval examples. Is that expected? Since I only run for 100 eval examples (100 * 1000 entries are actually predicted.)