Closed LeonCrashCode closed 4 years ago
Seems like the run_squad script is in bad shape now. It just doesn't work.
same question.. also running this script with XLNet on Squad, is ~10 F1 points below BERT-Large-WWM. The difference in preprocessing as pointed out above could be one of the reasons.
This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions.
❓ Questions & Help
This is the padding problem. In GLUE codes in the examples, the padding for XLNet is on the left of the input, but in Squad codes, the padding is on right. I was wondering which one is correct.
Also, the inputs of
convert_examples_to_features
are different in Glue and Squad, where Squad uses most of default values likepad_token, sep_token, pad_token_segment_id and cis_token_segment_id
, but Glue use the value oftokenizer
. which one is correct?Or the example codes are out-of-date? Thanks