Open mratsim opened 6 years ago
See fast.ai
A new paper on pre-trained language model: BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding and discussion
See fast.ai