peleiden / daluke

A Danish-speaking language model with entity-aware self-attention
MIT License
9 stars 0 forks source link

Investigate correctness of data, masking, and accuracy in pretraining #77

Closed asgerius closed 3 years ago

asgerius commented 3 years ago

Everything seems correct. Word masking, entity masking, and entity spans all behave as expected