mim-solutions / bert_for_longer_texts

BERT classification model for processing texts longer than 512 tokens. Text is first divided into smaller chunks and after feeding them to BERT, intermediate results are pooled. The implementation allows fine-tuning.
Other
129 stars 30 forks source link

Major refactor #9

Closed MichalBrzozowski91 closed 1 year ago

MichalBrzozowski91 commented 1 year ago