mim-solutions / bert_for_longer_texts

BERT classification model for processing texts longer than 512 tokens. Text is first divided into smaller chunks and after feeding them to BERT, intermediate results are pooled. The implementation allows fine-tuning.
Other
129 stars 30 forks source link

add notebook with getting embedding vectors #6

Closed MichalBrzozowski91 closed 1 year ago

MichalBrzozowski91 commented 2 years ago

Closes #1