issues
search
mim-solutions
/
bert_for_longer_texts
BERT classification model for processing texts longer than 512 tokens. Text is first divided into smaller chunks and after feeding them to BERT, intermediate results are pooled. The implementation allows fine-tuning.
Other
129
stars
30
forks
source link
Major refactor
#9
Closed
MichalBrzozowski91
closed
1 year ago
MichalBrzozowski91
commented
1 year ago
major simplification of model classes
change env installation instructiion to requirements.txt
delete stripped notebooks
change directory name from "ipython" to "notebooks"
add simple model tests
add typing hints
autoformat with black