dmmiller612 / bert-extractive-summarizer

Easy to use extractive text summarization with BERT
MIT License
1.4k stars 308 forks source link

Reproducibility bug on run_embeddings method #137

Open DraceniY opened 2 years ago

DraceniY commented 2 years ago

The run_embeddings has reproducibility issue in long text, when it runs in different machines.

I put this reproducibility_summarization.zip file with input scientific article and my embedding output for comparison, I set random.seed in the init function of the class BertEmbedding but still had differences in the embeddings between two machines.

Please could you fix this issue, it will be very helpful.

Thanks.