dmmiller612 / bert-extractive-summarizer

Easy to use extractive text summarization with BERT
MIT License
1.37k stars 307 forks source link

Reproducibility bug on run_embeddings method #137

Open DraceniY opened 1 year ago

DraceniY commented 1 year ago

The run_embeddings has reproducibility issue in long text, when it runs in different machines.

I put this reproducibility_summarization.zip file with input scientific article and my embedding output for comparison, I set random.seed in the init function of the class BertEmbedding but still had differences in the embeddings between two machines.

Please could you fix this issue, it will be very helpful.

Thanks.