CypherousSkies / reading-for-listeners

A deep-learning powered accessibility application which turns pdfs into audio files. Featuring ocr improvement and tts with inflection!
GNU Affero General Public License v3.0
23 stars 3 forks source link

BART summarizer to improve BERT results #5

Open CypherousSkies opened 2 years ago

CypherousSkies commented 2 years ago

DistilBERT can only take 512 tokens which is not enough for most passages. So... maybe using a summarizer to give context outside of the sentence with a mask token to improve results. Huggingface has a DistilBART model that I could use, but that would only increase the (already considerable) runtime. Maybe make it optional?

CypherousSkies commented 2 years ago

This could be replaced by TrOCR (#13), although it might be neat to test which of these methods is faster.