rmusser01 / tldw

tl/dw (Too Long, Didn't Watch): Your Personal Research Multi-Tool - a naive attempt at 'A Young Lady's Illustrated Primer'
Apache License 2.0
330 stars 11 forks source link

Improvement: Improve general summarization pipeline #45

Open rmusser01 opened 5 months ago

rmusser01 commented 5 months ago

Title.

Whatever we can do to improve accuracy and speed of the summarization pipeline.

This issue can act as a tracker for ideas, approaches and information.

Articles: https://github.com/voidism/Lookback-Lens https://github.com/Vaibhavs10/optimise-my-whisper

Videos:

Optimizing Whisper:

Flan-T5 Finetuning:

Improving summarization approach:

Unsorted Papers:

https://github.com/AugmendTech/treeseg https://huggingface.co/migtissera/HelixNet https://arxiv.org/pdf/2407.18521 https://github.com/patched-codes/patchwork

rmusser01 commented 2 weeks ago

https://huggingface.co/google/gemma-7b-aps-it https://huggingface.co/google/gemma-2b-aps-it https://arxiv.org/abs/2406.19803