Closed djchan closed 12 months ago
Hey! Thanks for reaching out. I don't think there's anything wrong with the package, but likely just a model limitation. While you can try adjusting the inference parameters, I would say this seems like more of a generalization issue with the model (it was trained on 'literature' books + their sparknotes, so not a lot of technical math stuff in there).
Personally, I've abandoned using LED in favor of long-t5 based models, you might see better performance there. Also worth noting is that I've got a long-t5 finetune on scientific docs and I just added some test models as options to the demo feedback on whether those perform better/why is welcome :)
Understood. Thanks Peter!
Dear Author,
I tried to use
pszemraj/led-large-book-summary
model for summarization. I was testing on a wiki page about linear regression, but the output is nothing close to that, so just wondering if this use case is not expected, or had plan to improve model, or maybe I just not used it in a right way?Please see below for code and env.
Thanks, DJ
Code:
pip freeze: