Open jbukuts opened 2 months ago
Jul 25 Notes
Recently tried various models and parameters to see yielded summarization. Quick recap:
mixtral
: Good at understanding instructions. Bad output though.flan-ul20
: Reasonable output but it can't understand instructions so it ignores my prompt for the most part.granite
: Decent but not optimal outputs. Understands prompts but outputs not as great as wanted.llama2-70b
: Very bad outputllama3-403b
: Very good at understanding the prompt. And results include important numbers and summary of key points within sentence limit given in prompt.With the release of Meta's new 403b parameter model decided to test that. Very pleased with ensuing results so going to continue on with that model.
I love the llama3-403b. Lets stay the course with it.
This story pertains to the generation of summaries from scraped content.
Things to keep in mind: