The Llama2-7B model has been shown to have increasing better performance on language understanding tasks. Evaluate the model for the task of text summarization on the CNN news summarization dataset. Tasks include:
[ ] Setup the base pipeline for Llama2-7B
[ ] Run the pipeline for a single news article and see if appropriate response is being generated
[ ] Run the pipeline for the entire dataset and generate summaries
[ ] Find the performance using the evaluation metric (ROUGE / BLEU / BERTScore)
The Llama2-7B model has been shown to have increasing better performance on language understanding tasks. Evaluate the model for the task of text summarization on the CNN news summarization dataset. Tasks include: