iiakshat / BriefComm

Simplifying audio content through transcription and summarization using Meta's LLAMA 2 & OpenAI's Whisper.
MIT License
1 stars 1 forks source link

Slow inference #2

Closed 09Kanika closed 1 month ago

09Kanika commented 1 month ago

Description

The current BriefComm summarizer efficiently converts audio content into text and provides summaries using the Whisper and Llama2 models. However, it has been observed that the summarization process is slower than expected, impacting user experience. Additionally, the Llama2 model has now become outdated with the release of Llama3, which offers improved performance and capabilities.

Proposed Enhancements

1. Speed Optimization with Groq Inference:

2. Upgrade to Llama3:

Feature Integration and Testing:

Milestone:

Labels:

Additional Information:

iiakshat commented 1 month ago

Thanks I'll do it. Closing this issue for now.