pigeonai-org / ViDove

🐦ViDove: RAG-Augmented End-to-end Multimodal Translation Agent
GNU General Public License v3.0
93 stars 9 forks source link

added qwen2 for extracting the number of speakers #20

Open Taka499 opened 1 month ago