voice-over-vision / vov-chrome-extension

Voice-Over Vision's Chrome Extension
Apache License 2.0
5 stars 1 forks source link

DemosFeaturesInstallationContributionAcknowledgmentsCitation

Voice-Over Vision: The future of the internet is accessible

We present Voice-Over Vision, a tool that transforms YouTube watching for the visually impaired, making every video more accessible and enjoyable. Like a friend sitting next to you, this Chrome Extension narrates the unseen parts of a video, filling in the blanks where audio alone falls short. It smartly sifts through videos, picking out details that you might miss otherwise, and uses text-to-speech technology to bring those visuals to life through vivid descriptions. With Voice-Over Vision, every story is fully told, ensuring everyone gets the complete picture, no matter what.

🎬 Demos

Voice-Over Video Demo

🚀 Features

Work In Progress - [ ] **Customizable Speech Parameters**: Adjust voice selection, speech rate, and volume to tailor the audio descriptions to your preferences. - [ ] **Detail Level Settings**: Choose the level of detail for descriptions, from basic overviews to in-depth analysis of physical appearances and emotions. - [ ] **Interruption Frequency Control**: Select how often you'd like the video's original audio to be interrupted with descriptions, ensuring a balanced experience.

💻 Installation

Instructions on how to install and run Voice-Over Vision (soon to be released at Google Chrome Extensions marketplace)

Prerequisites

Installing the Chrome Extension

1. Clone the repository:

git clone https://github.com/voice-over-vision/vov-chrome-extension.git

2. Load the extension in Chrome (detailed information here):

3. Enjoy the magic of Voice-Over Video!✨

🌟 Contribution

Davi Giordano
Davi Giordano
Guilherme Mariano
Guilherme Mariano
Mariana Serrao
Mariana Serrão
Murillo Teixeira
Murillo Teixeira

💎 Acknowledgments

Chroma DB

We extend our heartfelt thanks to the developers and community behind Chroma DB for their exceptional AI-native open-source embedding database, a crucial component in our mission to create an accessibility tool for the visually impaired. ChromaDB's robust and efficient data management capabilities have been pivotal in our efforts to make a positive impact.

GPT-4

Our appreciation goes to the OpenAI team for providing foundational AI technology for our project. The robustness of GPT-4 was instrumental in our project's natural language processing and image processing capabilities.

📄 Citation

@software{voice-over-vision,
  author = {Davi Giordano, Guilherme Mariano, Mariana Serrao and Murillo Teixeira},
  title = {Voice-Over Vision: The future of the internet is accessible},
  month = {March},
  year = {2024},
  url = {https://github.com/voice-over-vision}
}