Demos • Features • Installation • Contribution • Acknowledgments • Citation
We present Voice-Over Vision, a tool that transforms YouTube watching for the visually impaired, making every video more accessible and enjoyable. Like a friend sitting next to you, this Chrome Extension narrates the unseen parts of a video, filling in the blanks where audio alone falls short. It smartly sifts through videos, picking out details that you might miss otherwise, and uses text-to-speech technology to bring those visuals to life through vivid descriptions. With Voice-Over Vision, every story is fully told, ensuring everyone gets the complete picture, no matter what.
Instructions on how to install and run Voice-Over Vision (soon to be released at Google Chrome Extensions marketplace)
git clone https://github.com/voice-over-vision/vov-chrome-extension.git
chrome://extensions/
in your Chrome browser.
Davi Giordano |
Guilherme Mariano |
Mariana Serrão |
Murillo Teixeira |
We extend our heartfelt thanks to the developers and community behind Chroma DB for their exceptional AI-native open-source embedding database, a crucial component in our mission to create an accessibility tool for the visually impaired. ChromaDB's robust and efficient data management capabilities have been pivotal in our efforts to make a positive impact.
Our appreciation goes to the OpenAI team for providing foundational AI technology for our project. The robustness of GPT-4 was instrumental in our project's natural language processing and image processing capabilities.
@software{voice-over-vision,
author = {Davi Giordano, Guilherme Mariano, Mariana Serrao and Murillo Teixeira},
title = {Voice-Over Vision: The future of the internet is accessible},
month = {March},
year = {2024},
url = {https://github.com/voice-over-vision}
}