algoritmist / portable-translator

MIT License
0 stars 0 forks source link

General idea & requirments #1

Open algoritmist opened 3 months ago

algoritmist commented 3 months ago

The aim of this project is to create a portable device that provides a convenient way for communication between people speeking different languages.

Functional requirements:

  1. The device should support English to Chinese communication and vise versa
  2. The device should be able to able to support speech-to-speech and speech-to-text communication
  3. The conversation history should be displayed as a chat

User interface requirments:

  1. User should be able to start new conversation
  2. User should be able to connect to the network

Hardware requirements:

  1. LCD display support
  2. Microphone & speaker support
  3. Internet support

Software requirements:

  1. The system should be running on Linux
  2. Python support

Features that could be added in future:

  1. Other languages
  2. More voices

Wheter should we should support offline translation and what type of Internet access to use (wifi or mobile network) is a subject of discussion.

Suggestions and thoughts are welcome

algoritmist commented 3 months ago

"The system should be running on Linux..." I think it is actually not necessary, bare metal also seems to be fine

algoritmist commented 3 months ago

I suggest to use LTE with speech-to-text API instead of offline self-trained NN. The device won't need much computing power and will be much more power efficient and small. The only disadvantage that I see is that APIs are not free and not all of them support Chinese. Right now I am researching the APIs, so if you like the idea you can help me with it. Right now I am looking at assemblyAI

algoritmist commented 3 months ago

For locally running models we can use Whisper or RealTimeSTT, or deploy it on a server and connect to it from the device