Computer listens people.
python RUNME.py
[x] Get voice sample
[x] Convert sample into text (recognize)
[x] Select command (match)
[x] Execute command
[x] Single top level command to start the process
[ ] Signal to start and end recording
[ ] Show "..processing.." state while recognizing
[ ] Show text
[x] Recorder (mic -> sample.wav)
[ ] (lev.2) UI (press button -- record -- unpress)
[x] Recognizer (sample.wav -> sample.txt)
[x] Matcher (sample.txt -> action.txt)
[x] Executor (action.txt -> ...)
[ ] Continuous voice scan
(no technology available, no time to reseach
current version of Sphinx CMU)
Requisites:
Every component is shell script:
[x] Console scripts
Requisites:
Go cross-platform
[x] Python (Lin/Win/OS) recording
[x] Python Google API request (requests lib)
[x] Python Matcher
[x] Python Action Executor
Graphics UI
or anything (web/kivy + websockets)
[ ] three static screens [ ] state machine [ ] animations