pannous / tensorflow-speech-recognition

🎙Speech recognition using the tensorflow deep learning framework, sequence-to-sequence neural networks
Other
2.17k stars 638 forks source link

spoken_words_wav.tar #33

Open altbit opened 7 years ago

altbit commented 7 years ago

Could you, please, upload spoken_words_wav.tar somewhere? Thank you.

pannous commented 7 years ago

Dropbox blocked that file, but you can generate it yourself. Look at ./generate_speech_data.py if you are on a Mac, the same approach would also work on other systems.

altbit commented 7 years ago

OK, thanks. I'll try.

psilva07 commented 7 years ago

I'm also trying to generate this data, but I haven't succeeded. When running that script the function check_voices() is called before main and it gives the following error:

Traceback (most recent call last):
  File "generate_speech_data.py", line 125, in <module>
    check_voices()
  File "generate_speech_data.py", line 108, in check_voices
    voice_infos=str(subprocess.check_output(["say", "-v?"])).split("\n")[:-2]
  File "/usr/lib/python2.7/subprocess.py", line 567, in check_output
    process = Popen(stdout=PIPE, *popenargs, **kwargs)
  File "/usr/lib/python2.7/subprocess.py", line 711, in __init__
    errread, errwrite)
  File "/usr/lib/python2.7/subprocess.py", line 1343, in _execute_child
    raise child_exception
OSError: [Errno 2] No such file or directory

UPDATE: check_voices() works when running in a MAC. The other errors remain though.

Even if I comment that the script fails because there is no file called wordlist.txt:

Traceback (most recent call last):
  File "generate_speech_data.py", line 218, in <module>
    main()
  File "generate_speech_data.py", line 214, in main
    spoken_words()
  File "generate_speech_data.py", line 195, in spoken_words
    words= open(wordlist).readlines()
IOError: [Errno 2] No such file or directory: 'wordlist.txt'

Is this just a file with words we want to be generated? If so, which words are allowed? I added a wordlist.txt file with the following words:

answer
call
dismiss
alarm
go
back

Got another error:

Traceback (most recent call last):
  File "generate_speech_data.py", line 5, in <module>
    from scikits.talkbox.features import mfcc # 2'nd lib
  File "/Users/Pedro/anaconda/lib/python3.5/site-packages/scikits/talkbox/__init__.py", line 5, in <module>
    __all__ += tools.__all__
AttributeError: module 'tools' has no attribute '__all__'

So I removed the scikits.talkbox.features import for the time being, since it seems like it's used only in the genreate_mcc later step, but it failed again before getting there, in the swift part:

fatal error: unexpectedly found nil while unwrapping an Optional value
Current stack trace:
0    libswiftCore.dylib                 0x0000000111d62130 swift_reportError + 129
1    libswiftCore.dylib                 0x0000000111d7eb50 _swift_stdlib_reportFatalError + 60
2    libswiftCore.dylib                 0x0000000111b6f250 specialized specialized StaticString.withUTF8Buffer<A> ((UnsafeBufferPointer<UInt8>) -> A) -> A + 342
3    libswiftCore.dylib                 0x0000000111ce9e90 partial apply for (_fatalErrorMessage(StaticString, StaticString, file : StaticString, line : UInt, flags : UInt32) -> Never).(closure #2) + 109
4    libswiftCore.dylib                 0x0000000111b6f250 specialized specialized StaticString.withUTF8Buffer<A> ((UnsafeBufferPointer<UInt8>) -> A) -> A + 342
5    libswiftCore.dylib                 0x0000000111c9c9a0 specialized _fatalErrorMessage(StaticString, StaticString, file : StaticString, line : UInt, flags : UInt32) -> Never + 96
7    swift                              0x0000000108f20b40 llvm::MCJIT::runFunction(llvm::Function*, llvm::ArrayRef<llvm::GenericValue>) + 655
8    swift                              0x0000000108f26700 llvm::ExecutionEngine::runFunctionAsMain(llvm::Function*, std::__1::vector<std::__1::basic_string<char, std::__1::char_traits<char>, std::__1::allocator<char> >, std::__1::allocator<std::__1::basic_string<char, std::__1::char_traits<char>, std::__1::allocator<char> > > > const&, char const* const*) + 707
9    swift                              0x0000000108422f30 swift::RunImmediately(swift::CompilerInstance&, std::__1::vector<std::__1::basic_string<char, std::__1::char_traits<char>, std::__1::allocator<char> >, std::__1::allocator<std::__1::basic_string<char, std::__1::char_traits<char>, std::__1::allocator<char> > > > const&, swift::IRGenOptions&, swift::SILOptions const&) + 3385
10   swift                              0x00000001083ebff0 swift::performFrontend(llvm::ArrayRef<char const*>, char const*, void*, swift::FrontendObserver*) + 50738
11   swift                              0x00000001083a3a10 main + 9052
12   libdyld.dylib                      0x00007fff8d8f3234 start + 1
0  swift                    0x000000010b5d84f7 PrintStackTraceSignalHandler(void*) + 39
1  swift                    0x000000010b5d79a6 SignalHandler(int) + 646
2  libsystem_platform.dylib 0x00007fff8db02b3a _sigtramp + 26
3  libsystem_platform.dylib 0x00000000eb1de9b7 _sigtramp + 1567473303
4  libsystem_platform.dylib 0x000000010cac1576 _sigtramp + 2130438742
5  swift                    0x0000000108f20dcf llvm::MCJIT::runFunction(llvm::Function*, llvm::ArrayRef<llvm::GenericValue>) + 655
6  swift                    0x0000000108f269c3 llvm::ExecutionEngine::runFunctionAsMain(llvm::Function*, std::__1::vector<std::__1::basic_string<char, std::__1::char_traits<char>, std::__1::allocator<char> >, std::__1::allocator<std::__1::basic_string<char, std::__1::char_traits<char>, std::__1::allocator<char> > > > const&, char const* const*) + 707
7  swift                    0x0000000108423c69 swift::RunImmediately(swift::CompilerInstance&, std::__1::vector<std::__1::basic_string<char, std::__1::char_traits<char>, std::__1::allocator<char> >, std::__1::allocator<std::__1::basic_string<char, std::__1::char_traits<char>, std::__1::allocator<char> > > > const&, swift::IRGenOptions&, swift::SILOptions const&) + 3385
8  swift                    0x00000001083f8622 swift::performFrontend(llvm::ArrayRef<char const*>, char const*, void*, swift::FrontendObserver*) + 50738
9  swift                    0x00000001083a5d6c main + 9052
10 libdyld.dylib            0x00007fff8d8f3235 start + 1
Stack dump:
0.  Program arguments: /Applications/Xcode.app/Contents/Developer/Toolchains/XcodeDefault.xctoolchain/usr/bin/swift -frontend -interpret ./word_to_phonemes.swift -enable-objc-interop -sdk /Applications/Xcode.app/Contents/Developer/Platforms/MacOSX.platform/Developer/SDKs/MacOSX10.12.sdk -color-diagnostics -module-name word_to_phonemes -- answer call

Traceback (most recent call last):
  File "generate_speech_data.py", line 218, in <module>
    main()
  File "generate_speech_data.py", line 214, in main
    spoken_words()
  File "generate_speech_data.py", line 196, in spoken_words
    generate(words, path)
  File "generate_speech_data.py", line 172, in generate
    generate_phonemes(word, path)
  File "generate_speech_data.py", line 148, in generate_phonemes
    pronounced=subprocess.check_output(["./word_to_phonemes.swift", word]).decode('UTF-8').strip()
  File "/Users/Pedro/anaconda/lib/python3.5/subprocess.py", line 626, in check_output
    **kwargs).stdout
  File "/Users/Pedro/anaconda/lib/python3.5/subprocess.py", line 708, in run
    output=stdout, stderr=stderr)
subprocess.CalledProcessError: Command '['./word_to_phonemes.swift', 'answer call\n']' returned non-zero exit status -4
haneesh11 commented 6 years ago

I get errors too while generating speech data. Is there any other place, where we can download the speech_words.tar file?

https://dl.dropboxusercontent.com/u/23615316/spoken_words.tar

4emkay commented 6 years ago

from this link you can download the data. http://pannous.net/files/ Thank You.