marytts / marytts

MARY TTS -- an open-source, multilingual text-to-speech synthesis system written in pure java
https://marytts.github.io/
Other
2.36k stars 738 forks source link

Sox wrong version #951

Closed seblemaguer closed 4 years ago

seblemaguer commented 4 years ago

Hi! So Now I ran that again and it was working a bit long and it ended. But I don't know if it is finished right or not. Here are the last few lines. In attachment (command.txt), I included the whole log. It looks like something is wrong with sox. I have the wrong version installed. Of course... I will install the correct one and report back.

HTK Configuration Parameters[24]
  Module/Tool     Parameter                  Value
                  CDGV                        TRUE
                  GVOFFMODEL          StrVec 2 _ _
                  OPTKIND                   NEWTON
                  GVWEIGHT                       1
                  HMMWEIGHT                      1
                  STEPDEC                 0.500000
                  STEPINC                 1.200000
                  STEPINIT                       1
                  MINEUCNORM              0.010000
                  GVEPSILON               0.000100
                  MAXGVITER                     50
                  GVHMMLIST       /home/nikigre/marytts-5.1.2/target/marytts-builder-5.1.2/NikGlas_2_HMM/hts/gv/qst001/ver1/tiedlist
                  GVMODELMMF      /home/nikigre/marytts-5.1.2/target/marytts-builder-5.1.2/NikGlas_2_HMM/hts/gv/qst001/ver1/clustered_all.mmf
                  USEGV                       TRUE
                  EMEPSILON               0.000100
                  MAXEMITER                     20
                  WINDIR          /home/nikigre/marytts-5.1.2/target/marytts-builder-5.1.2/NikGlas_2_HMM/hts/data/win
                  WINFN           StrVec 3 mgc.win1 mgc.win2 mgc.win3 StrVec 3 lf0.win1 lf0.win2 lf0.win3 StrVec 3 str.win1 str.win2 str.win3 
                  PDFSTREXT       StrVec 3 mgc lf0 str
                  PDFSTRORDER      IntVec 3 35 1 5
                  PDFSTRSIZE        IntVec 3 1 3 1
                  USEALIGN                    TRUE
                  NATURALWRITEORDER              TRUE
                  NATURALREADORDER              TRUE

=====================================================================
Start synthesizing waveforms (1mix) at tor feb 18 22:20:00 CET 2020
=====================================================================

Processing directory /home/nikigre/marytts-5.1.2/target/marytts-builder-5.1.2/NikGlas_2_HMM/hts/gen/qst001/ver1/1mix/0:
 Synthesizing a speech waveform from gen_stavek109.mgc and gen_stavek109.lf0.../usr/bin/sox WARN getopt: option `s' not recognized
/usr/bin/sox:      SoX v14.4.2

/usr/bin/sox FAIL sox: invalid option

Usage summary: [gopts] [[fopts] infile]... [fopts] outfile [effect [effopt]]...

SPECIAL FILENAMES (infile, outfile):
-                        Pipe/redirect input/output (stdin/stdout); may need -t
-d, --default-device     Use the default audio device (where available)
-n, --null               Use the `null' file handler; e.g. with synth effect
-p, --sox-pipe           Alias for `-t sox -'

SPECIAL FILENAMES (infile only):
"|program [options] ..." Pipe input from external program (where supported)
http://server/file       Use the given URL as input file (where supported)

GLOBAL OPTIONS (gopts) (can be specified at any point before the first effect):
--buffer BYTES           Set the size of all processing buffers (default 8192)
--clobber                Don't prompt to overwrite output file (default)
--combine concatenate    Concatenate all input files (default for sox, rec)
--combine sequence       Sequence all input files (default for play)
-D, --no-dither          Don't dither automatically
--dft-min NUM            Minimum size (log2) for DFT processing (default 10)
--effects-file FILENAME  File containing effects and options
-G, --guard              Use temporary files to guard against clipping
-h, --help               Display version number and usage information
--help-effect NAME       Show usage of effect NAME, or NAME=all for all
--help-format NAME       Show info on format NAME, or NAME=all for all
--i, --info              Behave as soxi(1)
--input-buffer BYTES     Override the input buffer size (default: as --buffer)
--no-clobber             Prompt to overwrite output file
-m, --combine mix        Mix multiple input files (instead of concatenating)
--combine mix-power      Mix to equal power (instead of concatenating)
-M, --combine merge      Merge multiple input files (instead of concatenating)
--magic                  Use `magic' file-type detection
--multi-threaded         Enable parallel effects channels processing
--norm                   Guard (see --guard) & normalise
--play-rate-arg ARG      Default `rate' argument for auto-resample with `play'
--plot gnuplot|octave    Generate script to plot response of filter effect
-q, --no-show-progress   Run in quiet mode; opposite of -S
--replay-gain track|album|off  Default: off (sox, rec), track (play)
-R                       Use default random numbers (same on each run of SoX)
-S, --show-progress      Display progress while processing audio data
--single-threaded        Disable parallel effects channels processing
--temp DIRECTORY         Specify the directory to use for temporary files
-T, --combine multiply   Multiply samples of corresponding channels from all
                         input files (instead of concatenating)
--version                Display version number of SoX and exit
-V[LEVEL]                Increment or set verbosity level (default 2); levels:
                           1: failure messages
                           2: warnings
                           3: details of processing
                           4-6: increasing levels of debug messages
FORMAT OPTIONS (fopts):
Input file format options need only be supplied for files that are headerless.
Output files will have the same format as the input file where possible and not
overridden by any of various means including providing output format options.

-v|--volume FACTOR       Input file volume adjustment factor (real number)
--ignore-length          Ignore input file length given in header; read to EOF
-t|--type FILETYPE       File type of audio
-e|--encoding ENCODING   Set encoding (ENCODING may be one of signed-integer,
                         unsigned-integer, floating-point, mu-law, a-law,
                         ima-adpcm, ms-adpcm, gsm-full-rate)
-b|--bits BITS           Encoded sample size in bits
-N|--reverse-nibbles     Encoded nibble-order
-X|--reverse-bits        Encoded bit-order
--endian little|big|swap Encoded byte-order; swap means opposite to default
-L/-B/-x                 Short options for the above
-c|--channels CHANNELS   Number of channels of audio data; e.g. 2 = stereo
-r|--rate RATE           Sample rate of audio
-C|--compression FACTOR  Compression factor for output format
--add-comment TEXT       Append output file comment
--comment TEXT           Specify comment text for the output file
--comment-file FILENAME  File containing comment text for the output file
--no-glob                Don't `glob' wildcard match the following filename

AUDIO FILE FORMATS: 8svx aif aifc aiff aiffc al amb amr-nb amr-wb anb au avr awb caf cdda cdr cvs cvsd cvu dat dvms f32 f4 f64 f8 fap flac fssd gsm gsrt hcom htk ima ircam la lpc lpc10 lu mat mat4 mat5 maud nist ogg paf prc pvf raw s1 s16 s2 s24 s3 s32 s4 s8 sb sd2 sds sf sl sln smp snd sndfile sndr sndt sou sox sph sw txw u1 u16 u2 u24 u3 u32 u4 u8 ub ul uw vms voc vorbis vox w64 wav wavpcm wv wve xa xi
PLAYLIST FORMATS: m3u pls
AUDIO DEVICE DRIVERS: alsa

EFFECTS: allpass band bandpass bandreject bass bend biquad chorus channels compand contrast dcshift deemph delay dither divide+ downsample earwax echo echos equalizer fade fir firfit+ flanger gain highpass hilbert input# ladspa loudness lowpass mcompand noiseprof noisered norm oops output# overdrive pad phaser pitch rate remix repeat reverb reverse riaa silence sinc spectrogram speed splice stat stats stretch swap synth tempo treble tremolo trim upsample vad vol
  * Deprecated effect    + Experimental effect    # LibSoX-only effect
EFFECT OPTIONS (effopts): effect dependent; see --help-effect
Error in /usr/bin/excite -p 80 /home/nikigre/marytts-5.1.2/target/marytts-builder-5.1.2/NikGlas_2_HMM/hts/gen/qst001/ver1/1mix/0/gen_stavek109.pit | /usr/bin/dfs -b -0.0055231 0.00997018 -0.0087294 -0.000428110000000001 0.01195534 -0.022844 0.0177785 0 -0.02604947 0.04951625 -0.03915007 0.00219802999999999 0.07492151 -0.1614324 0.22659094 0.76356262 0.22659094 -0.1614324 0.07492151 0.00219802999999999 -0.03915007 0.04951625 -0.02604947 0 0.0177785 -0.022844 0.01195534 -0.000428110000000001 -0.0087294 0.00997018 -0.0055231 | /usr/bin/vopr -a /home/nikigre/marytts-5.1.2/target/marytts-builder-5.1.2/NikGlas_2_HMM/hts/gen/qst001/ver1/1mix/0/gen_stavek109.unv | /usr/bin/mglsadf -m 34 -p 80 -a 0.42 -c 0 /home/nikigre/marytts-5.1.2/target/marytts-builder-5.1.2/NikGlas_2_HMM/hts/gen/qst001/ver1/1mix/0/gen_stavek109.mgc | /usr/bin/x2x +fs -o | /usr/bin/sox -c 1 -s -2 -t raw -r 16000 - -c 1 -s -2 -t wav -r 16000 /home/nikigre/marytts-5.1.2/target/marytts-builder-5.1.2/NikGlas_2_HMM/hts/gen/qst001/ver1/1mix/0/gen_stavek109.wav

Originally posted by @nikigre in https://github.com/marytts/marytts/issues/950#issuecomment-589115362

seblemaguer commented 4 years ago

@nikigre could you tell me if you are using the gradle script or the marytts-builder?

nikigre commented 4 years ago

Hi! I am using marytts-builder.

nikigre commented 4 years ago

Hi! So I have installed correct version of sox.

nikigre@Lenovo-G505:~/marytts-5.1.2/target/marytts-builder-5.1.2/NikGlas_2_HMM$ sox --version
sox: SoX v14.2.0

And I ran command like before. nikigre@Lenovo-G505:~/marytts-5.1.2/target/marytts-builder-5.1.2$ sudo /usr/bin/perl /home/nikigre/marytts-5.1.2/target/marytts-builder-5.1.2/NikGlas_2_HMM/hts/scripts/Training.pl /home/nikigre/marytts-5.1.2/target/marytts-builder-5.1.2/NikGlas_2_HMM/hts/scripts/Config.pm And after some thime I got this error:

=====================================================================
Start synthesizing waveforms (1mix) at pet feb 21 01:26:47 CET 2020
=====================================================================

Processing directory /home/nikigre/marytts-5.1.2/target/marytts-builder-5.1.2/NikGlas_2_HMM/hts/gen/qst001/ver1/1mix/0:
 Synthesizing a speech waveform from gen_stavek109.mgc and gen_stavek109.lf0...sh: 1: /usr/bin/sox: not found
Error in /usr/bin/excite -p 80 /home/nikigre/marytts-5.1.2/target/marytts-builder-5.1.2/NikGlas_2_HMM/hts/gen/qst001/ver1/1mix/0/gen_stavek109.pit | /usr/bin/dfs -b -0.0055231 0.00997018 -0.0087294 -0.000428110000000001 0.01195534 -0.022844 0.0177785 0 -0.02604947 0.04951625 -0.03915007 0.00219802999999999 0.07492151 -0.1614324 0.22659094 0.76356262 0.22659094 -0.1614324 0.07492151 0.00219802999999999 -0.03915007 0.04951625 -0.02604947 0 0.0177785 -0.022844 0.01195534 -0.000428110000000001 -0.0087294 0.00997018 -0.0055231 | /usr/bin/vopr -a /home/nikigre/marytts-5.1.2/target/marytts-builder-5.1.2/NikGlas_2_HMM/hts/gen/qst001/ver1/1mix/0/gen_stavek109.unv | /usr/bin/mglsadf -m 34 -p 80 -a 0.42 -c 0 /home/nikigre/marytts-5.1.2/target/marytts-builder-5.1.2/NikGlas_2_HMM/hts/gen/qst001/ver1/1mix/0/gen_stavek109.mgc | /usr/bin/x2x +fs -o | /usr/bin/sox -c 1 -s -2 -t raw -r 16000 - -c 1 -s -2 -t wav -r 16000 /home/nikigre/marytts-5.1.2/target/marytts-builder-5.1.2/NikGlas_2_HMM/hts/gen/qst001/ver1/1mix/0/gen_stavek109.wav

I checked where sox is installed and it was not in /usr/bin so I copied it from /usr/local/bin to correct folder and now I run perl scripts/Training.pl scripts/Config.pm I get this:

 TB: Stats 10346->143 [1.4%]  { 10346->143 [1.4%] total }

TB 0.00 lf0_s3_ {*.state[3].stream[2-4]}
 Tree based clustering based on MDL criterion, threshold=4.793334e+01

 TB: Stats 10346->208 [2.0%]  { 20692->351 [1.7%] total }

TB 0.00 lf0_s4_ {*.state[4].stream[2-4]}
 Tree based clustering based on MDL criterion, threshold=4.891688e+01

 TB: Stats 10346->297 [2.9%]  { 31038->648 [2.1%] total }

TB 0.00 lf0_s5_ {*.state[5].stream[2-4]}
 Tree based clustering based on MDL criterion, threshold=4.855858e+01

 TB: Stats 10346->117 [1.1%]  { 41384->765 [1.8%] total }

TB 0.00 lf0_s6_ {*.state[6].stream[2-4]}
 Tree based clustering based on MDL criterion, threshold=4.572270e+01

It looks like, it is working! I will report back on how it goes!

nikigre commented 4 years ago

Hi all! The command completed successfully! Thank you all for help!