wav Search Results - Githubissues

1000+ results
for wav

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

espnet/espnet #5960

Alien-like sound from inferenced audio at loss rate ~ 0.8

**Describe your question** I am training a TTS model using Fastspech2. I started training my model about 6 hours ago (40,000~ steps) and loss rate dropped from `4~` down to `0.8~`. I tried running …

amarbayar updated 17 hours ago
2
JuergenFleiss/aTrain #40

aTrain crashes with m4a audio files

I have been testing aTrain on both mp3 and m4a files, and I noticed that it systematically silently crashes towards the end of the transcription, with no transcription output, only the wav file and me…

lrq3000 updated 1 day ago
1
ValveResourceFormat/ValveResourceFormat #377

Implement ADPCM wav correctly

Needs a review of which wave header fields need to change. NAudio also doesn't seem to support it.

xPaw updated 6 days ago
1
speechbrain/speechbrain #2764

SpeechBrain Quantization refactoring

I'd like to raise a concern about how quantization is currently handled in SpeechBrain. While training my own k-means quantizer on the last layer of an ASR model, I noticed that the interface was not …

Adel-Moumen updated 1 hour ago
3
OpenPecha/piper-tts #1

TTS0010: piper tts training

## Dataset Format The pre-processing script expects data to be a directory with: * `metadata.csv` - CSV file with text, audio filenames, and speaker names * `wav/` - directory with audio files The …

gangagyatso4364 updated 1 day ago
3
nikvaessen/w2v2-speaker-few-samples #3

How do i pass a new .wav sample for prediction?

It seems from the code, the datasets have to be in a .tar.gz format for the train/validation/test to work. We need to pass the data as a datamodule as seen in main.py line 378 "result = trainer.test(n…

kprateek-iitbh updated 1 day ago
1
SYSTRAN/faster-whisper #1152

how can i not to get the normalized chinese number?

segments, _ = model.transcribe( wav_name+'.wav', language="zh", ) eg. output "二零一四“，rather than "2014"

LRY1994 updated 3 days ago
1
jishengpeng/WavTokenizer #18

Mel or wav？

Great work! I want to ask if you have tried using mel as input? If mel is used as input and the same bitrate is maintained (e.g. frameshift=256, encoder downsampled by 3 times), will the performance o…

howitry updated 2 months ago
1
teslamotors/light-show #111

Multi Partition USB with multiple LightShows

Using the reference for this section of the instruction documentation. This is not really an issue but a documentation improvement or observation. "Must contain a base-level folder called "LightSho…

DreadfullyDespized updated 2 days ago
7
shhossain/BanglaSpeech2Text #36

Error While Running transcription = stt.transcribe("audio.wa…

ValueError: You are trying to return timestamps, but the generation config is not properly set. Make sure to initialize the generation config with the correct attributes that are needed such as `no_ti…

nayeem01 updated 1 week ago
9

上一页 1...1 2 3 4 5 6 7...100 下一页

1000+ results for wav

1000+ results
for wav