Open bagsorbet opened 11 months ago
Yes. This was poorly documented.
You need to adjust your audio array to write to the wav file if you are running in float16.
audio_array = speech_output.cpu().numpy().squeeze()
audio_array /=1.414
audio_array *= 32767
audio_array = audio_array.astype(np.int16)
# print(audio_array)
scipy.io.wavfile.write("bark_out_bet.wav", rate=sampling_rate, data=audio_array)
Hi all,
I'm trying to get bark up and running, and used the example code to see if it's working.
OS is Ubuntu 22.04. Running the latest stable release of python3, using pytorch for CUDA 12.2, can provide more details if necessary (I am very inexperienced with these tools so please pardon me if there is a glaring omission in details pertinent to diagnosing the problem).
Here's what happens when I use the example:
Is this related to the deprecated packages? I searched around for this error, and the only thing I found seems entirely unrelated (something about seconds since 1970, but since this is about the format and not time, I am pretty sure that has no bearing whatsoever on my problem).
So, any ideas? :-)