Closed rejuce closed 9 months ago
I'm not really sue what could be happening here? Between runs there's nothing that caches the previously supplied wave files as far as I am aware.
Can you share output showing where it's loading anything from the previous speaker so we can get a better look at what might be going on?
the first part stayed the same, it list the chapters an loads the model
_Part: 27 Then, once we proceeded halfway to the demon's camp, the black cube, Mao, sent a number of demons forward too. Is that large man with a doglike face a kobold? There was a woman who looked like a vampire in armor, and a heavily armored lizardman too. B Length: 5599 Part: 28 Extra Story, The Prickly Girl Wants to Be Spoiled Too It was a little after Souma and his group returned from the Demon Lord's Domain to the Kingdom. Around this time, the two countries were sorting out how they were going to announce the complete l Number of chapters to read: 28 Saving to How a Realist Hero Rebuilt the Kingdom - LN 17 Premium-badr-odhiambo.m4b Total characters: 390646 Using GPU VRAM: 4294443008 Loading model: /home/jk/.local/share/tts/tts_models--multilingual--multi-dataset--xtts_v2
tts_models/multilingual/multi-dataset/xttsv2 is already downloaded. Using model: xtts
now before at this point it listed all the previous already converted chapters, shows the next batch of sentences to tts and startrs processing, also processed it, showed the realtimefactor etc...thats alls gone now
but since i used the .wav files ones (even in differnt directory) the already finishes chapters of this book are ignored and the output looks the same as when I called it with the .wav files
0%| | 0/2 [00:13<?, ?it/s]
since that one call with the .wav files, now the intermediate files it creates are .wav
and it looks like it is detecting ones, but somehow it looks now for .wav intermediate files,
but the intermediate ones created before were .flac and also was looking for flac before.
'How a Realist Hero Rebuilt the Kingdom - LN 17 Premium-1.flac' 'How a Realist Hero Rebuilt the Kingdom - LN 17 Premium-1.wav' 'How a Realist Hero Rebuilt the Kingdom - LN 17 Premium-10.flac' 'How a Realist Hero Rebuilt the Kingdom - LN 17 Premium-11.flac' 'How a Realist Hero Rebuilt the Kingdom - LN 17 Premium-12.flac' 'How a Realist Hero Rebuilt the Kingdom - LN 17 Premium-13.flac' 'How a Realist Hero Rebuilt the Kingdom - LN 17 Premium-14.flac' 'How a Realist Hero Rebuilt the Kingdom - LN 17 Premium-15.flac' 'How a Realist Hero Rebuilt the Kingdom - LN 17 Premium-16.flac' 'How a Realist Hero Rebuilt the Kingdom - LN 17 Premium-2.flac' 'How a Realist Hero Rebuilt the Kingdom - LN 17 Premium-2.wav' 'How a Realist Hero Rebuilt the Kingdom - LN 17 Premium-3.flac' 'How a Realist Hero Rebuilt the Kingdom - LN 17 Premium-4.flac' 'How a Realist Hero Rebuilt the Kingdom - LN 17 Premium-5.flac' 'How a Realist Hero Rebuilt the Kingdom - LN 17 Premium-6.flac' 'How a Realist Hero Rebuilt the Kingdom - LN 17 Premium-7.flac' 'How a Realist Hero Rebuilt the Kingdom - LN 17 Premium-8.flac' 'How a Realist Hero Rebuilt the Kingdom - LN 17 Premium-9.flac'
It is just surprising that calling once with the input .wav options, changes also the behaviour of calling it without. or did this change due to recent comits?
ahh just saw the last merge ffmepg, changed the format of the intermediate files
then it still would be nice to have the realtime factor display back as well
AH! I understand what happened... If epub2tts is interrupted before completing, it will leave the intermediate files it completed behind. When I first started working on this, it would occasionally crash, and sometimes it would be after having already done a lot of the work. For instance each chapter is made up of a bunch of small files that get concatenated into one file per chapter. So if it crashes and you leave those temp files sitting there, epub2tts will pick up where it left off. BUT if you want to start fresh after a crash, you need to delete all the temp-N.wav (or .flac if you were using an earlier version) files. Same with each chapter file, for instance if the epub was "mybook.epub" and you had --start 5, and it crashed while working on the third part, you will have mybook-5.wav and mybook-6.wav sitting in the directory.
SO - if you remove all those temp files and start again, it will start fresh from the beginning. If you leave the temp files, it will try to pick up where it left off.
Hope that explanation is clear enough and helps, please feel free to ask about anything else that comes up!
Yeah it's a very useful feature. Just started the book, then updated and expected to continue it. It would have worked but the comit changed the intermediate format. All good
I got it working with the last version, that I could load the speech files fo refine.
I did not like the result though. now I wanted no continue converting my books with the default models set by --engine xtts --speaker ...
but somehow it still tries to load the latent speaker stuff from before when i supplied the three wave files. i tried deleteing the model completly but no change
how do I get back the original behvaiour I had when calling python3 /home/jk/epub2tts/epub2tts.py ~/epubconvert/....epub --engine xtts --speaker "Badr Odhiambo" before I called it with the .wav files as input?