Open slopezpereyra opened 1 year ago
The plugin saves the recordings under media/voiceovers/
. Can you play the ones you recorded with "default" device? They will properly not play properly with a media player, but just trying to narrow the problem down.
I have the same issue, and would like to contribute by answering that the recording is properly saved in in the voiceovers folder, the problem occurs when combining it with mp4 file
Description of bug / unexpected behavior
I took the following example from the VoiceOver Website:
I then ran
manim -pqh myfile.py MyScene --disable_caching
. I was requested to chose from which input device to record. I chose "default" (13). I recorded my voice as instructed, holding the 'r' key.Upon finishing my recording, the following message appeared on the console:
I run Ubuntu 23.04. All dependencies were installed and are up-to-date.
Expected behavior
After executing
manim -pqh myfile.py MyScene --disable_caching
and recording my voice with my (functioning) microphone, I expected the recording to be succesfully embedded to the video and an .mp4 file to be outputted with the recording.How to reproduce the issue
Code for reproducing the problem
```py from manim import * from manim_voiceover import VoiceoverScene from manim_voiceover.services.recorder import RecorderService class MyScene(VoiceoverScene): def construct(self): self.set_speech_service(RecorderService( )) with self.voiceover(text="This circle is drawn as I speak.") as tracker: self.play(Create(circle), run_time=tracker.duration) ```System specifications
System Details
- OS: Ubuntu 23.04 - RAM: 16 GB - Python version 3.11.2 - Installed modules (provide output from `pip list`): ``` Package Version ------------------------------ ---------- azure-cognitiveservices-speech 1.28.0 build 0.10.0 certifi 2022.12.7 charset-normalizer 3.1.0 click 8.1.3 click-default-group 1.2.2 cloup 0.13.1 cmake 3.26.3 colour 0.1.5 decorator 5.1.1 docstring-to-markdown 0.12 evdev 1.6.1 ffmpeg-python 0.2.0 filelock 3.12.0 fsspec 2023.4.0 future 0.18.3 glcontext 2.3.7 greenlet 2.0.2 gTTS 2.3.2 huggingface-hub 0.14.1 humanhash3 0.0.6 idna 3.4 isosurfaces 0.1.0 jedi 0.17.2 Jinja2 3.1.2 lit 16.0.2 llvmlite 0.40.0 manim 0.17.3 manim-voiceover 0.3.0 ManimPango 0.4.3 mapbox-earcut 1.0.1 markdown-it-py 2.2.0 MarkupSafe 2.1.2 mdurl 0.1.2 moderngl 5.8.2 moderngl-window 2.4.3 more-itertools 9.1.0 mpmath 1.3.0 msgpack 1.0.5 multipledispatch 0.6.0 mutagen 1.46.0 networkx 2.8.8 numba 0.57.0 numpy 1.24.3 nvidia-cublas-cu11 11.10.3.66 nvidia-cuda-cupti-cu11 11.7.101 nvidia-cuda-nvrtc-cu11 11.7.99 nvidia-cuda-runtime-cu11 11.7.99 nvidia-cudnn-cu11 8.5.0.96 nvidia-cufft-cu11 10.9.0.58 nvidia-curand-cu11 10.2.10.91 nvidia-cusolver-cu11 11.4.0.1 nvidia-cusparse-cu11 11.7.4.91 nvidia-nccl-cu11 2.14.3 nvidia-nvtx-cu11 11.7.91 openai-whisper 20230314 packaging 23.1 parso 0.7.1 Pillow 9.5.0 pip 23.1.2 pip-tools 6.13.0 playsound 1.3.0 pluggy 1.0.0 PyAudio 0.2.13 pycairo 1.23.0 pydub 0.25.1 pyglet 2.0.5 Pygments 2.15.1 pynput 1.7.6 pynvim 0.4.3 pyproject_hooks 1.0.0 pyrr 0.10.3 python-dotenv 0.21.1 python-jsonrpc-server 0.4.0 python-language-server 0.36.2 python-lsp-jsonrpc 1.0.0 python-lsp-server 1.7.2 python-xlib 0.33 PyYAML 6.0 regex 2023.5.5 requests 2.29.0 rich 13.3.5 scipy 1.10.1 screeninfo 0.8.1 setuptools 66.1.1 six 1.16.0 skia-pathops 0.7.4 sox 1.4.1 srt 3.5.3 stable-ts 2.5.3 svgelements 1.9.3 sympy 1.11.1 tiktoken 0.3.1 tokenizers 0.13.3 torch 2.0.0 torchaudio 2.0.1 tqdm 4.65.0 transformers 4.28.1 triton 2.0.0 typing_extensions 4.5.0 ujson 5.7.0 urllib3 1.26.15 watchdog 2.3.1 wheel 0.40.0 ```FFMPEG
Output of `ffmpeg -version`: ``` ffmpeg version 5.1.2-3ubuntu1 Copyright (c) 2000-2022 the FFmpeg developers built with gcc 12 (Ubuntu 12.2.0-14ubuntu2) configuration: --prefix=/usr --extra-version=3ubuntu1 --toolchain=hardened --libdir=/usr/lib/x86_64-linux-gnu --incdir=/usr/include/x86_64-linux-gnu --arch=amd64 --enable-gpl --disable-stripping --enable-gnutls --enable-ladspa --enable-libaom --enable-libass --enable-libbluray --enable-libbs2b --enable-libcaca --enable-libcdio --enable-libcodec2 --enable-libdav1d --enable-libflite --enable-libfontconfig --enable-libfreetype --enable-libfribidi --enable-libglslang --enable-libgme --enable-libgsm --enable-libjack --enable-libmp3lame --enable-libmysofa --enable-libopenjpeg --enable-libopenmpt --enable-libopus --enable-libpulse --enable-librabbitmq --enable-librist --enable-librubberband --enable-libshine --enable-libsnappy --enable-libsoxr --enable-libspeex --enable-libsrt --enable-libssh --enable-libsvtav1 --enable-libtheora --enable-libtwolame --enable-libvidstab --enable-libvorbis --enable-libvpx --enable-libwebp --enable-libx265 --enable-libxml2 --enable-libxvid --enable-libzimg --enable-libzmq --enable-libzvbi --enable-lv2 --enable-omx --enable-openal --enable-opencl --enable-opengl --enable-sdl2 --disable-sndio --enable-libjxl --enable-pocketsphinx --enable-librsvg --enable-libmfx --enable-libdc1394 --enable-libdrm --enable-libiec61883 --enable-chromaprint --enable-frei0r --enable-libx264 --enable-libplacebo --enable-librav1e --enable-shared libavutil 57. 28.100 / 57. 28.100 libavcodec 59. 37.100 / 59. 37.100 libavformat 59. 27.100 / 59. 27.100 libavdevice 59. 7.100 / 59. 7.100 libavfilter 8. 44.100 / 8. 44.100 libswscale 6. 7.100 / 6. 7.100 libswresample 4. 7.100 / 4. 7.100 libpostproc 56. 6.100 / 56. 6.100 ```Aditional comments
Choosing HDA Intel PCH: ALC897 Analog or HDA Intel PCH: ALC897 Alt Analog as input devices, instead of default, did not produce the same issue. However, the recordings were of terrible quality (not a microphone issue, tested the same microphone on an online recorder and had good quality).