Open slopezpereyra opened 1 year ago
The plugin saves the recordings under media/voiceovers/
. Can you play the ones you recorded with "default" device? They will properly not play properly with a media player, but just trying to narrow the problem down.
Description of bug / unexpected behavior
I took the following example from the VoiceOver Website:
I then ran
manim -pqh myfile.py MyScene --disable_caching
. I was requested to chose from which input device to record. I chose "default" (13). I recorded my voice as instructed, holding the 'r' key.Upon finishing my recording, the following message appeared on the console:
I run Ubuntu 23.04. All dependencies were installed and are up-to-date.
Expected behavior
After executing
manim -pqh myfile.py MyScene --disable_caching
and recording my voice with my (functioning) microphone, I expected the recording to be succesfully embedded to the video and an .mp4 file to be outputted with the recording.How to reproduce the issue
Code for reproducing the problem
```py from manim import * from manim_voiceover import VoiceoverScene from manim_voiceover.services.recorder import RecorderService class MyScene(VoiceoverScene): def construct(self): self.set_speech_service(RecorderService( )) with self.voiceover(text="This circle is drawn as I speak.") as tracker: self.play(Create(circle), run_time=tracker.duration) ```System specifications
System Details
- OS: Ubuntu 23.04 - RAM: 16 GB - Python version 3.11.2 - Installed modules (provide output from `pip list`): ``` Package Version ------------------------------ ---------- azure-cognitiveservices-speech 1.28.0 build 0.10.0 certifi 2022.12.7 charset-normalizer 3.1.0 click 8.1.3 click-default-group 1.2.2 cloup 0.13.1 cmake 3.26.3 colour 0.1.5 decorator 5.1.1 docstring-to-markdown 0.12 evdev 1.6.1 ffmpeg-python 0.2.0 filelock 3.12.0 fsspec 2023.4.0 future 0.18.3 glcontext 2.3.7 greenlet 2.0.2 gTTS 2.3.2 huggingface-hub 0.14.1 humanhash3 0.0.6 idna 3.4 isosurfaces 0.1.0 jedi 0.17.2 Jinja2 3.1.2 lit 16.0.2 llvmlite 0.40.0 manim 0.17.3 manim-voiceover 0.3.0 ManimPango 0.4.3 mapbox-earcut 1.0.1 markdown-it-py 2.2.0 MarkupSafe 2.1.2 mdurl 0.1.2 moderngl 5.8.2 moderngl-window 2.4.3 more-itertools 9.1.0 mpmath 1.3.0 msgpack 1.0.5 multipledispatch 0.6.0 mutagen 1.46.0 networkx 2.8.8 numba 0.57.0 numpy 1.24.3 nvidia-cublas-cu11 11.10.3.66 nvidia-cuda-cupti-cu11 11.7.101 nvidia-cuda-nvrtc-cu11 11.7.99 nvidia-cuda-runtime-cu11 11.7.99 nvidia-cudnn-cu11 8.5.0.96 nvidia-cufft-cu11 10.9.0.58 nvidia-curand-cu11 10.2.10.91 nvidia-cusolver-cu11 11.4.0.1 nvidia-cusparse-cu11 11.7.4.91 nvidia-nccl-cu11 2.14.3 nvidia-nvtx-cu11 11.7.91 openai-whisper 20230314 packaging 23.1 parso 0.7.1 Pillow 9.5.0 pip 23.1.2 pip-tools 6.13.0 playsound 1.3.0 pluggy 1.0.0 PyAudio 0.2.13 pycairo 1.23.0 pydub 0.25.1 pyglet 2.0.5 Pygments 2.15.1 pynput 1.7.6 pynvim 0.4.3 pyproject_hooks 1.0.0 pyrr 0.10.3 python-dotenv 0.21.1 python-jsonrpc-server 0.4.0 python-language-server 0.36.2 python-lsp-jsonrpc 1.0.0 python-lsp-server 1.7.2 python-xlib 0.33 PyYAML 6.0 regex 2023.5.5 requests 2.29.0 rich 13.3.5 scipy 1.10.1 screeninfo 0.8.1 setuptools 66.1.1 six 1.16.0 skia-pathops 0.7.4 sox 1.4.1 srt 3.5.3 stable-ts 2.5.3 svgelements 1.9.3 sympy 1.11.1 tiktoken 0.3.1 tokenizers 0.13.3 torch 2.0.0 torchaudio 2.0.1 tqdm 4.65.0 transformers 4.28.1 triton 2.0.0 typing_extensions 4.5.0 ujson 5.7.0 urllib3 1.26.15 watchdog 2.3.1 wheel 0.40.0 ```FFMPEG
Output of `ffmpeg -version`: ``` ffmpeg version 5.1.2-3ubuntu1 Copyright (c) 2000-2022 the FFmpeg developers built with gcc 12 (Ubuntu 12.2.0-14ubuntu2) configuration: --prefix=/usr --extra-version=3ubuntu1 --toolchain=hardened --libdir=/usr/lib/x86_64-linux-gnu --incdir=/usr/include/x86_64-linux-gnu --arch=amd64 --enable-gpl --disable-stripping --enable-gnutls --enable-ladspa --enable-libaom --enable-libass --enable-libbluray --enable-libbs2b --enable-libcaca --enable-libcdio --enable-libcodec2 --enable-libdav1d --enable-libflite --enable-libfontconfig --enable-libfreetype --enable-libfribidi --enable-libglslang --enable-libgme --enable-libgsm --enable-libjack --enable-libmp3lame --enable-libmysofa --enable-libopenjpeg --enable-libopenmpt --enable-libopus --enable-libpulse --enable-librabbitmq --enable-librist --enable-librubberband --enable-libshine --enable-libsnappy --enable-libsoxr --enable-libspeex --enable-libsrt --enable-libssh --enable-libsvtav1 --enable-libtheora --enable-libtwolame --enable-libvidstab --enable-libvorbis --enable-libvpx --enable-libwebp --enable-libx265 --enable-libxml2 --enable-libxvid --enable-libzimg --enable-libzmq --enable-libzvbi --enable-lv2 --enable-omx --enable-openal --enable-opencl --enable-opengl --enable-sdl2 --disable-sndio --enable-libjxl --enable-pocketsphinx --enable-librsvg --enable-libmfx --enable-libdc1394 --enable-libdrm --enable-libiec61883 --enable-chromaprint --enable-frei0r --enable-libx264 --enable-libplacebo --enable-librav1e --enable-shared libavutil 57. 28.100 / 57. 28.100 libavcodec 59. 37.100 / 59. 37.100 libavformat 59. 27.100 / 59. 27.100 libavdevice 59. 7.100 / 59. 7.100 libavfilter 8. 44.100 / 8. 44.100 libswscale 6. 7.100 / 6. 7.100 libswresample 4. 7.100 / 4. 7.100 libpostproc 56. 6.100 / 56. 6.100 ```Aditional comments
Choosing HDA Intel PCH: ALC897 Analog or HDA Intel PCH: ALC897 Alt Analog as input devices, instead of default, did not produce the same issue. However, the recordings were of terrible quality (not a microphone issue, tested the same microphone on an online recorder and had good quality).