Plachtaa / seed-vc

State-of-the-Art zero-shot voice conversion & singing voice conversion with in context learning
GNU General Public License v3.0
596 stars 66 forks source link

Issue installing `requirements.txt` in linux kaggle envrionment #44

Open slooi opened 3 days ago

slooi commented 3 days ago

When running the below command, I get pip dependency issues as shown in the terminal output section. Does anyone know how to solve this?

!git clone https://github.com/Plachtaa/seed-vc
%cd ./seed-vc
!pip install -r requirements.txt

Full Terminal Output:

Cloning into 'seed-vc'...
remote: Enumerating objects: 525, done.
remote: Counting objects: 100% (75/75), done.
remote: Compressing objects: 100% (28/28), done.
remote: Total 525 (delta 56), reused 48 (delta 47), pack-reused 450 (from 1)
Receiving objects: 100% (525/525), 62.92 MiB | 24.60 MiB/s, done.
Resolving deltas: 100% (233/233), done.
/kaggle/working/seed-vc
Looking in indexes: https://pypi.org/simple, https://download.pytorch.org/whl/cu113
Collecting git+https://github.com/openai/whisper.git (from -r requirements.txt (line 11))
  Cloning https://github.com/openai/whisper.git to /tmp/pip-req-build-s6nq1_x0
  Running command git clone --filter=blob:none --quiet https://github.com/openai/whisper.git /tmp/pip-req-build-s6nq1_x0
  Resolved https://github.com/openai/whisper.git to commit 271445b2f24f00f8175c4fb7ae91876f7451dfc1
  Installing build dependencies ... done
  Getting requirements to build wheel ... done
  Preparing metadata (pyproject.toml) ... done
Requirement already satisfied: torch in /opt/conda/lib/python3.10/site-packages (from -r requirements.txt (line 2)) (2.4.0+cpu)
Requirement already satisfied: torchvision in /opt/conda/lib/python3.10/site-packages (from -r requirements.txt (line 3)) (0.19.0+cpu)
Requirement already satisfied: torchaudio in /opt/conda/lib/python3.10/site-packages (from -r requirements.txt (line 4)) (2.4.0+cpu)
Collecting scipy==1.13.1 (from -r requirements.txt (line 5))
  Downloading scipy-1.13.1-cp310-cp310-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.metadata (60 kB)
     ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 60.6/60.6 kB 1.6 MB/s eta 0:00:00
Collecting librosa==0.10.2 (from -r requirements.txt (line 6))
  Downloading librosa-0.10.2-py3-none-any.whl.metadata (8.6 kB)
Requirement already satisfied: huggingface-hub in /opt/conda/lib/python3.10/site-packages (from -r requirements.txt (line 7)) (0.25.1)
Collecting munch (from -r requirements.txt (line 8))
  Downloading munch-4.0.0-py2.py3-none-any.whl.metadata (5.9 kB)
Collecting einops (from -r requirements.txt (line 9))
  Downloading einops-0.8.0-py3-none-any.whl.metadata (12 kB)
Collecting descript-audio-codec (from -r requirements.txt (line 10))
  Downloading descript_audio_codec-1.0.0-py3-none-any.whl.metadata (7.8 kB)
Collecting gradio (from -r requirements.txt (line 12))
  Downloading gradio-5.5.0-py3-none-any.whl.metadata (16 kB)
Requirement already satisfied: pydub in /opt/conda/lib/python3.10/site-packages (from -r requirements.txt (line 13)) (0.25.1)
Collecting resemblyzer (from -r requirements.txt (line 14))
  Downloading Resemblyzer-0.1.4-py3-none-any.whl.metadata (5.8 kB)
Collecting jiwer (from -r requirements.txt (line 15))
  Downloading jiwer-3.0.5-py3-none-any.whl.metadata (2.7 kB)
Requirement already satisfied: transformers in /opt/conda/lib/python3.10/site-packages (from -r requirements.txt (line 16)) (4.45.1)
Collecting FreeSimpleGUI (from -r requirements.txt (line 17))
  Downloading FreeSimpleGUI-5.1.1-py3-none-any.whl.metadata (976 bytes)
Requirement already satisfied: soundfile in /opt/conda/lib/python3.10/site-packages (from -r requirements.txt (line 18)) (0.12.1)
Collecting sounddevice (from -r requirements.txt (line 19))
  Downloading sounddevice-0.5.1-py3-none-any.whl.metadata (1.4 kB)
Requirement already satisfied: numpy<2.3,>=1.22.4 in /opt/conda/lib/python3.10/site-packages (from scipy==1.13.1->-r requirements.txt (line 5)) (1.26.4)
Requirement already satisfied: audioread>=2.1.9 in /opt/conda/lib/python3.10/site-packages (from librosa==0.10.2->-r requirements.txt (line 6)) (3.0.1)
Requirement already satisfied: scikit-learn>=0.20.0 in /opt/conda/lib/python3.10/site-packages (from librosa==0.10.2->-r requirements.txt (line 6)) (1.2.2)
Requirement already satisfied: joblib>=0.14 in /opt/conda/lib/python3.10/site-packages (from librosa==0.10.2->-r requirements.txt (line 6)) (1.4.2)
Requirement already satisfied: decorator>=4.3.0 in /opt/conda/lib/python3.10/site-packages (from librosa==0.10.2->-r requirements.txt (line 6)) (5.1.1)
Requirement already satisfied: numba>=0.51.0 in /opt/conda/lib/python3.10/site-packages (from librosa==0.10.2->-r requirements.txt (line 6)) (0.60.0)
Requirement already satisfied: pooch>=1.1 in /opt/conda/lib/python3.10/site-packages (from librosa==0.10.2->-r requirements.txt (line 6)) (1.8.2)
Requirement already satisfied: soxr>=0.3.2 in /opt/conda/lib/python3.10/site-packages (from librosa==0.10.2->-r requirements.txt (line 6)) (0.5.0.post1)
Requirement already satisfied: typing-extensions>=4.1.1 in /opt/conda/lib/python3.10/site-packages (from librosa==0.10.2->-r requirements.txt (line 6)) (4.12.2)
Requirement already satisfied: lazy-loader>=0.1 in /opt/conda/lib/python3.10/site-packages (from librosa==0.10.2->-r requirements.txt (line 6)) (0.4)
Requirement already satisfied: msgpack>=1.0 in /opt/conda/lib/python3.10/site-packages (from librosa==0.10.2->-r requirements.txt (line 6)) (1.0.8)
Requirement already satisfied: filelock in /opt/conda/lib/python3.10/site-packages (from torch->-r requirements.txt (line 2)) (3.15.1)
Requirement already satisfied: sympy in /opt/conda/lib/python3.10/site-packages (from torch->-r requirements.txt (line 2)) (1.12)
Requirement already satisfied: networkx in /opt/conda/lib/python3.10/site-packages (from torch->-r requirements.txt (line 2)) (3.3)
Requirement already satisfied: jinja2 in /opt/conda/lib/python3.10/site-packages (from torch->-r requirements.txt (line 2)) (3.1.4)
Requirement already satisfied: fsspec in /opt/conda/lib/python3.10/site-packages (from torch->-r requirements.txt (line 2)) (2024.6.1)
Requirement already satisfied: pillow!=8.3.*,>=5.3.0 in /opt/conda/lib/python3.10/site-packages (from torchvision->-r requirements.txt (line 3)) (10.3.0)
Requirement already satisfied: packaging>=20.9 in /opt/conda/lib/python3.10/site-packages (from huggingface-hub->-r requirements.txt (line 7)) (21.3)
Requirement already satisfied: pyyaml>=5.1 in /opt/conda/lib/python3.10/site-packages (from huggingface-hub->-r requirements.txt (line 7)) (6.0.2)
Requirement already satisfied: requests in /opt/conda/lib/python3.10/site-packages (from huggingface-hub->-r requirements.txt (line 7)) (2.32.3)
Requirement already satisfied: tqdm>=4.42.1 in /opt/conda/lib/python3.10/site-packages (from huggingface-hub->-r requirements.txt (line 7)) (4.66.4)
Collecting argbind>=0.3.7 (from descript-audio-codec->-r requirements.txt (line 10))
  Downloading argbind-0.3.9.tar.gz (17 kB)
  Preparing metadata (setup.py) ... done
Collecting descript-audiotools>=0.7.2 (from descript-audio-codec->-r requirements.txt (line 10))
  Downloading descript_audiotools-0.7.2-py2.py3-none-any.whl.metadata (3.4 kB)
Requirement already satisfied: more-itertools in /opt/conda/lib/python3.10/site-packages (from openai-whisper==20240930->-r requirements.txt (line 11)) (10.3.0)
Collecting tiktoken (from openai-whisper==20240930->-r requirements.txt (line 11))
  Downloading tiktoken-0.8.0-cp310-cp310-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.metadata (6.6 kB)
Collecting triton>=2.0.0 (from openai-whisper==20240930->-r requirements.txt (line 11))
  Downloading https://download.pytorch.org/whl/triton-3.1.0-cp310-cp310-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (209.5 MB)
     ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 209.5/209.5 MB 5.8 MB/s eta 0:00:0000:0100:01
Requirement already satisfied: aiofiles<24.0,>=22.0 in /opt/conda/lib/python3.10/site-packages (from gradio->-r requirements.txt (line 12)) (22.1.0)
Requirement already satisfied: anyio<5.0,>=3.0 in /opt/conda/lib/python3.10/site-packages (from gradio->-r requirements.txt (line 12)) (4.4.0)
Collecting fastapi<1.0,>=0.115.2 (from gradio->-r requirements.txt (line 12))
  Downloading fastapi-0.115.4-py3-none-any.whl.metadata (27 kB)
Collecting ffmpy (from gradio->-r requirements.txt (line 12))
  Downloading ffmpy-0.4.0-py3-none-any.whl.metadata (2.9 kB)
Collecting gradio-client==1.4.2 (from gradio->-r requirements.txt (line 12))
  Downloading gradio_client-1.4.2-py3-none-any.whl.metadata (7.1 kB)
Requirement already satisfied: httpx>=0.24.1 in /opt/conda/lib/python3.10/site-packages (from gradio->-r requirements.txt (line 12)) (0.27.0)
Requirement already satisfied: markupsafe~=2.0 in /opt/conda/lib/python3.10/site-packages (from gradio->-r requirements.txt (line 12)) (2.1.5)
Requirement already satisfied: orjson~=3.0 in /opt/conda/lib/python3.10/site-packages (from gradio->-r requirements.txt (line 12)) (3.10.4)
Requirement already satisfied: pandas<3.0,>=1.0 in /opt/conda/lib/python3.10/site-packages (from gradio->-r requirements.txt (line 12)) (2.2.3)
Requirement already satisfied: pydantic>=2.0 in /opt/conda/lib/python3.10/site-packages (from gradio->-r requirements.txt (line 12)) (2.9.2)
Collecting python-multipart==0.0.12 (from gradio->-r requirements.txt (line 12))
  Downloading python_multipart-0.0.12-py3-none-any.whl.metadata (1.9 kB)
Collecting ruff>=0.2.2 (from gradio->-r requirements.txt (line 12))
  Downloading ruff-0.7.3-py3-none-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.metadata (25 kB)
Collecting safehttpx<1.0,>=0.1.1 (from gradio->-r requirements.txt (line 12))
  Downloading safehttpx-0.1.1-py3-none-any.whl.metadata (4.1 kB)
Collecting semantic-version~=2.0 (from gradio->-r requirements.txt (line 12))
  Downloading semantic_version-2.10.0-py2.py3-none-any.whl.metadata (9.7 kB)
Collecting starlette<1.0,>=0.40.0 (from gradio->-r requirements.txt (line 12))
  Downloading starlette-0.41.2-py3-none-any.whl.metadata (6.0 kB)
Collecting tomlkit==0.12.0 (from gradio->-r requirements.txt (line 12))
  Downloading tomlkit-0.12.0-py3-none-any.whl.metadata (2.7 kB)
Requirement already satisfied: typer<1.0,>=0.12 in /opt/conda/lib/python3.10/site-packages (from gradio->-r requirements.txt (line 12)) (0.12.3)
Requirement already satisfied: uvicorn>=0.14.0 in /opt/conda/lib/python3.10/site-packages (from gradio->-r requirements.txt (line 12)) (0.30.1)
Requirement already satisfied: websockets<13.0,>=10.0 in /opt/conda/lib/python3.10/site-packages (from gradio-client==1.4.2->gradio->-r requirements.txt (line 12)) (12.0)
Collecting webrtcvad>=2.0.10 (from resemblyzer->-r requirements.txt (line 14))
  Downloading webrtcvad-2.0.10.tar.gz (66 kB)
     ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 66.2/66.2 kB 1.8 MB/s eta 0:00:00
  Preparing metadata (setup.py) ... done
Collecting typing (from resemblyzer->-r requirements.txt (line 14))
  Downloading typing-3.7.4.3.tar.gz (78 kB)
     ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 78.6/78.6 kB 2.7 MB/s eta 0:00:00
  Preparing metadata (setup.py) ... done
Requirement already satisfied: click<9.0.0,>=8.1.3 in /opt/conda/lib/python3.10/site-packages (from jiwer->-r requirements.txt (line 15)) (8.1.7)
Collecting rapidfuzz<4,>=3 (from jiwer->-r requirements.txt (line 15))
  Downloading rapidfuzz-3.10.1-cp310-cp310-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.metadata (11 kB)
Requirement already satisfied: regex!=2019.12.17 in /opt/conda/lib/python3.10/site-packages (from transformers->-r requirements.txt (line 16)) (2024.5.15)
Requirement already satisfied: safetensors>=0.4.1 in /opt/conda/lib/python3.10/site-packages (from transformers->-r requirements.txt (line 16)) (0.4.5)
Requirement already satisfied: tokenizers<0.21,>=0.20 in /opt/conda/lib/python3.10/site-packages (from transformers->-r requirements.txt (line 16)) (0.20.0)
Requirement already satisfied: cffi>=1.0 in /opt/conda/lib/python3.10/site-packages (from soundfile->-r requirements.txt (line 18)) (1.16.0)
Requirement already satisfied: idna>=2.8 in /opt/conda/lib/python3.10/site-packages (from anyio<5.0,>=3.0->gradio->-r requirements.txt (line 12)) (3.7)
Requirement already satisfied: sniffio>=1.1 in /opt/conda/lib/python3.10/site-packages (from anyio<5.0,>=3.0->gradio->-r requirements.txt (line 12)) (1.3.1)
Requirement already satisfied: exceptiongroup>=1.0.2 in /opt/conda/lib/python3.10/site-packages (from anyio<5.0,>=3.0->gradio->-r requirements.txt (line 12)) (1.2.0)
Requirement already satisfied: docstring-parser in /opt/conda/lib/python3.10/site-packages (from argbind>=0.3.7->descript-audio-codec->-r requirements.txt (line 10)) (0.16)
Requirement already satisfied: pycparser in /opt/conda/lib/python3.10/site-packages (from cffi>=1.0->soundfile->-r requirements.txt (line 18)) (2.22)
Collecting pyloudnorm (from descript-audiotools>=0.7.2->descript-audio-codec->-r requirements.txt (line 10))
  Downloading pyloudnorm-0.1.1-py3-none-any.whl.metadata (5.6 kB)
Requirement already satisfied: importlib-resources in /opt/conda/lib/python3.10/site-packages (from descript-audiotools>=0.7.2->descript-audio-codec->-r requirements.txt (line 10)) (6.4.0)
Collecting julius (from descript-audiotools>=0.7.2->descript-audio-codec->-r requirements.txt (line 10))
  Downloading julius-0.2.7.tar.gz (59 kB)
     ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 59.6/59.6 kB 1.6 MB/s eta 0:00:00
  Preparing metadata (setup.py) ... done
Requirement already satisfied: ipython in /opt/conda/lib/python3.10/site-packages (from descript-audiotools>=0.7.2->descript-audio-codec->-r requirements.txt (line 10)) (8.21.0)
Requirement already satisfied: rich in /opt/conda/lib/python3.10/site-packages (from descript-audiotools>=0.7.2->descript-audio-codec->-r requirements.txt (line 10)) (13.7.1)
Requirement already satisfied: matplotlib in /opt/conda/lib/python3.10/site-packages (from descript-audiotools>=0.7.2->descript-audio-codec->-r requirements.txt (line 10)) (3.7.5)
Collecting pystoi (from descript-audiotools>=0.7.2->descript-audio-codec->-r requirements.txt (line 10))
  Downloading pystoi-0.4.1-py2.py3-none-any.whl.metadata (4.0 kB)
Collecting torch-stoi (from descript-audiotools>=0.7.2->descript-audio-codec->-r requirements.txt (line 10))
  Downloading torch_stoi-0.2.3-py3-none-any.whl.metadata (3.6 kB)
Collecting flatten-dict (from descript-audiotools>=0.7.2->descript-audio-codec->-r requirements.txt (line 10))
  Downloading flatten_dict-0.4.2-py2.py3-none-any.whl.metadata (9.2 kB)
Collecting markdown2 (from descript-audiotools>=0.7.2->descript-audio-codec->-r requirements.txt (line 10))
  Downloading markdown2-2.5.1-py2.py3-none-any.whl.metadata (2.2 kB)
Collecting randomname (from descript-audiotools>=0.7.2->descript-audio-codec->-r requirements.txt (line 10))
  Downloading randomname-0.2.1.tar.gz (64 kB)
     ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 64.2/64.2 kB 1.7 MB/s eta 0:00:00
  Preparing metadata (setup.py) ... done
Collecting protobuf<3.20,>=3.9.2 (from descript-audiotools>=0.7.2->descript-audio-codec->-r requirements.txt (line 10))
  Downloading protobuf-3.19.6-cp310-cp310-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.metadata (787 bytes)
Requirement already satisfied: tensorboard in /opt/conda/lib/python3.10/site-packages (from descript-audiotools>=0.7.2->descript-audio-codec->-r requirements.txt (line 10)) (2.16.2)
Requirement already satisfied: certifi in /opt/conda/lib/python3.10/site-packages (from httpx>=0.24.1->gradio->-r requirements.txt (line 12)) (2024.8.30)
Requirement already satisfied: httpcore==1.* in /opt/conda/lib/python3.10/site-packages (from httpx>=0.24.1->gradio->-r requirements.txt (line 12)) (1.0.5)
Requirement already satisfied: h11<0.15,>=0.13 in /opt/conda/lib/python3.10/site-packages (from httpcore==1.*->httpx>=0.24.1->gradio->-r requirements.txt (line 12)) (0.14.0)
Requirement already satisfied: llvmlite<0.44,>=0.43.0dev0 in /opt/conda/lib/python3.10/site-packages (from numba>=0.51.0->librosa==0.10.2->-r requirements.txt (line 6)) (0.43.0)
Requirement already satisfied: pyparsing!=3.0.5,>=2.0.2 in /opt/conda/lib/python3.10/site-packages (from packaging>=20.9->huggingface-hub->-r requirements.txt (line 7)) (3.1.2)
Requirement already satisfied: python-dateutil>=2.8.2 in /opt/conda/lib/python3.10/site-packages (from pandas<3.0,>=1.0->gradio->-r requirements.txt (line 12)) (2.9.0.post0)
Requirement already satisfied: pytz>=2020.1 in /opt/conda/lib/python3.10/site-packages (from pandas<3.0,>=1.0->gradio->-r requirements.txt (line 12)) (2024.1)
Requirement already satisfied: tzdata>=2022.7 in /opt/conda/lib/python3.10/site-packages (from pandas<3.0,>=1.0->gradio->-r requirements.txt (line 12)) (2024.1)
Requirement already satisfied: platformdirs>=2.5.0 in /opt/conda/lib/python3.10/site-packages (from pooch>=1.1->librosa==0.10.2->-r requirements.txt (line 6)) (3.11.0)
Requirement already satisfied: annotated-types>=0.6.0 in /opt/conda/lib/python3.10/site-packages (from pydantic>=2.0->gradio->-r requirements.txt (line 12)) (0.7.0)
Requirement already satisfied: pydantic-core==2.23.4 in /opt/conda/lib/python3.10/site-packages (from pydantic>=2.0->gradio->-r requirements.txt (line 12)) (2.23.4)
Requirement already satisfied: charset-normalizer<4,>=2 in /opt/conda/lib/python3.10/site-packages (from requests->huggingface-hub->-r requirements.txt (line 7)) (3.3.2)
Requirement already satisfied: urllib3<3,>=1.21.1 in /opt/conda/lib/python3.10/site-packages (from requests->huggingface-hub->-r requirements.txt (line 7)) (1.26.18)
Requirement already satisfied: threadpoolctl>=2.0.0 in /opt/conda/lib/python3.10/site-packages (from scikit-learn>=0.20.0->librosa==0.10.2->-r requirements.txt (line 6)) (3.5.0)
Requirement already satisfied: shellingham>=1.3.0 in /opt/conda/lib/python3.10/site-packages (from typer<1.0,>=0.12->gradio->-r requirements.txt (line 12)) (1.5.4)
Requirement already satisfied: mpmath>=0.19 in /opt/conda/lib/python3.10/site-packages (from sympy->torch->-r requirements.txt (line 2)) (1.3.0)
Requirement already satisfied: six>=1.5 in /opt/conda/lib/python3.10/site-packages (from python-dateutil>=2.8.2->pandas<3.0,>=1.0->gradio->-r requirements.txt (line 12)) (1.16.0)
Requirement already satisfied: markdown-it-py>=2.2.0 in /opt/conda/lib/python3.10/site-packages (from rich->descript-audiotools>=0.7.2->descript-audio-codec->-r requirements.txt (line 10)) (3.0.0)
Requirement already satisfied: pygments<3.0.0,>=2.13.0 in /opt/conda/lib/python3.10/site-packages (from rich->descript-audiotools>=0.7.2->descript-audio-codec->-r requirements.txt (line 10)) (2.18.0)
Requirement already satisfied: jedi>=0.16 in /opt/conda/lib/python3.10/site-packages (from ipython->descript-audiotools>=0.7.2->descript-audio-codec->-r requirements.txt (line 10)) (0.19.1)
Requirement already satisfied: matplotlib-inline in /opt/conda/lib/python3.10/site-packages (from ipython->descript-audiotools>=0.7.2->descript-audio-codec->-r requirements.txt (line 10)) (0.1.7)
Requirement already satisfied: prompt-toolkit<3.1.0,>=3.0.41 in /opt/conda/lib/python3.10/site-packages (from ipython->descript-audiotools>=0.7.2->descript-audio-codec->-r requirements.txt (line 10)) (3.0.47)
Requirement already satisfied: stack-data in /opt/conda/lib/python3.10/site-packages (from ipython->descript-audiotools>=0.7.2->descript-audio-codec->-r requirements.txt (line 10)) (0.6.2)
Requirement already satisfied: traitlets>=5 in /opt/conda/lib/python3.10/site-packages (from ipython->descript-audiotools>=0.7.2->descript-audio-codec->-r requirements.txt (line 10)) (5.14.3)
Requirement already satisfied: pexpect>4.3 in /opt/conda/lib/python3.10/site-packages (from ipython->descript-audiotools>=0.7.2->descript-audio-codec->-r requirements.txt (line 10)) (4.9.0)
Requirement already satisfied: contourpy>=1.0.1 in /opt/conda/lib/python3.10/site-packages (from matplotlib->descript-audiotools>=0.7.2->descript-audio-codec->-r requirements.txt (line 10)) (1.2.1)
Requirement already satisfied: cycler>=0.10 in /opt/conda/lib/python3.10/site-packages (from matplotlib->descript-audiotools>=0.7.2->descript-audio-codec->-r requirements.txt (line 10)) (0.12.1)
Requirement already satisfied: fonttools>=4.22.0 in /opt/conda/lib/python3.10/site-packages (from matplotlib->descript-audiotools>=0.7.2->descript-audio-codec->-r requirements.txt (line 10)) (4.53.0)
Requirement already satisfied: kiwisolver>=1.0.1 in /opt/conda/lib/python3.10/site-packages (from matplotlib->descript-audiotools>=0.7.2->descript-audio-codec->-r requirements.txt (line 10)) (1.4.5)
Requirement already satisfied: future>=0.16.0 in /opt/conda/lib/python3.10/site-packages (from pyloudnorm->descript-audiotools>=0.7.2->descript-audio-codec->-r requirements.txt (line 10)) (1.0.0)
Collecting fire (from randomname->descript-audiotools>=0.7.2->descript-audio-codec->-r requirements.txt (line 10))
  Downloading fire-0.7.0.tar.gz (87 kB)
     ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 87.2/87.2 kB 2.0 MB/s eta 0:00:00ta 0:00:01
  Preparing metadata (setup.py) ... done
Requirement already satisfied: absl-py>=0.4 in /opt/conda/lib/python3.10/site-packages (from tensorboard->descript-audiotools>=0.7.2->descript-audio-codec->-r requirements.txt (line 10)) (1.4.0)
Requirement already satisfied: grpcio>=1.48.2 in /opt/conda/lib/python3.10/site-packages (from tensorboard->descript-audiotools>=0.7.2->descript-audio-codec->-r requirements.txt (line 10)) (1.64.1)
Requirement already satisfied: markdown>=2.6.8 in /opt/conda/lib/python3.10/site-packages (from tensorboard->descript-audiotools>=0.7.2->descript-audio-codec->-r requirements.txt (line 10)) (3.6)
Requirement already satisfied: setuptools>=41.0.0 in /opt/conda/lib/python3.10/site-packages (from tensorboard->descript-audiotools>=0.7.2->descript-audio-codec->-r requirements.txt (line 10)) (70.0.0)
Requirement already satisfied: tensorboard-data-server<0.8.0,>=0.7.0 in /opt/conda/lib/python3.10/site-packages (from tensorboard->descript-audiotools>=0.7.2->descript-audio-codec->-r requirements.txt (line 10)) (0.7.2)
Requirement already satisfied: werkzeug>=1.0.1 in /opt/conda/lib/python3.10/site-packages (from tensorboard->descript-audiotools>=0.7.2->descript-audio-codec->-r requirements.txt (line 10)) (3.0.4)
Requirement already satisfied: parso<0.9.0,>=0.8.3 in /opt/conda/lib/python3.10/site-packages (from jedi>=0.16->ipython->descript-audiotools>=0.7.2->descript-audio-codec->-r requirements.txt (line 10)) (0.8.4)
Requirement already satisfied: mdurl~=0.1 in /opt/conda/lib/python3.10/site-packages (from markdown-it-py>=2.2.0->rich->descript-audiotools>=0.7.2->descript-audio-codec->-r requirements.txt (line 10)) (0.1.2)
Requirement already satisfied: ptyprocess>=0.5 in /opt/conda/lib/python3.10/site-packages (from pexpect>4.3->ipython->descript-audiotools>=0.7.2->descript-audio-codec->-r requirements.txt (line 10)) (0.7.0)
Requirement already satisfied: wcwidth in /opt/conda/lib/python3.10/site-packages (from prompt-toolkit<3.1.0,>=3.0.41->ipython->descript-audiotools>=0.7.2->descript-audio-codec->-r requirements.txt (line 10)) (0.2.13)
Requirement already satisfied: termcolor in /opt/conda/lib/python3.10/site-packages (from fire->randomname->descript-audiotools>=0.7.2->descript-audio-codec->-r requirements.txt (line 10)) (2.4.0)
Requirement already satisfied: executing>=1.2.0 in /opt/conda/lib/python3.10/site-packages (from stack-data->ipython->descript-audiotools>=0.7.2->descript-audio-codec->-r requirements.txt (line 10)) (2.0.1)
Requirement already satisfied: asttokens>=2.1.0 in /opt/conda/lib/python3.10/site-packages (from stack-data->ipython->descript-audiotools>=0.7.2->descript-audio-codec->-r requirements.txt (line 10)) (2.4.1)
Requirement already satisfied: pure-eval in /opt/conda/lib/python3.10/site-packages (from stack-data->ipython->descript-audiotools>=0.7.2->descript-audio-codec->-r requirements.txt (line 10)) (0.2.2)
Downloading scipy-1.13.1-cp310-cp310-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (38.6 MB)
   ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 38.6/38.6 MB 16.9 MB/s eta 0:00:0000:0100:01m
Downloading librosa-0.10.2-py3-none-any.whl (260 kB)
   ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 260.0/260.0 kB 6.7 MB/s eta 0:00:00:00:01
Downloading munch-4.0.0-py2.py3-none-any.whl (9.9 kB)
Downloading einops-0.8.0-py3-none-any.whl (43 kB)
   ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 43.2/43.2 kB 1.2 MB/s eta 0:00:00
Downloading descript_audio_codec-1.0.0-py3-none-any.whl (26 kB)
Downloading gradio-5.5.0-py3-none-any.whl (56.7 MB)
   ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 56.7/56.7 MB 21.5 MB/s eta 0:00:00:00:0100:01
Downloading gradio_client-1.4.2-py3-none-any.whl (319 kB)
   ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 319.8/319.8 kB 10.3 MB/s eta 0:00:00
Downloading python_multipart-0.0.12-py3-none-any.whl (23 kB)
Downloading tomlkit-0.12.0-py3-none-any.whl (37 kB)
Downloading Resemblyzer-0.1.4-py3-none-any.whl (15.7 MB)
   ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 15.7/15.7 MB 53.1 MB/s eta 0:00:00:00:0100:01
Downloading jiwer-3.0.5-py3-none-any.whl (21 kB)
Downloading FreeSimpleGUI-5.1.1-py3-none-any.whl (720 kB)
   ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 720.0/720.0 kB 18.3 MB/s eta 0:00:0000:01
Downloading sounddevice-0.5.1-py3-none-any.whl (32 kB)
Downloading descript_audiotools-0.7.2-py2.py3-none-any.whl (106 kB)
   ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 106.7/106.7 kB 2.5 MB/s eta 0:00:00ta 0:00:01
Downloading fastapi-0.115.4-py3-none-any.whl (94 kB)
   ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 94.7/94.7 kB 2.8 MB/s eta 0:00:00
Downloading rapidfuzz-3.10.1-cp310-cp310-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (3.1 MB)
   ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 3.1/3.1 MB 45.6 MB/s eta 0:00:00:00:01
Downloading ruff-0.7.3-py3-none-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (11.0 MB)
   ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 11.0/11.0 MB 39.9 MB/s eta 0:00:0000:010:01m
Downloading safehttpx-0.1.1-py3-none-any.whl (8.4 kB)
Downloading semantic_version-2.10.0-py2.py3-none-any.whl (15 kB)
Downloading starlette-0.41.2-py3-none-any.whl (73 kB)
   ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 73.3/73.3 kB 2.3 MB/s eta 0:00:00
Downloading ffmpy-0.4.0-py3-none-any.whl (5.8 kB)
Downloading tiktoken-0.8.0-cp310-cp310-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (1.2 MB)
   ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 1.2/1.2 MB 27.5 MB/s eta 0:00:00:00:01
Downloading protobuf-3.19.6-cp310-cp310-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (1.1 MB)
   ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 1.1/1.1 MB 23.0 MB/s eta 0:00:00:00:01
Downloading flatten_dict-0.4.2-py2.py3-none-any.whl (9.7 kB)
Downloading markdown2-2.5.1-py2.py3-none-any.whl (48 kB)
   ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 48.4/48.4 kB 1.4 MB/s eta 0:00:00
Downloading pyloudnorm-0.1.1-py3-none-any.whl (9.6 kB)
Downloading pystoi-0.4.1-py2.py3-none-any.whl (8.2 kB)
Downloading torch_stoi-0.2.3-py3-none-any.whl (8.1 kB)
Building wheels for collected packages: openai-whisper, argbind, webrtcvad, typing, julius, randomname, fire
  Building wheel for openai-whisper (pyproject.toml) ... done
  Created wheel for openai-whisper: filename=openai_whisper-20240930-py3-none-any.whl size=803557 sha256=4a80f03a048cb4bb1358a0a5a536ff202a81c011e4c47f13d5eb987b536aa7b8
  Stored in directory: /tmp/pip-ephem-wheel-cache-bjdgq9xm/wheels/8b/6c/d0/622666868c179f156cf595c8b6f06f88bc5d80c4b31dccaa03
  Building wheel for argbind (setup.py) ... done
  Created wheel for argbind: filename=argbind-0.3.9-py2.py3-none-any.whl size=11729 sha256=341081f7f0b0ec8fd3852b02f160f56cc2ea36d68b9231c9942ba8a6d2e6ceed
  Stored in directory: /root/.cache/pip/wheels/ed/ab/ff/64eb14a776ae6525e1a7d6ad38b73ba020ecc4262d83a7889d
  Building wheel for webrtcvad (setup.py) ... done
  Created wheel for webrtcvad: filename=webrtcvad-2.0.10-cp310-cp310-linux_x86_64.whl size=27295 sha256=bec2eedc66bd6a962c525deabe91ac824d0ac1e664b97d2dbb6787911b7263b5
  Stored in directory: /root/.cache/pip/wheels/2a/2b/84/ac7bacfe8c68a87c1ee3dd3c66818a54c71599abf308e8eb35
  Building wheel for typing (setup.py) ... done
  Created wheel for typing: filename=typing-3.7.4.3-py3-none-any.whl size=26306 sha256=e9003cfbf16917f1b7966ee1b5909f0b5e5440fad8342542d97cd750debda7ae
  Stored in directory: /root/.cache/pip/wheels/7c/d0/9e/1f26ebb66d9e1732e4098bc5a6c2d91f6c9a529838f0284890
  Building wheel for julius (setup.py) ... done
  Created wheel for julius: filename=julius-0.2.7-py3-none-any.whl size=21870 sha256=c37e66b88730ff2cc4eaf0769e72d9ef6c810ed05712b68bafbedc0c96798f99
  Stored in directory: /root/.cache/pip/wheels/b9/b2/05/f883527ffcb7f2ead5438a2c23439aa0c881eaa9a4c80256f4
  Building wheel for randomname (setup.py) ... done
  Created wheel for randomname: filename=randomname-0.2.1-py3-none-any.whl size=89195 sha256=e270655277800973d987fb80093acb80782f75a9a2f92004d7af795de6b4ec68
  Stored in directory: /root/.cache/pip/wheels/10/50/8a/25f3820d26a431ffed1834d72ff2eb349123cf2b44c5a45727
  Building wheel for fire (setup.py) ... done
  Created wheel for fire: filename=fire-0.7.0-py3-none-any.whl size=114248 sha256=a2d8ec994922fe9f13f8b3a0f1123233c5ec5dcb881f3f82baf6c674c5fb024f
  Stored in directory: /root/.cache/pip/wheels/19/39/2f/2d3cadc408a8804103f1c34ddd4b9f6a93497b11fa96fe738e
Successfully built openai-whisper argbind webrtcvad typing julius randomname fire
Installing collected packages: webrtcvad, FreeSimpleGUI, typing, triton, tomlkit, semantic-version, scipy, ruff, rapidfuzz, python-multipart, protobuf, munch, markdown2, flatten-dict, fire, ffmpy, einops, argbind, tiktoken, starlette, sounddevice, randomname, pystoi, pyloudnorm, jiwer, safehttpx, openai-whisper, librosa, julius, gradio-client, fastapi, torch-stoi, resemblyzer, gradio, descript-audiotools, descript-audio-codec
  Attempting uninstall: tomlkit
    Found existing installation: tomlkit 0.13.2
    Uninstalling tomlkit-0.13.2:
      Successfully uninstalled tomlkit-0.13.2
  Attempting uninstall: scipy
    Found existing installation: scipy 1.14.1
    Uninstalling scipy-1.14.1:
      Successfully uninstalled scipy-1.14.1
  Attempting uninstall: python-multipart
    Found existing installation: python-multipart 0.0.9
    Uninstalling python-multipart-0.0.9:
      Successfully uninstalled python-multipart-0.0.9
  Attempting uninstall: protobuf
    Found existing installation: protobuf 3.20.3
    Uninstalling protobuf-3.20.3:
      Successfully uninstalled protobuf-3.20.3
  Attempting uninstall: starlette
    Found existing installation: starlette 0.37.2
    Uninstalling starlette-0.37.2:
      Successfully uninstalled starlette-0.37.2
  Attempting uninstall: librosa
    Found existing installation: librosa 0.10.2.post1
    Uninstalling librosa-0.10.2.post1:
      Successfully uninstalled librosa-0.10.2.post1
  Attempting uninstall: fastapi
    Found existing installation: fastapi 0.111.0
    Uninstalling fastapi-0.111.0:
      Successfully uninstalled fastapi-0.111.0
ERROR: pip's dependency resolver does not currently take into account all the packages that are installed. This behaviour is the source of the following dependency conflicts.
apache-beam 2.46.0 requires cloudpickle~=2.2.1, but you have cloudpickle 3.0.0 which is incompatible.
apache-beam 2.46.0 requires dill<0.3.2,>=0.3.1.1, but you have dill 0.3.8 which is incompatible.
apache-beam 2.46.0 requires numpy<1.25.0,>=1.14.3, but you have numpy 1.26.4 which is incompatible.
apache-beam 2.46.0 requires pyarrow<10.0.0,>=3.0.0, but you have pyarrow 17.0.0 which is incompatible.
cesium 0.12.3 requires numpy<3.0,>=2.0, but you have numpy 1.26.4 which is incompatible.
google-ai-generativelanguage 0.6.10 requires protobuf!=4.21.0,!=4.21.1,!=4.21.2,!=4.21.3,!=4.21.4,!=4.21.5,<6.0.0dev,>=3.20.2, but you have protobuf 3.19.6 which is incompatible.
google-cloud-aiplatform 0.6.0a1 requires google-api-core[grpc]<2.0.0dev,>=1.22.2, but you have google-api-core 2.11.1 which is incompatible.
google-cloud-automl 1.0.1 requires google-api-core[grpc]<2.0.0dev,>=1.14.0, but you have google-api-core 2.11.1 which is incompatible.
google-cloud-language 2.14.0 requires protobuf!=4.21.0,!=4.21.1,!=4.21.2,!=4.21.3,!=4.21.4,!=4.21.5,<6.0.0dev,>=3.20.2, but you have protobuf 3.19.6 which is incompatible.
google-cloud-spanner 3.47.0 requires protobuf!=3.20.0,!=3.20.1,!=4.21.0,!=4.21.1,!=4.21.2,!=4.21.3,!=4.21.4,!=4.21.5,<5.0.0dev,>=3.20.2, but you have protobuf 3.19.6 which is incompatible.
google-cloud-videointelligence 2.13.5 requires protobuf!=4.21.0,!=4.21.1,!=4.21.2,!=4.21.3,!=4.21.4,!=4.21.5,<6.0.0dev,>=3.20.2, but you have protobuf 3.19.6 which is incompatible.
kfp 2.5.0 requires google-cloud-storage<3,>=2.2.1, but you have google-cloud-storage 1.44.0 which is incompatible.
libpysal 4.9.2 requires packaging>=22, but you have packaging 21.3 which is incompatible.
libpysal 4.9.2 requires shapely>=2.0.1, but you have shapely 1.8.5.post1 which is incompatible.
onnx 1.17.0 requires protobuf>=3.20.2, but you have protobuf 3.19.6 which is incompatible.
tensorboardx 2.6.2.2 requires protobuf>=3.20, but you have protobuf 3.19.6 which is incompatible.
tensorflow 2.16.1 requires protobuf!=4.21.0,!=4.21.1,!=4.21.2,!=4.21.3,!=4.21.4,!=4.21.5,<5.0.0dev,>=3.20.3, but you have protobuf 3.19.6 which is incompatible.
tensorflow-datasets 4.9.6 requires protobuf>=3.20, but you have protobuf 3.19.6 which is incompatible.
tensorflow-serving-api 2.16.1 requires protobuf!=4.21.0,!=4.21.1,!=4.21.2,!=4.21.3,!=4.21.4,!=4.21.5,<5.0.0dev,>=3.20.3, but you have protobuf 3.19.6 which is incompatible.
tsfresh 0.20.3 requires scipy>=1.14.0; python_version >= "3.10", but you have scipy 1.13.1 which is incompatible.
Successfully installed FreeSimpleGUI-5.1.1 argbind-0.3.9 descript-audio-codec-1.0.0 descript-audiotools-0.7.2 einops-0.8.0 fastapi-0.115.4 ffmpy-0.4.0 fire-0.7.0 flatten-dict-0.4.2 gradio-5.5.0 gradio-client-1.4.2 jiwer-3.0.5 julius-0.2.7 librosa-0.10.2 markdown2-2.5.1 munch-4.0.0 openai-whisper-20240930 protobuf-3.19.6 pyloudnorm-0.1.1 pystoi-0.4.1 python-multipart-0.0.12 randomname-0.2.1 rapidfuzz-3.10.1 resemblyzer-0.1.4 ruff-0.7.3 safehttpx-0.1.1 scipy-1.13.1 semantic-version-2.10.0 sounddevice-0.5.1 starlette-0.41.2 tiktoken-0.8.0 tomlkit-0.12.0 torch-stoi-0.2.3 triton-3.1.0 typing-3.7.4.3 webrtcvad-2.0.10

Note I have also tried running inference and it produces this output (also contains an error):

(…)er_base_f0_44k_bigvgan_pruned_ft_ema.pth: 100%|█| 821M/821M [00:19<00:00, 42.
(…)it_mel_seed_uvit_whisper_base_f0_44k.yml: 100%|█| 2.25k/2.25k [00:00<00:00, 7
rmvpe.pt: 100%|███████████████████████████████| 181M/181M [00:00<00:00, 206MB/s]
Warning: Skipped loading some keys due to shape mismatch: {'estimator.input_pos'}
cfm loaded
length_regulator loaded
campplus_cn_common.bin: 100%|██████████████| 28.0M/28.0M [00:00<00:00, 38.8MB/s]
config.json: 100%|█████████████████████████| 1.40k/1.40k [00:00<00:00, 5.68MB/s]
Loading weights from nvidia/bigvgan_v2_44khz_128band_512x
bigvgan_generator.pt: 100%|██████████████████| 489M/489M [00:13<00:00, 36.7MB/s]
Removing weight norm...
config.json: 100%|█████████████████████████| 1.97k/1.97k [00:00<00:00, 6.16MB/s]
model.safetensors: 100%|█████████████████████| 967M/967M [00:17<00:00, 55.6MB/s]
preprocessor_config.json: 100%|██████████████| 185k/185k [00:00<00:00, 2.16MB/s]
It is strongly recommended to pass the `sampling_rate` argument to this function. Failing to do so can result in silent errors that might be hard to debug.
Traceback (most recent call last):
  File "/kaggle/working/seed-vc/inference.py", line 276, in <module>
    main(args)
  File "/opt/conda/lib/python3.10/site-packages/torch/utils/_contextlib.py", line 116, in decorate_context
    return func(*args, **kwargs)
  File "/kaggle/working/seed-vc/inference.py", line 163, in main
    alt_inputs = whisper_feature_extractor([converted_waves_16k.squeeze(0).cpu().numpy()],
  File "/opt/conda/lib/python3.10/site-packages/transformers/models/whisper/feature_extraction_whisper.py", line 282, in __call__
    padded_inputs = self.pad(
  File "/opt/conda/lib/python3.10/site-packages/transformers/feature_extraction_sequence_utils.py", line 163, in pad
    if is_tf_tensor(first_element):
  File "/opt/conda/lib/python3.10/site-packages/transformers/utils/generic.py", line 208, in is_tf_tensor
    return False if not is_tf_available() else _is_tensorflow(x)
  File "/opt/conda/lib/python3.10/site-packages/transformers/utils/generic.py", line 199, in _is_tensorflow
    import tensorflow as tf
  File "/opt/conda/lib/python3.10/site-packages/tensorflow/__init__.py", line 45, in <module>
    from tensorflow._api.v2 import __internal__
  File "/opt/conda/lib/python3.10/site-packages/tensorflow/_api/v2/__internal__/__init__.py", line 8, in <module>
    from tensorflow._api.v2.__internal__ import autograph
  File "/opt/conda/lib/python3.10/site-packages/tensorflow/_api/v2/__internal__/autograph/__init__.py", line 8, in <module>
    from tensorflow.python.autograph.core.ag_ctx import control_status_ctx # line: 34
  File "/opt/conda/lib/python3.10/site-packages/tensorflow/python/autograph/core/ag_ctx.py", line 21, in <module>
    from tensorflow.python.autograph.utils import ag_logging
  File "/opt/conda/lib/python3.10/site-packages/tensorflow/python/autograph/utils/__init__.py", line 17, in <module>
    from tensorflow.python.autograph.utils.context_managers import control_dependency_on_returns
  File "/opt/conda/lib/python3.10/site-packages/tensorflow/python/autograph/utils/context_managers.py", line 19, in <module>
    from tensorflow.python.framework import ops
  File "/opt/conda/lib/python3.10/site-packages/tensorflow/python/framework/ops.py", line 33, in <module>
    from tensorflow.core.framework import attr_value_pb2
  File "/opt/conda/lib/python3.10/site-packages/tensorflow/core/framework/attr_value_pb2.py", line 5, in <module>
    from google.protobuf.internal import builder as _builder
ImportError: cannot import name 'builder' from 'google.protobuf.internal' (/opt/conda/lib/python3.10/site-packages/google/protobuf/internal/__init__.py)
Plachtaa commented 2 days ago

uninstall tensorflow from your enrionment probably solve it