RVC-Project / Retrieval-based-Voice-Conversion

in preparation...
MIT License
265 stars 42 forks source link

RVC API Problem with Gradio in F0 Curve File #38

Open MohamedhediHamdi1 opened 3 months ago

MohamedhediHamdi1 commented 3 months ago

I am trying to access the RVC API method infer_convert using the Gradio client, but I am encountering issues with the F0 curve file. I have tried using None and empty string and other files, but it still does not work. Could you please provide an example or guidance on how to properly use the F0 curve file with this API?

MohamedhediHamdi1 commented 3 months ago

`from gradio_client import Client

client = Client("http://127.0.0.1:7865/") result = client.predict( 0, # float (numeric value between 0 and 2333) in 'Select Speaker/Singer ID:' Slider component "C:/Users/Administrator/Downloads/test/wav.wav", # str in 'Enter the path of the audio file to be processed (default is the correct format example):' Textbox component 5, # float in 'Transpose (integer, number of semitones, raise by an octave: 12, lower by an octave: -12):' Number component None, # str (filepath on your computer (or URL) of file) in 'F0 curve file (optional). One pitch per line. Replaces the default F0 and pitch modulation:' File component "rmvpe", # str in 'Select the pitch extraction algorithm ('pm': faster extraction but lower-quality speech; 'harvest': better bass but extremely slow; 'crepe': better quality but GPU intensive), 'rmvpe': best quality, and little GPU requirement' Radio component "", # str in 'Path to the feature index file. Leave blank to use the selected result from the dropdown:' Textbox component "logs/agadmator.index", # str (Option from: ['logs/agadmator.index']) in 'Auto-detect index path and select from the dropdown:' Dropdown component 0, # float (numeric value between 0 and 1) in 'Search feature ratio (controls accent strength, too high has artifacting):' Slider component 0, # float (numeric value between 0 and 7) in 'If >=3: apply median filtering to the harvested pitch results. The value represents the filter radius and can reduce breathiness.' Slider component 0, # float (numeric value between 0 and 48000) in 'Resample the output audio in post-processing to the final sample rate. Set to 0 for no resampling:' Slider component 0, # float (numeric value between 0 and 1) in 'Adjust the volume envelope scaling. Closer to 0, the more it mimicks the volume of the original vocals. Can help mask noise and make volume sound more natural when set relatively low. Closer to 1 will be more of a consistently loud volume:' Slider component 0, # float (numeric value between 0 and 0.5) in 'Protect voiceless consonants and breath sounds to prevent artifacts such as tearing in electronic music. Set to 0.5 to disable. Decrease the value to increase protection, but it may reduce indexing accuracy:' Slider component api_name="/infer_convert" ) print(result)`

KungFuFurniture commented 1 month ago

I am having the same issue. Any ideas would be welcomed... Please and Thank you.