Bulk Inference - Githubissues

MTG / essentia

C++ library for audio and music analysis, description and synthesis, including Python bindings

GNU Affero General Public License v3.0

2.77k stars 525 forks source link

Hello,

I have a set of 100,000 mp3 files which I want to run the effnet-discogs model inference on. Can you suggest anything to help speed up the inference ?

I am using ThreadPool right now for parallel inference. Is this approach okay, or can this be corrected/ optimized further ?

pool = ThreadPool(960)

def inference_for_audio(audio_file):
    audio = MonoLoader(filename=audio_file, sampleRate=16000)()
    activations = model(audio)
    print(activations)

model = TensorflowPredictEffnetDiscogs(graphFilename="discogs-effnet-bs64-1.pb")
audio_files = glob.glob("/home/research/Songs/test/*.wav")
results = pool.map(inference_for_audio, audio_files)

Any help would be appreciated. Thanks!

MTG / essentia

Bulk Inference #1268