lhotse-speech / lhotse

Tools for handling speech data in machine learning projects.
https://lhotse.readthedocs.io/en/latest/
Apache License 2.0
902 stars 204 forks source link

Is there any possible to use multi GPU to inside the compute_and_store_features_batch function #1331

Open OswaldoBornemann opened 1 month ago

OswaldoBornemann commented 1 month ago

So suppose I have 4 million audio files, and I want to extract the features using Encodec. I found that the extracting process is kind of slow, where Computing features in batches: 183301it [19:36, 176.40it/s]. The extracting process only leverages single GPU. So I was wonder whether we can use multi-GPU to accelerate the extraction process in the compute_and_store_features_batch.

pzelasko commented 1 month ago

It will be more robust if you split your manifest into parts and process each part separately. You can launch the script multiple times with GPU ID as an argument.

OswaldoBornemann commented 1 month ago

I see. Thank you.

OswaldoBornemann commented 1 month ago

So how to split the manifest like the cut file did?