Closed markovivl closed 8 months ago
Greetings! Thank you very much for your amazing work!
I am currently trying to better understand the pipeline, and already had a question. In the file prepare_data.py you have the following assertion
prepare_data.py
def __getitem__(self, index): fn = self.file_names[index] logger.debug(f"Reading audio file {fn}") x = self.read(fn) assert x.dim() == 2 and x.shape[0] <= 16, f"Got sample {fn} with unexpected shape {x.shape}" n_samples = x.shape[1] x = encode(x, self.sr, self.codec, self.compression) return {"file_name": fn, "data": x, "n_samples": n_samples}
where you check that the number of channels in the input file is less than 16. In DNS4 dataset there are RIRs that have 30 channels.
You can just remove this line. This was just a workaround to make sure that the signal has 'channels first' shape and is not transposed.
Greetings! Thank you very much for your amazing work!
I am currently trying to better understand the pipeline, and already had a question. In the file
prepare_data.py
you have the following assertionwhere you check that the number of channels in the input file is less than 16. In DNS4 dataset there are RIRs that have 30 channels.