File loading currently uses audio2numpy to load audio cut into fixed lengths during preprocessing.
This is cognitively suboptimal and leads to common errors in code.
Implementing the use of librosa.stream for loading and analysis also seems more straightforward than using tf.keras.utils.audio_dataset_from_directory as using tf.data.dataset objects still often evades my intuition.
File loading currently uses audio2numpy to load audio cut into fixed lengths during preprocessing. This is cognitively suboptimal and leads to common errors in code. Implementing the use of
librosa.stream
for loading and analysis also seems more straightforward than usingtf.keras.utils.audio_dataset_from_directory
as usingtf.data.dataset
objects still often evades my intuition.