LumenPallidium / neural-file-sorter

A neural network based file sorter. Trains an autoencoder to sort images or audio based on the similarity of their encodings, or uses the OpenAI CLIP model.
MIT License
27 stars 1 forks source link

add basic utilities for audio autoencoder #3

Open LumenPallidium opened 3 years ago

LumenPallidium commented 3 years ago

at minimum this must be its own class, options in the dataloader for dealing with audio, and the necessary methods for working with audio data (windowed FTs?, 1d convolutions, transformers?)