afrancl / BinauralLocalizationCNN

Code to create networks that localize sounds sources in 3D environments
41 stars 12 forks source link

BinauralLocalizationCNN

Code to create networks that localize sounds sources in 3D environments

  1. Main training/testing python script is call_model_training_valid_pad_francl.py.

    • This script is responsible for processing the experiment parameters, validating the model folder, saving a copy of the experiment parameters there and ensuring the same folder isn't used for two different training rounds.
    • An example set of parameeters can be found for testing in slurm_CNN_spherical_testing_francl.sh and in slurm_CNN_spherical_training_francl.sh.
  2. Networks weights can be downloaded at: https://www.dropbox.com/sh/af6vaotxt41i7pe/AACfTzMxMLfv-Edmn33S4gTpa?dl=0

  3. The model input should be nervegrams with their associated metadata saved into tensorflow records. The cochlear model we use is the PyCochleagram package . We have a wrapper to transform stereo .wav files into the proper input available here: https://github.com/afrancl/BinauralDataGen

Note: Before running, please change the model save folder to point to your directory with the model architecture config file and data folder to point to your data. Both of these are in the associated shell scripts. The code itself contains no absolute paths.

Setup

To aid reproducibility and decrease setup time we provide a Singularity Image that contains all packages necessary to run the code without any further setup. The image is available on dropbox here: https://www.dropbox.com/s/ey74fiw4uquww0n/tfv1.13_tcmalloc.simg?dl=0