cfoster0 / CLAP

Contrastive Language-Audio Pretraining
BSD 3-Clause "New" or "Revised" License
87 stars 4 forks source link

TFRecords #25

Closed cfoster0 closed 3 years ago

cfoster0 commented 3 years ago

Switch dataset formatting to TFRecords, from current version which just uses a .pt files within a directory. This should make streaming possible, which may important for real training. Slightly awkward because the preprocessing is in PyTorch and the training is in JAX. I'm told the best way to read off TFRecords during training is through tf.data

cfoster0 commented 3 years ago

Working on this on the tfrecords branch.