LAION-AI / audio-dataset

Audio Dataset for training CLAP and other models
632 stars 53 forks source link

files for preprocessing #89

Open yasser-sulaiman opened 1 year ago

yasser-sulaiman commented 1 year ago

Where do we get the following files from? Any help would be appreciated. Thanks in advance.

https://github.com/LAION-AI/audio-dataset/blob/572ebd29163dcecc0d1ed68f9cbf63c1c808c319/data_preprocess/preprocess_freesound.py#L26 https://github.com/LAION-AI/audio-dataset/blob/572ebd29163dcecc0d1ed68f9cbf63c1c808c319/data_preprocess/preprocess_freesound.py#L27 https://github.com/LAION-AI/audio-dataset/blob/572ebd29163dcecc0d1ed68f9cbf63c1c808c319/data_preprocess/preprocess_freesound.py#L28

YuchenHui22314 commented 1 year ago

The preprocess scripts are offered just for reference purpose, so you have to adapt them for your own usage. For example, instead of using freesound_parquet.parquet, you may use https://github.com/LAION-AI/audio-dataset/blob/main/laion-audio-630k/README.md instead