facebookresearch AudioMAE issues

facebookresearch / AudioMAE

This repo hosts the code and models of "Masked Autoencoders that Listen".

Other

547 stars 45 forks source link

issues

Newest

Newest Most commented Recently updated Oldest Least commented Least recently updated

ESC-50/Speechcommands recipe/documentation

#30 IvanBirkmaier opened 1 month ago
0
Linear probing recipe missing

#29 Peipi98 opened 2 months ago
0
converting the resulting fbank back to .wav?

#28 Desync-o-tron opened 5 months ago
0
[BUG] No module named 'torch._six'

#27 ps4vs opened 7 months ago
2
Question Regarding _roll_mag_aug Function Implementation in AudioMAE

#26 unoct opened 11 months ago
0
kaldi fbank

#25 lix4 opened 1 year ago
1
RuntimeError: Default process group has not been initialized, please make sure to call init_process_group.

#24 wuhongsheng opened 1 year ago
1
Cannot reproduce finetuning result on Audioset-20k

#23 kxgong opened 1 year ago
0
Submitted job triggered an exception

#22 wuhongsheng closed 1 year ago
0
where is 2M audioset data and pretrain_audioset2M.sh?

#21 JHjang223 opened 1 year ago
3
VIT-L checkpoint and reproducing the visualization results

#20 i-need-sleep opened 1 year ago
3
reproduce inference results

#19 hanlin-lu opened 1 year ago
0
Can't find train_all_video.json for main_pretrain.py

#18 joo-young-lee opened 1 year ago
0
Can't download any ckpt files in github

#17 joo-young-lee closed 1 year ago
1
Unable to utilize main_pretrain.py for speechcommands dataset training

#16 unoct closed 11 months ago
1
Could u release the finetuned checkpoint of dataset 'AudioSet20K' ?

#15 yanjies opened 1 year ago
0
May I ask how to visualize the data in esc50

#14 yangyangshuyang opened 1 year ago
0
Vit-S pretraining checkpoint

#13 G-JWLee opened 1 year ago
0
Issues with loading model weights to reproduce the demo notebook

#12 bpiyush opened 1 year ago
2
Could u release self-supervised weights without fine-tuning?

#11 haidog-yaqub closed 1 year ago
1
Where is the mae_env.yml ?

#10 unoct closed 1 year ago
1
where is the weight_train.csv?

#9 LinB203 closed 1 year ago
1
Reproducing the downstream task performance

#8 Sara-Ahmed opened 1 year ago
1
Evaluation on ASR

#7 LeyuanQu opened 1 year ago
0
When will you release the code?

#6 jinx2018 closed 1 year ago
0
Open Sourcing through Hugging Face

#5 osanseviero opened 2 years ago
0
When will you release the code?

#4 linmou closed 1 year ago
0
Do you think this work will apply to music super resolution task

#3 mengdexing closed 2 years ago
1
When will you release the code?

#2 hou821 closed 2 years ago
2
goodwork

#1 qixing-ai closed 2 years ago
1