issues
search
facebookresearch
/
AudioMAE
This repo hosts the code and models of "Masked Autoencoders that Listen".
Other
547
stars
45
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
ESC-50/Speechcommands recipe/documentation
#30
IvanBirkmaier
opened
1 month ago
0
Linear probing recipe missing
#29
Peipi98
opened
2 months ago
0
converting the resulting fbank back to .wav?
#28
Desync-o-tron
opened
5 months ago
0
[BUG] No module named 'torch._six'
#27
ps4vs
opened
7 months ago
2
Question Regarding _roll_mag_aug Function Implementation in AudioMAE
#26
unoct
opened
11 months ago
0
kaldi fbank
#25
lix4
opened
1 year ago
1
RuntimeError: Default process group has not been initialized, please make sure to call init_process_group.
#24
wuhongsheng
opened
1 year ago
1
Cannot reproduce finetuning result on Audioset-20k
#23
kxgong
opened
1 year ago
0
Submitted job triggered an exception
#22
wuhongsheng
closed
1 year ago
0
where is 2M audioset data and pretrain_audioset2M.sh?
#21
JHjang223
opened
1 year ago
3
VIT-L checkpoint and reproducing the visualization results
#20
i-need-sleep
opened
1 year ago
3
reproduce inference results
#19
hanlin-lu
opened
1 year ago
0
Can't find train_all_video.json for main_pretrain.py
#18
joo-young-lee
opened
1 year ago
0
Can't download any ckpt files in github
#17
joo-young-lee
closed
1 year ago
1
Unable to utilize main_pretrain.py for speechcommands dataset training
#16
unoct
closed
11 months ago
1
Could u release the finetuned checkpoint of dataset 'AudioSet20K' ?
#15
yanjies
opened
1 year ago
0
May I ask how to visualize the data in esc50
#14
yangyangshuyang
opened
1 year ago
0
Vit-S pretraining checkpoint
#13
G-JWLee
opened
1 year ago
0
Issues with loading model weights to reproduce the demo notebook
#12
bpiyush
opened
1 year ago
2
Could u release self-supervised weights without fine-tuning?
#11
haidog-yaqub
closed
1 year ago
1
Where is the mae_env.yml ?
#10
unoct
closed
1 year ago
1
where is the weight_train.csv?
#9
LinB203
closed
1 year ago
1
Reproducing the downstream task performance
#8
Sara-Ahmed
opened
1 year ago
1
Evaluation on ASR
#7
LeyuanQu
opened
1 year ago
0
When will you release the code?
#6
jinx2018
closed
1 year ago
0
Open Sourcing through Hugging Face
#5
osanseviero
opened
2 years ago
0
When will you release the code?
#4
linmou
closed
1 year ago
0
Do you think this work will apply to music super resolution task
#3
mengdexing
closed
2 years ago
1
When will you release the code?
#2
hou821
closed
2 years ago
2
goodwork
#1
qixing-ai
closed
2 years ago
1