issues
search
RetroCirce
/
HTS-Audio-Transformer
The official code repo of "HTS-AT: A Hierarchical Token-Semantic Audio Transformer for Sound Classification and Detection"
https://arxiv.org/abs/2202.00874
MIT License
344
stars
62
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Can the HTS-AT model be exported as ONNX?
#64
chenyangzhen
opened
1 day ago
0
add cog quick inference
#63
allenhung1025
opened
3 weeks ago
1
[The testing result on a siren audio file seems not working from my end]
#62
allenhung1025
opened
3 weeks ago
2
RuntimeError: Default process group has not been initialized, please make sure to call init_process_group.
#61
zhiziwy
opened
2 months ago
2
balanced_audioset pretrain
#60
cxy0022
opened
2 months ago
0
RuntimeError: Default process group has not been initialized, please make sure to call init_process_group.
#59
LiupengNew
closed
3 months ago
1
Where do I get the MD5 for the Audio Set
#58
TFEI-Nagato
opened
5 months ago
0
1
#57
WangXD-8
opened
6 months ago
0
框图字体咨询
#55
Evigouse
opened
6 months ago
0
SEDWrapper sed_model problem
#54
Roon311
closed
6 months ago
1
FileNotFoundError: [Errno 2] No such file or directory: 'audio_32k/1-100032-A-0.wav'
#53
Darcy0218
opened
10 months ago
2
cannot pickle 'module' object
#52
gillesmeyhi
opened
10 months ago
2
谱图编码
#51
haloolahh
opened
10 months ago
1
type of GPU
#50
haloolahh
opened
10 months ago
2
the size of the input spectrum
#49
haloolahh
opened
10 months ago
1
报错内容ValueError: The provided lr scheduler "<torch.optim.lr_scheduler.LambdaLR object at 0x7fe3d759bb50>" is invalid
#48
kkkjjjj1
opened
11 months ago
4
cyclic window shifting in the (256,256) tensor
#47
tsw123tsw
opened
1 year ago
1
Validation loss metric
#46
OhadCohen97
opened
1 year ago
0
Usage on Strongly labelled Dataset for SED
#45
urvashi07
closed
1 year ago
2
Key to checkpoints in drive
#44
Sreyan88
opened
1 year ago
3
训练过程报错:段错误 (核心已转储)
#43
ammerser
opened
1 year ago
0
Model Checkpoints
#42
wonyangcho
closed
1 year ago
5
reporduce training on esc-50 has an error
#41
visionchan
closed
1 year ago
3
About shape of input wav
#40
wangqian621
closed
1 year ago
1
RuntimeError: Input and output sizes should be greater than 0, but got input (H: 0, W: 64) output (H: 1024, W: 64)
#39
Mizuho32
closed
1 year ago
1
Audioset dataset for pretraining
#38
youngwhite
opened
1 year ago
3
Training will get stuck and stop without reporting an error
#37
YooWang
opened
1 year ago
3
audioset 训练中报错
#36
fuguanyu
closed
1 year ago
0
Does this framework's output have been compared with other features?
#35
MisakaMikoto96
opened
1 year ago
1
Getting started with a custom dataset
#34
OhadCohen97
opened
1 year ago
4
upsample_bicubic2d_backward_out_cuda
#33
DuQingChen
closed
1 year ago
1
How can run this project with one GPU?!
#32
saeedmaroof
closed
1 year ago
2
Training and infering with dataset containing 4 classes
#31
JonathanFL
closed
1 year ago
1
How to finetune on strong label dataset?
#30
wengstA
closed
1 year ago
2
How can i use my own dataset in this model:
#29
yyssxxx
closed
1 year ago
1
Question about AudioSet and finetune learning rate.
#28
MichaelLynn1996
closed
1 year ago
2
cannot pickle 'module' object when running the htsat_esc_training
#27
kremHabashy
closed
1 year ago
1
How can I test model?
#26
yyssxxx
closed
1 year ago
2
How to perform localization and generate heatmap with AudioSet
#25
samuelladyanov
closed
1 year ago
1
Questions about models.py
#24
PrShi113
closed
1 year ago
1
How to choose loss functions for a different dataset.
#23
the6thsense
closed
1 year ago
1
Unexpectedly high accuracy of 99 percent
#22
the6thsense
closed
1 year ago
1
TypeError: cannot pickle 'module' object
#21
JonathanFL
closed
1 year ago
2
关于语义模块
#20
dong-0412
closed
2 years ago
1
关于音频事件检测
#19
dong-0412
closed
2 years ago
2
Learning rate for small datasets
#18
EmreOzkose
closed
2 years ago
3
预训练的.ckpt文件
#17
dong-0412
closed
2 years ago
1
bug
#16
dong-0412
closed
2 years ago
1
Question about reshape log_mel to img size
#15
lyc1993
closed
2 years ago
1
QUESTION
#14
dong-0412
closed
2 years ago
1
Next