YuanGongND / ast

Code for the Interspeech 2021 paper "AST: Audio Spectrogram Transformer".
BSD 3-Clause "New" or "Revised" License
1.13k stars 212 forks source link

Some question about AST #21

Closed ooobsidian closed 3 years ago

ooobsidian commented 3 years ago

Hello Yuan, I sent an email to your email(yuangong@mit.edu) and hope to discuss with you about AST.

YuanGongND commented 3 years ago

Hi there,

Sorry I am not familiar with your downstream task and it is hard for me to answer that. A general suggestion is to tune the hyper-parameter, especially the learning rate, batch size, epochs for your task. AST generally needs a smaller learning rate than other models.

-Yuan