topel / listen-attend-tell

Audio captioning system based on LAS, used in the DCASE2020 challenge
5 stars 2 forks source link