rst0070 / Rawformer-implementation-anti-spoofing

Pytorch implementation of "LEVERAGING POSITIONAL-RELATED LOCAL-GLOBAL DEPENDENCY FOR SYNTHETIC SPEECH DETECTION"
20 stars 0 forks source link

Rawformer Implementation for Anti-Spoofing

This is my own implementation of Rawformer model (LEVERAGING POSITIONAL-RELATED LOCAL-GLOBAL DEPENDENCY FOR SYNTHETIC SPEECH DETECTION - Xiaohui Liu, Meng Liu, Longbiao Wang, Kong Aik Lee2, Hanyi Zhang1, Jianwu Dang)
fig1
WARNING

Rawformer-S vs Rawformer-L vs SE-Rawformer

In the paper, authors developed three types of Rawformer, Rawformer-S, Rawformer-L and SE-Rawformer. I implemented all of these models only with 1-dimesional positional encoding. N is the number of Conv2D-based Blocks and M is the number of Transformer Encoders.