Open xwan0527 opened 4 months ago
Why use convolutions instead? Since upsampling is already employed to obtain the mask matrix, it seems like transformers could also be used.
Why use convolutions instead? Since upsampling is already employed to obtain the mask matrix, it seems like transformers could also be used.