Can MAGNET be used for Extreme Multi-label classification?

@Owaiskhan9654 Thank you for your question. Firstly, it's not my paper or research but i try to reproduce the author work. I've tried to reach the author and ask some question or even asking their code. But he said that the research was implemented on Tensorflow 1 so the code is not reliable anymore so he said that i can reproduce the code by following this https://github.com/Megvii-Nanjing/ML-GCN and change the GCN to the GAT. But desperately i can't reproduce the best result by using BERT Embedding. With GloVe Embedding i think i get similar result (you can look at Bar Chart on the paper on Figure 2).

About your question, as mentioned in the paper that the paper said the model will be difficult to train because large number of label means the model need large number of correlation matrix. It will need very much resource of computation. If you think about hundred thousand of labels, i think it impossible to do that without very much resource like RAM.

Also i've tried Bert Embedding (Word Embedding not sentence ) like on this https://mccormickml.com/2019/05/14/BERT-word-embeddings-tutorial/ but i am not getting result like in the paper, the result even worse than GloVe Embedding. If you have another way to use BERT as Embedding, please let me know.

About discussion on google meet, maybe i have to look for my spare time.

Lastly, i am confused about text preprocessing too, because they're only mentioned little on text preprocessing so i tried text cleaning by my way.

Thank you

if you have another question or unclear explanation from me. please let me know.

adrinta / MAGNET

Can MAGNET be used for Extreme Multi-label classification? #3