Tiiiger / SGC

official implementation for the paper "Simplifying Graph Convolutional Networks"
MIT License
831 stars 145 forks source link

Preprocess for Text Classification #30

Closed kentwhf closed 3 years ago

kentwhf commented 3 years ago

Hi,

I am interested in applying SGC to some other text classification datasets. How did you prepossess the dataset?

Many thanks ahead

kentwhf commented 3 years ago

Also, is it possible to release the original text data in .txt format (eg. R8.txt)? I might want try some sensitivity analysis on embedding size as well

felixgwu commented 3 years ago

Hi @Kentwhf, Thank you for your interest. Our text classification code is a modified clone of the TextGCN repo. Unfortunately, we simply use their preprocessed data, so we don't have the details of how they preprocess the data. Please contact the author of TextGCN for the data preprocessing.