I'm just wondering what type of data preprocessing for SIF embedding I need to do for the sentences. For example,
1) do I need to remove punctuations? In the example, sentences don't have punctuations.
2) should I tokenize negations?
3) what other preprocessing needs to be done?
Thanks a lot!!
I'm just wondering what type of data preprocessing for SIF embedding I need to do for the sentences. For example, 1) do I need to remove punctuations? In the example, sentences don't have punctuations. 2) should I tokenize negations? 3) what other preprocessing needs to be done? Thanks a lot!!