ruidan / IMN-E2E-ABSA

Code and dataset for ACL2019 ‘‘An Interactive Multi-Task Learning Network for End-to-End Aspect-Based Sentiment Analysis’’.
Apache License 2.0
99 stars 18 forks source link

about dataset #2

Open sjm1992st opened 5 years ago

sjm1992st commented 5 years ago

There are four files in the folder data_"preprocessed", and i think the 0, 1, 2 in the opinion.txt should be the pos,neg,neu, said in the paper. But i don't understand what's the meanful about 0, 1, 2 in the target.txt and the 0, 1, 2, 3, 4 in the target_polarity.txt?

ruidan commented 5 years ago

The 0, 1, 2 in target.txt and opinion.txt give the annotations of aspect terms and opinion terms respectively. A word labeled with 0 means that it is outside the span of an aspect (opinion) term. A word labeled with 1 indicates that it is the beginning word of an aspect (opinion) term, and a word labeled with 2 indicates that it is inside of an aspect (opinion) term.

In target_polarity.txt, 0 means that the word is outside the span of any aspect term as only aspect terms have sentiment labels. 1, 2, 3, 4 refer to pos, neg, neu, and conflict sentiment polarity respectively.

You can refer to section 3.1 and 4.1 in the paper for more details on the data preparation and experimental details.