The Dataset for the paper Open Intent Extraction from Natural Language Interactions (best paper, Web Conference 2020) is shared here.
The dataset is a tab-separated text file. Each line of this file contains three columns: Inputs
, Intent Labels
and Intent Tags
separated by a 'tab'.
The first column Inputs
contains one or more sentences of Input text.
The second column Intent Labels
contains human annotated intents for the input text. The number of intents per input text varies from 0 to 3.
Each intent is a phrase containing an ACTION
phrase and an OBJECT
phrase. For more description of the ACTION
and OBJECT
phrases, please refer to the above mentioned paper.
The third column Intent Tags
contains the tag label (ACTION
, OBJECT
or NONE
) for each word of the intents in the second column.
If you use this dataset, please cite the following paper:
@inproceedings{vedula2020open,
title={Open Intent Extraction from Natural Language Interactions},
author={Vedula, Nikhita and Lipka, Nedim and Maneriker, Pranav and Parthasarathy, Srinivasan},
booktitle={Proceedings of The Web Conference 2020},
pages={2009--2020},
year={2020}
}