thuiar / TEXTOIR

TEXTOIR is the first opensource toolkit for text open intent recognition. (ACL 2021)
https://github.com/thuiar/TEXTOIR
MIT License
185 stars 30 forks source link

How to use this work to cluster on my own dataset? #14

Closed jzh1082 closed 2 years ago

jzh1082 commented 2 years ago

I have a dataset with about 10% labeled data and I want to use this labeled data to do clustering on the rest. But the dataset in 'data' folder are all provided with ground truth label. How can I prepare data to use this work?

HanleiZhang commented 2 years ago

Hi, thanks for your watching in our works! You may achieve this goal with the pipeline framework in our another repo TEXTOIR-DEMO