I have created a simple script that processes data from the GoEmotions dataset (Google):
Link: https://github.com/google-research/google-research/tree/master/goemotions
Description: GoEmotions is a corpus of 58k carefully curated comments extracted from Reddit, with human annotations to 27 emotion categories or Neutral.
I have created a simple script that processes data from the GoEmotions dataset (Google): Link: https://github.com/google-research/google-research/tree/master/goemotions Description: GoEmotions is a corpus of 58k carefully curated comments extracted from Reddit, with human annotations to 27 emotion categories or Neutral.
ZIP with script (ipynb), JSONL and parquet. GoEmotions_script_jsonl_parquet.zip