Closed syedzubeen closed 10 months ago
Dataset Details: The dataset, which has undergone initial cleaning, comprises CNN News Articles spanning from 2011 to 2022. It consists of two essential components: the category labels and the complete article text.
This dataset was sourced from Kaggle and can be accessed via the following URL: https://www.kaggle.com/datasets/hadasu92/cnn-articles-after-basic-cleaning. It has been divided into two distinct sets:
A training set containing 32,218 examples. A test set containing 5,686 examples.
Dataset URL: https://huggingface.co/datasets/AyoubChLin/CNN_News_Articles_2011-2022
Join the discussion on DagsHub!
Dataset Details: The dataset, which has undergone initial cleaning, comprises CNN News Articles spanning from 2011 to 2022. It consists of two essential components: the category labels and the complete article text.
This dataset was sourced from Kaggle and can be accessed via the following URL: https://www.kaggle.com/datasets/hadasu92/cnn-articles-after-basic-cleaning. It has been divided into two distinct sets:
A training set containing 32,218 examples. A test set containing 5,686 examples.
Dataset URL: https://huggingface.co/datasets/AyoubChLin/CNN_News_Articles_2011-2022