CNN News Articles (2011-2022) - Githubissues

DagsHub / open-source-ml-datasets

This repository holds open source datasets for various machine learning domains with a link to download and use them

https://dagshub.com/DagsHub/open-source-ml-datasets

8 stars 8 forks source link

CNN News Articles (2011-2022) #54

Closed syedzubeen closed 10 months ago

syedzubeen commented 11 months ago

Dataset Details: The dataset, which has undergone initial cleaning, comprises CNN News Articles spanning from 2011 to 2022. It consists of two essential components: the category labels and the complete article text.

This dataset was sourced from Kaggle and can be accessed via the following URL: https://www.kaggle.com/datasets/hadasu92/cnn-articles-after-basic-cleaning. It has been divided into two distinct sets:

A training set containing 32,218 examples. A test set containing 5,686 examples.

Dataset URL: https://huggingface.co/datasets/AyoubChLin/CNN_News_Articles_2011-2022

dagshub[bot] commented 11 months ago

Join the discussion on DagsHub!