Open mytnguyen26 opened 7 months ago
We will collect as much data as we could from Chinese sites, Vietnamese sites, or any English open sourced sites. Material includes:
Let's save this unprocessed data in a ShareDrive (TBD) in .txt format, organized by types (music, short stories, arts)
.txt
https://www.kaggle.com/datasets/carlosgdcj/genius-song-lyrics-with-language-information
https://www.kaggle.com/datasets/rickyjli/chinese-fine-art
We will collect as much data as we could from Chinese sites, Vietnamese sites, or any English open sourced sites. Material includes:
Let's save this unprocessed data in a ShareDrive (TBD) in
.txt
format, organized by types (music, short stories, arts)