Open ravi-prakash1907 opened 1 year ago
I plan to curate a diverse dataset of poems from online repositories and public domain collections. With a focus on balanced sentiment representation and accurate annotations, I will ensure the dataset's quality and integrity.
That will be nice @Nabanita29. Please try to get the multilingual data and as mentioned, consider the short poems as a priority. You may join the community's discord server for further discussion, queries, and suggestions!
Note: As the milestone "Dataset Collection" is nearing its deadline, pull requests associated with this issue will be considered a priority. π
Description: π
This project requires an NLP model trained on a poetry dataset, encompassing different languages with a current focus on English and Hindi. The dataset should meet the following constraints:
For longer poems, consider the following contributions:
Note: