Closed nkandpa2 closed 8 months ago
This is not a huge source of data (a couple thousand essays) but it requires very little work to scrape/clean and the text is high-quality (seems to be authored by people who study and write about art/history/culture professionally). I have the code for scraping/cleaning the text from their site here. Only thing left to do is to convert the text to Dolma.
Looks good! Let's include it.
Closed via #67
This is a blog containing relatively long-form essays about works that have entered the public domain. The essays themselves are under a CC-BY SA license (see here for license info).