The goal is to create a dataset with all Armenian-related subjects in the USC Digital Folklore Archives.
Tasks
You should collect entries from http://folklore.usc.edu website with the author, date, and tags, preferably with categories somehow indicated subheadings. Please, saved collected data in machine-readable formats such as JSON or csv files. Please save documents to any temporary public storage and provide link to transfer it to the permanent storage.
A word of warning: not all articles on this page are related to Armenia (example).
The website uses Google for searching its content and sometimes outputs unrelated articles in the results due to how they were rendered to the Google bot.
Goal
The goal is to create a dataset with all Armenian-related subjects in the USC Digital Folklore Archives.
Tasks
You should collect entries from http://folklore.usc.edu website with the author, date, and tags, preferably with categories somehow indicated subheadings. Please, saved collected data in machine-readable formats such as JSON or csv files. Please save documents to any temporary public storage and provide link to transfer it to the permanent storage.
Context
USC Digital Folklore Archives is a database of folklore performances. Armenian-related topics can be found at http://folklore.usc.edu/search_gcse/?q=armenian.
Requirements
Wishes
Please write your code as reusable code that could be launched by someone else later since we could need to update this dataset later.
Resources
Prepared by
This task was prepared by the Open Data Armenia team