This repository contains scripts used for processing WikiData dumps, specifically for the "claims" and "labels" data.
The claims package includes scripts for processing claims data from WikiData dumps. The main functionalities include:
do_text.py
: Processes the claims data and generates a textual report.read_dump.py
: Reads and parses the claims data from the WikiData dump.save.py
: Saves the processed claims data to a specified location.The labels package consists of scripts for handling labels data from WikiData dumps. The primary features are:
do_text.py
: Generates a text report based on the labels data.read_dump.py
: Reads and processes the labels data from the WikiData dump.save.py
: Saves the processed labels data to a designated location.