NBU-DSCM-2020 / dscm006-semtech-group-project

NBU Data Science 2020 DSCM006 Semantic Technologies Group Project
0 stars 0 forks source link

DSCM006 Semantic Technologies (NBU 2020) Group Project

Тук се съдържа скрейпър на декларациите по ЗЗДПДПОРДМУ от 2020г. базиран на Python Scrapy 2.4.0, както и детиали за транформация на данните в RDF

Scraper Install & Run

cd dscm006-semtech-group-project/scraper
python3 -m venv .venv/
source .venv/bin/activate
pip3 install -r requirements.txt
scrapy crawl declarations_register 

Data Model

The data model is located in the [model] folder.

Public endpoint

http://edu.ontotext.com/ repository:media-registry contains the endpoint. It is still a work in progress.

Workflow data fixes