VCityTeam / UD-Demo-Workspace-GratteCiel

GNU Lesser General Public License v2.1
0 stars 0 forks source link

automatically load data into blazegraph container #2

Closed DiegoVinasco closed 1 month ago

DiegoVinasco commented 1 year ago

upon building a blazegraph container, identified data should be loaded automatically though a script

See https://github.com/blazegraph/database/wiki/Quick_Start#load-data for an example

DiegoVinasco commented 1 year ago

from blazegraph api doc, a blazegraph REST API to add files by URL/URI:

curl -X POST --data-binary 'uri=file:///Users/bryan/Documents/workspace/BIGDATA_RELEASE_1_2_0/bigdata-rdf/src/resources/data/foaf/data-0.nq' http://localhost:8080/bigdata/sparql

from @JPugetGil, a script to download and upload files (to an SQL db) that can be adapted for blazegraph:

mkdir dataset

cd dataset || exit

# Get the data from the liris dataset server
curl --request GET -sL \
  --url 'https://dataset-dl.liris.cnrs.fr/rdf-owl-urban-data-ontologies/Datasets/GratteCiel_Workspace_2009_2018/3.0/GratteCiel_2018_split.ttl' \
  --output 'GratteCiel_2018_split.ttl'

curl --request GET -sL \
  --url 'https://dataset-dl.liris.cnrs.fr/rdf-owl-urban-data-ontologies/Datasets/GratteCiel_Workspace_2009_2018/3.0/GratteCiel_2015_split.ttl' \
  --output 'GratteCiel_2015_split.ttl'

curl --request GET -sL \
  --url 'https://dataset-dl.liris.cnrs.fr/rdf-owl-urban-data-ontologies/Datasets/GratteCiel_Workspace_2009_2018/3.0/GratteCiel_2012_split.ttl' \
  --output 'GratteCiel_2012_split.ttl'

curl --request GET -sL \
  --url 'https://dataset-dl.liris.cnrs.fr/rdf-owl-urban-data-ontologies/Datasets/GratteCiel_Workspace_2009_2018/3.0/GratteCiel_2012_alt_split.ttl' \
  --output 'GratteCiel_2012_alt_split.ttl'

curl --request GET -sL \
  --url 'https://dataset-dl.liris.cnrs.fr/rdf-owl-urban-data-ontologies/Datasets/GratteCiel_Workspace_2009_2018/3.0/GratteCiel_2009_split.ttl' \
  --output 'GratteCiel_2009_split.ttl'

curl --request GET -sL \
  --url 'https://dataset-dl.liris.cnrs.fr/rdf-owl-urban-data-ontologies/Datasets/GratteCiel_Workspace_2009_2018/3.0/GratteCiel_2009_alt_split.ttl' \
  --output 'GratteCiel_2009_alt_split.ttl'

# Import the data
curl --location 'http://localhost:8080/import/add' \
  --header 'Content-Type: multipart/form-data' \
  --form 'files=@"GratteCiel_2018_split.ttl"'

curl --location 'http://localhost:8080/import/add' \
  --header 'Content-Type: multipart/form-data' \
  --form 'files=@"GratteCiel_2015_split.ttl"'

curl --location 'http://localhost:8080/import/add' \
  --header 'Content-Type: multipart/form-data' \
  --form 'files=@"GratteCiel_2012_split.ttl"'

curl --location 'http://localhost:8080/import/add' \
  --header 'Content-Type: multipart/form-data' \
  --form 'files=@"GratteCiel_2012_alt_split.ttl"'

curl --location 'http://localhost:8080/import/add' \
  --header 'Content-Type: multipart/form-data' \
  --form 'files=@"GratteCiel_2009_split.ttl"'

curl --location 'http://localhost:8080/import/add' \
  --header 'Content-Type: multipart/form-data' \
  --form 'files=@"GratteCiel_2009_alt_split.ttl"'