astrolabsoftware / grafink

Grafink is a spark ETL job to load data into Janusgraph [GSoC 2020]
Apache License 2.0
6 stars 5 forks source link

Add a spark job to load fink processed data into Janusgraph #1

Closed saucam closed 4 years ago

saucam commented 4 years ago

We want to add a spark ETL job that will push output data from fink into Janusgraph. There should be a configurable option to load the data schema as well, which will read the schema from parquet data and write to janusgraph before loading the data.

fink --schema and data--> Hbase | |__> parquet files--grafink --> JanusGraph

saucam commented 4 years ago

Addressed by #4