benchflow / data-transformers

Spark scripts utilised to transform data to the BenchFlow internal formats
Other
0 stars 0 forks source link

Develop Tasks that read a mysql dump, transform the data and store them in Cassandra #1

Closed VincenzoFerme closed 8 years ago

VincenzoFerme commented 8 years ago

Develop a task that:

Give a look at Spark SQL providing a powerful tool for performing ETL on data. A nice example can be found on the following link: http://chapeau.freevariable.com/2014/10/fedmsg-and-spark.html

This task is a starting point to understand what we need to do in order to simplify this process for the users, by providing an already implemented library that does the most.

Something to evaluate:

Pay attention to:

Cerfoglg commented 8 years ago

Additional details:

VincenzoFerme commented 8 years ago

Closed by #2