scylladb / scylla-migrator

Migrate data extract using Spark to Scylla, normally from Cassandra
Apache License 2.0
54 stars 34 forks source link

Build spark-cassandra-connector as a usual dependency instead of a fat-jar #151

Closed julienrf closed 2 weeks ago

julienrf commented 3 weeks ago

The assembly of the migrator project requires a huge amount of resources because it has to process thousands of files that are duplicated between the fat-jar of the spark-cassandra-connector and scylla-migrator’s dependencies.

On my machine, the OS often crashes when I try to build the assembly…

Instead, we should build the spark-cassandra-connector as a usual Maven/Ivy package, publish it locally, and depend on it in the scylla-migrator project.

This PR needs https://github.com/scylladb/spark-cassandra-connector/pull/4 to be merged first.

julienrf commented 2 weeks ago

Superseded by #155