cylondata / twister2

A composable framework for fast and scalable data analytics
https://twister2.org
Apache License 2.0
57 stars 32 forks source link

unifying disk configurations #941

Closed ahmet-uyar closed 4 years ago

ahmet-uyar commented 4 years ago

Currently there are multiple configuration parameters to specify local/hdfs disk locations for checkpointing, saving tset, slipping over to disk during networking operations:

We have: twister2.checkpointing.store.fs.dir and twister2.checkpointing.store.hdfs.dir in checkpoint.yaml twister2.data.fs.root and twister2.data.hdfs.root in data.yaml twister2.network.ops.persistent.dirs in network.yaml

Can we have a single parameter for local and hdfs disk locations? That would simplify configuring twister2 for users.

We can create a directory per job inside these directories not to get conflicts for concurrent jobs.