datastax / dsbulk

DataStax Bulk Loader (DSBulk) is an open-source, Apache-licensed, unified tool for loading into and unloading from Apache Cassandra(R), DataStax Astra and DataStax Enterprise (DSE)
Apache License 2.0
85 stars 30 forks source link

Split when unloading into smaller files #486

Closed ustndagsemih closed 1 year ago

ustndagsemih commented 1 year ago

Hello,

I am trying to export a large Cassandra table but dsbulk is considering total CPU and create for example 5 files and each of them with a different size. I want my exported files to splitted into for example 1G of files. In cqlsh there is an option to do that, maxoutputsize, however in dsbulk I couldn't find such an option.

Am I missing something here? Or is there a way to achieve this?

Thanks.