DataStax Bulk Loader (DSBulk) is an open-source, Apache-licensed, unified tool for loading into and unloading from Apache Cassandra(R), DataStax Astra and DataStax Enterprise (DSE)
I am trying to export a large Cassandra table but dsbulk is considering total CPU and create for example 5 files and each of them with a different size.
I want my exported files to splitted into for example 1G of files. In cqlsh there is an option to do that, maxoutputsize, however in dsbulk I couldn't find such an option.
Am I missing something here? Or is there a way to achieve this?
Hello,
I am trying to export a large Cassandra table but dsbulk is considering total CPU and create for example 5 files and each of them with a different size. I want my exported files to splitted into for example 1G of files. In cqlsh there is an option to do that, maxoutputsize, however in dsbulk I couldn't find such an option.
Am I missing something here? Or is there a way to achieve this?
Thanks.