NVIDIA / spark-rapids-tools

User tools for Spark RAPIDS
Apache License 2.0
56 stars 37 forks source link

Add Arguments for Distributed Mode in Qualification Tool CLI #1429

Open parthosa opened 4 days ago

parthosa commented 4 days ago

Fixes #1430.

This PR adds the initial changes needed in CLI to support distributed execution in the Qualification Tool CLI. It adds arguments to enable distributed mode and sets the stage for future implementation PRs.

Note:

Changes Overview

CMD:

spark_rapids qualification --platform onprem --eventlogs /path/to/eventlogs  --verbose --filter_apps all \
 --distributed --tools_config_file /path/to/custom_conf_file.yaml

Sample Config File:

api_version: '1.0'
distributed_tools:
  spark_properties:
    - name: 'spark.executor.memory'
      value: '20g'

Details:

Enhancements to argument processing:

Platform class updates:

Other improvements: