NVIDIA / spark-rapids-tools

User tools for Spark RAPIDS
Apache License 2.0
53 stars 37 forks source link

[FEA] Support for Dataproc Serverless platform in spark-rapids-user-tools #682

Open SurajAralihalli opened 11 months ago

SurajAralihalli commented 11 months ago

Adding support for the Dataproc Serverless platform in spark-rapids-user-tools would be useful to the Serverless users.

  1. Qualification Tool
  2. Profiling Tool
### Tasks
- [ ] https://github.com/NVIDIA/spark-rapids-tools/issues/663
amahussein commented 10 months ago

Do we mean by "support" the ability to run the spark-rapids-user-tools python package on Dataproc serverless? Or the ability to analyze the logs generated by apps running on serverless platform?

mattahrens commented 10 months ago

P0 scope:

P1 scope:

SurajAralihalli commented 10 months ago

Do we mean by "support" the ability to run the spark-rapids-user-tools python package on Dataproc serverless? Or the ability to analyze the logs generated by apps running on serverless platform?

Hello @amahussein, I meant the latter one. The ability to analyze the logs generated by Dataproc Serverless applications (qualification, profiling). On a similar note #663 is a feature request to add the Dataproc Serverless job creation command in the qual tool output for Dataproc (for users migrating from classic dataproc to serverless)

amahussein commented 8 months ago

After assessing this feature, we discussed that we need to be able to detect the cluster shape from the eventlogs. The reason is that users won't likely use the batchId of spark submission around for long. This means that we cannot use the batchID or cluster configs as inputs for the tools. Instead we have to rely on extracting the information from evntlogs/driverlogs.

So, this issue should depend on #581

Resources:

Spark properties for dataproc Serverless

https://cloud.google.com/dataproc-serverless/docs/concepts/properties

serverless pricing estimation:

https://cloud.google.com/dataproc-serverless/pricing