NVIDIA / spark-rapids-tools

User tools for Spark RAPIDS
Apache License 2.0
49 stars 36 forks source link

Refactor DB AWS qual tool to cache and process pricing info from DB website #1139

Closed cindyyuanjiang closed 3 months ago

cindyyuanjiang commented 3 months ago

Currently, DB AWS qual tool extracts pricing information from https://github.com/NVIDIA/spark-rapids-tools/blob/dev/user_tools/src/spark_rapids_pytools/resources/databricks-premium-catalog.json. However, this file could be outdated and has not been actively maintained.

We recently found Databricks saves AWS pricing info under: https://www.databricks.com/en-website-assets/data/pricing/AWS.json. Qual tool should utilize this website to maintain DB AWS pricing info.