issues
search
NVIDIA
/
spark-rapids-tools
User tools for Spark RAPIDS
Apache License 2.0
43
stars
34
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
[FEA] Include commit id in build properties
#1155
wjxiz1992
opened
3 hours ago
0
[FEA] Revisit/Update eventlogs of scala unit tests
#1154
nartal1
opened
8 hours ago
0
[FEA] Set user-tools environment to drop python-3.8
#1153
amahussein
opened
2 days ago
0
[FEA] Auto tuning/node recommendation missing features
#1152
tgravescs
opened
2 days ago
0
Add all stage metrics to profiling tool output
#1151
nartal1
opened
3 days ago
2
Unsupported op logic should read action column from qual's output
#1150
amahussein
closed
3 days ago
0
[BUG] Revisit QualX code to include it in Tox
#1149
amahussein
opened
3 days ago
0
Add plugin mechanism for dataset-specific preprocessing in qualx
#1148
leewyang
closed
2 days ago
2
[BUG] user tools Qualification tunings and node recommendation when --cluster specified can be wrong
#1147
tgravescs
opened
3 days ago
1
Follow-up 1142: remove TODO line
#1146
amahussein
closed
3 days ago
0
Disable pylint-unreachable code in tox.ini
#1145
amahussein
closed
4 days ago
0
[DOC] Documentation and user tools --help should clarify what type of cluster is used with --cluster option
#1144
tgravescs
opened
4 days ago
0
[FEA] user tools qualification node recommendation should include the number of GPUs to use
#1143
tgravescs
opened
4 days ago
0
Mark wholestageCodeGen as shouldRemove when child nodes are removed
#1142
amahussein
closed
3 days ago
1
Refactor Databricks-AWS Qual tool to cache and process pricing info from DB website
#1141
cindyyuanjiang
closed
2 days ago
2
Update qualx readme for training
#1140
leewyang
closed
4 days ago
0
Refactor DB AWS qual tool to cache and process pricing info from DB website
#1139
cindyyuanjiang
closed
2 days ago
0
Recommended cluster should use executors_per_node and cores_per_executor
#1138
amahussein
opened
5 days ago
5
Add internal CLI to generate instance descriptions for CSPs
#1137
cindyyuanjiang
opened
6 days ago
6
[FEA] Revisit string formats in tools
#1136
amahussein
opened
6 days ago
0
[FEA] Display full failure messages in failed CSV files
#1135
amahussein
closed
5 days ago
0
[FEA] Support custom XGBoost model file via user tools CLI
#1134
mattahrens
opened
6 days ago
0
[DOC] Fix User-tools README with absolute links and valid links
#1133
amahussein
opened
6 days ago
0
[BUG] Error mesasges in failed_job.csv and failed_stages.csv are not fully displayed
#1132
wjxiz1992
closed
5 days ago
2
[BUG] unsupported operator handling logic should use action column and not override table
#1131
eordentlich
closed
3 days ago
0
Fix Python runtime error caused by numpy 2.0.0 release
#1130
amahussein
closed
1 week ago
0
Bump urllib3 from 1.26.18 to 1.26.19 in /data_validation
#1129
dependabot[bot]
opened
1 week ago
0
Fix Python runtime error caused by numpy 2.0.0 release
#1128
amahussein
closed
1 week ago
2
[BUG] Python runtime failure due to incompatibe numpy
#1127
amahussein
closed
1 week ago
1
[BUG] python user tools should always display processed apps - even if passed GPU event logs
#1126
tgravescs
opened
1 week ago
0
[FEA] Be able to recommend specific GPU SKU according to SQL nature
#1125
wjxiz1992
opened
2 weeks ago
2
Handle different exception thrown by incomplete eventlogs
#1124
amahussein
closed
1 week ago
0
Add an internal CLI to generate instance type descriptions for CSPs
#1123
cindyyuanjiang
opened
2 weeks ago
0
[BUG] Handle different exception thrown by incomplete eventlogs
#1122
amahussein
closed
1 week ago
0
Qual tool cluster info - calculating the number of exec nodes on yarn should look at all executors over time when dynamic allocation on
#1121
tgravescs
opened
2 weeks ago
0
[FEA] Add Benchmarking to evaluate the core tools performance
#1120
amahussein
opened
2 weeks ago
0
Include number of executors per node in cluster information
#1119
parthosa
closed
1 week ago
2
[BUG] Platform should be initialized after parsing the eventlogs
#1118
amahussein
opened
2 weeks ago
0
[BUG] Cluster Information: Include number of executors per node
#1117
parthosa
closed
1 week ago
1
[BUG] Revisit the categorization of unsupported ops in Qual tool output
#1116
amahussein
opened
2 weeks ago
0
[BUG] Qualification CLI does not generate AutoTuning for onPrem
#1115
amahussein
opened
2 weeks ago
0
Disable the spark_rapids bootstrap command
#1114
amahussein
closed
2 weeks ago
2
Fix typo in Profiler class using qual instead of prof
#1113
amahussein
closed
2 weeks ago
0
[BUG] Qualification estimate is not generated when `SparklistenerApplicationStart` is missing from the eventlog
#1112
kuhushukla
opened
2 weeks ago
5
Add support to Python 3.12
#1111
amahussein
closed
2 weeks ago
0
user-tools: Update log messages
#1110
nartal1
closed
2 weeks ago
0
[FEA] Qualification tool should recommend the cluster shape based on the best TCO according to our internal benchmark
#1109
viadea
opened
2 weeks ago
3
Enable xgboost prediction model by default
#1108
amahussein
closed
2 weeks ago
0
[FEA] Enable xgboost prediction model by default
#1107
amahussein
closed
2 weeks ago
0
[FEA] Qualification tool should print Kryo related recommendations
#1106
viadea
opened
2 weeks ago
0
Next