issues
search
NVIDIA
/
spark-rapids-tools
User tools for Spark RAPIDS
Apache License 2.0
44
stars
34
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Disable UI-HTML report by default in Qualification tool
#1168
amahussein
opened
1 hour ago
0
Fix parsing App IDs inside metrics directory in QualX
#1167
parthosa
closed
27 minutes ago
0
[FEA] Disable UI-HTML report by default in Qualification CLI
#1166
amahussein
opened
1 day ago
0
[BUG] Qualification tool may show negative numbers in GPU estimates
#1165
amahussein
opened
2 days ago
0
[BUG] Profiling Tool does not contain status info for failed event log
#1164
cindyyuanjiang
opened
2 days ago
1
[BUG] QualX fails to detect profiling metrics for non-standard App IDs
#1163
parthosa
closed
27 minutes ago
0
[audit 4.0] [SPARK-48479] Investigate new UDF support - make sure tools can recognize
#1162
tgravescs
opened
2 days ago
0
[Bug] Fix java Qual tool handling of `--platform` argument
#1161
cindyyuanjiang
opened
2 days ago
3
[FEA] Qual tool tuning rec based on CPU event log coherently recommend tunings and node setup and infer cluster from eventlog
#1160
tgravescs
opened
2 days ago
0
[BUG] Qualification tool cluster recommendation on Dataproc defaults numGpus recommended is 2
#1159
tgravescs
closed
2 days ago
1
[BUG] Update the scala style and code formatter
#1158
amahussein
opened
3 days ago
0
[FEA] Qualification tool: Add operators stats output csv file
#1157
nartal1
opened
3 days ago
0
[BUG] unsupportedoperators.csv shows stageID=-1 for certain unsupported operator
#1156
viadea
opened
4 days ago
1
[FEA] Include commit id in build properties
#1155
wjxiz1992
opened
4 days ago
0
[FEA] Revisit/Update eventlogs of scala unit tests
#1154
nartal1
opened
4 days ago
1
[FEA] Set user-tools environment to drop python-3.8
#1153
amahussein
opened
1 week ago
0
[FEA] Auto tuning/node recommendation missing features
#1152
tgravescs
opened
1 week ago
1
Add all stage metrics to tools output
#1151
nartal1
closed
2 days ago
5
Unsupported op logic should read action column from qual's output
#1150
amahussein
closed
1 week ago
0
[BUG] Revisit QualX code to include it in Tox
#1149
amahussein
closed
3 days ago
1
Add plugin mechanism for dataset-specific preprocessing in qualx
#1148
leewyang
closed
6 days ago
2
[BUG] user tools Qualification tunings and node recommendation when --cluster specified can be wrong
#1147
tgravescs
opened
1 week ago
1
Follow-up 1142: remove TODO line
#1146
amahussein
closed
1 week ago
0
Disable pylint-unreachable code in tox.ini
#1145
amahussein
closed
1 week ago
0
[DOC] Documentation and user tools --help should clarify what type of cluster is used with --cluster option
#1144
tgravescs
opened
1 week ago
0
[FEA] user tools qualification node recommendation should include the number of GPUs to use
#1143
tgravescs
opened
1 week ago
0
Mark wholestageCodeGen as shouldRemove when child nodes are removed
#1142
amahussein
closed
1 week ago
1
Refactor Databricks-AWS Qual tool to cache and process pricing info from DB website
#1141
cindyyuanjiang
closed
6 days ago
2
Update qualx readme for training
#1140
leewyang
closed
1 week ago
0
Refactor DB AWS qual tool to cache and process pricing info from DB website
#1139
cindyyuanjiang
closed
6 days ago
0
Recommended cluster should use executors_per_node and cores_per_executor
#1138
amahussein
opened
1 week ago
5
Add internal CLI to generate instance descriptions for CSPs
#1137
cindyyuanjiang
opened
1 week ago
7
[FEA] Revisit string formats in tools
#1136
amahussein
opened
1 week ago
0
[FEA] Display full failure messages in failed CSV files
#1135
amahussein
closed
1 week ago
0
[FEA] Support custom XGBoost model file via user tools CLI
#1134
mattahrens
opened
1 week ago
0
[DOC] Fix User-tools README with absolute links and valid links
#1133
amahussein
opened
1 week ago
0
[BUG] Error mesasges in failed_job.csv and failed_stages.csv are not fully displayed
#1132
wjxiz1992
closed
1 week ago
2
[BUG] unsupported operator handling logic should use action column and not override table
#1131
eordentlich
closed
1 week ago
0
Fix Python runtime error caused by numpy 2.0.0 release
#1130
amahussein
closed
2 weeks ago
0
Bump urllib3 from 1.26.18 to 1.26.19 in /data_validation
#1129
dependabot[bot]
opened
2 weeks ago
0
Fix Python runtime error caused by numpy 2.0.0 release
#1128
amahussein
closed
2 weeks ago
2
[BUG] Python runtime failure due to incompatibe numpy
#1127
amahussein
closed
2 weeks ago
1
[BUG] python user tools should always display processed apps - even if passed GPU event logs
#1126
tgravescs
opened
2 weeks ago
0
[FEA] Be able to recommend specific GPU SKU according to SQL nature
#1125
wjxiz1992
opened
2 weeks ago
2
Handle different exception thrown by incomplete eventlogs
#1124
amahussein
closed
2 weeks ago
0
Add an internal CLI to generate instance type descriptions for CSPs
#1123
cindyyuanjiang
opened
2 weeks ago
0
[BUG] Handle different exception thrown by incomplete eventlogs
#1122
amahussein
closed
2 weeks ago
0
Qual tool cluster info - calculating the number of exec nodes on yarn should look at all executors over time when dynamic allocation on
#1121
tgravescs
opened
3 weeks ago
0
[FEA] Add Benchmarking to evaluate the core tools performance
#1120
amahussein
opened
3 weeks ago
0
Include number of executors per node in cluster information
#1119
parthosa
closed
2 weeks ago
2
Next