Performance models for autotuning

triton-lang / triton

Development repository for the Triton language and compiler

MIT License

13.02k stars 1.59k forks source link

"‘perf_model’: performance model used to predicate running time with different configs, returns running time."

I saw in the code that there is a matrix multiplication performance estimator. Are there any others? For e.g. it is increasingly common in Neural Architecture Search papers to use some function approximator (e.g. RandomForest, Neural Network) etc as a performance estimator. These can also be configured to output uncertainty with multiple independent instances.

Say for a particular GPU one could imagine having a library of such estimators that can be periodically (or online) updated as more configurations are tried out.

triton-lang / triton

Performance models for autotuning #1659