Execution times - Githubissues

mithril-security / blindai

Confidential AI deployment with secure enclaves :lock:

Apache License 2.0

500 stars 35 forks source link

Description

In depth info about execution plans This is more of a meta-issue (/roadmap) focusing everything about execution times.

Plans I have in mind:

first audit of execution time, map possible improvement, low hanging fruits improvements, see if there are design flaws Will be using flamegraphs and run our current e2e tests with that
Work on benchmarks
CI integration to see regressions / improvements with each versions
probably an auto generated page with these execution times of well known models
Add execution info (or links to) on the readme for known models

I am not sure whether all of this is overkill or not since we're just using tract and not really touching the perf sensitive parts. We'll see.

None