mithril-security / blindai

Confidential AI deployment with secure enclaves :lock:
https://www.mithrilsecurity.io/
Apache License 2.0
500 stars 35 forks source link

Execution times #52

Closed cchudant closed 2 years ago

cchudant commented 2 years ago

Description

In depth info about execution plans This is more of a meta-issue (/roadmap) focusing everything about execution times.

Plans I have in mind:

I am not sure whether all of this is overkill or not since we're just using tract and not really touching the perf sensitive parts. We'll see.

Additional Information

None

Checklist

JoFrost commented 2 years ago

"I am not sure whether all of this is overkill or not since we're just using tract and not really touching the perf sensitive parts. We'll see."

We are not only using Tract actually, and because we are executing the code inside an enclave, the performance sensitive part is affected. We cannot rely on the same way than the original library to load a model in memory, because of the enclave context we have to load the model entierly in one-go, and the memory management inside an enclave is different than on a normal application. This PR can be an opportunity to also test the performance with hardware optimizations we can use inside an enclave.