hikettei / Caten

[wip] Deep Learning Compiler based on Polyhedral Compiler and Light-weight IRs based on Optimizing Pattern Matcher
https://hikettei.github.io/Caten/
Other
20 stars 3 forks source link

O(n) and Fast JIT Compiler Workload (Transformer > 70 layers) #231

Open hikettei opened 1 week ago

hikettei commented 1 week ago
hikettei commented 1 week ago

Currently (with JIT=1, including gcc time)

what we should optimize is not an algorithm, but the number of N (input nodes) by caching the aasm per modules.

image