bclarkson-code / Tricycle

Autograd to GPT-2 completely from scratch
104 stars 7 forks source link

Speed + memory optimisations #47

Closed bclarkson-code closed 3 months ago

bclarkson-code commented 5 months ago

Currently, the speed and memory usage is pretty high. Tricycle should be properly profiled and optimised