bclarkson-code / Tricycle

Autograd to GPT-2 completely from scratch
107 stars 9 forks source link

Build layer objects #3

Closed bclarkson-code closed 9 months ago

bclarkson-code commented 10 months ago

Some wrappers should be made around tensors and their multiplication to abstract them as layers instead of separate parameters and operations