bclarkson-code / Tricycle

Autograd to GPT-2 completely from scratch
104 stars 7 forks source link

36 add embedding layer #44

Closed bclarkson-code closed 5 months ago

bclarkson-code commented 5 months ago

Added an embedding layer that is much more efficient that the original implementation