PAIR-code / tiny-transformers

Apache License 2.0
14 stars 2 forks source link

Add dropout to Transformer implementation #1

Open iislucas opened 1 year ago

iislucas commented 1 year ago

Transformer implementation: https://github.com/PAIR-code/tiny-transformers/blob/main/animated-transformer/src/lib/transformer/transformer_gtensor.ts

Idea: provide a boolean configuration variable to enable/disable dropout. e.g. like positional encodings: https://github.com/PAIR-code/tiny-transformers/blob/main/animated-transformer/src/lib/transformer/transformer_gtensor.ts#LL59C8-L59C27

TODOs:

Context:

1wheel commented 1 year ago

fwiw here's the haiku implementation that Asma and I have been working off of.

We've mostly been turning it off — I think it leads to redundancy/backup circuits.