Open iislucas opened 1 year ago
Transformer implementation: https://github.com/PAIR-code/tiny-transformers/blob/main/animated-transformer/src/lib/transformer/transformer_gtensor.ts
Idea: provide a boolean configuration variable to enable/disable dropout. e.g. like positional encodings: https://github.com/PAIR-code/tiny-transformers/blob/main/animated-transformer/src/lib/transformer/transformer_gtensor.ts#LL59C8-L59C27
TODOs:
Context:
fwiw here's the haiku implementation that Asma and I have been working off of.
We've mostly been turning it off — I think it leads to redundancy/backup circuits.
Transformer implementation: https://github.com/PAIR-code/tiny-transformers/blob/main/animated-transformer/src/lib/transformer/transformer_gtensor.ts
Idea: provide a boolean configuration variable to enable/disable dropout. e.g. like positional encodings: https://github.com/PAIR-code/tiny-transformers/blob/main/animated-transformer/src/lib/transformer/transformer_gtensor.ts#LL59C8-L59C27
TODOs:
Context: