Closed DimitriTimoz closed 2 weeks ago
Implement the following todo to be able to train the bert model with real dropout layers. https://github.com/huggingface/candle/blob/e2b6b367fa852ed30ac532f8d77cd8479c7ed092/candle-transformers/src/models/bert.rs#L126-L132
Implement the following todo to be able to train the bert model with real dropout layers. https://github.com/huggingface/candle/blob/e2b6b367fa852ed30ac532f8d77cd8479c7ed092/candle-transformers/src/models/bert.rs#L126-L132