Please refer to the FAQ in doc and search for the related issues before you ask the question.
Describe the question(问题描述)
In Class: BaseModel, Why manually manage L2 norm multiplication of model parameters ? Why not use the optimizer to achieve this ? Is it set up like this on purpose?
Please refer to the FAQ in doc and search for the related issues before you ask the question.
Describe the question(问题描述) In Class:
BaseModel
, Why manually manage L2 norm multiplication of model parameters ? Why not use the optimizer to achieve this ? Is it set up like this on purpose?Operating environment(运行环境):