lgarithm / crystalnet

crystalnet -- a mini core AI library (being refactored, see https://github.com/lgarithm/stdnn-ops)
MIT License
16 stars 3 forks source link

Split forward model and backward model #50

Open lgarithm opened 6 years ago

lgarithm commented 6 years ago

Currently in our model_t (the physical model instantiated from s_model_t), we pre allocate memory for both the output tensor and the gradient tensor. But the gradient tensor is not necessary when doing inference, we should refactor the model_t to only take one tensor, and use 2 model_t instances during training.