Open danieldk opened 1 year ago
In many applications we only need the last layer and letting go of references to intermediate layers can save some memory during inference.
In many applications we only need the last layer and letting go of references to intermediate layers can save some memory during inference.