Closed tada0347 closed 7 months ago
In sparsegpt.py at def add_batch line 42: inp = inp.t()
This code makes the hessian matrix into X^TX rather than XX^T when pruning nn.Linear or nn.Conv1d
Why did you transpose these inputs??
Are there any missings that I don't understand?
In sparsegpt.py at def add_batch line 42: inp = inp.t()
This code makes the hessian matrix into X^TX rather than XX^T when pruning nn.Linear or nn.Conv1d
Why did you transpose these inputs??
Are there any missings that I don't understand?