Using _vmap in PyTorch to compute the Hessian-vector product (hvp) encounters a runtime error

Trying to use the minimize function with methods

trust-ncg
dogleg
newton-exact
trust-exact
trust-krylov

But succeeds with the other methods. Presumably, the other methods aren't computing Hessians.

I tracked the error to here.

I can't quite figure out what is really responsible for the error but I suspect its _vmap failing to batch properly because my debugger indicates there is something wrong with the tensors that are yielded. If i look at the batched_inputs[0] variable in _vmap_internals._vmap and try print it, or view it, or do +1 to it then i get the error RuntimeError: Batching rule not implemented for aten::is_nonzero. We could not generate a fallback.

Computing the hessian in a loop works but is hideous and slow.

hvp_map = lambda V: torch.stack(
                [autograd.grad(grad, x, v, retain_graph=True)[0] for v in V], dim=0)
hess = hvp_map(self._I)

Is this a real issue or am missing something?

rfeinman / pytorch-minimize

Using _vmap in PyTorch to compute the Hessian-vector product (hvp) encounters a runtime error #33