Improve behavior of evaluation during compile

When a tensor is evaluated during compile, we currently raise an error or print a warning. However, we could make Tripy functionally correct even in the case of evaluation by simply not updating the frontend tensor's op to Storage. This would preserve the computation graph and make compilation work correctly.

This might be as simple as adding one condition to Tensor.eval():

if not self.trace_tensor.is_compile_tracer:
    Storage.build_internal([], [self.trace_tensor], data)
    ...

We will need the warnings though since there may be cases where the evaluated result is erroneously used later in the graph - e.g.

batch = int(x.shape[0]) # Eval happens here
tp.ones((batch, ...)) # Dynamic shapes broken here

We could also suppress warnings in some cases, e.g. if the tensor is only being printed. A simple way to achieve that would be to add a suppress_warnings parameter to eval and set it to True when calling it from __repr__ - we will need to pass it through tolist, so we will likely want a _tolist_helper since the public tolist method should not have this option exposed.

We can, however, drop the errors that are thrown in compile, which means we can also drop the eval_stack_info field of TraceTensor.

Finally, to test it, we should verify the following are true when evaluating while compiling:

We never raise an error
The frontend tensor op is never updated in-place (i.e. it should not be turned into a Storage tensor)
We do not emit warnings when the evaluation is triggered by __repr__ (i.e. we can safely assume the output is unused later in the graph)

NVIDIA / TensorRT-Incubator

Improve behavior of evaluation during compile #409