-
### System Info
- tensorrt-llm:0.9.0.dev2024022700
- GPU:L40S
- tensorrtl-llm docker
- driver:535.129.03
### Who can help?
_No response_
### Information
- [X] The official example scripts
- [ …
-
When having non commutative coefficients in a TensMul object, the result may get corrupted by TensMul not checking the is_commutative assumption. Looking through the source code it looks like this cou…
-
As brought up in reddit thread, a OpenCL device would be useful to support folks with AMD gpus.
Here are roughly the tasks that need to happen:
1. [ ] Add `opencl` feature flag
2. [ ] Create an `…
-
Maybe this comes from generating the bindings automatically, but is there a use case for expecting `&[i64]` in `Tensor::reshape` and similar methods? Maybe it would make sense to change this to `&[usi…
grtlr updated
5 years ago
-
**Related issues:**
https://github.com/jacobgil/pytorch-grad-cam/issues/254
https://github.com/jacobgil/pytorch-grad-cam/issues/394 (maybe)
Model is `resnest14d` from `timm` and I am regress…
-
Why does this repository appear empty?
I have some questions regarding the paper titled "Optimizing Deep Learning Inference via Global Analysis and Tensor Expressions," and I was hoping to clarify …
-
I tried to integrate mii into tritonserver, but encountered some problems
Below is part of my code
```
class TritonPythonModel:
def initialize(self, args):
import mii
from …
-
In addition to SGD and EM, we should support spectral learning methods. See for example,
> Anandkumar, Animashree, et al. "Tensor decompositions for learning latent variable models." Journal of machi…
-
Current implementation of symmetries has limited handling of symmetries that exclude for instance the symmetries of the Riemann tensor.
One would naturally expect:
* To be able to (anti)symmetrie…
-
The Ollama model hub still has the default quant type of Q4_0 which is a legacy format that under-performs compared to K-quants (Qn_K, e.g. Q4_K_M, Q6_K, Q5_K_L etc...).
- Would it perhaps make sen…