-
**Link to the notebook**
[Notebook](https://github.com/aws/amazon-sagemaker-examples/blob/main/sagemaker-triton/business_logic_scripting/stable_diffusion/sm-triton-bls-stablediff.ipynb)
**Describe…
-
### Discussed in https://github.com/PyTorchLightning/pytorch-lightning/discussions/8363
Originally posted by **MohammedAljahdali** July 10, 2021
Hi, I have a script that does the following log…
-
## 🐛 Bug
I am running into the following RuntimeError when pruning parameters of `torch.nn.MultiheadAttention` module:
```
RuntimeError: Trying to backward through the graph a second time (or dir…
-
### 🐛 Describe the bug
# Bug Description
According to the official [tutorial](https://pytorch.org/tutorials/intermediate/pruning_tutorial.html#iterative-pruning), pruning should work if applied fo…
-
### Is your feature request related to a problem? Please describe.
I would like to request the implementation of a compressed tiled matrix multiply operator for use in large language model inferenc…
-
Great framework!
I have some questions for EvoPress
1. Is this a valid config to run EvoPress (pruning) properly?
```bash
#!/bin/bash
export CUDA_VISIBLE_DEVICES=0,1
export SSL_CERT_DIR='/…
-
For Feature-request:
* describe your feature as detailed as possible
YOLObile framework, a real-time object detection on mobile devices via compression-compilation co-design.
* provide li…
-
-
Hi @VainF,
Thank you for the nice work. It is a clean and very helpful pruning framework, especially for unstructured pruning.
However, I have trouble pruning the ResNet20 model with the Cifar10…
-
When I am using Tensorflow/Keras I compress my models as in this [guide](https://www.tensorflow.org/model_optimization/guide/pruning/pruning_with_keras#create_a_10x_smaller_model_from_combining_prunin…