-
Loading Model: {'checkpoint_info': {'filename': 'D:\\neuron\\webui_forge_cu121_torch21\\webui\\models\\Stable-diffusion\\enjoyXLSuperRealistic_v30ModifiedVersion.safetensors', 'hash': '16bb3dda'}, 'ad…
-
### System Info
```shell
I'm compiling a fine-tuned Llama 3.1 70B model with the below system info on an inf2.48xlarge machine. I'm using neuronX TGI 0.0.25 with AWS Sagemaker. I get the below err…
-
### System Info
```shell
using Huggingface AMI from AWS marketplace with Ubuntu 22.04
optimum-neuron 0.0.25
transformers 4.45.2
peft 0.13.0
trl 0.11.4
accelerate 0.29.2
torch 2.1.2
```
…
-
### 🚀 The feature, motivation and pitch
We know that language models exhibit activation sparsity during inference, meaning that they can perform lossless inference even when only a portion of the act…
-
The refractory time after a spike at time $t^*$ is defined for `iaf_psc_delta` and many other neurons via
$$V_m(t) = V_{\text{reset}} \quad \text{for} \quad t^* \leq t < t^* + t_{\text{refract}}$$
…
-
I'm trying to quantize a Mistral (7B) model to run with `aws-neuron` on an `Inf2` instance
It seems like the [`int8 weight storage`](https://awsdocs-neuron.readthedocs-hosted.com/en/latest/librarie…
jpaye updated
2 weeks ago
-
Hi,
Could you please help me with the followings related to the code?
1. you have mentioned 2 files ( train_SpikeYOLO.py and test_SpikeYOLO.py) to train and test respectively. But those two fil…
-
The same bug that was present in AWS Neuron SDK 2.19 and fixed in 2.19.1 (#91) is back in AWS Neuron SDK 2.20.
With AWS Neuron SDK 2.19, when exporting a model and saving the compiled artifacts, it…
-
Hi, I find that neuron compiler will crash when compiling my customized model running on a Neuron device. Could you help check this issue? **Thanks so much!**
#### Below is my model sample and the…
-
I’m encountering issues when trying to convert my YOLOv8x model from torchscript to torch_neuron on Kaggle. Here are the details:
1. YOLOv8x Model (Single Class):
- Trained model file: '.pt'
…