-
`#find query FFN neurons activating attn neurons
curfile_ffn_score_dict = {}
for l_h_n_p, increase_score in cur_file_attn_neuron_list_sort[:30]:
attn_layer, attn_head, attn_neuron, attn_pos = l…
-
### Summary
Currently, the [linear model](https://docs.deepmodeling.com/projects/deepmd/en/master/model/linear.html) is implemented to combine multiple frozen models. However, this architecture can b…
-
Fails for both `Mistral-7B-Instruct-v0.2` and `intfloat/e5-mistral-7b-instruct `
Only fails with `tp_degree=1`, good for `2
-
**server:** inf2.8xlarge
**vllm version**: 0.6.3.post2.dev77+g2394962d.neuron215
_Desctiption_
Hellow! I am trying to run the code below (the code was taken [here](https://docs.vllm.ai/en/v0.4.1/…
-
### System Info
```shell
accelerate 1.1.1
neuronx-cc 2.14.227.0+2d4f85be
neuronx-distributed 0.8.0
neuronx-distributed-training 1.0.0
optimum …
-
Hi,
I am trying to `llama3.2 1B` fine-tuning using AWS Trn1 and I'm running into the following error.
Error in eager mode (without jax.jit):
```
2024-11-21 04:44:13.000699: 3926 ERROR ||NEU…
-
Hi, I tried a test about compiling unet(torch.float16), which is the part of StableDiffusionXLPipeline in Inferentia2.8xlarge and it failed.
When the latent size of unet is (64, 64), it did not fai…
-
Hi!
I am trying to use transformer-neuronx to compile the customized huggingface llama-3.1-8b model.
I use the model with beam search, and I know that it makes dynamic graph during generation.
Bu…
-
Hi! Im interested in your project [Nissl-neuron-Fiji-Measurement-Macro](https://github.com/aidan-mcG/Nissl-neuron-Fiji-Measurement-Macro)
Im asking that is this model can measure the stain which is f…
-
Taking the calculated decoders, implement them in Nengo as transformations between the spike data and the output -- simulate this model on a small subset of the data. This implementation should allow …