-
Hi all,
Is it possible to do inference on the aforementioned machines as we are facing so many issues in Inf2 with Falcon model?
Context:
We are facing issues while using Falcon/Falcoder on t…
-
### Feature request
Under network isolation, SageMaker endpoint will not have access to the `aws-neuron/optimum-neuron-cache` to fetch cache.
Instead, we need pre-download the caches and model w…
-
https://github.com/aws-neuron/neuronx-distributed/blob/a80091de6c9d8eb75f96a7367e143a81d586fbbc/examples/inference/llama2/neuron_modeling_llama.py#L36
The llama inference example needs to be update…
-
直接运行DSQN项目时(未做任何修改)如题报错
虽然自己可以手动强制转化为tensor,但我不确定是否存在底层函数的bug
感谢作者查看,期待回复
**SpikingJelly version**
`0.0.0.0.14`
**Description**
spikingjelly/activation_based/neuron.py", line 830
Ru…
-
I am following the steps (https://github.com/aws-neuron/aws-neuron-samples/blob/master/torch-neuronx/transformers-neuronx/inference/meta-llama-2-13b-sampling.ipynb) to run a Llama2 quantized model (ht…
-
llava multimodel would be huge to be supported for aws neuron chips
https://huggingface.co/llava-hf/llava-v1.6-mistral-7b-hf
This in particular is trending
I'm not sure if this is the correct…
-
Hi.
I have used your code, for the past year, without issues.
Now I'm trying to make a small scale network using PyNeuron-Toolbox to load the morphology of a model. I can correctly simulate, sti…
-
Source: https://forum.nengo.ai/t/recurrent-bcm-connection-problems/817/
Minimal reproducer:
```python
import nengo
import numpy as np
with nengo.Network() as model:
mem = nengo.Ensemble(…
-
```
from transformers import AutoTokenizer
from optimum.neuron import NeuronModelForCausalLM
```
results in
```
RuntimeError: Failed to import optimum.neuron.modeling because of the fol…
-
Hi,
I was interested in using fit_neuron's evaluate functions to calculate some spike metrics. When I installed it using pip it did not include the models folder, which makes import fit_neuron fail. …