mlp-architecture Search Results

1000+ results
for mlp-architecture

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

NVIDIA/TensorRT-LLM #1906

Ada `FP8xint4` performance issue

Since Ada GPUs like 4090 limit the FP8 arithmetic into `fp32` accumulation, it only achieve the same max `TFLOPs` compared to `fp16xfp16` with `fp16` accumulation. Further more, according to my test,…

jcao-ai updated 3 months ago
6
openai/multi-agent-emergence-environments #4

Missing architectures jsonnet files

ma_policy/graph_construct.py specifies that file mas/ppo/base-architectures.jsonnet contains example architectures, to the best of my ability I can't find that file in the repository.

jeremiahvpratt updated 3 years ago
2
pytorch/pytorch #97749

Bug on Minified repro example

### 🐛 Describe the bug I created a minified repro to examine the cause of the runtime error (as the compiler seems to have no error report). The card used to generate the repro is cuda:7. Th…

allanchan339 updated 1 year ago
1
Lareina2441/LLaVA-Med #1

作者的自言自语。。。

UserWarning: Failed to initialize NumPy: _ARRAY_API not found (Triggered internally at ../torch/csrc/utils/tensor_numpy.cpp:84.) device: torch.device = torch.device("cpu"), Models: ['llavamed']

Lareina2441 updated 1 month ago
35
coqui-ai/TTS #3612

[Bug] Error building extension 'transformer_inference' when …

### Describe the bug I am using manual streaming mode in colab, and it shows the error ``` CalledProcessError Traceback (most recent call last) [/usr/local/lib/python3.10/…

weijia-yu updated 3 weeks ago
4
aws-neuron/transformers-neuronx #28

Discrepancies Between GPU and Neuron-based Outputs for GPTJ …

I attempted to use [this model](https://huggingface.co/PygmalionAI/pygmalion-6b) through inf2.24xlarge. This model is based on the GPTJ architecture, but when I run this model based on Neuron, the res…

ho4040 updated 1 year ago
2
5g4s/paper #42

LORA: LOW-RANK ADAPTATION OF LARGE LANGUAGE MODELS

https://arxiv.org/abs/2106.09685

5g4s updated 1 year ago
6
vllm-project/llm-compressor #870

How to load compressed model with vllm?

I utilized LLMCompressor to quantize our model using the FP8-dynamic recipe. The quantized model was successfully tested using the SparseAutoModelForCausalLM method. ![image](https://github.com/use…

IEI-mjx updated 1 week ago
9
haotian-liu/LLaVA #551

[Question] Questions about pretrained LLM

### Question Hi Haotian, Your job is great, well done. I have a some issues that after I use my pruned vicuna LLM as the base model, I was succeed in the phase 1--pretraining. ![8423f0bfebba…

HanyangZhong updated 1 year ago
2
matomatical/jaxgmg #17

37 Implementation details of PPO

Our baselines use a PPO algorithm that is adapted from PureJaxRL. But it doesn't appear to stick to all of the relevant implementation details from [Huang et al., 2022](https://iclr-blog-track.github.…

matomatical updated 3 months ago
3

上一页 1...31 32 33 34 35 36 37...100 下一页

1000+ results for mlp-architecture

1000+ results
for mlp-architecture