mlp-architecture Search Results

1000+ results
for mlp-architecture

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

Xilinx/finn-base #65

Issue with inferring shapes in example model

If I create an onnx file with this sample script and [input.txt](https://github.com/Xilinx/finn-base/files/8809937/input.txt): ```python import torch import torch.nn as nn import torch.nn.functi…

jmitrevs updated 2 years ago
7
axolotl-ai-cloud/axolotl #1038

[BOUNTY] Optimized Triton Kernels for full fine tunes

### 🔖 Feature description We've seen marketing from Unsloth that optimized triton kernels for various operations can significantly improve both the speed and memory efficiency of fine-tuning LoRA a…

winglian updated 2 days ago
13
NVIDIA/TransformerEngine #407

Expected margin of error versus a typical pytorch implementa…

I am currently attempting to port a llama-like model architecture from pure pytorch to TransformerEngine's pytorch classes. However, I have been unable to obtain identical results in certain cases.…

152334H updated 1 year ago
2
CompVis/stable-diffusion #838

omegaconf.errors.ConfigAttributeError: Missing key data

I wrote the code in the terminal: `CUDA_VISIBLE_DEVICES=6 python main.py --base configs/stable-diffusion/v1-inference.yaml --gpus=1` but this script was printed: ``` Global seed set to 23 Runni…

parkjjoe updated 4 months ago
2
nupurkmr9/concept-ablation #10

A bug

When I'm using the train.py in compvis, this bug come out, and I don't konw how to solve it. Anyone can help me? Thanks! angogh_painting" --train_size 200 Global seed set to 23 Running on GPUs 0…

LemonC19 updated 4 months ago
1
zyushun/Adam-mini #25

Qwen2-0.5B cannot be Adam-mini-optimized in 4 shards (Deepsp…

Hi all, I found that using Adam-mini 1.0.1 cannot run in 4 shards, it would threw the exception related to Tensor reshaping: ``` File "/opt/conda/lib/python3.10/site-packages/adam_mini/adam_m…

xiningnlp updated 3 weeks ago
3
facebookresearch/detectron2 #4641

Loading checkpoint for ViTDet raises shape mismatch warning

## Instructions To Reproduce the Issue: I installed Detectron2 and attempted to train the ViTDet base model from the documentation provided here: https://github.com/facebookresearch/detectron2/tree…

layadas updated 1 month ago
2
NVIDIA/TensorRT-LLM #1676

quantize.py fails to export important data to config.json (e…

### System Info 4x NVIDIA H100, TensorRT-LLM backend 0.9.0 ### Who can help? @Tracin ### Information - [X] The official example scripts - [ ] My own modified scripts ### Tasks -…

janpetrov updated 2 months ago
23
laiviet/ed-gated-gcn #2

Question about the performance

Hi thanks for the sharing of your code. In the paper, you implemented a baseline called "BERT+MLP", reaching a **76.2** F1 score. But when I use the same architecture, I cannot get the same result. Di…

alderpaw updated 2 years ago
1
NVIDIA/TensorRT-LLM #1247

v0.8.0 KeyError: 'builder_config' when benchmarking with n…

### System Info - CPU：4090 * 4 - TensorRT-LLm : v0.8.0 - CUDA Version: 12.3 - NVIDIA-SMI 545.29.06 ### Who can help? _No response_ ### Information - [X] The official example scripts …

plt12138 updated 4 months ago
13

上一页 1...4 5 6 7 8 9 10...100 下一页

1000+ results for mlp-architecture

1000+ results
for mlp-architecture