mlp-architecture Search Results

1000+ results
for mlp-architecture

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

openai/baselines #1119

Where to find the default neural network architecture for ml…

wangyixu14 updated 4 years ago
1
caopuzheng/261-team-6-1-repo #18

Model 5 - MLP Neural Networks 5 YR - Build

caopuzheng updated 6 months ago
1
arcee-ai/mergekit #361

Thera are still some problems with moe merge qwen with other…

Here is one piece of code In the file of mergekit/mergekit/moe/qwen.py `for model_ref in ( [config.base_model] + [e.source_model for e in config.experts] + [e…

aoyinke updated 1 month ago
3
NVlabs/tiny-cuda-nn #392

Fully fused MLPs do not support GPU architectures of 70 or l…

Hi, I use rtx 3090 and got this warning which is not supposed to appear. when I use tiny-cuda-nn on other project, I got warning "tinycudann was built for lower compute capability ({cc}) than the sy…

Orange-Ctrl updated 8 months ago
1
LLaVA-VL/LLaVA-NeXT #115

LLaVA-NeXT demo code froze while running

Trying to deploy and run demo on a 4 A6000 cluster but it seemed that the runtime froze without any exceptions... Could there be any possible problems? Sorry for asking a naive question and thanks for…

FrankFcc updated 1 month ago
3
NVIDIA/TensorRT-LLM #1946

How to use Medusa to support non llama models?

### System Info Hardware: L20 Version: 0.11.0.dev20240625 Model: Bloom7b1 ### Who can help? @ncomly-nvidia @byshiue I have obtained the Medusa head for Bloom according to the official M…

skyCreateXian updated 1 month ago
8
turboderp/exllamav2 #545

I am trying to add Qwen2Moe support

qiyuxinlin updated 2 months ago
2
facebookresearch/nocturne #9

Add examples of permutation invariant architectures

Our current architecture in sample-factory is just an MLP encoder; I suspect a permutation invariant or GNN-based architecture would be better

eugenevinitsky updated 1 year ago
5
KindXiaoming/pykan #204

Feature Request: KAN Model Does Not Support Tensor Input wit…

## Description The KAN (Kolmogorov Activation Network) model from the pykan library currently only supports two-dimensional input tensors (batch_size x hid_dim). A `RuntimeError` is raised when att…

linkedlist771 updated 4 months ago
5
keras-team/tf-keras #109

Add MLP-Mixer models `keras.applications`

If you open a GitHub issue, here is our policy: It must be a bug, a feature request, or a significant problem with the documentation (for small docs fixes please send a PR instead). The form below…

sayakpaul updated 12 months ago
4

上一页 1...1 2 3 4 5 6 7...100 下一页

1000+ results for mlp-architecture

1000+ results
for mlp-architecture