mlp-architecture Search Results

1000+ results
for mlp-architecture

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

tenstorrent/tt-metal #13277

[Feature Request] Support model Qwen2-7B in the model demos

* Goal: Run model [Qwen2-7B](https://huggingface.co/Qwen/Qwen2-7B) on the TT Wormhole device. * Changes: Add this directory `models/demos/wormhole/qwen2_7b`. ## Approach We will leverage the ex…

cthsieh updated 1 month ago
2
HazyResearch/m2 #34

What category does the M2 model belong to

Hello, thank you for your great work! M2bert paper mentioned that "Monarch Mixer is part of a new class of architectures called state-space models (SSMs), which include S4, Mamba, and BiGS". Is Monar…

41924076 updated 5 months ago
2
facebookresearch/nocturne #9

Add examples of permutation invariant architectures

Our current architecture in sample-factory is just an MLP encoder; I suspect a permutation invariant or GNN-based architecture would be better

eugenevinitsky updated 2 years ago
5
xinntao/Real-ESRGAN #13

Improvment Idea.

I think it would be very useful to add more discriminators, from the tests I have done with conditional GANs, it seems that having several discriminators with different levels of reception fields incr…

QLaHPD updated 2 years ago
5
keras-team/tf-keras #109

Add MLP-Mixer models `keras.applications`

If you open a GitHub issue, here is our policy: It must be a bug, a feature request, or a significant problem with the documentation (for small docs fixes please send a PR instead). The form below…

sayakpaul updated 1 year ago
4
NUS-HPC-AI-Lab/Neural-Network-Parameter-Diffusion #17

Need config file and cnn3 model

hello, can i have the config files for others datasets and cnn3, mlp architecture. i would like to generate the pretrained weights for all datasets/ I meant only the models on all tasks training. i…

sorobedio updated 5 months ago
1
KindXiaoming/pykan #204

Feature Request: KAN Model Does Not Support Tensor Input wit…

## Description The KAN (Kolmogorov Activation Network) model from the pykan library currently only supports two-dimensional input tensors (batch_size x hid_dim). A `RuntimeError` is raised when att…

linkedlist771 updated 5 months ago
5
caopuzheng/261-team-6-1-repo #18

Model 5 - MLP Neural Networks 5 YR - Build

caopuzheng updated 7 months ago
1
HazyResearch/fly #11

Monarch & PixelFly based MLP layer efficiency testing

Here I post some efficiency testing numbers for Monarch based MLP v.s. vanilla nn.Linear based MLP. I found that Monarch is best suitable for MLPs in Transformer architectures, which generally have la…

zhujiem updated 9 months ago
3
pyg-team/pytorch_geometric #553

Update DynamicEdgeConv Classification Example to match paper…

I was reviewing the example of dynamic edge conv. I'm not sure if the aim of this example is to reproduce the results of the paper. However, I found that I think it is a discrepancy between the implem…

dhorka updated 5 years ago
14

上一页 1...2 3 4 5 6 7 8...100 下一页

1000+ results for mlp-architecture

1000+ results
for mlp-architecture