mlp-architecture Search Results

1000+ results
for mlp-architecture

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

TencentARC/ST-LLM #12

Some weights of the model checkpoint at stllm/output/instruc…

dear author: when loading pretrained ckpt, some weights are not used, is it normal?? ``` Loading checkpoint shards: 100%|██████████████████████████████████████████████████████████████████████████…

dragen1860 updated 4 months ago
1
pytorch/rl #1771

[Feature Request] Information of RNNs expected inputs and ou…

## Motivation When RNN’s are used in isolation, creating a TensorDictPrimer Transform for the environment to populate the TensorDicts with the expected tensors is pretty straightforward: ```pyth…

albertbou92 updated 5 months ago
2
karpathy/nanoGPT #272

Error in importing custom weights

I would like to use GPT-2-like model in nanoGPT. I downloaded pytorch_model.bin, renamed it into ckpt.pt and put in directory, but I get the following error: ` gptconf = GPTConfig(**checkpoint['mode…

Maniues updated 3 months ago
4
pyg-team/pytorch_geometric #8330

Add `channel_list` to `BasicGNN`

### 🛠 Proposed Refactor Today, all pre-defined GNN models (`GCN`, `GAT`, etc.) can only have a constant hidden size. The base class for these models, which is [`BasicGNN`](https://github.com/pyg-tea…

binkjakub updated 9 months ago
3
constantinpape/training-deep-learning-models-for-vison #2

Exercises Day1

Summary of issues with exercises of day 1: - 1e-3 as learning rate is too high for the LogReg and the MLP. A good learning rate is 5.e-4 or 1.e-4. - In exercise 2: trying just some new filters is to…

constantinpape updated 3 years ago
1
erikwijmans/Pointnet2_PyTorch #66

why is network architecture different from original?

Hi Erik, Regarding the network architecture in your pytorch implementation. I noticed that in the SA and FP modules, the mlp / conv2d channel input and output dimensions differ from the dimensions …

triple-tam updated 5 years ago
1
fani-lab/OpeNTF #211

Implementing Sigmoid-Outputting Models

I aim to the following sigmoid-outputting models for team formation: 1. Neural Collaborative Filtering (NCF) with MLP/FNN: - NCF has demonstrated success in recommendation systems and collaborat…

MarcoKurepa updated 1 year ago
1
microsoft/CSWin-Transformer #14

It's was hard to fine-tuning on other dataset.

I was use this network trained on image defect classification task, and it was very hard train, and get low acc, but other model, like VIP model based on mlp architecture，or pure resnet50，those model …

BboyHanat updated 1 year ago
5
aws-neuron/aws-neuron-sdk #881

Internal tensorizer error when trying to compile and train a…

I replaced the MLP from this example with a CNN and I'm getting a `Internal tensorizer error` when trying to run it. Here are the scripts: `model.py`: ```python import torch import torch.nn as n…

sgaseretto updated 4 months ago
3
mir-group/allegro #76

How can I modify Training and loss forces of my system speci…

Dear all, I have recently Used Nequip-Allegro Framework to retrain DFT data in the LGPO systems. I have used 30000 configurations for retraining DFT data.By using the ASE calculator generated ML for…

ElhamPisheh updated 6 months ago
6

上一页 1...5 6 7 8 9 10 11...100 下一页

1000+ results for mlp-architecture

1000+ results
for mlp-architecture