-
dear author:
when loading pretrained ckpt, some weights are not used, is it normal??
```
Loading checkpoint shards: 100%|██████████████████████████████████████████████████████████████████████████…
-
## Motivation
When RNN’s are used in isolation, creating a TensorDictPrimer Transform for the environment to populate the TensorDicts with the expected tensors is pretty straightforward:
```pyth…
-
I would like to use GPT-2-like model in nanoGPT.
I downloaded pytorch_model.bin, renamed it into ckpt.pt and put in directory, but I get the following error:
` gptconf = GPTConfig(**checkpoint['mode…
-
### 🛠 Proposed Refactor
Today, all pre-defined GNN models (`GCN`, `GAT`, etc.) can only have a constant hidden size. The base class for these models, which is [`BasicGNN`](https://github.com/pyg-tea…
-
Summary of issues with exercises of day 1:
- 1e-3 as learning rate is too high for the LogReg and the MLP. A good learning rate is 5.e-4 or 1.e-4.
- In exercise 2: trying just some new filters is to…
-
Hi Erik,
Regarding the network architecture in your pytorch implementation. I noticed that in the SA and FP modules, the mlp / conv2d channel input and output dimensions differ from the dimensions …
-
I aim to the following sigmoid-outputting models for team formation:
1. Neural Collaborative Filtering (NCF) with MLP/FNN:
- NCF has demonstrated success in recommendation systems and collaborat…
-
I was use this network trained on image defect classification task, and it was very hard train, and get low acc, but other model, like VIP model based on mlp architecture,or pure resnet50,those model …
-
I replaced the MLP from this example with a CNN and I'm getting a `Internal tensorizer error` when trying to run it. Here are the scripts:
`model.py`:
```python
import torch
import torch.nn as n…
-
Dear all,
I have recently Used Nequip-Allegro Framework to retrain DFT data in the LGPO systems. I have used 30000 configurations for retraining DFT data.By using the ASE calculator generated ML for…