mlp-architecture Search Results

1000+ results
for mlp-architecture

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

microsoft/CSWin-Transformer #14

It's was hard to fine-tuning on other dataset.

I was use this network trained on image defect classification task, and it was very hard train, and get low acc, but other model, like VIP model based on mlp architecture，or pure resnet50，those model …

BboyHanat updated 1 year ago
5
chemprop/chemprop #475

[FEATURE]: v2 atom- and bond-targets

kevingreenman updated 3 months ago
1
laiviet/ed-gated-gcn #2

Question about the performance

Hi thanks for the sharing of your code. In the paper, you implemented a baseline called "BERT+MLP", reaching a **76.2** F1 score. But when I use the same architecture, I cannot get the same result. Di…

alderpaw updated 3 years ago
1
NVIDIA/TransformerEngine #407

Expected margin of error versus a typical pytorch implementa…

I am currently attempting to port a llama-like model architecture from pure pytorch to TransformerEngine's pytorch classes. However, I have been unable to obtain identical results in certain cases.…

152334H updated 1 year ago
2
NVIDIA/TensorRT-LLM #1946

How to use Medusa to support non llama models?

### System Info Hardware: L20 Version: 0.11.0.dev20240625 Model: Bloom7b1 ### Who can help? @ncomly-nvidia @byshiue I have obtained the Medusa head for Bloom according to the official M…

skyCreateXian updated 3 months ago
8
Project-MONAI/research-contributions #346

Swin-UNETR pretraining model weights do not load into the mo…

**Describe the bug** The provided pre-trained Swin-UNETR weights do not load into a newly instantiated SSLHead model object. The naming scheme for the model state_dict keys is different between the p…

Saqeeb95 updated 2 months ago
5
CuiRuikai/Partial2Complete #21

The max of the idx which is the pointops.knn 's return is m…

` # shape reconstruction loss rebuild_points = nbr_groups[0] + center_groups[0].unsqueeze(-2) idx = pointops.knn(center_groups[0], pred, int(self.nbr_ratio * self.group_size))[0] …

kefusong updated 5 months ago
5
fani-lab/OpeNTF #211

Implementing Sigmoid-Outputting Models

I aim to the following sigmoid-outputting models for team formation: 1. Neural Collaborative Filtering (NCF) with MLP/FNN: - NCF has demonstrated success in recommendation systems and collaborat…

MarcoKurepa updated 1 year ago
1
aitsc/GLMKD #4

Using different language models

How can I use different language models from Hugging Face for knowledge distillation in this set up?

j-datta updated 4 months ago
4
haotian-liu/LLaVA #1083

[Question] LLaVA 1.5 sizes and vision encoder

### Question Hello, I was trying to get a sense of the number of params. of LLaVA 1.5. I understand that the LLM used is Vicuna 1.5 (either 7B or 13B) and that the vision encoder is CLIP ViT-L/14 336…

aldoz-mila updated 5 months ago
1

上一页 1...6 7 8 9 10 11 12...100 下一页

1000+ results for mlp-architecture

1000+ results
for mlp-architecture