mlp-architecture Search Results

1000+ results
for mlp-architecture

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

mlverse/torch #1167

torch fails on new Mac M3 architecture

Dear @dfalbel I have bought a new MacBook Air with the M3 chip which has 8 CPUs, 10 GPUs and 16GB integrated memory. My R `torch` apps are crashing. I have put together a MWE which works on all other …

gilbertocamara updated 3 months ago
13
5g4s/paper #37

LEARNING FROM PROTEIN STRUCTURE WITH GEOMETRIC VECTOR PERCEP…

https://arxiv.org/abs/2009.01411

5g4s updated 1 year ago
6
arcee-ai/mergekit #284

Merging fails with RuntimeError: weight required but not pre…

I'm trying to merge some embedding models with this config file. the architectures are similar but I think it is erroring out on some names of layers? Would love some suggestions on how to change the …

w601sxs updated 4 months ago
7
pratyushasharma/laser #4

Mistral Support

Hi, Great work on this! Is Mistral supported? Right now I only see GPT-J and Llama 2. Thank you!

fakerybakery updated 8 months ago
16
pytorch/pytorch #121067

RuntimeError

### 🐛 Describe the bug ###################################### # # # Retrieving MEMIT hyperparameters # # # ################…

daisysong76 updated 6 months ago
1
unslothai/unsloth #502

Models not pushing to specified username (organisation)

Running: ``` hf_username="Trelis" new_model_name="Meta-Llama-3-8B-Instruct-Gaeilge" if True: model.push_to_hub_merged(f"{hf_username}/{new_model_name}", tokenizer, save_method = "merged_16bit") `…

RonanKMcGovern updated 4 months ago
1
sail-sg/poolformer #10

Can I say PoolFormer is just a non-trainable MLP-like module…

Hi! Thanks for sharing the great work! I have some questions about PoolFormer. If I explain PoolFormer like the following attachments, can I say PoolFormer is just a non-trainable MLP-like model? …

072jiajia updated 1 year ago
8
MKLab-ITI/JGNN #2

Graph Neural Network tutorial

This is a very interesting library and I want to try this for my project. I wanted to know if it's possible to have a Graph Neural Network example in the tutorials?

gavalian updated 1 year ago
9
TheLastBen/fast-stable-diffusion #1345

Training on a custom (huggingface) model is broken

I tried several different base models based on 1.5. Pasted the following in `Path_to_HuggingFace`, no path or link. `1.5` selected as custom model version: - darkstorm2150/Protogen_v5.3_Official_Rele…

flesler updated 1 year ago
40
facebookresearch/vissl #562

Incosistent results when running inference on trained resnet…

## Instructions To Reproduce the Issue: I have trained a pretrained ImageNet resnet on a custom dataset with 12 classes. For training I used following yaml file: [training_yaml_file](https://git…

BartvanMarrewijk updated 1 year ago
1

上一页 1...27 28 29 30 31 32 33...100 下一页

1000+ results for mlp-architecture

1000+ results
for mlp-architecture