layer1 Search Results - Githubissues

1000+ results
for layer1

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

tenstorrent/tt-metal #11664

Fuse residual add with attention norm in Llama

Presently in transformer decoder, we do ``` h = x + self.attention.forward(self.attention_norm(x), start_pos, freqs_cis, mask) out = h + self.feed_forward.forward(self.ffn_norm(h)) ``` We have c…

kpaigwar updated 3 months ago
1
open-mmlab/mmrotate #1069

[Bug] MMDataParallel loads very slowly

### Prerequisite - [X] I have searched [Issues](https://github.com/open-mmlab/mmrotate/issues) and [Discussions](https://github.com/open-mmlab/mmrotate/discussions) but cannot get the expected help. …

1shenhui updated 3 weeks ago
1
xinntao/Real-ESRGAN #707

RuntimeError: Error(s) in loading state_dict

I have trained the net and am trying to now train the gan. The net worked and finished and the gan works until interrupted (Colab). When I try and resume net training it works well. When trying to res…

ajeema updated 4 months ago
1
microsoft/DeepSpeed #6714

[BUG] pipeline parallelism+fp16+moe isn't working

**Describe the bug** My model use deepspeed `PipelineModule(num_stages=4)` split into 4 parts, and my `deepspeed.moe.layer.MoE` is only set in the pipeline stage1 layer. When my model `train_batch`, t…

NeferpitouS3 updated 1 day ago
5
li554/resnet18-cifar10-classification #1

权重

为啥我用您这个模型权重的时候报错RuntimeError: Error(s) in loading state_dict for CustomResNet18: size mismatch for conv1.weight: copying a param with shape torch.Size([64, 3, 7, 7]) from checkpoint, the shape in cu…

maoshanwen updated 3 months ago
2
google-research/simclr #190

Error when Finetuning: WARNING:absl:Importing a function (__…

I work with TF version 2.4.1, Here is how I finetune the saved checkpoint of ResNet : ``` ###Model ############################## def create_model(): baseModel = tf.keras.models.load_model(…

aynesss updated 7 months ago
2
TUI-NICR/EMSANet #15

Evaluation error "RuntimeError: Error(s) in loading state_di…

RuntimeError: Error(s) in loading state_dict for EMSANet: Missing key(s) in state_dict: "encoder.backbone_rgb.conv1.weight", "encoder.backbone_rgb.norm1.weight", "encoder.backbone_rgb.norm1.bias", "…

AvidahRai updated 1 year ago
1
ZhengJianwei2/DMINet #3

demo.py

你好，我在跑demo.py时报错 initialize network with normal Traceback (most recent call last): File "D:\DMINet-main\demo.py", line 64, in model.load_checkpoint(args.checkpoint_name) File "D:\DMINet-…

JackLiu-97 updated 6 months ago
7
origo-map/origo #2058

Option to add multiple layers in the `featureinfoLayer` laye…

**Description** It would be cool to have the possibility to let a feature info click on a single feature show the results from multiple layers at the same time. **Describe the solution you'd like*…

MattiasSp updated 1 month ago
1
huggingface/transformers #28292

DPT normalization causes contouring when there are significa…

### System Info Python 3.10.12 transformers-4.36.2 ### Who can help? @stevhliu @NielsRogge ### Information - [X] The official example scripts - [x] My own modified scripts ### Task…

CyrusVorwald updated 3 days ago
10

上一页 1...2 3 4 5 6 7 8...100 下一页

1000+ results for layer1

1000+ results
for layer1