transformer-networks Search Results

1000+ results
for transformer-networks

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

apache/mxnet #18922

[Random] Expose the random state

Exposing the RandomState is quite crucial for implementing many techniques like gradient check-pointing and reversible networks, e.g., https://github.com/lucidrains/routing-transformer/blob/master/rou…

sxjscience updated 4 years ago
3
lucidrains/vit-pytorch #164

Removing FC layer at the top of the transformer model

Hi all, I would like to ask you how I can remove the fully connected layer at the top of the transformer layer. I want to get the output of the transformer networks and not the final prediction. Mo…

loukasilias updated 1 year ago
2
SakanaAI/AI-Scientist #116

The process has been stuck at the retrieval phase for about …

(AI_Scientist) root@intern-studio-50102651:~/AI-Scientist# python launch_scientist.py --model "gpt-4o-2024-05-13" --experiment nanoGPT --num-ideas 1 Using GPUs: [0] Using OpenAI API with model gpt-4…

Wuyuhang11 updated 2 months ago
2
yoxu515/aot-benchmark #40

No module named spatial_correlation_Sampler

Hi @z-x-yang , It was mentioned in the readme that demo script will run even without spatial correlation sampler.But, in the attention.py ,it was being imported and the error is as follows: Build …

sivaji123256 updated 1 year ago
2
MinZHANG-WHU/Change-Detection-Review #3

Please add our paper

Hi, Can you please add our recent CD paper with Transformers ("A Transformer-Based Siamese Network for Change Detection") to your collection? arxiv link: https://arxiv.org/abs/2201.01293 Code: …

wgcban updated 2 years ago
1
openai/CLIP #281

Any plan to add `Swin` transformer?

`Swin transformer` achieves higher accuracy in model size and computational amount similar to `ViT`. I think that using clip's method and dataset will show higher performance. - ViT-B/16, 384x384, 8…

klae01 updated 1 year ago
2
kohya-ss/sd-webui-additional-networks #139

Additional Network extension not installed, Only hijack buil…

Hello, i get this thing when i run stable diffusion webui and don't know what it means... Is it bad? Did something break? All my loras work... well now, first all of the gave errors Error running pr…

randomuser11956 updated 7 months ago
2
pytorch/pytorch #72253

Transformer Initialization

While you took care of this in the tutorial on Transformers and `nn.Transformer`. I just used `nn.TransformerEncoder` and realized that this won't initialize parameters in a sensible way on its own. O…

SamuelGabriel updated 9 months ago
4
arcee-ai/mergekit #198

Idea: Downscaling the K and/or Q matrices for repeated layer…

Has anyone tried downscaling the K and/or Q matrices for repeated layers in franken-merges? This should act like changing the temperature of the softmax and effectively smooth the distribution: **H…

jukofyork updated 2 months ago
63
e2nIEE/pandapower #2352

Merge two networks as DSO/TSO network?

Hi I was wondering if and how you can use the merge function to merge two networks such that one functions as a dso and one as TSO?

samabu2011 updated 3 months ago
2

上一页 1...4 5 6 7 8 9 10...100 下一页

1000+ results for transformer-networks

1000+ results
for transformer-networks