linear-transformer Search Results

1000+ results
for linear-transformer

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

time-series-foundation-models/lag-llama #79

forecasts = list(forecast_it) not performed

Trying to test the prediction with the minimal code from https://github.com/marcopeix/time-series-analysis/blob/master/lag_llama.ipynb https://medium.com/@odhitom09/lag-llama-an-open-source-base-m…

kenadianu updated 1 week ago
7
pytorch/vision #7871

compute flops for scaled_dot_product_flash_attention

https://github.com/pytorch/vision/actions/runs/5941974400/job/16117254380 Failures start with 9c4f7389d0db7cfe7e8591ea920459673344aaa8, which is the first commit that used yesterdays (20230822) PyT…

pmeier updated 7 months ago
6
UKPLab/sentence-transformers #238

Simple way of producing two independent embeddings

I would like to finetune BERT (or similar) models for an asymmetric task using two different embeddings. There will be two inputs (1 and 2), and I would use an embedding in 1 and an embedding in 2 to …

fjhheras updated 3 years ago
8
pytorch/xla #5464

FSDP flatten_parameter=True causing excessive memory consump…

## ❓ Questions and Help I have noticed during testing that enabling FSDP's flatten_parameter=True results in a significant increase in GPU Peak Memory. In fact, the memory usage is several times la…

Seventeen17 updated 10 months ago
5
fudan-generative-vision/champ #29

Inference time

Hi, I'm grateful for your excellent work! I've implemented the code as per the instructions, and it runs without errors. However, the inference time is slow, approximately 176 seconds per iteration. I…

puckikk1202 updated 2 months ago
4
AkihikoWatanabe/paper_notes #765

RWKV: Reinventing RNNs for the Transformer Era, Bo Peng+, N/…

# URL - https://arxiv.org/abs//2305.13048 # Affiliations - Bo Peng, N/A - Eric Alcaide, N/A - Quentin Anthony, N/A - Alon Albalak, N/A - Samuel Arcadinho, N/A - Huanqi Cao, N/A - Xin Che…

AkihikoWatanabe updated 1 year ago
1
microsoft/GLUECoS #7

Token level task for transliteration

As the title says, is there any way to add the evaluation script for the transliteration task. I am currently working on creation of a transliteration dataset and training a neural model on the extrac…

TrigonaMinima updated 3 years ago
3
bonlime/pytorch-tools #104

Transformers

буду хранить тут дамп статей про трансформеры, которые читаю, либо которые хочу прочитать An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale - статья где предложили ViT, иде…

bonlime updated 11 months ago
3
AUTOMATIC1111/stable-diffusion-webui #13201

[Feature Request]: Set cutoff tokens in prompt at will using…

### What would your feature do ? I'm developing an extension and discovered something strange in the configuration with the CLIPTokenizer on Automatic1111 for an SDXL model i downloaded from civita…

Nekos4Lyfe updated 9 months ago
3
pytorch/pytorch #18182

Update weight initialisations to current best practices

## 🚀 Feature Update weight initialisations to current best practices. ## Motivation The current weight initialisations for a lot of modules (e.g. `nn.Linear`) may be ad-hoc/carried over from Torc…

Kaixhin updated 4 months ago
47

上一页 1...91 92 93 94 95 96 97...100 下一页

1000+ results for linear-transformer

1000+ results
for linear-transformer