linear-transformer Search Results

1000+ results
for linear-transformer

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

interpretml/interpret #521

Integrate EBM into the pytorch framework

Hi all, I want to use EBM as a GAM to replace the fully connected layer at the end of a large CNN/Transformer to get interpretable output. However, I need to train the EBM like a deep learning model,…

JWKKWJ123 updated 3 months ago
7
luyug/COIL #13

pyarrow.lib.ArrowNotImplementedError during training phrase

I ran these commands in Google Colab with GPU ``` !wget http://boston.lti.cs.cmu.edu/luyug/coil/msmarco-psg/psg-train.tar.gz !tar xfz psg-train.tar.gz !git clone https://github.com/luyug/COIL !…

udaygoyat45 updated 2 years ago
1
huggingface/peft #1865

Does PEFT support CodeBERT?

### System Info linux ### Who can help? @pacman100 @younesbelkada @BenjaminBossan When I used prefix tuning to fine-tune codebert for sequence classification, it showed the following erro…

MabelQi updated 2 weeks ago
5
facebookresearch/mmf #1262

Exception: process 0 terminated with signal SIGKILL

Hi, While I am trying the training code with m4c_captioner model, I am getting the following error, /home/root1/anaconda3/envs/mmf/lib/python3.7/site-packages/omegaconf/grammar_visitor.py:257: U…

Huangzhw0221 updated 1 year ago
1
CompVis/latent-diffusion #52

the stability of training(a collapse loss)

Thanks for the great work. I try to train the ldm model on ImageNet with 8 V100, but get a bad result.I found that loss was normal at first, but soon collapsed: ![image](https://user-images.githu…

sunsq-blue updated 1 year ago
5
nshepperd/gpt-2 #13

Intermediate Layer Output

Similar to the issue I posted here: https://github.com/openai/gpt-2/issues/148 -- Is it possible to use the intermediate layer outputs and generate text ignoring the layers on top?Basically, I want t…

bakszero updated 4 years ago
4
baichuan-inc/Baichuan2 #204

AttributeError: 'BaichuanTokenizer' object has no attribute …

AttributeError: 'BaichuanTokenizer' object has no attribute 'sp_model'

Hopping-Rabbit updated 2 months ago
47
MGheini/xattn-transfer-for-mt #4

TypeError: forward() missing 1 required positional argument:…

Hi, When running `finetune-mbart-on-transaltion_embed+xattn.sh` I get the error `TypeError: forward() missing 1 required positional argument: 'prev_output_tokens' in the beginning of epoch 1.` When…

theamato updated 1 year ago
1
baichuan-inc/Baichuan-13B #159

web_demo.py 运行时报错 CUDA error: CUBLAS_STATUS_NOT_INITIALIZED …

环境 --- ```shell (base) [root@localhost ~]# nvcc -V nvcc: NVIDIA (R) Cuda compiler driver Copyright (c) 2005-2023 NVIDIA Corporation Built on Mon_Apr__3_17:16:06_PDT_2023 Cuda compilation tool…

matyhtf updated 5 months ago
1
OpenAccess-AI-Collective/axolotl #207

The error maze of deepspeed + qlora + falcon

I've been trying to make the combination `deepspeed + qlora + falcon` work but due to unknown reasons I've stuck in an error maze. ## Setup - Docker image: `winglian/axolotl-runpod:main-py3.9-cu…

utensil updated 7 months ago
15

上一页 1...94 95 96 97 98 99 100...100 下一页

1000+ results for linear-transformer

1000+ results
for linear-transformer