adaptive-label-smoothing Search Results

310 results
for adaptive-label-smoothing

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

LiyuanLucasLiu/Transformer-Clinic #14

wmt_en_de admin: Function 'SoftmaxBackward' returned nan val…

I was wondering if you ever encountered nan-gradients during admin training. I'm in torch 1.6/CUDA 10.1 with no modifications to the code: #### Command ```bash export dd=data-bin/wmt14_en_de_joi…

sshleifer updated 3 years ago
8
facebookresearch/fairseq #2710

pip error error: unknown file type '.pyx' (from 'fairseq/dat…

## ❓ Questions and Help #### What is your question? When i pip fairseq in build dockerfile.Raise error. dockerfile FROM pytorch/pytorch:1.3-cuda10.1-cudnn7-devel RUN pip install redis fairse…

luoqishuai updated 3 years ago
6
facebookresearch/fairseq #3215

Exception: process 0 terminated with signal SIGKILL when tra…

## 🐛 Bug I followed up almost literally the example provided in https://github.com/pytorch/fairseq/blob/master/examples/backtranslation/README.md, but when I run fairseq-train I get "Exception: p…

davidepatrucco updated 3 years ago
3
XuezheMax/apollo #4

Apollo applied to NMT

@XuezheMax Hello, I replaced Adam with Apollo in the machine translation based on the transformer structure of the fairseq framework, but the effect decreased. I have a partner who does reading comp…

BUAAers updated 3 years ago
3
LiyuanLucasLiu/Transformer-Clinic #9

Post-LN with 12-12 is trained ok, but 12-3 diverge

Hi, As we expect, the model with more transformer layers is easier to diverge during training. However, we find that the model with 12 encoder layers and 12 decoder layers is trained ok, but the model…

ZhenYangIACAS updated 4 years ago
9
IBM/transition-amr-parser #9

problem with fairseq-preprocess

I was able to install everything as per your setup instructions. I run the training script `bash scripts/stack-transformer/experiment.sh configs/amr2_o5+Word100_roberta.large.top24_stnp6x6.sh` The da…

PolKul updated 3 years ago
13
bert-nmt/bert-nmt #56

Training throws pytorch Runtime error

When I use the training script `train.sh`, the following error is thrown - ``` + nvidia-smi NVIDIA-SMI has failed because it couldn't communicate with the NVIDIA driver. Make sure that the latest …

ishanisri updated 3 years ago
1
MarlinFirmware/Marlin #18480

[BUG] Probe Constantly Fails

### Bug Description Probe constantly fails whether it is homing Z or leveling. Occasionally when it actually starts leveling it will "skip" a spot, immediately moving upwards and going to a new s…

RNewDeal updated 4 years ago
20
LiyuanLucasLiu/Transformer-Clinic #13

tmp_weight is not defined

Hi, In [this line](https://github.com/LiyuanLucasLiu/Transformer-Clinic/blob/master/fairseq/fairseq/modules/transformer_layer.py#L178), the variable `tmp_weight` is not defined. How should it be s…

sshleifer updated 3 years ago
4
facebookresearch/fairseq #2831

FileNotFoundError: Dict not found: /zjw/testproject/data/mus…

When I run speech to text on must c follows the instruciton, the error "FileNotFoundError: Dict not found: /zjw/testproject/data/mustc/en-fr/dict.txt" Occurs. I followed the preprocess(st) an…

zjw1990 updated 4 years ago
2

上一页 1...17 18 19 20 21 22 23...31 下一页

310 results for adaptive-label-smoothing

310 results
for adaptive-label-smoothing