-
## 🐛 Bug
Model I am using: BERT
Language I am using the model on (English, Chinese....): English
```
from transformers import BertTokenizer, BertModel
tokenizer2 = BertTokenizer.from_pret…
-
## ❓ Questions and Help
I'm trying to implement beam search for the autoregressive predictions of a transformer that I have implemented. Although I've tried my best to ensure everything has constan…
-
Any plans to introduce mixed precision training as an analog to opt_level O2 in Nvidia/Apex? I'm training GPT-2 model right now. It's not training well with XLA_USE_BF16=1. I can get perplexity 120 wi…
-
I am willing to fine-tune GPT2-large which simply does not fit into GPU memory. I wanted to run the script `run_lm_finetuning.py` with GPT2-large having two Nvidia Tesla P100, but I suppose model spli…
-
Hi, I am currently using this code to research the transferability of those pre-trained models and I wonder how could I apply the fine-tuned parameter of a model to another model. For example, I fine-…
-
**Is your feature request related to a problem? Please describe.**
This feature request is not related to a problem, _per se_. I believe that users would benefit from an autogenerated methods section…
tsalo updated
5 years ago
-
## ❓ Questions and Help
I want to run pythia on my Ubuntu 18.04 machine (which I'm remotely accessing - also it has 3 2080s), but the docs are unclear on how to run pythia on a specific image in the …
inafi updated
4 years ago
-
We may or may not decide to look into this before the 2019 data release, but I see 8 lakes that have lots of temperature observations and yet lack entries in [pb0_config.json](https://www.sciencebase.…
-
As a result of a thorough bug report from @Shotgunosine, in ANTsX/ANTs#822 several recommendations we should adopt for fMRIPrep are given:
- Rescale the input if negative values are found (https:…
-
Dear Kevin,
I'm a big fan of renv and I'm using it in combination with drake and Docker to ensure reproducibility for my scientific projects and foster collaboration. I'm at the moment experimenting …