-
Hi,
Thanks for the great work!
Firstly I wanted to ask about the performance of the code-clippy models. It seems that the 125M parameter models are quite weak and perform quite poorly on human-eval…
-
## Environment info
- `transformers` version:4.12.5
- Platform:pytorch
- Python version:3.8
- PyTorch version (GPU?):1.12.0-cuda11.3
- Tensorflow version (GPU?):
- Using GPU in script?:yes
…
-
I have been successfully using the GPT2LMHeadModel module for text generation for some time and I recently tried to reuse the code to generate with GPTNeoForCausalLM. Though the documentations appear …
-
## Environment info
- `transformers` version: 4.11.0.dev0
- Platform: Linux-4.15.0-156-generic-x86_64-with-glibc2.17
- Python version: 3.8.11
- PyTorch version (GPU?): 1.8.1+cu111 (True)
- Te…
-
I would like to fine tune this model to my custom QA dataset where the input is a question and a data table, and the output is an answer to the question. How can I have this setup for this model? I ca…
-
I'm trying to get this to work on a Jetson nano rather than cloud deployment but I'm stuck in python module dependency gridlock. Did you test this with a certain docker container perhaps that would ac…
-
## Environment info
This hotfix: https://github.com/huggingface/transformers/pull/14204 could be superseeded with superior fix that would keep `inference_mode` by hiding the tensor inplace modifica…
-
## Environment info
- `transformers` version: 4.9.2
- Platform: Linux
- Python version: 3.8.10
- PyTorch version (GPU?): 1.9.0+cu111 (True)
- Tensorflow version (GPU?): not installed (NA)
- …
-
## Environment info
- `transformers` version: 4.15.0
- Platform: Linux-5.4.144+-x86_64-with-Ubuntu-18.04-bionic
- Python version: 3.7.12
- PyTorch version (GPU?): 1.10.0+cu111 (False)
- Tensorf…
-
@patrickvonplaten, @LysandreJik @sgugger
GPT-Neo implements the class `GPTNeoForCausalLM` and GPT-2 implements the class `GPT2LMHeadModel`. These look like they're supposed to do roughly the same t…