-
**Describe the bug**
When I want to convert a generate function to a concrete function for tf lite, it errors out with `AttributeError: 'Tensor' object has no attribute 'numpy'`, the stack trace di…
-
I am able to finetune gpt2-xl model on AWS c5.9xlarge - CPU (72 GB memory) ( slow but doable)
It looks like 1x A100 40GB is capable of achieving this at much better speed.
I don't have access t…
-
Thanks for your nice work.
I am try to replicate result on webNLG, but the finnal epochs of checkpoint is only 11270, different from 20000. This results in a significant difference in the accuracy …
vvhj updated
5 months ago
-
Hi Jürgen,
this is an awesome exploration work. I have 2 questions and 1 suggestion (sorry for the long issue):
1) How do we know results are corect? I am running with the `tornado-multi-tg` b…
-
```
necktwi@CheapFellow:~/workspace/llm.c$ make train_gpt2cu USE_CUDNN=1 CUDNN_FRONTEND_PATH="/home/necktwi/workspace/cudnn-frontend/include"
necktwi@CheapFellow:~/workspace/llm.c$ ./train_gpt2cu …
-
I have only just tried loading GPT2, havent tried to score a sentence yet. Here is the code:
```
import torch
from lm_scorer.models.auto import AutoLMScorer as LMScorer
device = "cuda:0" if to…
-
Reference - https://github.com/jaymody/picoGPT/blob/main/gpt2.py
Will work on this after https://github.com/lfortran/lfortran/pull/1213 and some improvements in `arr_slice.cpp`.
-
### System Info
lorax:latest
### Information
- [X] Docker
- [ ] The CLI directly
### Tasks
- [X] An officially supported command
- [ ] My own modifications
### Reproduction
@tga…
-
Platforms: linux
This test was disabled because it is failing in CI. See [recent examples](https://hud.pytorch.org/flakytest?name=test_fake_tensor_mode_huggingface_gpt2&suite=TORCH_NN_MODULE&limit=10…
-
### Question
I'm looking to build a AI Journaling app that helps you reflect from your journal entries
I'm looking for a model like (GPT or Claude) that will take the selected text and provide ins…