-
Hi, first, the book is amazing!
I have question about the NLP chapter (16). In the 11th question you asked to generate shakespearean text with "one of the recent language models (e.g. BERT) to gene…
-
I have downloaded the chitchat_pretrain_model from the google drive. while running the GPT2LMHeadModel.from_pretrained, raise that "RuntimeError: unexpected EOF, expected 5249182 more bytes. The file …
-
http://paiza.hatenablog.com/entry/javascript_intro
ES6 ES2015 (2017年JS Update)
https://goo.gl/DtborX
vs code debugger
http://jsstudy.hatenablog.com/entry/javascript-stepwise-execution-with-vis…
-
@yufenglee I'm not sure this issue is fixed (tried using the latest onnxruntime-1.12.0 GPU). While disabling shape inference works for quantization, optimization is still broken for saving models such…
-
Hi Rose,
The recent commit changed x_tp1=x_tp to x_tp=x_tp1 (https://github.com/rosewang2008/language_modeling_via_stochastic_processes/commit/cb3d3454433d821c606bc224d42ee81b7cd3754f?diff=split?),…
-
**Is your feature request related to a problem? Please describe.**
If I want to experiment with zero-3 to train 345m GPT model, how to set the relevant configuration of zero-3? At present, I use th…
-
Python 3.6.6 |Anaconda custom (64-bit)| (default, Jun 28 2018, 11:27:44) [MSC v.1900 64 bit (AMD64)] on win32
Type "help", "copyright", "credits" or "license" for more information.
>>> import gpt_2_…
-
**Describe the bug**
Hi -- I'm trying to build an example to demonstrate expert parallelism feature as described [here](https://arxiv.org/abs/2201.05596). I'm getting an error when initiating the inf…
-
### 🐛 Describe the bug
- 运行sh examples/train_sft.sh
![image](https://user-images.githubusercontent.com/22451062/233003591-6f777795-54cf-4f57-8f1b-184a7bdfde7d.png)
- 报错信息如下:
[04/19/23 15:2…
-
installed via conda env with py3.6.9 and new pytorch / transformers on ubuntu v18
change to model = GPT2LMHeadModel.from_pretrained("gpt2")
but got this error:
`(poemGen) root314@sr-02631:~/p…