-
Possible modification suggestion?: Sentence boundary disambiguation, and sentence segmentation (each sentence on a new line) – “period” “space” with “period” "new line”
Apologies in advance, as this …
-
### 描述该错误
lmdeploy lite auto_awq internlm/internlm2_5-20b-chat --work-dir /home/ma/work/models/internlm2_5-20b-chat-4bit --batch-size 8 --search-scale True
Move model.layers.45 to CPU.
Move mod…
-
### Checklist
- [x] 1. I have searched related issues but cannot get the expected help.
- [x] 2. The bug has not been fixed in the latest version.
- [x] 3. Please note that if the bug-related issue y…
-
At least most of the small open-source corpora should be available on https://teanga.io/
This includes at least:
* Gutenberg
* Brown
* CESS
* Chat-80
* CoNLL 2000, 2002, 2007
* All of UD?
…
-
100%|██████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████| 667/667…
-
## Motivation and summary of the current issues in torchtext
Based on the feedback from users, there are several issues existing in torchtext, including
- Several components and functionals are un…
-
Thank you for your excellent work.
Why am I unable to reproduce your perplexity metrics on Penn Treebank (PTB) ?
In OWQ Paper: 12.46
Our reproduce: 56.033756256103516
![image](https://github.…
hopef updated
8 months ago
-
Hello, I was wondering what are the training times for the demonstrations.
I just tried the english seq labeler, and it took 1 hour to process 10% of the corpus! (is this normal?)
It's known Deep Le…
-
I'm trying to package your module as an rpm package. So I'm using the typical PEP517 based build, install and test cycle used on building packages from non-root account.
- `python3 -sBm build -w --no…
-
## 論文リンク
https://cdn.openai.com/better-language-models/language_models_are_unsupervised_multitask_learners.pdf
## 公開日(yyyy/mm/dd)
2019/02/14
## 概要
GPT-2 の論文。
15 億というパラメタ数の言語モデルである GPT-2 をウェブ…