bookcorpus Search Results

211 results
for bookcorpus

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

osuossu8/paper-reading #29

[2021] BART for Post-Correction of OCR Newspaper Text

https://aclanthology.org/2021.wnut-1.31.pdf 学会 : ACL2021

osuossu8 updated 2 years ago
3
togethercomputer/RedPajama-Data #45

Partially downloaded datasets

Hi there! I looked through the corpuses and found that sometimes they are not 100% downloaded. Not sure, if the issue is with the downloading scripts. Below are some examples grepped from bookcorp…

soboleva-daria updated 1 month ago
1
microsoft/DeepSpeed #272

How to reproduce BERT perf results in deepspeed blog

Hi Deepspeed team, The BERT perf results from blog [Microsoft DeepSpeed achieves the fastest BERT training time](https://www.deepspeed.ai/news/2020/05/27/fastest-bert-training.html) are very impres…

LiweiPeng updated 3 years ago
21
Lightning-AI/litgpt #1517

Catastrophic forgetting occur when I perform continued pre-t…

# My question Why does catastrophic forgetting occur when I perform continued pre-training on Llama 3? I used open source data from BookCorpus, iterated 100,000 steps, and then after testing the trai…

BestJiayi updated 3 months ago
11
Living-with-machines/TargetedSenseDisambiguation #41

Implement sense embeddings supervised baseline from Hu et al

Hu et al. (2019)'s paper is summarised in #35 . Their code is [here](https://github.com/iris2hu/diachronic-sense-modeling). @kasparvonbeelen started implementing their method in #18 Currently n…

BarbaraMcG updated 3 years ago
9
dmlc/gluon-nlp #1508

Have problom in BERT pre-training: how to training on multip…

## Description - I want to train BERT model on GPU, but have some problems. My configuration: * Software environment: Python: 3.7.7, Cuda: 10.2 * Install MXNet: `pip install mxnet-cu102` , ve…

yangshuo0323 updated 3 years ago
13
e4exp/paper_manager_abstract #641

Compressive Transformers for Long-Range Sequence Modelling

- https://arxiv.org/abs/1911.05507v1 - 2019 我々は、過去の記憶を圧縮して長距離シーケンス学習を行う気配りシーケンスモデル「Compressive Transformer」を発表する。 Compressive Transformerは、WikiText-103およびEnwik8ベンチマークにおいて、それぞれ17.1 pplおよび0.97 bpcと…

e4exp updated 3 years ago
5
nltk/nltk #1210

Sentence tokenizer not splitting correctly

I think there is a bug in standard sentence tokenizer `sent_tokenize`. The problem is, that it is not splitting text into sentences under certain case. Here is this case, where the tokenizer fails to …

jeryini updated 1 month ago
6
shimopino/papers-challenge #80

MobileBERT: a Compact Task-Agnostic BERT for Resource-Limite…

### 論文へのリンク [[arXiv:2004.02984] MobileBERT: a Compact Task-Agnostic BERT for Resource-Limited Devices](https://arxiv.org/abs/2004.02984) ### 著者・所属機関 Zhiqing Sun, Hongkun Yu, Xiaodan Song, Ren…

shimopino updated 4 years ago
1
jojonki/arXivNotes #210

2019: BERT for Joint Intent Classification and Slot Filling

BERT for Joint Intent Classification and Slot Filling Qian Chen, Zhu Zhuo, Wen Wang 4 pages, 1 figure https://arxiv.org/abs/1902.10909 ## 概要新しい言語表現モデルであるBERTを利用して，インテント分類及びスロットフィリングを行った． …

jojonki updated 5 years ago
3

上一页 1...4 5 6 7 8 9 10...22 下一页

211 results for bookcorpus

211 results
for bookcorpus