bookcorpus Search Results

215 results
for bookcorpus

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

Lightning-AI/litgpt #1517

Catastrophic forgetting occur when I perform continued pre-t…

# My question Why does catastrophic forgetting occur when I perform continued pre-training on Llama 3? I used open source data from BookCorpus, iterated 100,000 steps, and then after testing the trai…

BestJiayi updated 4 months ago
11
codertimo/BERT-pytorch #32

pred_loss decrease fast while avg_acc stay at 50%

I try to run the code on a small dataset and I find that pred_loss decrease fast while avg_acc stay at 50%. It is strange to me since decrease in pred_loss should indicates increase in accuracy. ![im…

jiqiujia updated 4 years ago
53
nltk/nltk #1210

Sentence tokenizer not splitting correctly

I think there is a bug in standard sentence tokenizer `sent_tokenize`. The problem is, that it is not splitting text into sentences under certain case. Here is this case, where the tokenizer fails to …

jeryini updated 3 months ago
6
tensorflow/models #2034

OMM during training in Object detection with batch size>12

I am trying to train the ssd_inception_v2 the training break with the error the result error: ResourceExhaustedError (see above for traceback): OOM when allocating tensor with shape[1917,1] …

zhangyilalala updated 2 years ago
14
microsoft/LMOps #124

dolly/RoBERTa Corpus dataset download

How to download the dolly or RoBERTa Corpus dataset? Please give the url. Thanks~

AInkCode updated 7 months ago
2
huggingface/datasets #6728

Issue Downloading Certain Datasets After Setting Custom `HF_…

### Describe the bug This bug is triggered under the following conditions: - datasets repo ids without organization names trigger errors, such as `bookcorpus`, `gsm8k`, `wikipedia`, rather than …

padeoe updated 8 months ago
3
jzhang38/TinyLlama #140

How to evaluate checkpoints during pretraining?

I'm learning how to train a language model from scratch, and I was training a 120M tinyLlama model with bookcorpus. I wonder how I can evaluate the checkpoints using GLUE. I have read EVAL.md which s…

DarthMurse updated 7 months ago
2
microsoft/DeepSpeed #308

'CUDA error: an illegal memory access was encountered' in fo…

Hi, I'm running into the following error when attempting to train bert with ds_train_bert_bsz64k_seq128_m.sh. I printed out all tensor shapes in the batch and it looks fine since I used train_micro_ba…

gongwei-130 updated 4 years ago
7
microsoft/Megatron-DeepSpeed #319

The pile and its BookCorpus subset are not available

The pile and its BookCorpus subset are not available. I am also unable to download this dataset for pretrain gpt. Is there any other dataset to replace it, or is there a backup of the previous dataset…

xvanQ updated 8 months ago
2
microsoft/Megatron-DeepSpeed #342

The link (https://the-eye.eu/public/AI/pile_neox/data/BookCo…

The link (https://the-eye.eu/public/AI/pile_neox/data/BookCorpusDataset_text_document.bin) has expired. ![image](https://github.com/microsoft/Megatron-DeepSpeed/assets/41630003/916ca98b-0324-44de-928…

HIT-cwh updated 8 months ago
1

上一页 1...5 6 7 8 9 10 11...22 下一页

215 results for bookcorpus

215 results
for bookcorpus