bookcorpus Search Results

213 results
for bookcorpus

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

huggingface/transformers #8027

(Load dataset failure) ConnectionError: Couldn’t reach https…

# ❓ Questions & Help ## Details Hey, I want to load the cnn-dailymail dataset for fine-tune. I write the code like this from datasets import load_dataset test_dataset = load_dataset(“cn…

AI678 updated 3 years ago
2
huggingface/transformers #11919

Trainer reported loss is wrong when using DeepSpeed and grad…

## Environment info - `transformers` version: 4.7.0.dev0 - Platform: Windows-10-10.0.19041-SP0 - Python version: 3.8.0 - PyTorch version (GPU?): 1.8.1 (True) - Tensorflow version (GPU?): not inst…

rfernand2 updated 3 years ago
6
jungwoo-ha/WeeklyArxivTalk #17

[20210718] Weekly AI ArXiv 만담

- AI News - EMNLP 2021 - Rebuttal 종료: 수고 많으셨습니다. 모두들 Good luck! - NeurIPS 2021 - Review 종료 - NVidia Jetson developer meetup (21. 7. 22) - Google ML 부트캠프 모집 시작 (~ 8.2): https://events.withg…

jungwoo-ha updated 3 years ago
10
facebookresearch/fairseq #3550

Which pre-training corpus is used for BART-base?

## ❓ Questions and Help #### What is your question? Hi , On what corpus is the [publicly available BART-base](https://huggingface.co/facebook/bart-base) pre-trained on? It was not explicitly …

rakeshchada updated 3 years ago
3
microsoft/MPNet #12

The exact English pretraining data and Chinese pretraining d…

Any one know where to get them? Thank you and thank you.

guotong1988 updated 3 years ago
1
huggingface/datasets #2071

Multiprocessing is slower than single process

```python # benchmark_filter.py import logging import sys import time from datasets import load_dataset, set_caching_enabled if __name__ == "__main__": set_caching_enabled(False) …

theo-m updated 3 years ago
1
huggingface/transformers #9986

How to train on shards of bookcorpus + wikipedia + openwebte…

# 🚀 Feature request Hello, I am trying to pretrain from scratch a custom model on bookcorpus + wikipedia + openwebtext but I only have a 1TB disk. I tried to merge 20% of each one and then reload t…

gaceladri updated 3 years ago
2
huggingface/datasets #708

Datasets performance slow? - 6.4x slower than in memory data…

I've been very excited about this amazing datasets project. However, I've noticed that the performance can be substantially slower than using an in-memory dataset. Now, this is expected I guess, du…

eugeneware updated 3 years ago
10
NVIDIA/DeepLearningExamples #955

[tacotron2/Pytorch] 'Tacotron2' object has no attribute 'tex…

**Describe the bug** I run the tutorial on https://pytorch.org/hub/nvidia_deeplearningexamples_waveglow/ and I got errors `AttributeError: 'Tacotron2' object has no attribute 'text_to_sequence` …

Luosuu updated 3 years ago
7
huggingface/datasets #1662

Arrow file is too large when saving vector data

I computed the sentence embedding of each sentence of bookcorpus data using bert base and saved them to disk. I used 20M sentences and the obtained arrow file is about 59GB while the original text fil…

weiwangorg updated 3 years ago
4

上一页 1...13 14 15 16 17 18 19...22 下一页

213 results for bookcorpus

213 results
for bookcorpus