bookcorpus Search Results

211 results
for bookcorpus

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

huggingface/datasets #2829

Optimize streaming from TAR archives

Hi ! As you know TAR has some constraints for data streaming. While it is optimized for buffering, the files in the TAR archive **need to be streamed in order**. It means that we can't choose which fi…

lhoestq updated 2 years ago
1
huggingface/transformers #9881

DeBERTa pretraining using MLM: model gradients become NAN

## Environment info - `transformers` version: 4.3.0.dev0 - Platform: Ubuntu - Python version: 3.6.12 - PyTorch version : 1.7.1 - Using GPU in script?: Y - Using distributed or parallel set-up …

mansimane updated 1 year ago
7
NorbertZheng/read-papers #71

Sik-Ho Tsang | Review: Representation Learning with Contrast…

Sik-Ho Tsang. [Review: Representation Learning with Contrastive Predictive Coding (CPC/CPCv1)](https://sh-tsang.medium.com/review-representation-learning-with-contrastive-predictive-coding-cpc-cpcv1-8…

NorbertZheng updated 1 year ago
11
xplip/pixel #5

rendering instructions

Thanks for your wonderful work! I am very interested in your work and try to extend the ideas to other tasks. I wonder when the rendering instructions will be available?

mzhaoshuai updated 1 year ago
3
apollographql/apollo-client #2303

IntrospectionFragmentMatcher does not seem to function in v2

Initialization of the client: ```js // @flow import { ApolloClient } from 'apollo-client'; import { HttpLink } from 'apollo-link-http'; import { InMemoryCache, IntrospectionFrag…

gajus updated 1 year ago
3
huggingface/transformers #2534

DistilBERT accuracies on the glue test set.

## ❓ Questions & Help I need to compare my research against distilBERT as a baseline for a paper in progress. I went through your publication and found that you don't report accuracies on the glu…

smr97 updated 2 years ago
3
huggingface/datasets #847

multiprocessing in dataset map "can only test a child proces…

Using a dataset with a single 'text' field and a fast tokenizer in a jupyter notebook. ``` def tokenizer_fn(example): return tokenizer.batch_encode_plus(example['text']) ds_tokenized = te…

timothyjlaurent updated 1 year ago
9
deep-spin/sparse_text_generation #3

BookCorpus

Hi authors, Thanks for sharing the repo. It seems that the original version of the dataset is unavailable online. If you have a copy, can you make it available for download? Otherwise, please sugge…

kgarg8 updated 2 years ago
1
IntelLabs/academic-budget-bert #20

How to combine wiki and bookcorpus into one file?

I found that in the dataset description, we can `Use process_data.py for pre-processing wikipedia/bookcorpus datasets into a single text file.` What if I want to process these two datasets at the s…

shizhediao updated 2 years ago
4
facebookresearch/fairseq #3936

The ratio between train set (trainpref) and valid set (valid…

Hello, I am very insterested in trying to pre-train the langage model from scratch. But I am not sure about the ratio between validpref and trainpref in actual pretraining? For example, BERT, the co…

WangJiexin updated 2 years ago
2

上一页 1...10 11 12 13 14 15 16...22 下一页

211 results for bookcorpus

211 results
for bookcorpus