bookcorpus Search Results

215 results
for bookcorpus

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

soskek/bookcorpus #24

Can anyone download all the files in the url list file?

I tried to download the bookscorpus data. So far I just downloaded around 5000 books. Can anyone get all the books? I met a lot `HTTP Error: 403 Forbidden` How to fix this ? Or can i get the all the…

wxp16 updated 4 years ago
13
huggingface/pixparse #10

[Explore] Curriculum training / progressive resolution

With issues like #9, can training be improved by starting with a lower resolution, possibly also simpler documents (ie generated or large font docs as in Pix2struct pretrain) before moving to training…

rwightman updated 1 year ago
2
google-research/bert #570

How many articles (Wiki+Book corpus) do Bert use in pretrain…

In the article "Bert: Pretraining of Deep..", It mentions that Wikipedia and Book corpus dataset are used to pretrain. When I try to generate my own data with Wikipedia, I get about 5.5 million artic…

Qinzhun updated 2 years ago
6
hsha0/electra #2

The effect of electra？

I wonder whether this codes have recurrented the effect the same as the paper described.And when do you plan to submit your pre-trained model ,thanks a lot.

litaozijin updated 4 years ago
2
ryankiros/skip-thoughts #5

decoding problem

Hi. Thanks very very much for your skip-thought vector. I can use encode() function with downloaded data(utable.npy, btable.npy, uni_skip.npz, etc.). And now, I want to decode my encoded sentences…

oxingoxing updated 6 years ago
10
LivNLP/cosine-discounting #1

request for some word properties file

Hey author! I want to load word properties file in "Pulication/ano3 word frequency VS l2_norm.ipynb/Dataset preparation", but i have some problem, can you provided the file of word properties file …

bloom1-afk updated 12 months ago
1
ryankiros/skip-thoughts #12

What is 'utable' and 'btable'?

A model has been trained according to the instruction [here](https://github.com/ryankiros/skip-thoughts/tree/master/training). I can load the model using the following commands: ``` import tools embe…

amirj updated 7 years ago
1
shaigue/pmi_masking #17

Try to optimize `aggregate_ngram_counts`

Currently, this is the main bottleneck for the program, and takes more than 80% of the time. We expect that as the size of the data grows this will be even greater. Try to think a little bit on how t…

shaigue updated 1 year ago
2
guolinke/TUPE #9

some issues

作者您好。心中有几个疑问，希望您能不吝赐教 1.pre-train上来就是一堆超参（这些超参在哪个文件里面的）;pre-train部分的最后一句是训练把，而且后面带了一堆参数？到底我要输入什么指令从而接下去运行。 2.我的服务器只有一个gpu，要运行你的代码，是不是要改一些配置？但是到底要改哪些参数 3.数据集的路径在哪个文件，没看到有 4."we use the English Wiki…

sanwei111 updated 3 years ago
1
zaidalyafeai/Arabert #1

Corpora

Here we combine all the datasets we can collect - [OSCAR's CommonCrawl Dataset](https://traces1.inria.fr/oscar/) - [Arabic BERT Corpus](https://www.kaggle.com/abedkhooli/arabic-bert-corpus) - [Hi…

zaidalyafeai updated 4 years ago
1

上一页 1...1 2 3 4 5 6 7...22 下一页

215 results for bookcorpus

215 results
for bookcorpus