bookcorpus Search Results

211 results
for bookcorpus

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

dhg-wei/DeCap #1

questions about the Paper: "A sentence with a large norm is …

Hi, thanks for your wonderful work and congratulations on your paper being accepted by ICLR 2023. But I have some questions about the expression (i.e., "A sentence with a large norm is usually not vi…

byougert updated 1 year ago
2
hku-systems/vpipe #2

AttributeError: module 'vgpus=4' has no attribute 'arch'

I have setup the enviroment and downloaded the dataset using the dockerfile offered in the repo, and I have already modified the data locations in config files. When I execute `python driver.py --conf…

Hyaloid updated 1 year ago
1
neulab/prompt2model #260

IsADirectoryError in cli_demo.py

I'm getting an IsADirectory error when I run the CLI demo. ```bash python cli_demo.py ____ _ ____ __ __ …

neubig updated 1 year ago
2
THUDM/GLM #59

The pretraining corpus of GLM-Large-Chinese

Hi, 1. What is the pretraining corpus of `GLM-Large-Chinese`/`GLM-10B-Chinese` released ? `Wiki+BookCorpus` in README or `wudao baike zhihu`(in `config/ds_block_large_chinese.sh`) ? 2. Besides, how …

cklsoft updated 1 year ago
1
microsoft/unilm #979

VLMO text only data, "wikibk.{index}.txt"

Model I am using (VLMO), I found that the text-onlt data is loaded from "wikibk.{index}.txt" where index=0,1,...,49，I want to ask I can I get the .txt files?

NieShenRuc updated 1 year ago
2
JonasGeiping/cramming #2

Preprocessing for final recipe

Hello! I am wondering what the correct data preprocessing command is for the final recipe. Could you add this information to the README? Also, is there a straight forward way to restrict memory …

florianmai updated 1 year ago
10
huggingface/datasets #486

Bookcorpus data contains pretokenized text

It seem that the bookcoprus data downloaded through the library was pretokenized with NLTK's Treebank tokenizer, which changes the text in incompatible ways to how, for instance, BERT's wordpiece toke…

orsharir updated 1 year ago
8
horseee/LLM-Pruner #6

my process have some problems

1. download the vicuna model from this: [vicuna model](https://huggingface.co/lmsys/vicuna-7b-v1.3) 2. because of network problem, i download the book corpus.tar.bz2 and uncompress it: and …

18140663659 updated 1 year ago
1
staoxiao/RetroMAE #2

How long is the pretraining of retromae with enhanced decodi…

Hi, thanks for your excellent work. As described in the Experiment Settings Section, the pretraining of retromae with enhanced decoding was finished on 8*A100 GPUs. Could you please tell us how man…

ma787639046 updated 1 year ago
2
microsoft/ContextualSP #36

LogiGAN: dataset creation

Hi! Thank you for sharing the code for LogiGAN paper. I'm having troubles creating training set. In particular: 1. [Here](https://github.com/microsoft/ContextualSP/blob/master/logigan/corpus_constru…

Golovneva updated 1 year ago
6

上一页 1...9 10 11 12 13 14 15...22 下一页

211 results for bookcorpus

211 results
for bookcorpus