-
Hey,
Great to see LISA implemented here.
As for the background, I am trying to finetune models with LORA other techniques on domain data but the Task i am doing is Causal LM is Next word Predict…
-
Hi, thank you for providing results from such a valuable research.
I'm trying to build word prediction model using your RNN code.
I successfully finished installation and built the first model!! Than…
-
@saiprasadb01 There seems to be a problem/bug when trying to select a word from the "predicted word list" in the Talk (Alt), Talk (ABC Alt), App (Alt) & App (ABC Alt) applications using the keyboard o…
-
### Steps to reproduce
1. Think of a sentence for example "how are you"
2. Type how
3. It doesn't show any prediction at all or if you type ar (since suggesting only appear after 2) it just show ra…
-
Hello,
I am working with the VRDU dataset, and I am attempting to normalize the bounding boxes for use with LayoutLMv3. In your [paper](https://arxiv.org/abs/2211.15421), I see that OCR is used, an…
-
Named after the scene from American Psycho... because this is probably going to be random, weird, and a little violent.
My program will generate a novel based on source material fed into it, using …
-
Hi, first, the book is amazing!
I have question about the NLP chapter (16). In the 11th question you asked to generate shakespearean text with "one of the recent language models (e.g. BERT) to gene…
-
For applications other than MT (e.g., image captioning, conversation modeling, etc.) it's crucial to prevent UNK from being generated at inference time. For greedy decoding, I could fix this by modify…
-
I am pre-training language model in Hindi of approx 9gb data and vocab of 32k approx. My parameters for training is
learning rate = 1e-4
warmup_steps=40k
max seq length = 128
training steps = 5000…
-
Hi M. H. Kwon,
Your tokenization script is really helpful.
I trained a bert model with custom corpus using Google's Scripts like create_pretraining_data.py, run_pretraining.py ,extract_features.py…