-
Hello, I have a problem with the following lines about the mlm/span mask:
https://github.com/pkusjh/LASS/blob/43f4c5e76525120ead3097fc3952f719db6dc695/model/data_collator.py#L24
https://github.com/p…
-
I have some questions regarding language model.
1. I want to build a character level LM. Previously I built a word level LM using KENLM and for that I created a text file with one sentence per line…
-
as a way of introducing modeling?
There's more than one way to do it (C sharp vs. D flat, synonyms, phrasing, denotation vs. connotation)
Multiple referents: "There's a dog" - is that about the …
-
The implementation of the SML (static modeling language) is currently broken down into stages:
- an intermediate representation (IR) based on a DAG (https://github.com/probcomp/Gen/blob/master/src/…
-
When displaying language modeling results, color code each token based on its probability.
-
**Describe what problem your feature request solves**
The Risk Analysis and Assessment Modeling Language (RAAML) specification is a sysml compliant format that would allow integration with other …
-
I have multiple checkpoints produced after running `examples/nlp/language_modeling/megatron_gpt_continue_training.py`.
However, I am unable to use `examples/nlp/language_modeling/megatron_ckpt_to_…
-
From the Community forums at https://community.software.sil.org/t/keyman-roadmap-march-2020/822/29:
> Being able to identify in the tsv file what types of words can take what types of prefixes and …
-
link text: [VideoLLaMA 2: Advancing Spatial-Temporal Modeling and Audio Understanding in Video-LLMs](https://arxiv.org/pdf/2306.02858)
Actual title of the paper: Video-LLaMA An Instruction-tuned Audi…
-
Right now, we have:
* `streaming_language_modeling` (which we use mainly for pre-training - requires data to be streamed in as text / tokenized-on-the-fly as opposed to being tokenized ahead of time)…