knowledge-pretraining Search Results

322 results
for knowledge-pretraining

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

neuralmind-ai/portuguese-bert #7

Failed to load tensorflow checkpoint

Hello, I'm having an issue trying to load model (base) on tensorflow 2.0 When trying to load checkpoints from different devices (e.g cpu from gpu) in tensorflow usually we use the following: …

domus123 updated 4 years ago
3
dhkim0225/1day_1paper #15

TODO LIST

# prompt Calibrate Before Use: Improving Few-Shot Performance of Language Models (https://arxiv.org/abs/2102.09690) p-tuning (https://arxiv.org/abs/2104.08691) Do Prompt-Based Models Really Underst…

dhkim0225 updated 2 years ago
1
Aidenzich/road-to-master #41

2024-03 Latest Health LLM

- [LLaVA-Med: Training a Large Language-and-Vision Assistant for Biomedicine in One Day](https://arxiv.org/abs/2306.00890) - [MEDITRON-70B: Scaling Medical Pretraining for Large Language Models](http…

Aidenzich updated 7 months ago
2
digitalepidemiologylab/covid-twitter-bert #21

No proper encodings for covid-related terms

I have just checked encodings that autotokenizer produces. It seems that for words "wuhan", "ncov", "coronavirus", "covid", or "sars-cov-2" it produces more than one token, while tokenizer produces on…

OleksiiRomanko updated 2 years ago
1
google-research/bert #615

BERT pre-training using only domain specific text

BERT is pre-trained using Wikipedia and other sources of normal text, but my problem domain has a very specific vocabulary & grammar. Is there an easy way to train BERT completely from domain specific…

nightowlcity updated 3 years ago
34
zihangdai/xlnet #41

Long Sequence in SQuAD

**Case: SQuAD task, sequence length > 512** Does your script utilizes cached memory/extended context in a segment, such that the predictions are inferred from sequence longer than 512 tokens? If…

ecchochan updated 5 years ago
3
bigshanedogg/survey #23

An Empirical Study of GPT-3 for Few-Shot Knowledge-Based VQA

## Problem statement 1. performance bottleneck in knowledge-based VQA due to two-phase architecture consists of knowledge retrieval from external soruces and training question answering task in super…

bigshanedogg updated 2 years ago
1
jbrry/Irish-BERT #14

Try adding synthethic Irish text

We could augment the BERT training data with English text, or text in other languages, machine translated to Irish and/or with automatic paraphrases of Irish text. Is their previous work adding syn…

jowagner updated 2 years ago
2
lobehub/lobe-chat #4547

[Bug] "Get Model List" clears out Github models

### 📦 Environment Vercel ### 📌 Version v1.26.11 ### 💻 Operating System Windows ### 🌐 Browser Chrome ### 🐛 Bug Description When "Get Model List" is pressed on Github, it reports "0 models avai…

ignaciocastro updated 6 days ago
2
wj-Mcat/agent-handbook #7

[Topic] 数据配比对训练的影响

[8.22-8.30] 这段时间想研究这个子方向

wj-Mcat updated 2 months ago
8

上一页 1...1 2 3 4 5 6 7...33 下一页

322 results for knowledge-pretraining

322 results
for knowledge-pretraining