knowledge-pretraining Search Results

322 results
for knowledge-pretraining

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

davidberenstein1957/concise-concepts #30

Question: How to use (external) transformer-based embeddings…

Hi, your idea of "concise concepts" sounds really intriguing! However, I would like to use transformer-based embeddings - as far as I can see it from the source code, you rely on `(word, vector)` t…

repodiac updated 1 year ago
3
ggerganov/llama.cpp #3475

Tokenizer not picking the right tokens ( mistral openorca )

Tested with 019ba1dcd0c7775a5ac0f7442634a330eb0173cc Model https://huggingface.co/Open-Orca/Mistral-7B-OpenOrca/tree/main converted and quantized to q8_0 from scratch. In case of mistral openorc…

staviq updated 1 year ago
40
microsoft/unilm #878

Some questions about VQ-KD

Hello, when I read and reproduce your work, there is a consistent question about VQ-KD. When training MIM, it can be regarded as an offline teacher or Tokenizer, but can't it perform Imagenet classifi…

Basums updated 1 year ago
9
ersilia-os/ersilia #828

✍️ Contribution period: Sarima Chiorlu

### Week 1 - Get to know the community - [x] Join the communication channels - [x] Open a GitHub issue (this one!) - [x] Install the Ersilia Model Hub and test the simplest model - [x] Write a motiva…

Richiio updated 1 year ago
37
microsoft/Cream #141

Model architecture search in TinyViT framework

I have tried finding the search algorithm to find tinier versions of the parent model, using "constrained local search" as mentioned in the paper for reproducing your work. Could you release the s…

NKSagarReddy updated 1 year ago
3
TencentAILabHealthcare/scBERT #11

What is the output of pre-trained model, and how it shows th…

What is the output of pre-trained model, and how it shows the information from the dataset? Is the output still kind of expression matrix which shows the information between Genes. I don't really get …

cfjiang123 updated 1 year ago
1
seoulsky-field/CXRAIL-dev #11

Features: Append More Options on CXR dataloader

### What - The more I looked at previous work on CheXpert, such as Issue #9, I saw that some options needed to be added. 1. Label Smoothing 2. Conditional Training ### Why - Lank 2 pape…

juppak updated 1 year ago
8
baaivision/EVA #28

Some questions about EVA pretraining

1. In your opinion, is EVA a method of both model scaling and data scaling? Does pretraining with more data (such as the data used in CLIP finetuning) yield better results than using only the 30M data…

conicoco1993 updated 1 year ago
13
Significant-Gravitas/AutoGPT #346

How about let AutoGPT to access a virtual machine like Virtu…

### Duplicates - [X] I have searched the existing issues ### Summary 💡 1. Attach to a VirtualBox instance, give AI a default OS like ubuntu 2. if AI decide to use computer: enter "screenshot-mouse…

artheru updated 1 year ago
26
PaddlePaddle/RocketQA #78

使用提供的例子进行训练无法输出模型

日志如下 ``` E:\IdeaProjects\knowledge-model\rocketqa_es>python example.py RocketQA model [zh_dureader_de] WARNING:root:paddle.fluid.layers.py_reader() may be deprecated in the near future. Please use…

wenlincheng updated 1 year ago
2

上一页 1...21 22 23 24 25 26 27...33 下一页

322 results for knowledge-pretraining

322 results
for knowledge-pretraining