-
Hi,
your idea of "concise concepts" sounds really intriguing! However, I would like to use transformer-based embeddings - as far as I can see it from the source code, you rely on `(word, vector)` t…
-
Tested with 019ba1dcd0c7775a5ac0f7442634a330eb0173cc
Model https://huggingface.co/Open-Orca/Mistral-7B-OpenOrca/tree/main converted and quantized to q8_0 from scratch.
In case of mistral openorc…
-
Hello, when I read and reproduce your work, there is a consistent question about VQ-KD. When training MIM, it can be regarded as an offline teacher or Tokenizer, but can't it perform Imagenet classifi…
-
### Week 1 - Get to know the community
- [x] Join the communication channels
- [x] Open a GitHub issue (this one!)
- [x] Install the Ersilia Model Hub and test the simplest model
- [x] Write a motiva…
-
I have tried finding the search algorithm to find tinier versions of the parent model, using "constrained local search" as mentioned in the paper for reproducing your work.
Could you release the s…
-
What is the output of pre-trained model, and how it shows the information from the dataset? Is the output still kind of expression matrix which shows the information between Genes. I don't really get …
-
### What
- The more I looked at previous work on CheXpert, such as Issue #9, I saw that some options needed to be added.
1. Label Smoothing
2. Conditional Training
### Why
- Lank 2 pape…
-
1. In your opinion, is EVA a method of both model scaling and data scaling? Does pretraining with more data (such as the data used in CLIP finetuning) yield better results than using only the 30M data…
-
### Duplicates
- [X] I have searched the existing issues
### Summary 💡
1. Attach to a VirtualBox instance, give AI a default OS like ubuntu
2. if AI decide to use computer: enter "screenshot-mouse…
-
日志如下
```
E:\IdeaProjects\knowledge-model\rocketqa_es>python example.py
RocketQA model [zh_dureader_de]
WARNING:root:paddle.fluid.layers.py_reader() may be deprecated in the near future. Please use…