knowledge-distillation Search Results

1000+ results
for knowledge-distillation

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

czczup/ViT-Adapter #143

knowledge distillation

Your project is so good.If I want to use knowledge distillation to teach your ViT-Adapter-S model to learn the human semantic segmentation effect of ViT-Adapter-L, what should I be mindful of.

GitHubYuxiao updated 1 year ago
1
AlexeyAB/darknet #4966

Knowledge Distillation

"[Object detection at 200 Frames Per Second] (https://arxiv.org/pdf/1805.06361.pdf)" In this paper, you can see a significant improvement in the performance of "tiny-yolov2". Is there a way to use th…

mnbv7581 updated 3 years ago
1
Dooders/Experiments #8

Experiment: Implement Distillation, Quantization, and Crosso…

We aim to implement a system that leverages distillation and quantization to create a "child" neural network by combining parameters from two "parent" neural networks. The child network should inherit…

csmangum updated 2 weeks ago
1
joeynmt/joeynmt #227

Implementing Knowledge distillation

Hi! I came across this library very recently and i am loving it! In my current research I am trying to implement knowledge distillation, which requires multiple datasets to be passed in, here a singl…

vmenan updated 7 months ago
3
henryliangt/usyd #52

Knowledge Distillation

[”Distilling the Knowledge in a Neural Network](https://link.zhihu.com/?target=https%3A//arxiv.org/abs/1503.02531) [Prakhar Ganesh. "Knowledge Distillation : Simplified"](https://towardsdatascience…

henryliangt updated 1 year ago
3
pytorch/torchtune #1957

LLAMA-3-2 11b Vision Instruct

Hi Team, I have attempted Knowledge Distillation using Torchtune for the 8B and 1B Instruct models. However, I still need to apply KD to the Vision Instruct model. I followed the same steps and cre…

Praveen-mvp updated 1 week ago
10
srijandas07/vpnplusplus #4

Question about testing

Hi, thank you for your great work! I have a question that makes me feel confused about your experimental part in the paper. You compared "VPN++" and "VPN++ +3D pose". But if I understand correctly, …

sydat2701 updated 8 hours ago
1
microsoft/table-transformer #148

how to approach model distillation, for creating a smaller +…

I am interested a implementation of model knowledge distillation for this specific model. This technique will allows us to transfer the valuable knowledge and performance of a larger, resource-intensi…

mllife updated 5 days ago
2
FlagOpen/FlagEmbedding #1178

New bug during fine-tune

![image](https://github.com/user-attachments/assets/e7f250b2-95e1-46ba-8a9e-a0b6c18e82c6) torchrun --nproc_per_node 1 \ -m FlagEmbedding.finetune.reranker.encoder_only.base \ --model_name_or_path…

holopyolo updated 6 days ago
9
google-research/google-research #911

Knowledge distillation on regression task

I noticed the conclusion in your paper - "In contrast, Single -> Multi knowledge distillation improves or matches the performance of the other methods on all tasks except STS, the only regression task…

graycrown updated 4 months ago
2

上一页 1...1 2 3 4 5 6 7...100 下一页

1000+ results for knowledge-distillation

1000+ results
for knowledge-distillation