knowledge-distillation Search Results

1000+ results
for knowledge-distillation

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

UKPLab/sentence-transformers #695

Multilingual Information Retrieval (MS-Marco Bi-Encoders)

Hey everyone! First of all, congratulations for your new [Information Retrieval models](https://www.sbert.net/docs/pretrained-models/msmarco-v2.html#performance). They are absolutely amazing. **My…

janandreschweiger updated 2 years ago
62
iopleke/Minechem #818

BioMineChem.

So, I am taking Biology in high school right now, and I find it really fun. The biochemistry was really interesting to learn about. And then I was playing around with some MineChem. I realized that…

ElectroRedstoner updated 7 years ago
2
hunto/image_classification_sota #14

about the dimension

Hi, thanks for opening the source code. I read the paper, I find you use logits and features before pooling to perform diffusion. but for the logits, I guess the dimension is [B, C] B is the batch siz…

JINzezhong7 updated 7 months ago
3
tanluren/yolov3-channel-and-layer-pruning #98

YOLO知识蒸馏损失函数设计

你好，我最近在学习YOLO知识蒸馏算法，我想问下，你的代码里软目标损失计算时为什么除以了batch_size，在最后加上硬目标损失（这部分好像代码没有除以batch_size），而且最后的loss，又乘以了batch_size/64。这部分我看的有点蒙。能给我解释一下吗？ 1、loss_st = criterion_st(nn.functional.log_softmax(output_s…

ghang0 updated 4 years ago
4
open-mmlab/mmyolo #136

Roadmap of MMYOLO

We keep this issue open to collect feature requests from users and hear your voice. Our monthly release plan is also available here. You can either: 1. Suggest a new feature by leaving a comment…

hhaAndroid updated 4 months ago
36
AkihikoWatanabe/paper_notes #1565

Reverse Thinking Makes LLMs Stronger Reasoners, Justin Chih-…

# URL - https://arxiv.org/abs/2411.19865 # Authors - Justin Chih-Yao Chen - Zifeng Wang - Hamid Palangi - Rujun Han - Sayna Ebrahimi - Long Le - Vincent Perot - Swaroop Mishra - Mohi…

AkihikoWatanabe updated 1 day ago
1
peabody124/PosePipeline #5

Algorithms to produce wrappers for

This issue is to maintain a list of some algorithms that seem particularly useful to implement. Currently up on the docket are: - [x] [Deep High-Resolution Representation Learning for Human Pose Es…

peabody124 updated 2 years ago
1
lxa9867/ImageFolder #4

On the global batch size of dino contrastive loss

Hi, thanks for your great work! It is known that CLIP style contrast loss requires huge global batch sizes (e.g. 32k). I'd like to know if this is a critical issue in your training and the global bat…

jiachunjin updated 19 hours ago
2
ali-chr/Semantic-aware-Knowledge-Distillation-for-Few-ShotClass-Incremental-Learning #3

Detailed data of the experimental results on the other two d…

Would you please release the detailed data of the experimental results on the other two datasets, including miniImageNet and CIFAR100?

Veagau updated 3 years ago
1
samsucik/knowledge-distil-bert #3

an inquiry about your knowledge-distil-bert talk

Hello, I'm Hady an ECE student at cairo university school of engineering, I've been working on a distilled version of a text summarization model called pegasus, I found your L3-AI talk on YouTube and …

hadywalied updated 3 years ago
4

上一页 1...29 30 31 32 33 34 35...100 下一页

1000+ results for knowledge-distillation

1000+ results
for knowledge-distillation