distillation-model Search Results

1000+ results
for distillation-model

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

ultralytics/ultralytics #13485

yolov8 : Adding New data class to customized yolov8 model

### Search before asking - [X] I have searched the YOLOv8 [issues](https://github.com/ultralytics/ultralytics/issues) and [discussions](https://github.com/ultralytics/ultralytics/discussions) and fou…

vasupradha-ch updated 1 day ago
7
lucidrains/imagen-pytorch #314

Model distillation

Hello, Imagen-Video states that they use model distillation to iteratively train student diffusion models that require half the sampling steps of their teacher diffusion model. This seems to be an …

HReynaud updated 1 year ago
13
UKPLab/sentence-transformers #507

model distillation

hi, can I use "knowledge distillation" and "dimension reduction" for Bert-large? and if it is possible, for knowledge distillation how many layers should be remained in option2 ? and for dimension …

ReySadeghi updated 3 years ago
6
facebookresearch/dinov2 #114

Some Questions about model distillation

Thanks for your work! I have some questions about model distillation. "we leverage the same training loop with a few exceptions: we use a larger model as a frozen teacher, keep a spare EMA of the st…

MMY1994 updated 2 months ago
5
wandb/wandb #7894

[CLI]:

### Describe the bug **It seems the wandb crash when I run another program using DDP.** I have two separate Python programs, A and B. Program A uses `torch.nn.DataParallel` to run a neural network…

j-cyoung updated 3 days ago
2
dipamgoswami/ADC #2

Is adversarial images used in distillation loss?

Hi. I have question about the method which I am confused. In the paper, I can see that in Figure3, the generated adversarial images are fed to old model and estimates logit vector? and used in …

YangJae96 updated 1 day ago
1
fdschmidt93/trident-nllb-llm2vec #2

Release of Pre-trained models

### Request for Release of Pretrained NLLB-LLM2Vec Model Hello Team, Could you please release the pretrained NLLB-LLM2Vec models mentioned in your paper on "Self-Distillation for Model Stacking…

ArkadeepAcharya updated 6 days ago
3
Tencent/HunyuanDiT #120

The distill_v1.1 model has same latency of the ema ckpt on A…

I checked the log and the pytorch_model_distill.pt is picked in processing. But the latency is same as the ema ckpt: 51s on A100. Is this normal? Is there any argument I haven't set correctly to unloc…

wzds2015 updated 3 days ago
1
intel/intel-extension-for-pytorch #576

How to use ipex.optimize for two models (model distillation)…

### Describe the issue Hi, I'm wondering how to use ````ipex.optimize(...)```` when I have two models, for example, teacher and student in model distillation but only one optimizer. Would calls like…

ferreirafabio updated 1 month ago
2
huggingface/diffusers #8414

[🌟 New Model] ConsistencyTTA: Accelerating Diffusion-Based T…

### Model/Pipeline/Scheduler description ConsistencyTTA, introduced in the paper [_Accelerating Diffusion-Based Text-to-Audio Generation with Consistency Distillation_](https://arxiv.org/abs/2309.…

Bai-YT updated 1 week ago
6

上一页 1...1 2 3 4 5 6 7...100 下一页

1000+ results for distillation-model

1000+ results
for distillation-model