distillation-model Search Results

1000+ results
for distillation-model

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

Hao840/OFAKD #34

dataset

Hello, can this distillation model be used for time series models, the dataset I want to process is related to weather prediction, can this be used

yibeibingcaomei updated 3 months ago
1
huggingface/diffusers #8414

[🌟 New Model] ConsistencyTTA: Accelerating Diffusion-Based T…

### Model/Pipeline/Scheduler description ConsistencyTTA, introduced in the paper [_Accelerating Diffusion-Based Text-to-Audio Generation with Consistency Distillation_](https://arxiv.org/abs/2309.…

Bai-YT updated 2 weeks ago
7
rwth-i6/returnn #1625

rf.BatchNorm keeps updating statistics when used in eval mod…

Currently `rf.BatchNorm` decides whether to update the running statistics based on the `rf.get_run_ctx().train_flag` as in [this line](https://github.com/rwth-i6/returnn/blob/master/returnn/frontend/n…

mnghiap updated 2 weeks ago
1
OpenGVLab/InternVideo #155

如何使用InternVideo2-CLIP-S14等小模型

首先，感谢internvideo小组出色的工作。 @yinanhe 从 [readme](https://github.com/OpenGVLab/InternVideo/blob/main/InternVideo2/multi_modality/MODEL_ZOO.md) 中可以看到InternVideo2-CIP-S14/B14等小模型的下载链接，但似乎模型只有十几M的大小，好像…

rTrQqgH74lc2PT5k updated 1 month ago
4
xorbitsai/inference #2372

[Feature] Support for Llama 3.2 Multi-modal and Lightweight …

### Feature request / 功能建议 This feature request proposes adding support for Meta's newly released Llama 3.2 models to lmdeploy. Llama 3.2 introduces exciting capabilities, including vision LLMs (11…

vikrantrathore updated 5 days ago
2
luopeixiang/awesome-text-summarization #1

Introduce two summarization papers

Hello, Thank you for your comprehensive and wonderful survey. Would you mind adding 2 papers about text summarization? Paper 1: Enriching and Controlling Global Semantics for Text Summarizati…

nguyentthong updated 1 month ago
3
mozilla/firefox-translations-training #771

Reduce monolingual data for da-en to investigate distillatio…

An experiment for #231 da-en is one of our best models from the spring-2024 run. The teacher ensemble had a COMET score of 0.9013. The student COMET was 0.8950, with a tiny -0.0063 gap. In order to…

gregtatum updated 2 weeks ago
2
jxiw/MambaInLlama #9

Training Slowdown for Llama3-Mamba2

Hello! I am training the first two knowledge distillation stages of Mamba 2 on one DGX-H100x8 node, and I am experiencing train times of ~8 hours for the first stage, and ~13 hours for the second stag…

Codys12 updated 2 weeks ago
13
G-U-N/AnimateLCM #33

Asking about SVDSolver

when you train LCM_svd, you set svd_solver like, svd_solver = SVDSolver(args.N, noise_scheduler.config.sigma_min, noise_scheduler.config.sigma_max, 7,0.7, 1.6) why you change training timestep t…

dreamyou070 updated 2 months ago
1
watertap-org/watertap #1302

Applying Unit Test Harness

### Description To be completed in the December release: - [ ] Ultraviolet Advanced Oxidation Process @luohezhiming - [ ] Dewatering Unit @adam-a-a - [ ] Electrodialysis 0D & 1D @lbibl - [ ]…

MarcusHolly updated 1 day ago
12

上一页 1...3 4 5 6 7 8 9...100 下一页

1000+ results for distillation-model

1000+ results
for distillation-model