self-knowledge-distillation Search Results

241 results
for self-knowledge-distillation

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

keras-team/keras #18468

Keras.io examples conversion gameplan

We need to convert keras.io examples to work with Keras 3. This involves two stages: ## Stage 1: tf.keras backwards compatibility check Keras 3 is intended as a drop-in replacement for tf.ker…

fchollet updated 7 months ago
21
lightonai/pylate #51

distilbert raises an error

Hi, thank you for building this amazing repo. My purpose is to train on msmarco a ColBERT model using `distilbert` as backbone. I took your script [`knowledge_distillation.py`](https://github.com/…

CosimoRulli updated 2 months ago
6
ocaml/ocaml #11548

Marshal + Domain leads to a memory leak

The code below leaks memory when running on OCaml 5 trunk, but by not starting the domain it stops leaking memory. ```ocaml let _domain = Domain.spawn (fun () -> let rec loop () = loop (…

EduardoRFS updated 1 year ago
7
kweonwooj/papers #123

Insertion Transformer: Flexible Sequence Generation via Inse…

## Abstract - present the `Insertion Transformer`, an iterative and partially autoregressive model for sequence generation based on insertion operations - can generate with an arbitrary ordering …

kweonwooj updated 5 years ago
4
open-mmlab/mmrazor #335

How to make this code base support mmrotate model? I want to…

### Checklist - I have searched related issues but cannot get the expected help. - I have read related documents and don't know what to do. ### Describe the question you meet I want to…

JinqingZhengTju updated 11 months ago
8
FlagOpen/FlagEmbedding #955

关于BGE-M3在微调时报：pyarrow.lib.ArrowInvalid: offset overflow whil…

**场景**：使用BGE-M3进行finetune，数据文件.jsonl 含有158000行记录，每行记录一个query，pos列表的长度为1，neg列表的长度为15。 **异常报错**： WARNING:torch.distributed.run: ***************************************** Setting OMP_NUM_THREADS envi…

MarcusEddie updated 4 months ago
1
meta-introspector/meta-meme #155

Claude Embedding Guile Scheme

Embedding Guile Scheme M gnu guile snarf Edit I'm afraid I don't have enough context to fully understand your query "gnu guile snarf". However, let me provide some potentially relevant informatio…

jmikedupont2 updated 5 months ago
2
FlagOpen/FlagEmbedding #1218

Cannot finetune due to GPU OOM error

I got GPU OOM error when trying to finetune embedder model on Kaggle (using GPU T4 x 2) This is my run command (already reduce query_max_len and passage_max_len): ``` !WANDB_DISABLED=True WANDB_M…

dviettu134 updated 4 days ago
5
larq/zoo #176

About RealToBinaryNet model

I read from https://docs.larq.dev/zoo/ that the RealToBinaryNet reach 65% accuracy and reach the SOTA. I really appreciate this and want to train the model to learn about it. I also read the code a…

appleleaves updated 4 years ago
14
microsoft/DeepSpeed #2894

[REQUEST] Add more device-agnostic compression algorithms

## **Summary** This is a design discussion RFC for contributing some device-agnostic compression algorithms, like the post training quantization(QDQ quant format) and structural sparsity supported …

ftian1 updated 1 year ago
8

上一页 1...2 3 4 5 6 7 8...25 下一页

241 results for self-knowledge-distillation

241 results
for self-knowledge-distillation