score-distillation Search Results

380 results
for score-distillation

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

FlagOpen/FlagEmbedding #701

I need help about bge-m3 training

I'm having trouble with the bge-m3 train. Accordingly, I would like to ask you a few questions. 1. m3 train code I learn the bge-m3 model on the H100 (80GB) * 8 server. Below is the learning s…

jhyeom1545 updated 2 months ago
6
PaddlePaddle/PaddleOCR #14124

[Infererence] AttributeError: ‘ParallelEnv‘ object has no at…

### 🔎 Search before asking - [X] I have searched the PaddleOCR [Docs](https://paddlepaddle.github.io/PaddleOCR/) and found no similar bug report. - [X] I have searched the PaddleOCR [Issues](https…

Cupcc updated 2 weeks ago
3
open-mmlab/mmrazor #276

The effect was not improved after distillation

### Describe the question you meet I use the CWD method,When resnet50 is used to distill resnet18, the training accuracy of the teacher's network is 80%, but the network accuracy after distillation…

xuhao-anhe updated 1 year ago
4
pentium3/sys_reading #348

Efficiently Scaling Transformer Inference

https://proceedings.mlsys.org/paper_files/paper/2023/file/523f87e9d08e6071a3bbd150e6da40fb-Paper-mlsys2023.pdf

pentium3 updated 8 months ago
2
irthomasthomas/undecidability #654

Representation engineering:

- [ ] [I'm the author of the GPT-2 work. This is a nice post, thanks for making it more... | Hacker News](https://news.ycombinator.com/item?id=39436215) # TITLE I'm the author of the GPT-2 work. Thi…

irthomasthomas updated 8 months ago
1
irthomasthomas/undecidability #645

LoRA Land: Fine-Tuned Open-Source LLMs that Outperform GPT-4…

- [ ] [LoRA Land: Fine-Tuned Open-Source LLMs that Outperform GPT-4 - Predibase - Predibase](https://predibase.com/blog/lora-land-fine-tuned-open-source-llms-that-outperform-gpt-4) # LoRA Land: Fine…

irthomasthomas updated 8 months ago
1
irthomasthomas/undecidability #751

sentence-transformers/README.md at master · liuyukid/sentenc…

- [ ] [sentence-transformers/README.md at master · liuyukid/sentence-transformers](https://github.com/liuyukid/sentence-transformers/blob/master/README.md?plain=1) # sentence-transformers/README.md a…

irthomasthomas updated 8 months ago
1
zchen0420/nn_papers #6

Humanlike behaviors

# ChatGPT is fun, but it is not funny! Humor is still challenging Large Language Models 2023 Workshop on Computational Approaches to Subjectivity, Sentiment “oxymoron” Despite being fun to interact …

zchen0420 updated 4 months ago
8
UKPLab/sentence-transformers #1011

EN-DE MS-Marco

Hi @nreimers, Hi Sentence-transformers community, First of all, I want to thank you for your continued support throughout the years. I have been following this repository for three years now and I'…

nero-nazok updated 2 years ago
43
huggingface/setfit #254

Why are the models fine-tuned with CosineSimilarity between …

Hi everyone, This is a small question related to how models are fine-tuned during the first step of training. I see that the default loss function is `losses.CosineSimilarityLoss`. But when generat…

EdouardVilain-Git updated 1 year ago
10

上一页 1...9 10 11 12 13 14 15...38 下一页

380 results for score-distillation

380 results
for score-distillation