-
### 💡 Your Question
I saw in this repo that there is knowledge distillation for different models. However, I got nothing when I tried to find a tutorial for object detection distillation. There is on…
-
It seems that ```VisionTransformer``` doesn't support feature extraction of all outputs in the ```forward_features``` method. Only returning of the cls token or [cls_token, distillation_token] is avai…
-
Hello! Recently we've merged PR https://github.com/Rikorose/DeepFilterNet/pull/452 with WASM conversion support. I've now uploaded a couple of raw examples using DeepFilterNet3 on the web. One of them…
-
Thanks for your excellent work! And I have a question about the training strategy. In the [report](https://arxiv.org/abs/2302.02651) you mentioned, it uses self-distillation with soft labels. I wonder…
-
Hello, I am trying to train on the UCF101 dataset, but I have encountered this problem in the test stage. Do you have any solutions to this problem? I am looking forward to your reply very much.
Trac…
wjj-w updated
3 months ago
-
https://github.com/intel/intel-extension-for-transformers/tree/main/intel_extension_for_transformers/llm/runtime/graph#2-run-llm-with-python-api
I'd like to use this guide. However, it is not in the…
-
I have prompt tuned the ``Falcon-7B-Instruct model``. Now, I want to perform inference using prompt tuned model in multi-gpu settings using ``accelerate``. I am using 2 A100 gpus and batch size of 1 o…
-
Hello, Thanks for your excellent work!
I have some questions about transition function:
1. I noticed tvalue was not used to optimize searching policy (block_assembly task) in backward fine-tuning, a…
-
- 环境:
Ubuntu 22.04.2, python3.8
paddlepaddle-gpu 2.5.2
paddleslim 从release/2.5安装的
- PaddleOCR
git checkout release/2.7
git checkout 52198722e932a3e09c8…
-
I'm now making a modpack with Destroy and find it not so easy to play with it, especially for those in survival mode. Here I have some suggestions that could enhance our survival experience with Destr…