Closed SleepEarlyLiveLong closed 11 months ago
@microsoft-github-policy-service agree
@microsoft-github-policy-service agree I have sole ownership of intellectual property rights to my Submissions and I am not making Submissions in the course of work for my employer.
Integrate the open-sourced model Qwen (https://huggingface.co/Qwen) into the minillm distillation algorithm, supporting both non-parallel and parallel training. mainly added 2 folders: transformers/src/transformers/models/qwen/ transformers/src/transformers/models/qwen_parallel/
Tips: