expert Search Results - Githubissues

1000+ results
for expert

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

QwenLM/Qwen2 #687

能不能针对qwen2-moe提供一个modeling_qwen2_moe.py的megatron转化特供版

megatron_moe的是对route_logits实现是先topk，再softmax；贵团队的modeling_qwen2_moe.py中的route_logits是先softmax，再topk，然后有个参数norm_topk_prob控制是否再进行归一化。在qwen2-moe中的norm_topk_prob是false，会导致megatron转化来的router_logits量级不对（m…

steins048596 updated 2 weeks ago
1
TorchMoE/MoE-Infinity #23

Can the MoE-Infinity framework be used in conjunction with t…

Because I am using vLLM server to deploy a MoE model. However, this model has a large number of experts and the number of activated experts is very small. So it is very suitable for the expert offload…

alphabewitch updated 4 weeks ago
1
gmtsar/gmtsar #941

[Help]: Compilation and installation of sbas_parallel in Cen…

Dear professors and experts, I have a question that I would like to consult, because I found that the memory required for single-thread sbas processing is too large to meet when I was conducting SBAS-…

MinervaLee0429 updated 4 days ago
7
MikeCodeur/typescript-expert #1

correction module title

corriger le titre de l'exo1

HAMADOns updated 4 hours ago
1
ElemDavid/ELEM-DAVID-TASK-1 #2

ELEM TASK 3 Assignment on Com423

Assignment 3 ELEM DAVID OBIAHU 2022/HND/35291/CS Question:. How does expert system resolve rule base conflict Answer Expert system resolve rule base conflict through various ways which includes A.…

ElemDavid updated 10 hours ago
1
Leeroo-AI/mergoo #17

[Feature] Support New Arguments for Expert Routing Policies.

Hi there, thanks mergoo, an amazing code base for MoE model construction. A crucial feature that may need to be implemented is that mergoo should let the user select the basic routing policy when c…

jacklanda updated 1 month ago
9
vllm-project/vllm #2405

Feature request: Expert parallel for MoE architectures

Can we implement the expert parallel strategy for MoE to fully exploit the sparse activation property? Ideally, MoE should only use compute at the order of active parameters, but the current implement…

imoneoi updated 1 week ago
2
Uob-DataScience-2024/DataScience #13

Brainstorm Time for NFL experts!

# _Assignees_ ###### Since our private repository paid plan is free, we cannot use the full GitHub team functionality (such as adding multiple Assignees to an issue). After evaluation, we have decide…

Dorrrrrrrr updated 3 months ago
1
NVIDIA/Megatron-LM #810

[QUESTION] Why is expert parallelism not supported during fp…

``` assert not args.model_parallel.fp16, \ "Expert parallelism is not supported with fp16 training." ``` from https://github.com/NVIDIA/Megatron-LM/blob/db3a3f79d1cda60ea4b3db0ceffcf…

yutian-mt updated 3 days ago
2
inspirehep/inspire-next #2654

expert keywording: expert assigns keywords

The expert needs a way to assign keywords to the papers he is responsible for. ## Expected Behavior Under the assumption that the papers are in his basket we need a special editor where the expert…

fschwenn updated 6 years ago
4

上一页 1...3 4 5 6 7 8 9...100 下一页

1000+ results for expert

1000+ results
for expert