issues
search
TorchMoE
/
MoE-Infinity
PyTorch library for cost-effective, fast and easy serving of MoE models.
Apache License 2.0
76
stars
5
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Readme Example not working (MemoryError: std::bad_alloc)
#26
akhauriyash
closed
3 weeks ago
1
Question: Support for Continuous Batching and Asynchronous Requests
#25
Msiavashi
opened
1 month ago
1
CUDA extension not installed Error while running readme_example.py
#24
Msiavashi
closed
1 month ago
4
Can the MoE-Infinity framework be used in conjunction with the vLLM framework?
#23
alphabewitch
opened
2 months ago
1
Dev
#22
drunkcoding
opened
2 months ago
0
Add explicit resource release
#21
lausannel
closed
2 months ago
0
Fix Wrong Output Stream Emplace Back
#20
lausannel
closed
2 months ago
0
Fix Wrong Stream Emplace Back
#19
lausannel
closed
2 months ago
0
Arctic Support
#18
drunkcoding
closed
1 month ago
0
Support Consecutive Tasks in Open MoE LLM Leaderboard
#17
lausannel
closed
2 months ago
0
Output of Mixtral-8*7b is strange
#16
JustQJ
closed
3 months ago
2
run on the mutiple gpus
#15
YLSnowy
opened
3 months ago
1
Support Open MoE LLM Leaderboard
#14
lausannel
closed
3 months ago
0
add readme example & fix peer access
#13
drunkcoding
closed
3 months ago
0
Add CI and Automated Rlease Process
#12
lausannel
closed
3 months ago
0
Install from pip failed
#11
future-xy
closed
3 months ago
2
How to Install it?
#10
MSGitt
closed
3 months ago
2
Feature/expert parallel
#9
drunkcoding
closed
2 months ago
0
Grok-1 Support
#8
drunkcoding
closed
3 months ago
0
add forward and call
#7
future-xy
closed
4 months ago
2
Release dev to main
#6
drunkcoding
closed
3 months ago
1
Support Constrained Server Memory
#5
drunkcoding
opened
4 months ago
0
Introduce Local Server for OpenAI-Compatible APIs (Beta)
#4
future-xy
opened
4 months ago
2
First Release of README
#3
drunkcoding
closed
4 months ago
1
MoE-Infinity API Proposal
#2
drunkcoding
closed
4 months ago
1
TODO for first release
#1
drunkcoding
opened
6 months ago
0