issues
search
PygmalionAI
/
aphrodite-engine
PygmalionAI's large-scale inference engine
https://pygmalion.chat
GNU Affero General Public License v3.0
606
stars
78
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
[WIP] feat: Mamba support
#419
AlpinDale
closed
5 days ago
1
[Feature]: Support hqq quantize method.
#418
Minami-su
opened
1 month ago
0
[Bug]: gguf loading failed. config.json?
#417
juud79
opened
1 month ago
4
[Bug]: manually setting --max-model-len flag always leads to OOM, even if it is set very low
#414
SalomonKisters
closed
4 weeks ago
2
[Feature]: Is there a reason CUDA 6.1 is the minimum? Would CUDA 6.0 on the P100 not work?
#413
Nero10578
opened
1 month ago
5
Fix max_num_batched_tokens for chunked_prefill.
#412
sgsdxzy
closed
1 month ago
0
[Bug]: Converting gguf to state_dict
#411
heungson
opened
1 month ago
8
Allow setting config-path when converting ggufs.
#410
sgsdxzy
closed
1 month ago
0
Support twe lm_head for quantized weights.
#409
sgsdxzy
closed
1 month ago
0
feat: EETQ
#408
AlpinDale
closed
1 month ago
0
feat: Triton flash attention backend for ROCm
#407
AlpinDale
closed
1 month ago
0
feat: add chunked prefill scheduler
#406
AlpinDale
closed
1 month ago
0
feat: FP8 E4M3 KV Cache
#405
AlpinDale
closed
1 month ago
0
Improve cohere model.
#404
sgsdxzy
closed
1 month ago
0
feat: Intel CPU support
#403
AlpinDale
closed
1 month ago
0
Speculative Decoding Part 4: Lookahead scheduling
#402
AlpinDale
closed
1 month ago
0
[Crash]: Program gets terminated
#401
DuckY-Y
opened
1 month ago
1
[Installation]: No module named 'aphrodite._C'
#400
DuckY-Y
closed
1 month ago
2
[Bug]: served-model-name is unused
#399
mrseeker
opened
1 month ago
1
feat: optimized layernorm kernels
#398
AlpinDale
closed
1 month ago
0
Missed .items() and assert
#397
50h100a
closed
1 month ago
0
Fix memory pinning conditional
#396
50h100a
closed
1 month ago
0
CMake build system
#395
AlpinDale
closed
1 month ago
0
Fix cohere for command-r+
#394
sgsdxzy
closed
1 month ago
0
Feat/small fixes
#393
sgsdxzy
closed
1 month ago
0
[Feature]: any workarounds for cc 6.0?
#392
Fuckingnameless
opened
1 month ago
2
[Feature]: actual working health endpoint
#390
mrseeker
closed
5 days ago
2
[Feature]: Add support for Command-r
#389
ccdv-ai
closed
1 month ago
2
[v0.5.3] Release Candidate
#388
AlpinDale
closed
6 days ago
1
[Feature]: Add support for Qwen2MoE
#387
StableFluffy
closed
5 days ago
1
[Bug]: Exllama v2 not working
#386
SalomonKisters
closed
3 weeks ago
11
[Feature]: Add support for DBRX model
#385
BlairSadewitz
closed
5 days ago
2
Chunked Prefill Part 1
#384
AlpinDale
closed
1 month ago
1
feat: Triton kernels for sampling
#383
AlpinDale
closed
1 month ago
0
fix: remove event and stream, add typing
#382
AlpinDale
closed
1 month ago
0
Support arbitrary model in GGUF.
#381
sgsdxzy
closed
1 month ago
0
fix: optimize context shift performance
#380
AlpinDale
closed
1 month ago
0
fix: cache neuron checks
#379
AlpinDale
closed
1 month ago
0
fix: display error in ray before deadlock
#378
AlpinDale
closed
1 month ago
0
chore: make metadata a dataclass
#377
AlpinDale
closed
1 month ago
1
feat: add context-free grammars
#376
AlpinDale
closed
1 month ago
0
feat: tensor parallelism for exllamav2 quantization
#375
AlpinDale
closed
1 month ago
1
feat: async tokenization
#374
AlpinDale
closed
1 month ago
0
fix: explicitly disallow installation on non-linux platforms
#373
AlpinDale
closed
1 month ago
0
feat: dynamic shared memory allocation for moe align block size
#372
AlpinDale
closed
1 month ago
0
feat: add batched RoPE kernels
#371
AlpinDale
closed
1 month ago
0
feat: add approximate gelu activation kernels
#370
AlpinDale
closed
1 month ago
0
fix: double free with sliding window
#369
AlpinDale
closed
1 month ago
0
feat: mistral neuron support
#368
AlpinDale
closed
1 month ago
0
feat: model execution refactor
#367
AlpinDale
closed
1 month ago
0
Previous
Next