issues
search
PygmalionAI
/
aphrodite-engine
PygmalionAI's large-scale inference engine
https://pygmalion.chat
GNU Affero General Public License v3.0
606
stars
78
forks
source link
issues
Least commented
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
[sparsetral and Qwen2idae]: support for mixtral of lora
#330
sorasoras
opened
2 months ago
27
[Installation]: ValueError: 17 is not a valid GGMLQuantizationType
#448
Abulhanan
closed
2 weeks ago
21
Problem with dockerfile and compiled image in 0.5.0
#310
puppetm4st3r
closed
2 months ago
20
Fix+feat: docker compose
#264
StefanDanielSchwarz
closed
2 months ago
20
Device Side Assertion, Assertion `index >= -sizes[i] && index < sizes[i] && "index out of bounds"` failed.
#199
StableFluffy
closed
4 months ago
19
Initial fetch for `config.json` ignores `--revision`?
#318
josephrocca
opened
2 months ago
13
Overcomplicated and unexplained usage for beginners
#64
Avroboros
opened
7 months ago
13
[Bug]: Exllama v2 not working
#386
SalomonKisters
closed
3 weeks ago
11
[Bug]: Mixtral-8x22b-instruct not running with AWQ
#421
SalomonKisters
closed
3 weeks ago
10
[Usage]: nccl and cupy problem "no cupy" and "NCCL_ERROR_UNHANDLED_CUDA_ERROR" when use TP in wsl
#336
yamosin
closed
2 months ago
10
[Bug]: Issue when trying to load a AWQ model with --load-in-4bits for mixtral flavors
#342
puppetm4st3r
opened
2 months ago
9
Is GGUF support broken?
#281
davideuler
opened
2 months ago
9
fix: correct auto ntk scaling_factor for 4k ctx case
#101
sandwichdoge
closed
6 months ago
9
chore: KoboldAI/koboldcpp updates
#48
g4rg
closed
5 months ago
9
Force torch with cuda 11.8
#41
AWAS666
closed
7 months ago
9
[Installation]: Installing from source does not work. undefined symbol: _ZN3c104cuda14ExchangeDeviceEa
#453
Nero10578
closed
5 days ago
8
[Bug]: Converting gguf to state_dict
#411
heungson
opened
1 month ago
8
[Bug]: WSL Cuda out of Memory when Trying to Load GGUF Model
#360
Lirikana
opened
1 month ago
8
CUDA illegal memory access when loading 70b AWQ with RoPE
#50
g4rg
closed
6 months ago
8
AttributeError: 'NoneType' object has no attribute 'fs' at fresh install
#8
AWAS666
closed
9 months ago
8
[Bug]: Cannot load 70b exl2 5bpw model across 4 GPUs.
#471
Ph0rk0z
opened
3 days ago
7
[Misc]: Building docker container requires insane amount of memory
#350
mrseeker
opened
1 month ago
7
Revert license back to AGPLv3
#38
AlpinDale
closed
7 months ago
7
[Bug]: Flash attention cannot be used on v0.5.3
#468
Nero10578
opened
5 days ago
6
[Bug]: multi GPU crashes backend
#359
mrseeker
opened
1 month ago
6
fix: Missing GPU KV cache blocks
#263
AlpinDale
closed
2 months ago
6
feat:Enable banning tokens
#80
StefanGliga
closed
6 months ago
6
Dockerfile: permission update, configurable build jobs, torch 2.3.0
#465
theobjectivedad
closed
6 days ago
5
[Feature]: Is there a reason CUDA 6.1 is the minimum? Would CUDA 6.0 on the P100 not work?
#413
Nero10578
opened
1 month ago
5
feat: add embeddings endpoint to openai rest-api server.
#363
IggoOnCode
closed
1 month ago
5
[Feature]: Support YiForCausalLM
#348
gsuhm
closed
1 month ago
5
Add logit_bias support for OpenAI API endpoint
#108
miku448
closed
6 months ago
5
ModuleNotFoundError: No module named 'aphrodite.common.logits'
#84
yixuantt
closed
6 months ago
5
Adds a copy of embedded Kobold Lite Web UI
#42
LostRuins
closed
7 months ago
5
Set ooba API Key as argument
#30
miku448
closed
6 months ago
5
[Bug]: gguf loading failed. config.json?
#417
juud79
opened
1 month ago
4
feat: typical_p threshold sampling
#343
AlpinDale
opened
2 months ago
4
AsyncEngineDeadError with koboldai api server
#208
ycros
opened
4 months ago
4
Error when `top_logprobs` value is `-inf`
#183
miku448
closed
3 months ago
4
chore: tensor parallel refactors part 2
#116
AlpinDale
closed
5 months ago
4
Classifier-Free Guidance support
#36
bdashore3
opened
7 months ago
4
KoboldAI endpoint
#31
g4rg
closed
7 months ago
4
No logits_processors in sampling params
#18
AWAS666
closed
7 months ago
4
Fix recursion errors with large amounts of blocks
#459
thomas-xin
closed
5 days ago
3
[Bug]: PermissionError: [Errno 13] Permission denied: '/app/aphrodite-engine/.triton'
#458
theobjectivedad
closed
6 days ago
3
Fix to https://github.com/PygmalionAI/aphrodite-engine/issues/318
#455
houmie
closed
1 week ago
3
[Installation]: Upload Aphrodite v0.5.2 On Pypi.org
#451
Abulhanan
closed
2 days ago
3
[Performance]: Memory Usage Fix for gguf.
#447
Abulhanan
closed
2 weeks ago
3
Mirostat v2 Rewrite
#334
50h100a
closed
1 month ago
3
Configuration of the internal port of the docker container
#300
puppetm4st3r
closed
2 months ago
3
Next