issues
search
PygmalionAI
/
aphrodite-engine
PygmalionAI's large-scale inference engine
https://pygmalion.chat
GNU Affero General Public License v3.0
606
stars
78
forks
source link
issues
Oldest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
[Bug]: torch._dynamo.exc.BackendCompilerFailed with command-r-plus
#472
heungson
opened
1 day ago
0
[Bug]: Cannot load 70b exl2 5bpw model across 4 GPUs.
#471
Ph0rk0z
opened
3 days ago
7
[Bug]: GPUExecutor throwing 'TypeError: 'type' object is not subscriptable' on 0.5.3
#470
xyzkpsf
closed
2 days ago
2
Fix quants installation on ROCM
#469
Naomiusearch
closed
5 days ago
1
[Bug]: Flash attention cannot be used on v0.5.3
#468
Nero10578
opened
5 days ago
6
Bump `torch` to 2.3.0
#467
AlpinDale
closed
1 week ago
0
Fix Navi support
#466
Naomiusearch
closed
6 days ago
1
Dockerfile: permission update, configurable build jobs, torch 2.3.0
#465
theobjectivedad
closed
6 days ago
5
[Usage]: Lora Adapter Parameter while inferencing
#464
alokgupta1996
closed
1 week ago
1
[Feature]: Exllamav2 Q4 cache
#463
Anthonyg5005
opened
1 week ago
2
fix: lora errors
#462
AlpinDale
closed
1 week ago
0
[Bug]: LoRA fails to load
#461
kubernetes-bad
closed
1 week ago
1
[Bug]: LoRA broken when TP>1
#460
kubernetes-bad
opened
1 week ago
0
Fix recursion errors with large amounts of blocks
#459
thomas-xin
closed
5 days ago
3
[Bug]: PermissionError: [Errno 13] Permission denied: '/app/aphrodite-engine/.triton'
#458
theobjectivedad
closed
6 days ago
3
Fixed REVISION variable not being passed on.
#456
houmie
closed
1 week ago
0
Fix to https://github.com/PygmalionAI/aphrodite-engine/issues/318
#455
houmie
closed
1 week ago
3
Refactor: Quantization
#454
AlpinDale
closed
1 week ago
1
[Installation]: Installing from source does not work. undefined symbol: _ZN3c104cuda14ExchangeDeviceEa
#453
Nero10578
closed
5 days ago
8
[Usage]: What to set to get acceptable performance on Pascal GPUs? (Non-P100)
#452
Nero10578
closed
5 days ago
2
[Installation]: Upload Aphrodite v0.5.2 On Pypi.org
#451
Abulhanan
closed
2 days ago
3
feat: add batch tokenization endpoint along with option for no token ids
#450
ahme-dev
closed
5 days ago
0
Fix minor bugs in outlines and lmfe.
#449
sgsdxzy
closed
1 week ago
1
[Installation]: ValueError: 17 is not a valid GGMLQuantizationType
#448
Abulhanan
closed
2 weeks ago
21
[Performance]: Memory Usage Fix for gguf.
#447
Abulhanan
closed
2 weeks ago
3
[Usage]: Please provide the environment variable that closes the KoboldAI Lite page.
#445
online2311
opened
2 weeks ago
0
fix: restore backwards compatibility with sm_60 (P100 and GP100)
#444
AlpinDale
closed
2 weeks ago
1
Fix out-of-range token crash in OpenAI endpoint
#443
50h100a
closed
2 weeks ago
0
Fix non-batched implementation of logit bias
#442
50h100a
closed
2 weeks ago
0
Tiny fixes
#441
sgsdxzy
closed
2 weeks ago
1
chore: clean up the quants
#440
AlpinDale
closed
2 weeks ago
0
fix: OPTIONS requests in the API
#439
AlpinDale
closed
3 weeks ago
0
feat: ngram prompt lookup decoding
#438
AlpinDale
closed
3 weeks ago
0
Fix/exl2 split
#437
sgsdxzy
closed
3 weeks ago
0
Fix/cohere
#436
sgsdxzy
closed
3 weeks ago
0
[Bug]:
#435
someoneexistsontheinternet
opened
3 weeks ago
1
[Bug]: Unable to use OpenAI API with an auth key via a web browser due to OPTIONS preflight request returning 401.
#434
LostRuins
closed
5 days ago
1
Update Kobold Lite Embed
#433
Pyroserenus
closed
3 weeks ago
0
feat: Speculative Decoding using a draft model
#432
AlpinDale
closed
3 weeks ago
0
Abort requests when the connection to /v1/completions is interrupted.
#431
sgsdxzy
closed
3 weeks ago
0
Fix linear bias of qkv layers in models.
#430
sgsdxzy
closed
3 weeks ago
0
[Installation]: Cannot install the library
#429
uysalfurkan
closed
3 weeks ago
0
feat: LM Format Enforcer support
#428
AlpinDale
closed
3 weeks ago
0
Port sampler+metadata changes from main to dev
#427
50h100a
closed
3 weeks ago
0
[Usage]: odd use of GPUS number and tensor parallelism
#426
puppetm4st3r
closed
3 weeks ago
2
[Feature]: Provide configuration via env vars or a configuration file
#425
alexandreteles
opened
3 weeks ago
0
Fix: kobold api /tokencount
#424
Krovius
closed
3 weeks ago
0
Split the exl2 weight ASAP.
#423
sgsdxzy
closed
3 weeks ago
2
[Bug]: Mixtral-8x22b-instruct not running with AWQ
#421
SalomonKisters
closed
3 weeks ago
10
Support sharded ggufs.
#420
sgsdxzy
closed
3 weeks ago
0
Next