issues
search
stikkireddy
/
mlflow-extensions
Deploy models quickly to databricks via mlflow based serving infra.
https://stikkireddy.github.io/mlflow-extensions/
Apache License 2.0
19
stars
11
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
[MODEL] stabilityai/stable-diffusion-xl-base-1.0
#74
jessiewen-databricks
opened
3 weeks ago
0
[MODEL] glm4-9b-chat
#73
jessiewen-databricks
opened
3 weeks ago
0
Added openai dependency and refactored
#72
edurdevic
opened
1 month ago
0
[MODEL] CosyVoice
#71
changshilim-db
opened
1 month ago
0
Fix GCP workspace issues
#70
changshilim-db
opened
1 month ago
0
WIP support models in local disk
#69
stikkireddy
opened
1 month ago
0
fix imports causing issue
#68
stikkireddy
closed
1 month ago
0
fix formatting
#67
stikkireddy
closed
1 month ago
0
Support infinity optimized embeddings server
#66
stikkireddy
opened
2 months ago
0
Adding Ray Serving support on Databricks with Autoscaling
#64
puneet-jain159
closed
1 month ago
0
Feature/new models added pixtral and qwen 2.5 models
#63
stikkireddy
closed
2 months ago
0
[FEATURE] show base url when deploying on job cluster with EZDeployLight
#62
juanlamadrid20
closed
2 months ago
1
fix pip installs
#61
stikkireddy
closed
2 months ago
0
commit for docs site
#60
stikkireddy
closed
2 months ago
0
set deployed model name to default
#59
stikkireddy
closed
2 months ago
0
[FEATURE] Integrate with Mosaic AI gateway
#58
sathishgang-db
closed
2 months ago
1
Feature/ezdeploy lite client
#57
stikkireddy
closed
2 months ago
0
Feature/improved health check
#55
stikkireddy
closed
2 months ago
0
fixed typo contributors
#53
natefleming
closed
2 months ago
0
Fix/issue 40
#52
stikkireddy
closed
2 months ago
0
Docs/developer guide
#51
stikkireddy
closed
2 months ago
0
fix passing context to compute diagnostics.
#50
stikkireddy
closed
2 months ago
0
[FEATURE] Better process killing and response when custom engine server might be restarting
#49
stikkireddy
closed
2 months ago
0
Fix/pin qwen to newer vllm
#48
stikkireddy
closed
2 months ago
0
added support for qwen2 vl and contributors page
#47
stikkireddy
closed
2 months ago
0
add unit tests for overriding libraries in the engines
#46
stikkireddy
closed
2 months ago
0
decouple max_num_batched_tokens from chunked prefill
#45
stikkireddy
closed
2 months ago
0
skip tokenizer check if the mode is not auto or slow for pixtral
#44
stikkireddy
closed
2 months ago
0
fix small issues (tokenizer mode and port being sent to config engine)
#43
stikkireddy
closed
2 months ago
0
Feature/custom logging module
#42
natefleming
closed
2 months ago
5
[MODEL] Support clip-ViT-B-32
#41
edurdevic
opened
2 months ago
0
Databricks MLR 15.4 Issues
#40
stikkireddy
closed
2 months ago
0
Feature/support cohere aya 23 35b
#39
stikkireddy
closed
2 months ago
0
[FEATURE] better use of local disk for registering models
#38
stikkireddy
opened
2 months ago
0
[FEATURE] GPU Config or Cloud Specific flags/arguments
#37
stikkireddy
opened
2 months ago
0
[MODEL] Support CohereForAI/aya-23-35B
#36
stikkireddy
closed
2 months ago
1
[FEATURE] Support ray serve engine
#35
stikkireddy
opened
2 months ago
1
updated formatting for rest of files
#34
stikkireddy
closed
2 months ago
0
Feature/unit test framework
#33
natefleming
closed
2 months ago
1
This PR adds a framework for testing
#32
natefleming
closed
2 months ago
0
add local model cache support for vllm engine
#31
FMurray
opened
2 months ago
1
[FEATURE] Support for offline registration for pre downloaded model artifacts
#30
stikkireddy
opened
2 months ago
0
[FEATURE] Support for existing transformer model registered in mlflow or system.ai
#29
stikkireddy
opened
2 months ago
0
[FEATURE] Huggingface Download Improvements
#28
stikkireddy
opened
2 months ago
0
[MODEL] Llama 3.1 Instruct Support for INT8 W8A16 and W4A16 for 50% and 75% weight memory reduction
#27
stikkireddy
opened
2 months ago
0
[MODEL] Qwen 2 VLM 2B instruct and Qwen 2 VLM 7B instruct
#26
stikkireddy
closed
2 months ago
1
incremental merge
#25
natefleming
closed
2 months ago
0
[BUG] Issue with microsoft/Phi-3.5-vision-instruct for specific image sizes
#24
stikkireddy
closed
2 months ago
1
Evaluate quantization (bitsandbytes)
#23
stikkireddy
closed
2 months ago
1
Support InternLM 2 / InternVL 2
#22
stikkireddy
opened
2 months ago
0
Next