issues
search
awslabs
/
data-on-eks
DoEKS is a tool to build, deploy and scale Data & ML Platforms on Amazon EKS
https://awslabs.github.io/data-on-eks/
Apache License 2.0
550
stars
180
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
feat: Add blueprint for using RayServe with vLLM on gpus
#563
ratnopamc
opened
2 days ago
0
[Feature]: New pattern to deploy and use Unity Catalog
#562
vara-bonthu
opened
1 week ago
0
fix: Dependabot Security Alert Fix for Docusaurus Packages
#561
vara-bonthu
closed
1 week ago
1
NVIDIA NIM LLM Hosting Pattern
#560
hustshawn
opened
1 week ago
2
Enhance pull speed for Large ML container Images with Bottlerocket
#559
ratnopamc
opened
1 week ago
3
feat: Upgrade Spark operator blueprint to EKS 1.30 and fluent-bt changes
#558
vara-bonthu
closed
1 week ago
0
Error: failed to create containerd task: failed to create shim task: OCI runtime create failed
#557
pythonking6
opened
2 weeks ago
1
[jupyterhub] Minimum core managed nodes must be >= 4 ?
#556
asmacdo
opened
2 weeks ago
1
feat: Add Ray head Pod high availability with Redis
#555
ratnopamc
closed
2 weeks ago
0
feat: Jupyterhub blueprint upgrade
#554
vara-bonthu
closed
2 weeks ago
1
Support S3 gateway endpoint for EMR on EKS
#553
hitsub2
opened
3 weeks ago
1
Ray Logging and Dashboard Metrics Export to S3 with Custom Dashboard for Historical Clusters
#552
vara-bonthu
opened
3 weeks ago
0
Ray Observability with Prometheus and AMP
#551
vara-bonthu
opened
3 weeks ago
0
feat: Upgrade ray version on mistral7b on inf2 blueprint
#550
ratnopamc
closed
3 weeks ago
0
fix: Triton deployment path fix
#549
vara-bonthu
closed
3 weeks ago
0
feat: tags + security group and karpenter helm config updates
#548
sanjeevrg89
closed
3 weeks ago
0
vLLM with RayServe pattern
#547
shivam-dubey-1
opened
3 weeks ago
0
Fix minor typo in path
#546
guikcd
closed
3 weeks ago
0
Propose a request to add a feature to Self managed Airflow
#545
sayakin0519
opened
3 weeks ago
0
Llama-3 on Inferentia generate infinite and meaningless output
#544
yubingjiaocn
opened
1 month ago
0
Incorrect POD name "aws-cli-cmd-shell" given in the instructions.
#543
AbrahamArellano
opened
1 month ago
1
fix: Incorrect command to provide Linux permission on the AWS Trainium on EKS Blueprint #533
#542
AbrahamArellano
closed
1 month ago
1
feat: Updates for Inferentia-Trainium workshop
#541
sanjeevrg89
closed
1 month ago
0
feat: Llama2 inf2 Ray inference upgrade and bug fix
#540
vara-bonthu
closed
1 month ago
1
How to run Data EKS Gen AI models with limited EC2 vCPUs service quota?
#539
Gall-oDrone
opened
1 month ago
1
fix: RayServe script name mismatch in Dockerfile
#538
charan-amzn
closed
1 month ago
0
JARK Stack - Error while launching training step in the dogbooth Jupyter notebook
#537
rivasdam
opened
1 month ago
2
feat: New Gen AI pattern - Llama2 Distributed Pre-training on Trn1 with RayTrain and KubeRay Operator
#536
vara-bonthu
closed
1 month ago
1
feat: NVIDIA Triton server Blueprint with vLLM
#535
vara-bonthu
closed
3 weeks ago
1
Make it possible to disable kuberay-operator
#534
askulkarni2
closed
1 day ago
3
Incorrect command to provide Linux permission on the AWS Trainium on EKS Blueprint
#533
AbrahamArellano
opened
1 month ago
2
[Website] Add Scalability Best Practices & Considerations for DoEKS Workloads
#532
brianhammons
closed
4 days ago
2
fix: Update installation script for Stable diffusion GPU blueprint
#531
ratnopamc
closed
1 month ago
0
docs: Improve Superset on EKS doc
#530
sguruvar
closed
1 month ago
5
Failing to schedule pod with default configuration
#529
JM322
closed
1 week ago
5
feat: Add a pattern for llama3 on inferentia2
#528
askulkarni2
closed
1 month ago
0
Add kuberay/trainium llama training example - DRAFT
#527
5cp
closed
2 weeks ago
1
fix: Updated sec fix to Github workflows to pr commit sha
#526
vara-bonthu
closed
1 month ago
1
Re-introduce plan-examples.yml with a proper fix
#525
askulkarni2
opened
1 month ago
0
chore: Remove pull_request_target from plan examples workflow
#524
askulkarni2
closed
1 month ago
0
chore: Modified the eks version to 1.29 as defined in the issue #520
#523
manjarisri
opened
1 month ago
3
docs: Adding Clickhouse blueprint doc
#522
linesa-dot
closed
1 month ago
0
feat: Triton server vllm blueprint enhancements
#521
ratnopamc
closed
1 month ago
0
Chore: Kubernetes cluster version upgrades
#520
raykrueger
closed
1 week ago
3
feat: Observability Tooling
#519
omrishiv
opened
1 month ago
1
[Inference]: NVIDIA Triton Server with vLLM gen-ai pattern
#518
vara-bonthu
closed
1 week ago
2
[Inference]: Llama3 on Inf2 with Trainium-inferentia blueprint wtih RayServe
#517
askulkarni2
closed
1 month ago
0
feat: Spark Streaming using Spark Operator
#516
lusoal
closed
1 month ago
1
Update documentation for JupyterHub on EKS solution
#515
petrokashlikov
opened
1 month ago
2
fix: Remove trailing curly from wildcard bucket arn
#514
dacort
closed
1 month ago
0
Next