issues
search
GoogleCloudPlatform
/
ai-on-gke
AI on GKE is a collection of examples, best-practices, and prebuilt solutions to help build, deploy, and scale AI Platforms on Google Kubernetes Engine
Apache License 2.0
194
stars
143
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Configure TPU Provisioner tests to run in CI
#659
danielvegamyhre
closed
2 months ago
0
Provision custom service accounts for node pools with minimum roles
#658
gtsorbo
closed
2 months ago
3
feat: add custom service accounts for node pools
#657
gtsorbo
closed
2 months ago
0
Allow provisioner to be configured to force on-demand nodes & disable auto-upgrade
#656
nstogner
closed
2 months ago
1
Bump jinja2 from 3.1.2 to 3.1.4 in /tutorials-and-examples/genAI-LLM/e2e-genai-langchain-app/src/backend
#655
dependabot[bot]
opened
2 months ago
0
Bump werkzeug from 3.0.1 to 3.0.3 in /tutorials-and-examples/genAI-LLM/e2e-genai-langchain-app/src/backend
#654
dependabot[bot]
opened
2 months ago
0
Bump jinja2 from 3.1.3 to 3.1.4 in /benchmarks/benchmark/tools/locust-load-inference/locust-docker/locust-tasks
#653
dependabot[bot]
closed
2 months ago
1
Bump werkzeug from 2.3.8 to 3.0.3 in /benchmarks/benchmark/tools/locust-load-inference/locust-docker/locust-tasks
#652
dependabot[bot]
closed
2 months ago
1
Bump werkzeug from 3.0.1 to 3.0.3 in /applications/rag/frontend/container
#651
dependabot[bot]
opened
2 months ago
0
Metrics support for Average Time To First Token
#650
kfswain
closed
2 months ago
0
RAG Application - release-1.1 - Failing running Terraform
#649
vmasilva
opened
2 months ago
1
Adding chat history to RAG app and refactor to better utilize LangChain
#648
alpha-amundson
opened
2 months ago
4
Bump tqdm from 4.66.1 to 4.66.3 in /tutorials-and-examples/genAI-LLM/e2e-genai-langchain-app/src/backend
#647
dependabot[bot]
opened
2 months ago
0
Bump tqdm from 4.66.1 to 4.66.3 in /best-practices/gke-batch-refarch/06_jobset
#646
dependabot[bot]
opened
2 months ago
0
TPU Provisioner: JobSet related fixes
#645
nstogner
closed
2 months ago
1
Cherry-pick #643 to release-1.1 branch
#644
roberthbailey
closed
3 months ago
1
Add extra debugging information to the assert statements in the jupyter hub tests.
#643
roberthbailey
closed
3 months ago
2
[TPU Provisioner] Fix dockerfile and bump go to 1.22
#642
danielvegamyhre
closed
3 months ago
1
Document GMP error in RAG README troubleshooting section
#641
artemvmin
closed
3 months ago
1
Make the references to the namespace consistent in the README
#640
soorena776
closed
3 months ago
0
Pin the terraform import paths on the release branch to the v1.1.2 release tag
#639
roberthbailey
closed
2 months ago
10
Update README.md
#638
soorena776
closed
3 months ago
0
Cherry-pick #635 to release-1.1 branch
#637
roberthbailey
closed
3 months ago
1
Add Stable Diffusion XL inference using MaxDiffusion
#636
rick-c-goog
closed
3 months ago
2
Update RAG to use Autopilot by default
#635
artemvmin
closed
3 months ago
1
oauth2/google: invalid token JSON from metadata: EOF
#634
kzos
opened
3 months ago
4
Set the namespace in the sample configurations for the rag, ray, & jupyter applications to `ai-on-gke`
#633
roberthbailey
closed
3 months ago
3
Cherry-pick #631 to release-1.1 branch
#632
roberthbailey
closed
3 months ago
1
Run QSS for Ray & Jupyter on autopilot.
#631
roberthbailey
closed
3 months ago
1
Update README.md
#630
soorena776
closed
3 months ago
0
Creating User for models that support gRPC requests. (Currently bespo…
#629
kfswain
closed
3 months ago
2
Error 400: Autopilot clusters must be regional clusters
#628
kzos
closed
3 months ago
2
Cherry-pick #599 and #618 to release-1.1
#627
roberthbailey
closed
3 months ago
1
[TPU Provisioner] Delete node pool when JobSet is completed, failed, or deleted
#626
danielvegamyhre
closed
3 months ago
2
Add soruce files for Jestream Pytorch on GKE single host user guide
#625
vivianrwu
closed
2 months ago
8
Add private_cluster_configuration instead of enable_private_endpoint.…
#624
hubatish
closed
2 months ago
1
Add source files for single host Jetstream Pytorch on GKE user guide
#623
vivianrwu
closed
3 months ago
0
Change refs
#622
gongmax
closed
3 months ago
0
Fetch the cached weights for Mistral-7B-Instruct-v0.1 from GCS bucket…
#621
gongmax
closed
3 months ago
1
Parallel all the applications within cloudbuild test
#620
chiayi
closed
2 months ago
62
QSS default to AP cluster and Mistral fix cherry pick
#619
gongmax
closed
3 months ago
2
Set the default GKE cluster type for ray to GKE Autopilot.
#618
roberthbailey
closed
3 months ago
1
Fixed redirect
#617
arueth
closed
3 months ago
3
Symlink for gke-batch-refarch
#616
alizaidis
closed
3 months ago
0
Pass ICI Resiliency label to Node Pool Creation Request
#615
SidneyShen
closed
3 months ago
3
TPU Provisioner reliability improvements
#614
danielvegamyhre
closed
3 months ago
5
Add notebook file and readme
#613
bkauf
closed
3 months ago
0
quick fix or rag prompt test output
#612
chiayi
closed
3 months ago
0
quick fix or rag prompt test output
#611
chiayi
closed
3 months ago
0
Update README.md
#610
elfinhe
opened
3 months ago
0
Previous
Next