issues
search
NVIDIA
/
nim-deploy
A collection of YAML files, Helm Charts, Operator code, and guides to act as an example reference implementation for NVIDIA NIM deployment.
https://build.nvidia.com/
Apache License 2.0
141
stars
64
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Add instructions to deploy NIMOperator on Azure AKS
#106
edemiraydin
opened
2 days ago
0
added improvements to nim on vertex notebook
#105
juanpabloguerra16
opened
4 days ago
0
update inference ami version in sagemaker endpoint config to fix nvml driver issue
#104
kshitizgupta21
closed
4 days ago
0
add aws marketplace notebooks for LLama 8b, 70B, mixtral and nemotron 15B NIM
#103
kshitizgupta21
closed
1 week ago
1
Deploy NIMs on AzureML (Jupyter notebook method) - Signed commit
#102
vikalluru
closed
1 week ago
0
[CLOSED] Deploy NIMs on AzureML (Jupyter notebook method)
#101
vikalluru
closed
1 week ago
1
add aws marketplace sample notebook for nemotron 15B NIM for sagemaker deployment
#100
kshitizgupta21
closed
1 week ago
0
UpdatingCLIAzureMLNIMDeployment_for_demo
#99
mreyesgomez
opened
1 month ago
0
nvml error when deploying NIM to AWS sagemaker
#98
nkumaraws
opened
1 month ago
1
Deploying on Google Cloud run
#97
jruokola
opened
1 month ago
0
Add AWS EKS deployment files and documentation for K8s NIM Operator
#96
edemiraydin
closed
1 month ago
1
Link to NIM on GKE added in README
#95
abhisheksawarkar
closed
1 month ago
0
Add NIM K8s Operator instructions for EKS
#94
edemiraydin
closed
1 month ago
0
Add support to pull NIM profiles from GCS cache
#93
pwschuurman
opened
1 month ago
1
deploying nim on k8s. With this custom-value.yaml, 3.1 8b model can be deployed but 70b failed.
#92
SarielMa
opened
1 month ago
0
Issue with Pod Termination in InferenceService using nvcr.io/nim/meta/llama-3.1-8b-instruct:1.1.2 - No Terminate Signal Sent to Nim Server
#91
test-1pro
opened
2 months ago
1
GKE / GCS Caching and NGC API Key update #82 #83
#90
sujituk
closed
2 months ago
1
GKE GCS Caching #82 #83
#89
sujituk
closed
2 months ago
2
Fix invalid secret key name in GKE TF
#88
liveaverage
closed
2 months ago
1
The default command with persistance.enabled results in fail to deploy state with no automatic PV provisioner
#87
kjw3
opened
2 months ago
0
Switch git clone command to use https
#86
kjw3
opened
2 months ago
0
Submitting PR for DigitalOcean Kubernetes (DOKS)
#85
bikram20
opened
2 months ago
0
Add oci support
#84
adinadiana1234
opened
2 months ago
1
fix: rename ngc api key;issue #82
#83
sujituk
closed
2 months ago
7
Variable name "NGC_CLI_API_KEY" needs to be updated
#82
tfrantzen
opened
2 months ago
1
Add basic example of NIM with Run.ai inference
#81
mlsorensen
opened
2 months ago
1
Add Test NIM example to cloudrun README
#80
dfisk
closed
1 month ago
0
Docs/runai additions
#79
resker
closed
2 months ago
3
helm: update the chart to version 1.1.2
#78
crookedstorm
closed
2 months ago
0
Add Embedding NIM example for NVCF
#77
kylehh
closed
1 month ago
0
Update NIM on CloudRun README
#76
FortunaZhang
closed
3 months ago
0
Add NVCF deployment of embedding model example
#75
kylehh
closed
3 months ago
0
Add GCP CloudRun example
#74
supertetelman
closed
3 months ago
1
GCP CloudRun NIM example L4 llama3-8b-instruct:1.0.0
#73
dfisk
closed
3 months ago
1
NGC_CLI_API_KEY changes to NGC_API_KEY
#72
edemiraydin
closed
1 month ago
0
AWS EKS deployment parameterization and documentation improvements
#71
edemiraydin
closed
3 months ago
0
Python test to try out function calling support
#70
vikalluru
closed
1 week ago
1
Add a reference to the existing Hugging Face deployment guide
#69
supertetelman
closed
3 months ago
0
Add detect-nv-keys precommit hook
#68
supertetelman
closed
3 months ago
0
Docs updates for disclaimer update and discoverability
#67
supertetelman
closed
3 months ago
0
gke updates; ReadMe.md, tfvars
#66
sujituk
closed
3 months ago
0
NIM on GCP VertexAI Python First Release
#65
FortunaZhang
closed
3 months ago
1
Error to deploy `llama-3.1-8b-instruct:1.1.1` using downloaded model repository with modelcar and kserve
#64
xieshenzh
opened
3 months ago
2
NIM_SERVED_MODEL_NAME does not work for certain models
#63
xieshenzh
opened
3 months ago
0
InvalidHeaderValue
#62
nemerna
opened
3 months ago
0
RuntimeError: CUDA error: no kernel image is available for execution on the device
#61
JIA-HONG-CHU
opened
3 months ago
0
Unknown RoPE scaling type {scaling_type}
#60
test-1pro
closed
2 months ago
1
Llm helm update
#59
supertetelman
closed
3 months ago
1
NIM on GCP Vertex AI Python
#58
FortunaZhang
closed
3 months ago
3
ReadMe Update to use storage class
#57
angudadevops
closed
3 months ago
1
Next