issues
search
nerc-project
/
operations
Issues related to the operation of the NERC OpenShift environment
2
stars
0
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Onboard vLLM upstream CI project to MOC OpenShift Clusters
#779
dystewart
opened
1 month ago
8
Users are unable to download the EC2 credential from the NERC OpenStack Web Console.
#778
Milstein
opened
1 month ago
3
Update Resource Usage Alerts Requirements
#777
joachimweyl
closed
1 week ago
1
OpenStack Test Cluster New Hardware
#776
joachimweyl
opened
1 month ago
2
Grant Jason Schlessman access to obs cluster
#775
tssala23
closed
1 month ago
9
IBM Autopilot Dashboard - Service Account with Access to Read Nodes
#774
Anish701
closed
4 weeks ago
8
fix: KRUIZE - apply a machine config for a security patch
#773
schwesig
closed
1 month ago
2
Find and run some AI/ML related workload to ensure RHOAI upgrade success
#772
dystewart
opened
1 month ago
1
fix: kruize GPU project: add shreyabiradar07 to kruize-admin
#771
schwesig
closed
1 month ago
3
Create deployment for rhods-notebooks that gathers pod logs
#770
IsaiahStapleton
closed
2 weeks ago
2
Add IBM Autopilot metrics, alerts, and dashboards to GPU clusters
#769
computate
closed
1 month ago
0
NVIDIA GPU Operator gpu-cluster-policy in OperandNotReady state in multiple clusters
#768
computate
opened
1 month ago
24
Create unique RHOAI Accelerator Profiles per GPU Node Type
#767
dystewart
opened
1 month ago
2
OpenShift Virtualization Testing—test non admin users with the Virtualization developer console
#766
computate
closed
1 month ago
2
Issue related to using OpenTofu (Terraform) IaC to create OCP Routes
#765
Milstein
opened
1 month ago
2
OpenShift Virtualization testing - Consider if it's advantageous to build a separate cluster to provide VMs
#764
joachimweyl
opened
1 month ago
1
https://regapp.mss.mghpcc.org/
#763
hpdempsey
opened
1 month ago
6
Ensure AI4DD workloads land on A100 GPU nodes(s) in nerc-ocp-prod cluster
#762
dystewart
closed
2 weeks ago
5
bug: nerc-ocp-obs: Node wrk-0 Low Memory and Not Ready Status
#761
schwesig
closed
2 weeks ago
7
Replace A100 in Beta-Test cluster
#760
tssala23
closed
1 month ago
7
Determine if RHOAI allows distinguishing between different types of GPUs
#759
dystewart
closed
1 month ago
1
Implement mechanism to tie custom resources to coldfront allocations
#758
larsks
opened
1 month ago
2
Add EldritchJS to nerc-logs-metrics team
#757
computate
closed
1 month ago
1
Custom Rbac for Jason Schlessman
#756
tssala23
closed
1 month ago
0
Help prepare notebook workbenches for AI4DD demo
#755
DanNiESh
closed
2 weeks ago
2
OpenShift Virtualization testing - Creating Snapshots & Reverting Back and Forth to Those Snapshots
#754
schwesig
closed
1 month ago
1
Copy of Notify professors of addition of webhooks and how to notify us if they identify any problems
#753
msdisme
closed
1 month ago
1
Find out what courses will be using NERC Spring 2025
#752
msdisme
opened
1 month ago
3
InvalidBackingStoreScaling Error on infra, acm-metrics-backing-store (follow up)
#751
schwesig
opened
1 month ago
1
obs cluster API has expired certificate
#750
computate
closed
1 month ago
16
Updated as Not Needed - OpenShift Virtualization testing - Use Ansible Automation Platform
#749
schwesig
closed
1 month ago
1
Thor: Working On this sprint (09/11to10/02) not (all) tracked here
#748
schwesig
closed
1 month ago
0
Notify professors of addition of webhooks and how to notify us if they identify any problems
#747
msdisme
closed
1 month ago
5
obs cluster degraded state due to worker no able to be drain to apply machineconfig
#746
RH-csaggin
closed
1 week ago
17
InvalidBackingStoreScaling Error on infra, acm-metrics-backing-store
#745
schwesig
closed
1 month ago
1
Add keykloak/github client secrets to vault for rhoai-test cluster
#744
dystewart
closed
1 month ago
1
Allocate dns records and route53 credentials for hypershift cluster
#743
larsks
closed
1 month ago
5
Configure vips and load balancer for hypershift cluster
#742
larsks
closed
4 weeks ago
1
OpenShift Virtualization testing - Consider if it's advantageous to build a separate cluster to provide VMs
#741
aabaris
closed
1 month ago
1
OpenShift Virtualization testing - using custom cloud images to build VMs
#740
aabaris
opened
2 months ago
0
Notify NERC OpenShift production users of planned downtime.
#739
msdisme
opened
2 months ago
4
Disable automatic networks blocks on MIT network(s)
#738
larsks
opened
2 months ago
10
Implement solution for RWX storage for Red Hat ET/InstructLab group
#737
larsks
closed
1 month ago
6
Create POV Document for RHDH Integration into NERC
#736
schwesig
closed
1 month ago
0
Update UDEV rules for 5 V100's in OpenShift Prod
#735
joachimweyl
opened
2 months ago
9
Issue with users' jobs withing a pod isolation
#734
Milstein
closed
2 months ago
2
Create ESI BM Billing sheet
#733
joachimweyl
closed
2 months ago
0
Prepaid - Notify user when their prepaid bucket is running low - discussion
#732
joachimweyl
closed
2 weeks ago
3
Create GitHub team for AI4DD team
#731
dystewart
closed
2 months ago
4
Remove CustomContainerMemoryUsage and CustomContainerCpuUsage Alerts in #alerts-nerc-ocp to Reduce Noise
#730
schwesig
closed
2 months ago
0
Previous
Next