issues
search
nerc-project
/
operations
Issues related to the operation of the NERC OpenShift environment
1
stars
0
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Google Sheet updates for July Invoice Data
#636
joachimweyl
opened
5 hours ago
0
Track down non billable Lenovo Loaned GPU usage
#635
joachimweyl
opened
8 hours ago
0
fix: Improve ArgoCD stability by adjusting resource allocation
#634
schwesig
opened
1 day ago
0
Infra cluster externalsecrets are failing to sync
#633
dystewart
opened
4 days ago
8
Configure NERC TORS switches in MOCA with new Core Switches
#632
hakasapl
opened
4 days ago
0
Schedule downtime to move existing NERC TORS switches to MOCA
#631
hakasapl
opened
4 days ago
0
Write a switch driver for Cisco NXOS for ansible-switches
#630
hakasapl
opened
4 days ago
0
Order 2x Aggregation Core Switches for NERC Core
#629
hakasapl
opened
4 days ago
0
Brainstorming for InstructLab on MOC
#628
joachimweyl
opened
6 days ago
0
Details on usage for ope classes Spring 2024 to verify or correct the cost allocation and calculation
#627
schwesig
opened
6 days ago
0
kruize project 4 - Closing the project (followup #580)
#626
schwesig
opened
6 days ago
0
kruize project 3 - Support the tests (followup #580)
#625
schwesig
opened
6 days ago
0
kruize project 2 - Dedicated project cluster (followup #580)
#624
schwesig
opened
6 days ago
10
kruize project 1 - Interim solution on prod cluster (followup #580)
#623
schwesig
opened
6 days ago
5
Add AWS Route 53 and GH OAuth secrets to vault
#622
tssala23
opened
1 week ago
3
Ensure H100s can be moved around (most likely through ESI)
#621
joachimweyl
opened
1 week ago
3
Delegate nerc.mghpcc.org to nerc route53 nameservers
#620
larsks
opened
1 week ago
3
Add records for nerc-ocp-test-2 cluster to nerc.mghpcc.org
#619
larsks
closed
1 week ago
1
Monitor effects on timeouts and performance. (follow up #473 & #596)
#618
schwesig
opened
1 week ago
3
Access to nerc-DNS repo
#617
tssala23
closed
1 week ago
6
Update base ope image with newer package versions
#616
DanNiESh
opened
1 week ago
0
Test and run updated ope image
#615
DanNiESh
opened
1 week ago
0
Connect WEKA system to OpenShift Test cluster
#614
joachimweyl
opened
2 weeks ago
0
cancelled - Allocate 4 GPU nodes for the test cluster NERC Project 408 KruizeOptimization
#613
schwesig
closed
6 days ago
5
Access for James Kunstle to GPUs
#612
hpdempsey
closed
3 weeks ago
1
Red Hat access needed to GPU
#611
hpdempsey
closed
3 weeks ago
2
Clarify on nerc.mghpcc.org and documentation that NERC is not limited to New England organizations
#610
msdisme
opened
3 weeks ago
0
Google Sheet updates for June Invoice Data
#609
joachimweyl
closed
5 hours ago
0
NERC RHOAI failing to mount volume
#608
Milstein
closed
2 weeks ago
4
Create a new project in test cluster for Orran and his student to try out their book
#607
DanNiESh
closed
2 weeks ago
3
Disable ASLR in ope-test namespace in test cluster
#606
DanNiESh
closed
3 weeks ago
2
NERC OpenShift Infra Upgrade to 4.15
#605
joachimweyl
opened
3 weeks ago
3
ArgoCD cluster-scope-test and cluster-scope-prod apps are reported unknown
#604
dystewart
closed
3 weeks ago
1
nerc-ocp-infra clustersecretstore is offline
#603
larsks
closed
3 weeks ago
2
What should the minimum cpu or memory request be?
#602
naved001
opened
4 weeks ago
3
Manual Invoicing for RHELAI GPU usage
#601
joachimweyl
closed
5 hours ago
4
Create scaffolding of how to allow PIs to set max dollar value they wish their project to use in a single month
#600
joachimweyl
opened
1 month ago
0
NFD Operator not working on Infra and OBS
#599
tssala23
closed
1 month ago
2
openstack controller ctl-0 fails to boot due to dead onboard battery
#598
aabaris
closed
1 month ago
3
KNative Deployment Pipelines
#597
cbolles
opened
1 month ago
11
Follow Up #473 Timeouts between pods: Adding 3 Nodes to the infra-cluster to follow RH support recommendation
#596
schwesig
closed
1 week ago
10
Move 4 A100SXM4 GPU nodes out of prod and available for RHELAI
#595
joachimweyl
closed
3 weeks ago
38
Get details about requirements for RHEL AI machines
#594
larsks
closed
2 weeks ago
3
Test Cluster still down after maintainance
#593
schwesig
closed
1 month ago
2
Google Sheet updates for May Invoice Data
#592
joachimweyl
closed
3 weeks ago
0
Manually add a prepaid row to the MGHPCC invoice for Prepaid Group
#591
joachimweyl
opened
1 month ago
0
Finalize documentation for Prepaid Invoicing
#590
joachimweyl
opened
1 month ago
0
Roadmap for gpu testing with rhoai in the nerc-ocp-test cluster
#589
dystewart
closed
1 week ago
10
Update ColdFront text "Expired" to "Active (Needs Renewal)"
#588
joachimweyl
closed
3 weeks ago
3
Update NERC Documentation (MB/GB/TB -> MiB/GiB/TiB)
#587
joachimweyl
closed
2 weeks ago
8
Next