Outage and reconfiguration news about ACCESS integrated RP resources and ACCESS' online services

OR update an existing news item by clicking on the Subject below.
Subject Type When Affected Infrastructure
FASTER Maintenance, October 15 Outage Full
Texas A&M University Dell Cluster with Intel Ice Lake and LIQID (FASTER)
Upgrade registry.access-ci.org Reconfiguration
ACCESS User Identity and OAuth Client Registry
ACES Partial Maintenance, October 9-10 Outage Partial
Texas A&M University Accelerated Computing for Emerging Sciences (ACES) Cluster
Bridges-2 Maintenance October 8-10, 2024 Outage Full
PSC Bridges-2 Regular Memory (Bridges-2 RM)
PSC Bridges-2 GPU (Bridges-2 GPU)
PSC Bridges-2 Extreme Memory (Bridges-2 EM)
PSC Bridges-2 Storage (Bridges-2 Ocean)
Testing Broadcast of Infrastructure News Outage Full
ACCESS Support Portal
ACCESS XDMoD Downtime Outage Partial
ACCESS Metrics on Demand Service (XDMoD)
Reconfiguration of registry.access-ci.org to use DynamoDB instead of LDAP Reconfiguration
ACCESS User Identity and OAuth Client Registry
DUO Authentication Maintenance Saturday September 7 2024 05:00am EDT Outage Partial
ACCESS Allocations Portal
ACCESS Identity Management Service
ACCESS User Identity and OAuth Client Registry
ACCESS Website
ACCESS Support Portal
ACCESS DUO Multi-factor Authentication
idp.access-ci.org Upgrade Reconfiguration
ACCESS Identity Management Service
Bridges-2 Ocean Filesystem Issues Persist Degraded
PSC Bridges-2 Storage (Bridges-2 Ocean)
Expanse maintenance - 5AM (PT) 07/24/2024 to 5AM (PT) 07/25/2024 Outage Full
SDSC Dell Cluster with AMD Rome HDR IB (Expanse)
SDSC Dell Cluster with NVIDIA V100 GPUs NVLINK and HDR IB (Expanse GPU)
SDSC Expanse Project Storage
Bridges-2 Ocean Filesystem Issues Degraded
PSC Bridges-2 Storage (Bridges-2 Ocean)
Kerberos Replica Potential Outage Degraded
ACCESS Kerberos Authentication Service
Upgrade registry.access-ci.org Degraded
ACCESS User Identity and OAuth Client Registry
CILogon logins are failing for some users Outage Partial
ACCESS Identity Management Service
Bridges-2 Jet Maintenance Monday July 15 Outage Full
PSC Bridges-2 Extreme Memory (Bridges-2 EM)
PSC Bridges-2 GPU (Bridges-2 GPU)
PSC Bridges-2 Regular Memory (Bridges-2 RM)
PSC Bridges-2 Storage (Bridges-2 Ocean)
Jira Service Management Incident - Some products are hard down - 3 July 2024 Degraded
ACCESS Ticket System (JSM or RT)
Bridges-2 Continued Maintenance Degraded
PSC Bridges-2 Extreme Memory (Bridges-2 EM)
PSC Bridges-2 GPU (Bridges-2 GPU)
PSC Bridges-2 Regular Memory (Bridges-2 RM)
PSC Bridges-2 Storage (Bridges-2 Ocean)
Bridges-2 Maintenance Monday, July1 - Tuesday, July 2 Outage Full
PSC Bridges-2 Regular Memory (Bridges-2 RM)
PSC Bridges-2 GPU (Bridges-2 GPU)
PSC Bridges-2 Extreme Memory (Bridges-2 EM)
PSC Bridges-2 Storage (Bridges-2 Ocean)
Important Update: Changes to Ticket Automation and Status Updates Reconfiguration
ACCESS Ticket System (JSM or RT)
Bridges-2 Degredation Outage Full
PSC Bridges-2 Extreme Memory (Bridges-2 EM)
PSC Bridges-2 GPU (Bridges-2 GPU)
PSC Bridges-2 Regular Memory (Bridges-2 RM)
PSC Bridges-2 Storage (Bridges-2 Ocean)
Ticketing System Automation Rules Currently Not Functioning Outage Partial
ACCESS Ticket System (JSM or RT)
Bridges-2 Scheduling Disruption Degraded
PSC Bridges-2 Extreme Memory (Bridges-2 EM)
PSC Bridges-2 GPU (Bridges-2 GPU)
PSC Bridges-2 Regular Memory (Bridges-2 RM)
PSC Bridges-2 Storage (Bridges-2 Ocean)
Bridges-2 Globus GridFTP Interruption Degraded
PSC Bridges-2 Storage (Bridges-2 Ocean)
Test System Status Reconfiguration
ACCESS Support Portal
Bridges-2 Unscheduled Maintenance Outage Full
PSC Bridges-2 Extreme Memory (Bridges-2 EM)
PSC Bridges-2 GPU (Bridges-2 GPU)
PSC Bridges-2 Regular Memory (Bridges-2 RM)
PSC Bridges-2 Storage (Bridges-2 Ocean)
ACES Partial Unavailability, May 29-30 Outage Partial
Texas A&M University Accelerated Computing for Emerging Sciences (ACES) Cluster
ACCESS XDMoD Downtime Outage Full
ACCESS Metrics on Demand Service (XDMoD)
Bridges-2 Extended Maintenance May 21-24 Outage Full
PSC Bridges-2 Extreme Memory (Bridges-2 EM)
PSC Bridges-2 GPU (Bridges-2 GPU)
PSC Bridges-2 Regular Memory (Bridges-2 RM)
PSC Bridges-2 Storage (Bridges-2 Ocean)
SDSC Expanse Maintenance 8AM-8PM (PT), Monday, May 20, 2024 Outage Full
SDSC Dell Cluster with AMD Rome HDR IB (Expanse)
SDSC Dell Cluster with NVIDIA V100 GPUs NVLINK and HDR IB (Expanse GPU)
SDSC Expanse Project Storage
Delta Notice: Delta maintenance 05-16-2024 from 6:30 AM to 6:30 PM Central Outage Full
NCSA Delta CPU (Delta CPU)
NCSA Delta GPU (Delta GPU)
NCSA Delta Storage (Delta Storage)
Anvil Cluster Maintenance - Partial Outage Partial
Purdue Anvil CPU
Bridges-2 VM Maintenance Monday, May 6 Outage Partial
PSC Bridges-2 Storage (Bridges-2 Ocean)
ACCESS XDMoD Downtime Outage Partial
ACCESS Metrics on Demand Service (XDMoD)
ACES Partial Availability, April 22-26 Reconfiguration
Texas A&M University Accelerated Computing for Emerging Sciences (ACES) Cluster
Delta compute and login nodes to be rebooted Outage Full
NCSA Delta CPU (Delta CPU)
NCSA Delta GPU (Delta GPU)
NCSA Delta /scratch file system exhibiting intermittent availability and performance issues Degraded
NCSA Delta Storage (Delta Storage)
Ookami has two new NVIDIA Grace CPUs (144 cores each) Reconfiguration
IACS at Stony Brook Ookami
Bridges-2 Maintenance Friday April 12 Outage Full
PSC Bridges-2 Regular Memory (Bridges-2 RM)
PSC Bridges-2 GPU (Bridges-2 GPU)
PSC Bridges-2 Extreme Memory (Bridges-2 EM)
Delta Post-Maintenance Status Update Outage Partial
NCSA Delta CPU (Delta CPU)
NCSA Delta GPU (Delta GPU)
Ticket System (JSM) Issues with creating issues and transitions Outage Partial
ACCESS Ticket System (JSM or RT)
Delta Projects file system maintenance Thursday March 14th, 2024 Outage Partial
NCSA Delta CPU (Delta CPU)
NCSA Delta GPU (Delta GPU)
Bridges-2 Maintenance Wednesday March 13, 2024 Outage Full
PSC Bridges-2 Regular Memory (Bridges-2 RM)
PSC Bridges-2 GPU (Bridges-2 GPU)
PSC Bridges-2 Extreme Memory (Bridges-2 EM)
PSC Bridges-2 Storage (Bridges-2 Ocean)
Anvil Cluster Maintenance Outage Full
Purdue Anvil CPU
Purdue Anvil GPU
Bridges-2 Maintenance Tuesday, March 5, 2024 Outage Full
PSC Bridges-2 Storage (Bridges-2 Ocean)
PSC Bridges-2 Regular Memory (Bridges-2 RM)
PSC Bridges-2 GPU (Bridges-2 GPU)
PSC Bridges-2 Extreme Memory (Bridges-2 EM)
Kerberos Outage Outage Full
ACCESS Kerberos Authentication Service
ACCESS Ticket System degraded Feb. 28, 2024 through Mar. 1, 2024 Outage Partial
ACCESS Ticket System (JSM or RT)
Anvil SLURM intermittent issues - Issue isolated, Outage resolved Outage Partial
Purdue Anvil CPU
Purdue Anvil GPU
SDSC Expanse maintenance, 8AM-4PM (PT), Monday, 02/12/2024 [Completed] Outage Full
SDSC Dell Cluster with AMD Rome HDR IB (Expanse)
SDSC Dell Cluster with NVIDIA V100 GPUs NVLINK and HDR IB (Expanse GPU)
SDSC Expanse Project Storage
Anvil Scheduled Maintenance Wednesday, February 7th, 2024 Outage Full
Purdue Anvil CPU
Purdue Anvil GPU