Submitted by Gregory Bauer on
News Content

DeltaAI Resource Users,

This is a reminder that the DeltaAI system will undergo planned maintenance on Wednesday, January 7th starting at 6AM (central time)and ending at 10PM (central time), approximately. Changes will include:

  • Upgrade of Slingshot FMS from 2.2.0 to 2.3.1.
  • Upgrade of Slurm from 23.02 to 25.11.
  • Maintenance on facility power feed for DeltaAI.

Available environment modules will NOT change. Nevertheless, please check your jobs once the scheduler resumes job scheduling.  

During the maintenance period:

  • All compute and login nodes will be unavailable.
  • DeltaAI Open OnDemand will be unavailable.
  • Globus endpoints will remain available. The work and projects filesystems will also be accessible from Delta.
  • A reservation will be in place to prevent jobs from running into the maintenance period.

Please be sure to check job time requirements as January 7th approaches so that jobs can be scheduled as the reservation drains the available nodes. Adjusting the time limit to account for the start time of the reservation will allow jobs to run.

The resource scheduler will resume once the maintenance is complete.

The other previously announced upgrades have been deferred:

  • OS upgrade from COS 24.8.0 / SLES 15 SP5 to SLES 15 SP6.
  • Upgrade of the HPE Cray Programming Environment from 24.07 to 25.09.
  • Upgrade of NVIDIA HPC SDK from 24.03 (CUDA 11.8 + 12.3) to 25.5 (CUDA 11.8 + 12.9).
  • Upgrade of NVIDIA driver from 550 series to 570 series.
  • Upgrade of Slingshot SHS from 11.0.0 to 13.0.0.

We invite users to test their codes on nodes where these deferred upgrades have been applied. Please create a ticket for additional information. Available environment modules in the test environment WILL change.

Thank you!

Infrastructure News Type
Outage Full
Affected Infrastructure
Start Date
End Date