UPDATE - 4pm - All of the affected nodes (see list below) are back online and operational. Unfortunately, due to the nature of the problem, all jobs running on the affected nodes were killed.
We apologize for the inconvenience, and if we can do anything, please let us know (firstname.lastname@example.org).
We are experiencing an issue with a power distribution unit for several racks in the HPC. Running jobs are affected on the following racks:
Jobs in the following partitions are affected:
The Systems Team has been deployed and we hope to have this issue resolved soon. In the meantime, we'll post updates to this page as soon as we have them.