ARC Cluster Status: Difference between revisions

From RCSWiki
Jump to navigation Jump to search
No edit summary
No edit summary
Line 33: Line 33:


{{Message of the day item
{{Message of the day item
| title = ⚠️ Filesystem Issues
| title = Filesystem Issues
| date = 2023/02/28
| date = 2023/02/28
| message =
| message =
Line 45: Line 45:


{{Message of the day item
{{Message of the day item
| title = ⚠️ Filesystem Issues
| title = Filesystem Issues
| date = 2023/03/1
| date = 2023/03/1
| message =
| message =
Line 53: Line 53:


Thank you for your patience.
Thank you for your patience.
}}
{{Message of the day item
| title = ⚠️ ARC Login node reboot
| date = 2023/03/2
| message =
The ARC login node will be rebooted this afternoon for an emergency maintenance. This downtime is needed to help mitigate the filesystem slowdowns experienced on the login node.
All logins to the ARC login node will be terminated at 3:00PM and will remain unavailable until 4:00PM.
We apologize for the inconvenience and thank you for your patience.
}}
}}

Revision as of 20:32, 2 March 2023

ARC status: Cluster operational


Open OnDemand will be rebooted Oct 17, 2023.

See the ARC Cluster Status page for system notices.

System Messages

January System Updates - 2023/01/01

Beginning January 16, 2023, the ARC cluster will undergo operating system updates. We shall do our utmost to minimize disruption and allow ongoing jobs to be completed. New jobs may be temporarily held from scheduling.

The ARC login node will reboot on the morning of January 16. Please save your work and log out if possible.

The upgrade is planned to be fully complete by January 20.

If you encounter any system issues, do not hesitate to let us know.

Thank you for your cooperation.

System Updates Completed - 2023/01/24

The upgrade has been completed. The following has been changed:
  • OS Updated to Rocky Linux 8.7
  • Slurm updated to 22.05.7
  • Apptainer replaces Singularity
  • Each job will have its own /tmp, /dev/shm, /run/user/$uid mounted

If you encounter any system issues, do not hesitate to let us know.

Thank you for your cooperation.

Filesystem Issues - 2023/02/28

We are currently investigating a filesystem issue that is causing filesystem slowdowns across ARC.

We will update you with more information as it becomes available.

Thank you for your patience.


Filesystem Issues - 2023/03/1

We are still currently investigating a filesystem issue that is causing filesystem slowdowns across ARC. Some jobs on ARC have been paused to help us find the root cause of the slowdowns.

We will update you with more information as it becomes available.

Thank you for your patience.


⚠️ ARC Login node reboot - 2023/03/2

The ARC login node will be rebooted this afternoon for an emergency maintenance. This downtime is needed to help mitigate the filesystem slowdowns experienced on the login node.

All logins to the ARC login node will be terminated at 3:00PM and will remain unavailable until 4:00PM.

We apologize for the inconvenience and thank you for your patience.