Think Login Node Status: Difference between revisions

From RCSWiki
Jump to navigation Jump to search
No edit summary
No edit summary
Line 11: Line 11:
{{Message of the day item
{{Message of the day item
| title = Notice of Upcoming Partial Outage
| title = Notice of Upcoming Partial Outage
| date = 2024/08/27
| date = 2024/08/23
| message = Several compute nodes from the ARC cluster will be unavailable  
| message = Several compute nodes from the ARC cluster will be unavailable  
between Sept 23 to Sept 27 inclusive (subject to change). Some Think GPU nodes will be affected during this maintenance window. These nodes will return to service as soon as the work is complete.   
between Sept 23 to Sept 27 inclusive (subject to change). Some Think GPU nodes will be affected during this maintenance window. These nodes will return to service as soon as the work is complete.   
}}
}}
{{Message of the day item
| title = Partial Outage Update I
| date = 2024/08/25
| message = Due to hardware issues that is blocking our original maintenance window, most compute nodes that were taken offline on Monday has been brought back online today. An additional partial outage will occur again starting next Tuesday for the same nodes.
On Tuesday, October 1, 2024, the compute nodes in cpu2019, cpu2021, cpu2022, gpu-v100, gpu-a100, and most nodes from bigmem will be unavailable until Friday October 4, 2024.
We apologise for the inconvenience.
}}


[[Category:ARC]]
[[Category:ARC]]
{{Navbox ARC}}
{{Navbox ARC}}

Revision as of 20:40, 25 September 2024

ARC status: Cluster operational


System is operational. No updates are planned.

See the ARC Cluster Status page for system notices.

System Messages

Systems Operating Normally - 2024/09/03

The ARC Cluster and the Think login node is operational. No upcoming upgrades are planned.

Notice of Upcoming Partial Outage - 2024/08/23

Several compute nodes from the ARC cluster will be unavailable between Sept 23 to Sept 27 inclusive (subject to change). Some Think GPU nodes will be affected during this maintenance window. These nodes will return to service as soon as the work is complete.

Partial Outage Update I - 2024/08/25

Due to hardware issues that is blocking our original maintenance window, most compute nodes that were taken offline on Monday has been brought back online today. An additional partial outage will occur again starting next Tuesday for the same nodes.

On Tuesday, October 1, 2024, the compute nodes in cpu2019, cpu2021, cpu2022, gpu-v100, gpu-a100, and most nodes from bigmem will be unavailable until Friday October 4, 2024.

We apologise for the inconvenience.