Altis Login Node Status: Difference between revisions
No edit summary |
No edit summary |
||
(4 intermediate revisions by 2 users not shown) | |||
Line 16: | Line 16: | ||
}} | }} | ||
{{Message of the day item | |||
| title = Partial Outage Update I | |||
| date = 2024/09/25 | |||
| message = Due to hardware issues that is blocking our original maintenance window, most compute nodes that were taken offline on Monday has been brought back online today. An additional partial outage will occur again starting next Tuesday for the same nodes. | |||
On Tuesday, October 1, 2024, the compute nodes in cpu2019, cpu2021, cpu2022, gpu-v100, gpu-a100, and most nodes from bigmem will be unavailable until Friday October 4, 2024. Affected WDF-Altis GPU nodes include: wdfgpu[1-2,6,8-12]. | |||
We apologise for the inconvenience. | |||
}} | |||
{{Message of the day item | |||
| title = Partial Outage Update II | |||
| date = 2024/10/04 | |||
| message = The maintenance window will be extended until at least Monday, October 7, 2024 due to a power distribution issue in our renovated data centre. | |||
Currently, the compute nodes in cpu2019, cpu2021, cpu2022, gpu-v100, gpu-a100, and most nodes from bigmem will be unavailable until at least Monday, October 7, 2024. Affected WDF-Altis GPU nodes include: wdfgpu[1-2,6,8-12]. | |||
We apologize for the extended downtime and will update you as soon as we have additional information from our operations team. | |||
}} | |||
{{Message of the day item | |||
| title = Partial Outage Update III | |||
| date = 2024/10/07 | |||
| message = Due to technical issues beyond our control the maintenance window will be extended until at least Tuesday, October 15, 2024. | |||
Currently, the compute nodes in cpu2019, cpu2021, cpu2022, gpu-v100, gpu-a100, and most nodes from bigmem will be unavailable until at least Tuesday, October 15, 2024. Affected WDF-Altis GPU nodes include: wdfgpu[1-2,6,8-12]. | |||
We apologize for the extended downtime and will update you as soon as we have additional information from our operations team. | |||
}} | |||
{{Message of the day item | |||
| title = Normal Scheduling has resumed. | |||
| date = 2024/10/08 | |||
| message = The ARC cluster has been successfully brought online and nodes are running jobs normally. We apologize for the extended downtime. | |||
Please reach out to support@hpc.ucalgary.ca with any issues or concerns. | |||
}} | |||
[[Category:ARC]] | [[Category:ARC]] | ||
{{Navbox ARC}} | {{Navbox ARC}} |
Latest revision as of 19:07, 8 October 2024
|
ARC status: Cluster operational System is operational. No updates are planned. See the ARC Cluster Status page for system notices. |
System Messages
Systems Operating Normally - 2024/09/03
Notice of Upcoming Partial Outage - 2024/08/27
Partial Outage Update I - 2024/09/25
On Tuesday, October 1, 2024, the compute nodes in cpu2019, cpu2021, cpu2022, gpu-v100, gpu-a100, and most nodes from bigmem will be unavailable until Friday October 4, 2024. Affected WDF-Altis GPU nodes include: wdfgpu[1-2,6,8-12].
We apologise for the inconvenience.Partial Outage Update II - 2024/10/04
Currently, the compute nodes in cpu2019, cpu2021, cpu2022, gpu-v100, gpu-a100, and most nodes from bigmem will be unavailable until at least Monday, October 7, 2024. Affected WDF-Altis GPU nodes include: wdfgpu[1-2,6,8-12].
We apologize for the extended downtime and will update you as soon as we have additional information from our operations team.Partial Outage Update III - 2024/10/07
Currently, the compute nodes in cpu2019, cpu2021, cpu2022, gpu-v100, gpu-a100, and most nodes from bigmem will be unavailable until at least Tuesday, October 15, 2024. Affected WDF-Altis GPU nodes include: wdfgpu[1-2,6,8-12].
We apologize for the extended downtime and will update you as soon as we have additional information from our operations team.