GLaDOS Cluster Status: Difference between revisions
Created page with "{{GLaDOS Cluster Status}} == System Messages == {{Message of the day item | title = ⚠️ January System Updates | date = 2023/01/01 | message = Beginning January 30, 2023, the GLaDOS cluster will undergo operating system updates. We shall do our utmost to minimize disruption and allow ongoing jobs to be completed. New jobs may be temporarily held from scheduling. The GLaDOS login node will reboot on the morning of January 30. Please save your work and log out if poss..." |
mNo edit summary |
||
| (7 intermediate revisions by 2 users not shown) | |||
| Line 31: | Line 31: | ||
Thank you for your cooperation. | Thank you for your cooperation. | ||
}} | }} | ||
{{Message of the day item | |||
| title = Apptainer (Singularity) on GLaDOS Login Node | |||
| date = 2023/06/23 | |||
| message = | |||
Apptainer (Singularity) containers may experience an error when | |||
running on the GLaDOS login node. If apptainer complains that a system | |||
administrator needs to enable user namespaces, simply run your | |||
containers inside a job. | |||
This is a temporary measure due to security vulnerability that will be | |||
patched soon. | |||
}} | |||
{{Message of the day item | |||
| title = GLaDOS Scheduled Temporary Shutdown for Move | |||
| date = 2023/06/26 | |||
| message = | |||
GLaDOS is scheduled to be shut down temporarily to allow for the | |||
cluster to be physically moved beginning Tuesday September 5, 2023 | |||
The cluster is expected to be down the rest of the week and back | |||
online on or before Monday the 11th. | |||
Please send any questions or concerns to support@hpc.ucalgary.ca | |||
}} | |||
{{Message of the day item | |||
| title = GLaDOS Move Complete | |||
| date = 2023/09/11 | |||
| message = | |||
GLaDOS has been moved and jobs can be submitted for scheduling. | |||
Please send any questions or concerns to support@hpc.ucalgary.ca | |||
}} | |||
{{Message of the day item | |||
| title = Power Interruption | |||
| date = 2024/05/07 | |||
| message = Glados Experienced an brief power outage around 11AM May 7, 2024. | |||
Most compute nodes have or are rebooting. Most jobs running at this time | |||
were lost. Administrators are actively working on restarting compute | |||
nodes. Sorry for the inconvenience. | |||
}} | |||
{{Message of the day item | |||
| title = OS updates complete | |||
| date = 2024/05/07 | |||
| message = Glados has been updated to Rocky Linux 8.10 and is operating normally | |||
Please reach out with any questions or concerns to support@hpc.ucalgary.ca | |||
}} | |||
{{Message of the day item | |||
| title = Support email address down | |||
| date = 2025/03/07 | |||
| message = support@hpc.ucalgary.ca Unavailable | |||
Please be informed that our support email address (support@hpc.ucalgary.ca) for RCS is currently not working. We are working to bring it back as soon as possible. Please keep an eye on this space for updates. The clusters are working normally, but support will not receive your messages at this time. We will begin responding as soon as we can get it back. | |||
Apologies for the inconvenience. | |||
}} | |||
{{Message of the day item | |||
| title = Support email address functional | |||
| date = 2025/03/07 | |||
| message = support@hpc.ucalgary.ca is back | |||
support@hpc.ucalgary.ca has been repaired and RCS can be contacted there. If you had reached out for assistance in recent days without response please follow up as we may not have received your initial email. | |||
Apologies for the inconvenience. | |||
}} | |||
[[Category:GLaDOS]] | |||
{{Navbox GLaDOS}} | |||
Latest revision as of 20:49, 10 March 2025
|
|
GLaDOS status: Cluster operational No upgrades planned. Please contact us if you experience system issues. See the GLaDOS Cluster Status page for system notices. |
System Messages
⚠️ January System Updates - 2023/01/01
The GLaDOS login node will reboot on the morning of January 30. Please save your work and log out if possible.
The upgrade is planned to be fully complete by February 3.
If you encounter any system issues, do not hesitate to let us know.
Thank you for your cooperation.System Updates Completed - 2023/01/31
- OS Updated to Rocky Linux 8.7
- Slurm updated to 22.05.7
- Apptainer replaces Singularity
- Each job will have its own /tmp, /dev/shm, /run/user/$uid mounted
If you encounter any system issues, do not hesitate to let us know.
Thank you for your cooperation.Apptainer (Singularity) on GLaDOS Login Node - 2023/06/23
running on the GLaDOS login node. If apptainer complains that a system administrator needs to enable user namespaces, simply run your containers inside a job.
This is a temporary measure due to security vulnerability that will be
patched soon.GLaDOS Scheduled Temporary Shutdown for Move - 2023/06/26
cluster to be physically moved beginning Tuesday September 5, 2023 The cluster is expected to be down the rest of the week and back online on or before Monday the 11th.
Please send any questions or concerns to support@hpc.ucalgary.caGLaDOS Move Complete - 2023/09/11
Power Interruption - 2024/05/07
Most compute nodes have or are rebooting. Most jobs running at this time were lost. Administrators are actively working on restarting compute
nodes. Sorry for the inconvenience.OS updates complete - 2024/05/07
Support email address down - 2025/03/07
Please be informed that our support email address (support@hpc.ucalgary.ca) for RCS is currently not working. We are working to bring it back as soon as possible. Please keep an eye on this space for updates. The clusters are working normally, but support will not receive your messages at this time. We will begin responding as soon as we can get it back.
Apologies for the inconvenience.Support email address functional - 2025/03/07
support@hpc.ucalgary.ca has been repaired and RCS can be contacted there. If you had reached out for assistance in recent days without response please follow up as we may not have received your initial email.
Apologies for the inconvenience.
| ||||||