|
|
(One intermediate revision by one other user not shown) |
Line 65: |
Line 65: |
| Please send any questions or concerns to support@hpc.ucalgary.ca | | Please send any questions or concerns to support@hpc.ucalgary.ca |
| }} | | }} |
| | |
| | {{Message of the day item |
| | | title = Power Interruption |
| | | date = 2024/05/07 |
| | | message = Glados Experienced an brief power outage around 11AM May 7, 2024. |
| | Most compute nodes have or are rebooting. Most jobs running at this time |
| | were lost. Administrators are actively working on restarting compute |
| | nodes. Sorry for the inconvenience. |
| | }} |
| | |
| | [[Category:GLaDOS]] |
| | {{Navbox GLaDOS}} |
|
GLaDOS status:
Cluster operational
No upgrades planned. Please contact us if you experience system issues.
See the GLaDOS Cluster Status page for system notices.
|
System Messages
⚠️ January System Updates - 2023/01/01
Beginning January 30, 2023, the GLaDOS cluster will undergo operating system updates. We shall do our utmost to minimize disruption and allow ongoing jobs to be completed. New jobs may be temporarily held from scheduling.
The GLaDOS login node will reboot on the morning of January 30. Please save your work and log out if possible.
The upgrade is planned to be fully complete by February 3.
If you encounter any system issues, do not hesitate to let us know.
Thank you for your cooperation.
************************************************************************
2023/01/01
--- ⚠️ January System Updates
Beginning January 30, 2023, the GLaDOS cluster will undergo operating system updates. We shall do our utmost to minimize disruption and allow ongoing jobs to be completed. New jobs may be temporarily held from scheduling.
The GLaDOS login node will reboot on the morning of January 30. Please save your work and log out if possible.
The upgrade is planned to be fully complete by February 3.
If you encounter any system issues, do not hesitate to let us know.
Thank you for your cooperation.
System Updates Completed - 2023/01/31
The upgrade has been completed. The following has been changed:
- OS Updated to Rocky Linux 8.7
- Slurm updated to 22.05.7
- Apptainer replaces Singularity
- Each job will have its own /tmp, /dev/shm, /run/user/$uid mounted
If you encounter any system issues, do not hesitate to let us know.
Thank you for your cooperation.
************************************************************************
2023/01/31
--- System Updates Completed
The upgrade has been completed. The following has been changed:
- OS Updated to Rocky Linux 8.7
- Slurm updated to 22.05.7
- Apptainer replaces Singularity
- Each job will have its own /tmp, /dev/shm, /run/user/$uid mounted
If you encounter any system issues, do not hesitate to let us know.
Thank you for your cooperation.
Apptainer (Singularity) on GLaDOS Login Node - 2023/06/23
Apptainer (Singularity) containers may experience an error when
running on the GLaDOS login node. If apptainer complains that a system
administrator needs to enable user namespaces, simply run your
containers inside a job.
This is a temporary measure due to security vulnerability that will be
patched soon.
************************************************************************
2023/06/23
--- Apptainer (Singularity) on GLaDOS Login Node
Apptainer (Singularity) containers may experience an error when
running on the GLaDOS login node. If apptainer complains that a system
administrator needs to enable user namespaces, simply run your
containers inside a job.
This is a temporary measure due to security vulnerability that will be
patched soon.
GLaDOS Scheduled Temporary Shutdown for Move - 2023/06/26
GLaDOS is scheduled to be shut down temporarily to allow for the
cluster to be physically moved beginning Tuesday September 5, 2023
The cluster is expected to be down the rest of the week and back
online on or before Monday the 11th.
Please send any questions or concerns to support@hpc.ucalgary.ca
************************************************************************
2023/06/26
--- GLaDOS Scheduled Temporary Shutdown for Move
GLaDOS is scheduled to be shut down temporarily to allow for the
cluster to be physically moved beginning Tuesday September 5, 2023
The cluster is expected to be down the rest of the week and back
online on or before Monday the 11th.
Please send any questions or concerns to support@hpc.ucalgary.ca
GLaDOS Move Complete - 2023/09/11
GLaDOS has been moved and jobs can be submitted for scheduling.
Please send any questions or concerns to support@hpc.ucalgary.ca
************************************************************************
2023/09/11
--- GLaDOS Move Complete
GLaDOS has been moved and jobs can be submitted for scheduling.
Please send any questions or concerns to support@hpc.ucalgary.ca
Power Interruption - 2024/05/07
Glados Experienced an brief power outage around 11AM May 7, 2024.
Most compute nodes have or are rebooting. Most jobs running at this time
were lost. Administrators are actively working on restarting compute
nodes. Sorry for the inconvenience.
************************************************************************
2024/05/07
--- Power Interruption
Glados Experienced an brief power outage around 11AM May 7, 2024.
Most compute nodes have or are rebooting. Most jobs running at this time
were lost. Administrators are actively working on restarting compute
nodes. Sorry for the inconvenience.