MARC Cluster Status: Difference between revisions
mNo edit summary |
|||
| (7 intermediate revisions by 2 users not shown) | |||
| Line 1: | Line 1: | ||
{{MARC Cluster Status}} | {{Cluster Status | ||
| cluster = MARC | |||
| status = green | |||
| title = Cluster operational | |||
| message = Upgrades are planned for Jan 20 2025. Please contact us if you experience system issues. | |||
See the [[MARC Cluster Status]] page for system notices. | |||
}} | |||
== System Messages == | == System Messages == | ||
| Line 30: | Line 37: | ||
Thank you for your cooperation. | Thank you for your cooperation. | ||
}} | |||
{{Message of the day item | |||
| title = Apptainer (Singularity) on MARC Login Node | |||
| date = 2023/06/23 | |||
| message = | |||
Apptainer (Singularity) containers may experience an error when | |||
running on the MARC login node. If apptainer complains that a system | |||
administrator needs to enable user namespaces, simply run your | |||
containers inside a job. | |||
This is a temporary measure due to security vulnerability that will be | |||
patched soon. | |||
}} | |||
{{Message of the day item | |||
| title = Storage Upgrade MARC/ARC cluster | |||
| date = 2023/10/23 | |||
| message = | |||
We will be performing storage upgrades on the MARC/ARC cluster on | |||
November 16 and 17, 2023. To facilitate this, we will be throttling | |||
down the number of jobs on both clusters while the upgrades are | |||
performed | |||
}} | |||
{{Message of the day item | |||
| title = OS Upgrade MARC/ARC cluster | |||
| date = 2024/09/11 | |||
| message = | |||
MARC will be going down for OS upgrades on 2024/Sep/16. The cluster | |||
will be unavailable temporarily to complete this work. Please contact | |||
support@hpc.ucalgary.ca if you have any questions or concerns. | |||
}} | |||
{{Message of the day item | |||
| title = Scheduled Maintenance | |||
| date = 2024/12/11 | |||
| message = The MARC login node will be rebooted on Tuesday December 17 for scheduled maintenance. It will be down for a few minutes and return shortly. Job scheduling and jobs running on the cluster will not be affected. Thank you for understanding. | |||
Please reach out to support@hpc.ucalgary.ca with any issues or concerns. | |||
}} | |||
{{Message of the day item | |||
| title = Scheduled Maintenance and OS Update | |||
| date = 2025/01/07 | |||
| message = The MARC cluster will be rebooted for OS updates on Monday January 20, 2025. Please make sure to save your work and log out before the reboot happens. Scheduling will be paused until the cluster is back, but queued jobs will remain in the queue and nodes will start scheduling when the cluster is ready. Thank you for understanding. | |||
Please reach out to support@hpc.ucalgary.ca with any issues or concerns. | |||
}} | |||
{{Message of the day item | |||
| title = Support email address down | |||
| date = 2025/03/07 | |||
| message = support@hpc.ucalgary.ca Unavailable | |||
Please be informed that our support email address (support@hpc.ucalgary.ca) for RCS is currently not working. We are working to bring it back as soon as possible. Please keep an eye on this space for updates. The clusters are working normally, but support will not receive your messages at this time. We will begin responding as soon as we can get it back. | |||
Apologies for the inconvenience. | |||
}} | |||
{{Message of the day item | |||
| title = Support email address functional | |||
| date = 2025/03/07 | |||
| message = support@hpc.ucalgary.ca is back | |||
support@hpc.ucalgary.ca has been repaired and RCS can be contacted there. If you had reached out for assistance in recent days without response please follow up as we may not have received your initial email. | |||
Apologies for the inconvenience. | |||
}} | }} | ||
Revision as of 20:49, 10 March 2025
|
|
MARC status: Cluster operational Upgrades are planned for Jan 20 2025. Please contact us if you experience system issues. See the MARC Cluster Status page for system notices. |
System Messages
⚠️ January System Updates - 2023/01/01
The MARC login node will reboot on the morning of January 23. Please save your work and log out if possible.
The upgrade is planned to be fully complete by January 27.
If you encounter any system issues, do not hesitate to let us know.
Thank you for your cooperation.System Updates Completed - 2023/01/24
- OS Updated to Rocky Linux 8.7
- Slurm updated to 22.05.7
- Apptainer replaces Singularity
- Each job will have its own /tmp, /dev/shm, /run/user/$uid mounted
If you encounter any system issues, do not hesitate to let us know.
Thank you for your cooperation.
Apptainer (Singularity) on MARC Login Node - 2023/06/23
running on the MARC login node. If apptainer complains that a system administrator needs to enable user namespaces, simply run your containers inside a job.
This is a temporary measure due to security vulnerability that will be
patched soon.Storage Upgrade MARC/ARC cluster - 2023/10/23
November 16 and 17, 2023. To facilitate this, we will be throttling down the number of jobs on both clusters while the upgrades are
performedOS Upgrade MARC/ARC cluster - 2024/09/11
will be unavailable temporarily to complete this work. Please contact
support@hpc.ucalgary.ca if you have any questions or concerns.Scheduled Maintenance - 2024/12/11
Scheduled Maintenance and OS Update - 2025/01/07
Support email address down - 2025/03/07
Please be informed that our support email address (support@hpc.ucalgary.ca) for RCS is currently not working. We are working to bring it back as soon as possible. Please keep an eye on this space for updates. The clusters are working normally, but support will not receive your messages at this time. We will begin responding as soon as we can get it back.
Apologies for the inconvenience.
Support email address functional - 2025/03/07
support@hpc.ucalgary.ca has been repaired and RCS can be contacted there. If you had reached out for assistance in recent days without response please follow up as we may not have received your initial email.
Apologies for the inconvenience.