How to run an interactive job on ARC: Difference between revisions

From RCSWiki
Jump to navigation Jump to search
(Added ARCX navbox, formatting)
(Review and editing)
Line 1: Line 1:
== Background ==
Submitting and running jobs is the only way to run computations on ARC. Normally, a user creates a '''job script''' and submit it to SLURM on ARC using the <code>sbatch</code> command to run a computation. This type of jobs are non-interactive batch jobs. In some circumstances, this type of workflow may be difficult or incompatible with workflows that require interactive inputs by the user such as when you need to apply data transformation, result collection and review from large computations, or to test new methods or software. For this kind of work, the best approach is to request an '''interactive job''' from ARC's.


Running jobs is the main and the only way to run computations on ARC.
== Why not the login node? ==
Normally, a user creates a '''job script''' and submit it to SLURM on ARC using the <code>sbatch</code> command to run a computation.
The login node is designed for users to interactively use ARC. However, this usage is intended for only '''management work''', such as:
The '''login node''', ARC's gateway, is only intended to do '''management work''', such as
* data management,
* data management,
* software management, and
* software management, and
* job management.  
* job management.  


The '''login node''' could also be used to '''software development''', if you write scripts, or if you work on some new code for your research.
The '''login node''' could also be used to '''software development''', if you write scripts, or if you work on some new code for your research. The '''login node''', however, '''cannot be used''' to run anything that is CPU heavy and relates to '''research''', that is '''not management'''. See our[[General Cluster Guidelines and Policies#Login Node | guidelines and policies]].
The '''login node''', however, '''cannot be used''' to run anything that is CPU heavy and relates to '''research''', that is '''not management'''.
(See our [[General Cluster Guidelines and Policies#Login Node | guidelines and policies]]).


=== Interactive work ===
== Submitting an interactive job ==
Sometimes, there is a need to run some research work related commands that are CPU heavy,
To '''request an interactive job''' on ARC via SLURM, you have to use the <code>salloc</code> command and to '''specify resources''' that your interactive work will need, similar to the resource request in a normal job script. After that, SLURM will try to '''allocate resources''' for your interactive job, and then it will '''transfer your command line''' session from ARC's login node '''to a new command line session on the allocated compute node'''.  
but which can be '''difficult or tedious''' to run as '''normal non-interactive batch jobs'''.
In this interactive job you can run any command on the command line without interrupting any other user's work. The resources you are using are allocated to your only and there is nobody else who can use them.
Possibly, you may want to do some '''interactive work''' using the command line,
=== Downsides to interactive jobs ===
'''transform your data''', '''collect results''' of a large set of computations, try or '''test new methods or software'''.
For this kind of work, the best approach is to request an '''interactive job''' from ARC's SLURM.
 
 
To '''request an interactive job''' on ARC via SLURM,  
you have to use the <code>salloc</code> command and to '''specify resources''' that your interactive work will need,  
similar to the resource request in a normal job script.
After that, SLURM will try to '''allocate resources''' for your interactive job,
and then it will '''transfer your command line''' session from ARC's login node '''to a new command line session on the allocated compute node'''.  
In this interactive job you can run any command on the command line without interrupting any other user's work.  
The resources you are using are allocated to your only and there is nobody else who can use them.
 
=== Downsides ===
However, one has to remember that there are some '''negatives''' of using '''interactive jobs''':
However, one has to remember that there are some '''negatives''' of using '''interactive jobs''':
* '''resource utilization''' is typically '''quite low''' when compared with non-interactive jobs.  
* '''Resource utilization''' is typically '''quite low''' when compared with non-interactive jobs.  
: The resources are not used when you are reading the outputs, type new commands, or think about what to do next. This is inefficient.
: The resources are not used when you are reading the outputs, type new commands, or think about what to do next. This is inefficient


* There is '''no amount of work''' that '''have to be done''', so that the resources can be freed for other users.  
* There is '''no amount of work''' that '''have to be done''', so that the resources can be freed for other users.  

Revision as of 19:12, 21 September 2023

Submitting and running jobs is the only way to run computations on ARC. Normally, a user creates a job script and submit it to SLURM on ARC using the sbatch command to run a computation. This type of jobs are non-interactive batch jobs. In some circumstances, this type of workflow may be difficult or incompatible with workflows that require interactive inputs by the user such as when you need to apply data transformation, result collection and review from large computations, or to test new methods or software. For this kind of work, the best approach is to request an interactive job from ARC's.

Why not the login node?

The login node is designed for users to interactively use ARC. However, this usage is intended for only management work, such as:

  • data management,
  • software management, and
  • job management.

The login node could also be used to software development, if you write scripts, or if you work on some new code for your research. The login node, however, cannot be used to run anything that is CPU heavy and relates to research, that is not management. See our guidelines and policies.

Submitting an interactive job

To request an interactive job on ARC via SLURM, you have to use the salloc command and to specify resources that your interactive work will need, similar to the resource request in a normal job script. After that, SLURM will try to allocate resources for your interactive job, and then it will transfer your command line session from ARC's login node to a new command line session on the allocated compute node. In this interactive job you can run any command on the command line without interrupting any other user's work. The resources you are using are allocated to your only and there is nobody else who can use them.

Downsides to interactive jobs

However, one has to remember that there are some negatives of using interactive jobs:

  • Resource utilization is typically quite low when compared with non-interactive jobs.
The resources are not used when you are reading the outputs, type new commands, or think about what to do next. This is inefficient
  • There is no amount of work that have to be done, so that the resources can be freed for other users.
The only way an interactive job can end is if it runs out of time, or you manually end it.
  • Typically, you can only interact with one job at a time.
In contrast, you can run multiple non-interactive jobs when running normal batch jobs.
  • If the resources you want for your interactive work are not available, you may be waiting for a long time before your interactive job starts.

Because of these reasons, interactive jobs on ARC is limited to up to 5 hours of run time.

Please remember that interactive jobs is not intended to be the primary way of using the ARC cluster.