RCS Summer School 2024: Difference between revisions

From RCSWiki
Jump to navigation Jump to search
Line 53: Line 53:
| rowspan="2" |'''[[RCS Summer School 2024#Introduction to HPC resources|Introduction to HPC resources]]'''<br>ICT 102, 9:30AM - 10:20AM<br>Robert Fridman, Dave Schulz
| rowspan="2" |'''[[RCS Summer School 2024#Introduction to HPC resources|Introduction to HPC resources]]'''<br>ICT 102, 9:30AM - 10:20AM<br>Robert Fridman, Dave Schulz
|'''[[RCS Summer School 2024#Data Management with Datalad|Data Management with Datalad]]<br>'''ICT 114, 9:30AM - 10:20AM<br>David Deepwell, Pedro Martinez
|'''[[RCS Summer School 2024#Data Management with Datalad|Data Management with Datalad]]<br>'''ICT 114, 9:30AM - 10:20AM<br>David Deepwell, Pedro Martinez
| rowspan="6" |'''[[RCS Summer School 2024#NVIDIA: Workflow Optimization with NVIDIA GPUs|NVIDIA: Workflow Optimization with NVIDIA GPUs]]'''<br>ICT 102, 9:30AM - 12:20AM<br>Artur Rodrigues from NVIDIA
| rowspan="6" |'''[[RCS Summer School 2024#NVIDIA|NVIDIA: Workflow Optimization with NVIDIA GPUs]]'''<br>ICT 102, 9:30AM - 12:20AM<br>Jonathan Dursi from NVIDIA
|-
|-
!10:00 AM
!10:00 AM
Line 79: Line 79:
| rowspan="3" |'''[[RCS Summer School 2024#Digital File Management and File Management|Digital File Management and File Management]]'''<br>ICT 102, 1:00PM - 2:20PM<br>Jennifer Abel, Alex Thistlewood, Ingrid Reiche
| rowspan="3" |'''[[RCS Summer School 2024#Digital File Management and File Management|Digital File Management and File Management]]'''<br>ICT 102, 1:00PM - 2:20PM<br>Jennifer Abel, Alex Thistlewood, Ingrid Reiche
| rowspan="7" |Refreshments<br>ICT 114
| rowspan="7" |Refreshments<br>ICT 114
| rowspan="3" |'''[[RCS Summer School 2024#Dell & AMD: Machine learning with Dell & AMD|Dell & AMD: Machine learning with Dell & AMD]]'''<br>ICT 102, 1:00PM - 1:50PM
| rowspan="3" |'''[[RCS Summer School 2024#Dell & AMD|Dell & AMD: Machine learning with Dell & AMD]]'''<br>ICT 102, 1:00PM - 1:50PM
|-
|-
!1:30 PM
!1:30 PM
| rowspan="2" |'''[[RCS Summer School 2024#AWS|AWS: ML in the Cloud]]'''<br>ICT 102
|'''[[RCS Summer School 2024#AWS|AWS: Inspiring the art of the possible]]'''<br>ICT 102, 1:30PM - 1:50PM
| rowspan="7" |Refreshments<br>ICT 114
| rowspan="7" |Refreshments<br>ICT 114
|-
|-
!2:00 PM
!2:00 PM
|[[RCS Summer School 2024#AWS|'''AWS: How AWS works with Researchers''']]ICT 102, 2:00PM - 2:20PM
|-
|-
!2:30 PM
!2:30 PM
Line 179: Line 180:
Modernize your research workflows using Prefect, an open source workflow orchestration tool.  We will show how you can build and deploy resilient workflows and workflow pipelines.
Modernize your research workflows using Prefect, an open source workflow orchestration tool.  We will show how you can build and deploy resilient workflows and workflow pipelines.


=== AWS ===
==== AWS: Inspiring the art of the possible ====
ICT 102, 1:30PM - 1:50PM by Artur Rodrigues
Learn what is possible on AWS Cloud for research.


==== AWS: How AWS works with Researchers ====
ICT 102, 1:30PM - 1:50PM by Artur Rodrigues


=== AWS ===
AWS has many programs to support researchers such as credits, letter of supports, immersion days, working on proof of concepts. In this session, we will cover how we engage with researchers and what programs are out there to help accelerate your research with the AWS Cloud.
==== AWS: ML in the Cloud ====
ICT 102, 1:30 PM - 4:45 PM


==== AWS: Machine learning with low-code workshop ====
==== AWS: Machine learning with low-code workshop ====
ICT 102, 1:30 PM - 4:45 PM
ICT 102, 1:30 PM - 4:45 PM by AWS


=== NVIDIA: Workflow Optimization with NVIDIA GPUs ===
The Machine Learning (ML) journey requires continuous experimentation and rapid prototyping to be successful. In order to create highly accurate and performant models, data scientists have to first experiment with feature engineering, model selection and  optimization techniques. These processes are traditionally time consuming and expensive. In this workshop attendees will learn the following:
ICT 102, 9:30AM - 12:20AM by Artur Rodrigues from NVIDIA
 
* How the Low-Code ML capabilities found in Amazon SageMaker Data Wrangler, Autopilot and Jumpstart, make it easier to experiment faster and bring highly accurate models to production more quickly and efficiently
* How to simplify the process of data preparation and feature engineering, and complete each step of the data preparation workflow
* Understand how to automatically build, train, and tune the best machine learning models based on your data, while allowing you to maintain full control and visibility.
* Get started with ML easily and quickly using pre-built solutions for common financial use cases and open source models from popular model zoos.
 
=== NVIDIA ===
 
==== Workflow Optimization with NVIDIA GPUs ====
ICT 102, 9:30AM - 12:20AM by Jonathan Dursi from NVIDIA


We will discuss how to optimizing workflows with NVIDIA powered GPUs to help accelerate your research.
We will discuss how to optimizing workflows with NVIDIA powered GPUs to help accelerate your research.


=== Dell & AMD: Machine learning with Dell & AMD ===
=== Dell & AMD ===
 
==== Machine learning with Dell & AMD ====
ICT 102, 1:00PM - 1:50PM
ICT 102, 1:00PM - 1:50PM


To be announced.
To be announced.

Revision as of 00:13, 10 May 2024

Research Computing Services' 3rd annual summer school will run from Monday, June 10 through to Wednesday, June 12, 2024 from 9AM to 5PM. This summer school consists of various sessions throughout these 3 days and is completely free to University of Calgary members.

Our goal for this year's summer school is to Empower our researchers: Inspiring what is possible on HPC infrastructure.

RCS Summer School 2024 Poster

Registration

Registration is required to attend the RCS Summer School sessions. Registration is free to all members of the University of Calgary. A link will be available here soon.

There will be a limit of approximately 100 seats. If you are unable to attend after registering, please notify us.

Topics

  • Introduction to RCS services and HPC resources
  • Introduction to Linux & Bash command line
  • Using Linux utilities for large datasets
  • Hands on with Linux & Slurm: Workshop
  • Using Open OnDemand on ARC
  • Develop a research data management plan
  • Reproducible data management with Datalad
  • Digital File Management
  • Using containers in HPC with Apptainer
  • Managing scientific software with Conda
  • Research workflow development with Prefect
  • AWS: ML in the Cloud, a walkthrough followed by a workshop
  • NVIDIA: Workflow optimization using NVIDIA GPUs
  • Dell & AMD: Machine learning with Dell and AMD

Schedule

The summer school sessions will be held in ICT 102 and ICT 114. Refreshments will be available in ICT 114 on all 3 days.

Time June 10 June 11 June 12
8:30 AM Registration & check-in
ICT 102
Refreshments
ICT 114
Registration & check-in
ICT 102
Refreshments
ICT 114
Registration & check-in
ICT 102
Refreshments
ICT 114
9:00 AM Introduction to RCS
ICT 102, 9:00AM - 9:20AM
Jill Kowalchuk
The Alliance: Introduction
ICT 102
Brock Kahanyshyn
TBD

ICT 102

9:30 AM Introduction to Linux, Bash,
and the command line

ICT 102, 9:30AM - 10:30AM
Robert Fridman
Developing a Research Data Management Plan with technical storage requirements
ICT 114, 9:30AM - 11:20AM
Ian Percel
Introduction to HPC resources
ICT 102, 9:30AM - 10:20AM
Robert Fridman, Dave Schulz
Data Management with Datalad
ICT 114, 9:30AM - 10:20AM
David Deepwell, Pedro Martinez
NVIDIA: Workflow Optimization with NVIDIA GPUs
ICT 102, 9:30AM - 12:20AM
Jonathan Dursi from NVIDIA
10:00 AM Refreshments
ICT 114
10:30 AM Workshop: Hands on with Linux & Slurm
ICT 102, 10:30AM - 11:50 AM
Robert Fridman
Linux tools & utilities for working with large data sets
ICT 102, 10:30AM - 11:20AM
Leo Leung
11:00 AM
11:30 AM Reproducible Data Management with Datalad
ICT 114, 10:30AM - 11:20AM
David Deepwell, Pedro Martinez
RCS Q&A period: Ask RCS anything
ICT 102, 11:30AM - 12:00PM
RCS Team
12:00 PM Open OnDemand on ARC
ICT 102, 12:00 AM - 12:20 AM
Leo Leung
Lunch break
12:30PM - 1:30PM
12:30 PM Lunch break
12:30PM - 1:30PM
Lunch break
12:30PM - 1:30PM
1:00 PM Digital File Management and File Management
ICT 102, 1:00PM - 2:20PM
Jennifer Abel, Alex Thistlewood, Ingrid Reiche
Refreshments
ICT 114
Dell & AMD: Machine learning with Dell & AMD
ICT 102, 1:00PM - 1:50PM
1:30 PM AWS: Inspiring the art of the possible
ICT 102, 1:30PM - 1:50PM
Refreshments
ICT 114
2:00 PM AWS: How AWS works with ResearchersICT 102, 2:00PM - 2:20PM
2:30 PM AWS: Machine Learning with low-code workshop
ICT 102, 2:30PM - 4:50PM
AWS
Introduction to containers with Apptainer
ICT 102, 2:30PM - 3:20PM
Tannistha Nandi
Prefect for Research Workflow Development
ICT 102, 2:30PM - 3:50PM
David Deepwell, Pedro Martinez
3:00 PM
3:30 PM Managing scientific software with Conda
ICT 102, 3:30PM - 4:20PM
Dmitri Rozmanov
4:00 PM End of day: 4:00PM
4:30 PM End of day: 4:30PM
5:00 PM End of day: 5:00PM

Sessions

Introduction to RCS

ICT 102, 9:00AM - 9:20AM by Jill Kowalchuk

We will begin the summer school with a quick introduction by Jill Kowalchuk, the Interim director of Research Computing Services. We'll go through who RCS is and the services that we offer.

Introduction to Linux, Bash, and the command line

ICT 102, 9:30AM - 10:30AM by Robert Fridman

A quick crash course on how to use Linux, bash shell, and the command line in general. This beginner friendly session requires no prior experience to Linux. We recommend bringing your own device to follow along.

Workshop: Hands on with Linux & Slurm

ICT 102, 10:30AM - 11:50 AM by Robert Fridman

A follow-up workshop that builds on the basics covered in the Linux introduction session and goes into depth on how to use Slurm, the scheduler that RCS uses in their high performance computing clusters. We recommend bringing your own device to follow along.

Open OnDemand on ARC

ICT 102, 12:00 AM - 12:20 AM by Leo Leung

Did you know you can run a Linux desktop on ARC? In this session, we will do a quick demo of ARC Open OnDemand, a web interface that allows users to submit jobs that need graphical user interfaces. We will also cover how to monitor your jobs through Open OnDemand.

Developing a Research Data Management Plan with technical storage requirements

ICT 114, 9:30AM - 11:20AM by Ian Percel

Effective management of your research data is paramount. Join us as we delve into crafting robust data management plans tailored to your specific research needs.

Reproducible Data Management with Datalad

ICT 114, 10:30AM - 11:20AM by David Deepwell and Pedro Martinez

An introduction to Datalad, a data management tool designed to streamline workflows and ensure the integrity and reproducibility of your research data.

Introduction to HPC resources

ICT 102, 9:30AM - 10:20AM by Robert Fridman, Dave Schulz

An introduction to high performance computing resources offered by RCS. We will go over how our infrastructure ties in to your research and how to make the most out of Slurm. How to download and transfer data with other institutions.

Linux tools & utilities for working with large data sets

ICT 102, 10:30AM - 11:20AM by Leo Leung

As researchers use larger and larger datasets, it is imperative to effectively handle and manage these datasets. In this session, we will go through some common methods to work with datasets using standard Linux tools and utilities. We will cover common use cases on how to download large datasets from the Internet, parsing text-based data using tools such as sed, awk, grep, and will then tie everything together with pipes.

RCS Q&A period: Ask RCS anything

ICT 102, 11:30AM - 12:00PM by the RCS team

A general question and answers period where you can ask us anything related to RCS and HPC.

Digital File Management and File Management

ICT 102, 1:00PM - 2:20PM by Jennifer Abel, Alex Thistlewood, Ingrid Reiche (The University of Calgary Libraries and Cultural Resources)

Managing your digital files and research materials is critical for keeping yourself organized, collaborating, and communicating with colleagues. Digital file management can range from simple to complex depending on individual and organizational needs. This presentation will discuss best practices, versioning, and how to document and share your file and folder convention using a README file.

Introduction to containers with Apptainer

ICT 102, 2:30PM - 3:20PM by Tannistha Nandi

Make your research workflows reproducible through the power of containers. We will go through in detail how to run containers on ARC using Apptainer.

Managing scientific software with Conda

ICT 102, 3:30PM - 4:20PM by Dmitri Rozmanov

Running customized scientific software on a shared HPC environment may be challenging. This session, we will go over how to set up customized software environments using Conda.

Data Management with Datalad

ICT 114, 9:30AM - 10:20AM by David Deepwell and Pedro Martinez

Digital data management is important to keep your research organized and efficient. In this session, we will show how to use Datalad to help keep track of your data, create structure, ensure reproducibility, and support collaboration with other researchers.

Prefect for Research Workflow Development

ICT 102, 2:30PM - 3:50PM by David Deepwell and Pedro Martinez

Modernize your research workflows using Prefect, an open source workflow orchestration tool. We will show how you can build and deploy resilient workflows and workflow pipelines.

AWS

AWS: Inspiring the art of the possible

ICT 102, 1:30PM - 1:50PM by Artur Rodrigues

Learn what is possible on AWS Cloud for research.

AWS: How AWS works with Researchers

ICT 102, 1:30PM - 1:50PM by Artur Rodrigues

AWS has many programs to support researchers such as credits, letter of supports, immersion days, working on proof of concepts. In this session, we will cover how we engage with researchers and what programs are out there to help accelerate your research with the AWS Cloud.

AWS: Machine learning with low-code workshop

ICT 102, 1:30 PM - 4:45 PM by AWS

The Machine Learning (ML) journey requires continuous experimentation and rapid prototyping to be successful. In order to create highly accurate and performant models, data scientists have to first experiment with feature engineering, model selection and  optimization techniques. These processes are traditionally time consuming and expensive. In this workshop attendees will learn the following:

  • How the Low-Code ML capabilities found in Amazon SageMaker Data Wrangler, Autopilot and Jumpstart, make it easier to experiment faster and bring highly accurate models to production more quickly and efficiently
  • How to simplify the process of data preparation and feature engineering, and complete each step of the data preparation workflow
  • Understand how to automatically build, train, and tune the best machine learning models based on your data, while allowing you to maintain full control and visibility.
  • Get started with ML easily and quickly using pre-built solutions for common financial use cases and open source models from popular model zoos.

NVIDIA

Workflow Optimization with NVIDIA GPUs

ICT 102, 9:30AM - 12:20AM by Jonathan Dursi from NVIDIA

We will discuss how to optimizing workflows with NVIDIA powered GPUs to help accelerate your research.

Dell & AMD

Machine learning with Dell & AMD

ICT 102, 1:00PM - 1:50PM

To be announced.