Archived HPC Training Recordings¶
2025¶
-
Intro to Wulver: Focus on Job Efficiency¶
This webinar provides more in-depth features of SLURM, how to run dependency, array jobs to run efficiently on the cluster.
Key Highlights:
- Sbatch: Some Examples
- salloc command
- Job Dependencies
- Job Arrays
- Checkpointing
-
Conda for Shared Environments¶
This webinar provides an introductory understanding of using Python for HPC and effectively managing their Python environments.
Key Highlights:
- Access Python on Wulver
- Introduction to Conda environments
- Install, uninstall and upgrade packages
- Best Practices for managing conda environments
- Common Python libraries for scientific computing
-
HPC User Meeting - Introduction to MIG¶
This in-person and virtual session provide an introductory understanding of using Multi-Instance GPUs (MIGs) on Wulver.
Key Highlights:
- What is MIG?
- Why MIG on Wulver?
- MIG Configuration Example
- Submitting Jobs (
srun&sbatch) - New Billing Model
-
Introduction to Linux¶
This is the fourth webinar of the 2025 Spring semester, introducing the basics of the Linux operating system. This session is designed to help new users become familiar with Linux, an essential skill for working in High-Performance Computing (HPC) environments.
Key Highlights:¶
- Basics of the Linux operating system
- Common commands and file system navigation
- Managing files, directories, and permissions
- Introduction to shell scripting for automation
- Connecting to remote systems and working with HPC cluster
-
Python and Conda Environments in HPC: From Basics to Best Practices¶
This is the third webinar of the 2025 Spring semester, focusing on an introductory understanding of using Python for HPC and effectively managing their Python environments using Conda
Key Highlights:¶
- Learn how to manage Python environments using Conda.
- How to create Conda environments in different locations and install Python packages.
- Become familiar with common tools and libraries for scientific computing in Python.
- Import Conda environment to a different location.
2024¶
-
Job Arrays and Advanced Submission Techniques for HPC¶
This is the final in a series of three webinars in the fall semester. designed to introduce researchers, scientists, and HPC users to the fundamentals of the containers. This session aims to provide useful information on submitting SLURM jobs efficiently by covering job arrays, job dependencies, checkpointing, and addressing common SLURM job issues.
Key Highlights:¶
- Understanding the concept and benefits of job arrays
- Syntax for submitting and managing job arrays
- Best practices for efficient array job design
- Dependency chains and complex workflows
- Resource optimization strategies
- Using SLURM's advanced options for improved job control
- Checkpointing the jobs and use of 3rd party checkpointing tool
-
SLURM Batch System Basics¶
This is the first in a series of three webinars in the fall semester. designed to introduce researchers, scientists, and HPC users to the fundamentals of the SLURM (Simple Linux Utility for Resource Management) workload manager. This virtual session will equip you with essential skills to effectively utilize HPC resources through SLURM.
Key Highlights:¶
- Introduction to SLURM and its role in HPC environments
- Basic SLURM commands for job submission, monitoring, and management
- How to write effective job scripts for various application types
- Understanding SLURM partitions, quality of service, and job priorities
- Best practices for resource requests and job optimization
- Troubleshooting common issues in job submission and execution
-
Introduction to Python and Conda¶
Participants will gain an introductory understanding of using Python for HPC and effectively managing their Python environments using Conda. This knowledge will empower them to leverage the power of Python for their scientific computing needs on HPC systems.
Key Highlights:¶
- Learn how to manage Python environments for HPC using Conda.
- Become familiar with common tools and libraries for scientific computing in Python.