sstat is a command-line tool that is part of the Slurm workload manager suite. It allows users and administrators to view information about running jobs on a Slurm-managed cluster.
The sstat command can be used to retrieve a wide range of information about running jobs, including resource usage, job status, memory usage, and more. It is particularly useful for monitoring and troubleshooting running jobs, as it provides real-time information on a variety of performance metrics.
sstat Command Examples
1. Display status information of a comma-separated list of jobs:
# sstat --jobs=job_id
2. Display job ID, average CPU and average virtual memory size of a comma-separated list of jobs, with pipes as column delimiters:
# sstat --parsable --jobs=job_id --format=JobID,AveCPU,AveVMSize
3. Display list of fields available:
# sstat --helpformat
Summary
Overall, sstat is a powerful and flexible tool that is widely used in cluster computing environments to monitor job performance and troubleshoot issues.