sinfo - view information about Slurm nodes and partitions.
sinfo [OPTIONS...]
sinfo is used to view partition and node information for a system running Slurm.
   -a, --all
          Display  information   about   all   partitions.   This   causes
          information to be displayed about partitions that are configured
          as hidden and partitions that are unavailable to user's group.
   -b, --bgl
          Display information about bglblocks (on Blue Gene systems only).
   -d, --dead
          If set only report state information for  non-responding  (dead)
          nodes.
   -e, --exact
          If  set,  do not group node information on multiple nodes unless
          their configurations to be reported are identical. Otherwise cpu
          count, memory size, and disk space for nodes will be listed with
          the minimum value followed by a "+"  for  nodes  with  the  same
          partition and state (e.g., "250+").
   -h, --noheader
          Do not print a header on the output.
   --help Print a message describing all sinfo options.
   --hide Do  not display information about hidden partitions. By default,
          partitions that are configured as hidden or are not available to
          the user's group will not be displayed (i.e. this is the default
          behavior).
   -i <seconds>, --iterate=<seconds>
          Print the state on a periodic basis.  Sleep  for  the  indicated
          number  of  seconds  between reports.  By default, prints a time
          stamp with the header.
   -l, --long
          Print  more  detailed  information.   This  is  ignored  if  the
          --format option is specified.
   -M, --clusters=<string>
          Clusters  to  issue  commands to.  Multiple cluster names may be
          comma separated.  A value of of 'all' will query to run  on  all
          clusters.
   -n <nodes>, --nodes=<nodes>
          Print  information  only  about the specified node(s).  Multiple
          nodes may be comma separated or expressed  using  a  node  range
          expression.  For  example  "linux[00-07]"  would  indicate eight
          nodes, "linux00" through "linux07."  Performance of the  command
          can  be  measurably  improved  for systems with large numbers of
          nodes when a single node name is specified.
   --noconvert
          Don't convert units from their original type (e.g.  2048M  won't
          be converted to 2G).
   -N, --Node
          Print  information  in  a node-oriented format with one line per
          node.    The   default   is   to   print   information   in    a
          partition-oriented  format.   This  is  ignored  if the --format
          option is specified.
   -o <output_format>, --format=<output_format>
          Specify the information to be displayed using  an  sinfo  format
          string.  Format strings transparently used by sinfo when running
          with various options are
          default        "%#P %.5a %.10l %.6D %.6t %N"
          --summarize    "%#P %.5a %.10l %.16F  %N"
          --long         "%#P %.5a %.10l %.10s %.4r %.8h %.10g %.6D  %.11T
                         %N"
          --Node         "%#N %.6D %#P %6t"
          --long --Node  "%#N %.6D %#P %.11T %.4c %.8z %.6m %.8d %.6w %.8f
                         %20E"
          --list-reasons "%20E %9u %19H %N"
          --long --list-reasons
                         "%20E %12U %19H %6t %N"
          In the above format strings,  the  use  of  "#"  represents  the
          maximum length of any partition name or node list to be printed.
          A pass is made over the records to be printed to  establish  the
          size  in  order to align the sinfo output, then a second pass is
          made over the records to print  them.   Note  that  the  literal
          character  "#" itself is not a valid field length specification,
          but is only used to document this behaviour.
          The field specifications available include:
          %all  Print all fields available  for  this  data  type  with  a
                vertical bar separating each field.
          %a    State/availability of a partition
          %A    Number  of  nodes by state in the format "allocated/idle".
                Do not use this with a node state option ("%t" or "%T") or
                the  different  node  states  will  be  placed on separate
                lines.
          %b    Features currently active on the nodes, also see %f
          %B    The max number of CPUs per node available to jobs  in  the
                partition.
          %c    Number of CPUs per node
          %C    Number    of    CPUs    by    state    in    the    format
                "allocated/idle/other/total". Do not use this with a  node
                state  option  ("%t" or "%T") or the different node states
                will be placed on separate lines.
          %d    Size of temporary disk space per node in megabytes
          %D    Number of nodes
          %e    Free memory of a node
          %E    The reason  a  node  is  unavailable  (down,  drained,  or
                draining states).
          %f    Features available the nodes, also see %b
          %F    Number    of    nodes    by    state    in    the   format
                "allocated/idle/other/total".  Do not use this with a node
                state  option  ("%t" or "%T") or the different node states
                will be placed on separate lines.
          %g    Groups which may use the nodes
          %G    Generic resources (gres) associated with the nodes
          %h    Jobs may  oversubscribe  compute  resources  (i.e.  CPUs),
                "yes", "no", "exclusive" or "force"
          %H    Print the timestamp of the reason a node is unavailable.
          %I    Partition job priority weighting factor.
          %l    Maximum    time    for    any    job    in    the   format
                "days-hours:minutes:seconds"
          %L    Default   time    for    any    job    in    the    format
                "days-hours:minutes:seconds"
          %m    Size of memory per node in megabytes
          %M    PreemptionMode
          %n    List of node hostnames
          %N    List of node names
          %o    List of node communication addresses
          %O    CPU load of a node
          %p    Partition scheduling tier priority.
          %P    Partition  name followed by "*" for the default partition,
                also see %R
          %r    Only user root may initiate jobs, "yes" or "no"
          %R    Partition name, also see %P
          %s    Maximum job size in nodes
          %S    Allowed allocating nodes
          %t    State of nodes, compact form
          %T    State of nodes, extended form
          %u    Print the user name of  who  set  the  reason  a  node  is
                unavailable.
          %U    Print  the  user name and uid of who set the reason a node
                is unavailable.
          %v    Print the version of the running slurmd daemon.
          %w    Scheduling weight of the nodes
          %X    Number of sockets per node
          %Y    Number of cores per socket
          %Z    Number of threads per core
          %z    Extended processor information: number of sockets,  cores,
                threads (S:C:T) per node
          %.<*> right justification of the field
          %<Number><*>
                size of field
   -O <output_format>, --Format=<output_format>
          Specify  the  information  to  be  displayed.   Also  see the -o
          <output_format>, --format=<output_format> option described below
          (which  supports greater flexibility in formatting, but does not
          support access to all fields because we  ran  out  of  letters).
          Requests  a  comma  separated  list  of  job  information  to be
          displayed.
          The format of each field is "type[:[.]size]"
          size    is the minimum field size.  If no size is specified,  20
                  characters will be allocated to print the information.
           .      indicates  the output should be right justified and size
                  must  be  specified.   By  default,   output   is   left
                  justified.
          Valid type specifications include:
          all   Print  all fields available in the -o format for this data
                type with a vertical bar separating each field.
          allocmem
                Prints the amount of allocated memory on a node.
          allocnodes
                Allowed allocating nodes.
          available
                State/availability of a partition.
          cpus  Number of CPUs per node.
          cpusload
                CPU load of a node.
          freemem
                Free memory of a node.
          cpusstate
                Number    of    CPUs    by    state    in    the    format
                "allocated/idle/other/total".  Do not use this with a node
                state option ("%t" or "%T") or the different  node  states
                will be placed on separate lines.
          cores Number of cores per socket.
          defaulttime
                Default    time    for    any    job    in    the   format
                "days-hours:minutes:seconds".
          disk  Size of temporary disk space per node in megabytes.
          features
                Features available on the nodes. Also see features_act.
          features_act
                Features currently active on the nodes. Also see features.
          groups
                Groups which may use the nodes.
          gres  Generic resources (gres) associated with the nodes.
          maxcpuspernode
                The max number of CPUs per node available to jobs  in  the
                partition.
          memory
                Size of memory per node in megabytes.
          nodes Number of nodes.
          nodeaddr
                List of node communication addresses.
          nodeai
                Number  of  nodes by state in the format "allocated/idle".
                Do not use this with a node state option ("%t" or "%T") or
                the  different  node  states  will  be  placed on separate
                lines.
          nodeaiot
                Number   of    nodes    by    state    in    the    format
                "allocated/idle/other/total".  Do not use this with a node
                state option ("%t" or "%T") or the different  node  states
                will be placed on separate lines.
          nodehost
                List of node hostnames.
          nodelist
                List of node names.
          oversubscribe
                Jobs  may  oversubscribe  compute  resources  (i.e. CPUs),
                "yes", "no", "exclusive" or "force".
          partition
                Partition name followed by "*" for the default  partition,
                also see %R.
          partitionname
                Partition name, also see %P.
          preemptmode
                PreemptionMode.
          priorityjobfactor
                Partition  factor  used  by priority/multifactor plugin in
                calculating job priority.
          prioritytier or priority
                Partition scheduling tier priority.
          reason
                The reason  a  node  is  unavailable  (down,  drained,  or
                draining states).
          root  Only user root may initiate jobs, "yes" or "no".
          size  Maximum job size in nodes.
          statecompact
                State of nodes, compact form.
          statelong
                State of nodes, extended form.
          sockets
                Number of sockets per node.
          socketcorethread
                Extended  processor information: number of sockets, cores,
                threads (S:C:T) per node.
          time  Maximum   time    for    any    job    in    the    format
                "days-hours:minutes:seconds".
          timestamp
                Print the timestamp of the reason a node is unavailable.
          threads
                Number of threads per core.
          user  Print  the  user  name  of  who  set  the reason a node is
                unavailable.
          userlong
                Print the user name and uid of who set the reason  a  node
                is unavailable.
          version
                Print the version of the running slurmd daemon.
          weight
                Scheduling weight of the nodes.
   -p <partition>, --partition=<partition>
          Print   information   only  about  the  specified  partition(s).
          Multiple partitions are separated by commas.
   -r, --responding
          If set only report state information for responding nodes.
   -R, --list-reasons
          List reasons nodes are in the down,  drained,  fail  or  failing
          state.   When  nodes are in these states Slurm supports optional
          inclusion of a "reason" string by an administrator.  This option
          will  display  the  first  35 characters of the reason field and
          list of nodes with that  reason  for  all  nodes  that  are,  by
          default, down, drained, draining or failing.  This option may be
          used with other node filtering options (e.g. -r,  -d,  -t,  -n),
          however,  combinations of these options that result in a list of
          nodes that are not down or drained or failing will  not  produce
          any  output.  When used with -l the output additionally includes
          the current node state.
   -s, --summarize
          List only a partition state summary with no node state  details.
          This is ignored if the --format option is specified.
   -S <sort_list>, --sort=<sort_list>
          Specification  of the order in which records should be reported.
          This uses the same field specification as  the  <output_format>.
          Multiple  sorts may be performed by listing multiple sort fields
          separated by commas.  The field specifications may  be  preceded
          by  "+"  or  "-"  for  ascending  (default) and descending order
          respectively.  The partition field specification,  "P",  may  be
          preceded  by  a  "#" to report partitions in the same order that
          they appear in Slurm's   configuration  file,  slurm.conf.   For
          example,  a  sort  value  of  "+P,-m"  requests  that records be
          printed in order of  increasing  partition  name  and  within  a
          partition  by decreasing memory size.  The default value of sort
          is "#P,-t" (partitions ordered  as  configured  then  decreasing
          node state).  If the --Node option is selected, the default sort
          value is "N" (increasing node name).
   -t <states> , --states=<states>
          List nodes only having the given state(s).  Multiple states  may
          be  comma  separated  and  the  comparison  is case insensitive.
          Possible values include (case  insensitive):  ALLOC,  ALLOCATED,
          COMP,  COMPLETING,  DOWN, DRAIN (for node in DRAINING or DRAINED
          states), DRAINED, DRAINING,  ERR,  ERROR,  FAIL,  FUTURE,  FUTR,
          IDLE,  MAINT, MIX, MIXED, NO_RESPOND, NPC, PERFCTRS, POWER_DOWN,
          POWER_UP, RESV, RESERVED, UNK, and UNKNOWN.  By default nodes in
          the  specified state are reported whether they are responding or
          not.  The  --dead  and  --responding  options  may  be  used  to
          filtering nodes by the responding flag.
   -T, --reservation
          Only display information about Slurm reservations.
   --usage
          Print a brief message listing the sinfo options.
   -v, --verbose
          Provide detailed event logging through program execution.
   -V, --version
          Print version information and exit.
   AVAIL  Partition state: up or down.
   CPUS   Count of CPUs (processors) on each node.
   S:C:T  Count of sockets (S), cores (C), and threads (T) on these nodes.
   SOCKETS
          Count of sockets on these nodes.
   CORES  Count of cores on these nodes.
   THREADS
          Count of threads on these nodes.
   GROUPS Resource  allocations  in  this  partition are restricted to the
          named groups.  all  indicates  that  all  groups  may  use  this
          partition.
   JOB_SIZE
          Minimum and maximum node count that can be allocated to any user
          job.  A single number indicates the  minimum  and  maximum  node
          count  are  the  same.   infinite is used to identify partitions
          without a maximum node count.
   TIMELIMIT
          Maximum    time    limit     for     any     user     job     in
          days-hours:minutes:seconds.    infinite   is  used  to  identify
          partitions without a job time limit.
   MEMORY Size of real memory in megabytes on these nodes.
   NODELIST or BP_LIST (BlueGene systems only)
          Names of nodes associated with this configuration/partition.
   NODES  Count of nodes with this particular configuration.
   NODES(A/I)
          Count of nodes with this particular configuration by node  state
          in the form "available/idle".
   NODES(A/I/O/T)
          Count  of nodes with this particular configuration by node state
          in the form "available/idle/other/total".
   PARTITION
          Name of a partition.  Note that the suffix  "*"  identifies  the
          default partition.
   ROOT   Is   the   ability  to  allocate  resources  in  this  partition
          restricted to user root, yes or no.
   OVERSUBSCRIBE
          Will jobs allocated resources in  this  partition  oversubscribe
          those compute resources (i.e. CPUs).  no indicates resources are
          never  oversubscribed.   exclusive  indicates  whole  nodes  are
          dedicated to jobs (equivalent to srun --exclusive option, may be
          used even with select/cons_res managing individual  processors).
          force   indicates   resources   are   always   available  to  be
          oversubscribed.  yes indicates resource may be oversubscribed or
          not per job's resource allocation.
   STATE  State   of  the  nodes.   Possible  states  include:  allocated,
          completing, down,  drained,  draining,  fail,  failing,  future,
          idle,  maint,  mixed,  perfctrs, power_down, power_up, reserved,
          and unknown plus Their abbreviated  forms:  alloc,  comp,  down,
          drain,  drng,  fail, failg, futr, idle, maint, mix, npc, pow_dn,
          pow_up, resv, and unk respectively.  Note that  the  suffix  "*"
          identifies nodes that are presently not responding.
   TMP_DISK
          Size of temporary disk space in megabytes on these nodes.
   Node  state  codes are shortened as required for the field size.  These
   node states may be followed by a special character  to  identify  state
   flags  associated  with  the  node.   The  following node sufficies and
   states are used:
   *   The node is presently not responding and will not be allocated  any
       new work.  If the node remains non-responsive, it will be placed in
       the  DOWN  state  (except  in  the  case  of  COMPLETING,  DRAINED,
       DRAINING, FAIL, FAILING nodes).
   ~   The  node is presently in a power saving mode (typically running at
       reduced frequency).
   #   The node is presently being powered up or configured.
   $   The node is currently  in  a  reservation  with  a  flag  value  of
       "maintenance" or is scheduled to be rebooted.
   ALLOCATED   The node has been allocated to one or more jobs.
   ALLOCATED+  The  node  is allocated to one or more active jobs plus one
               or more jobs are in the process of COMPLETING.
   COMPLETING  All jobs associated with this node are in  the  process  of
               COMPLETING.   This  node  state will be removed when all of
               the job's processes have terminated and  the  Slurm  epilog
               program  (if  any) has terminated. See the Epilog parameter
               description  in  the   slurm.conf   man   page   for   more
               information.
   DOWN        The  node  is  unavailable for use. Slurm can automatically
               place nodes in this state if some  failure  occurs.  System
               administrators  may  also  explicitly  place  nodes in this
               state. If  a  node  resumes  normal  operation,  Slurm  can
               automatically return it to service. See the ReturnToService
               and   SlurmdTimeout   parameter   descriptions    in    the
               slurm.conf(5) man page for more information.
   DRAINED     The  node  is  unavailable for use per system administrator
               request.  See the update node command  in  the  scontrol(1)
               man   page   or   the   slurm.conf(5)  man  page  for  more
               information.
   DRAINING    The node is currently executing a  job,  but  will  not  be
               allocated  to  additional  jobs.  The  node  state  will be
               changed to state DRAINED when the last job on it completes.
               Nodes  enter  this  state per system administrator request.
               See the update node command in the scontrol(1) man page  or
               the slurm.conf(5) man page for more information.
   ERROR       The  node is currently in an error state and not capable of
               running any jobs.  Slurm can automatically place  nodes  in
               this  state  if some failure occurs.  System administrators
               may also explicitly place nodes in this state.  If  a  node
               resumes normal operation, Slurm can automatically return it
               to  service.  See  the  ReturnToService  and  SlurmdTimeout
               parameter  descriptions  in  the slurm.conf(5) man page for
               more information.
   FAIL        The node is expected to fail soon and  is  unavailable  for
               use  per system administrator request.  See the update node
               command in the scontrol(1) man page  or  the  slurm.conf(5)
               man page for more information.
   FAILING     The  node  is currently executing a job, but is expected to
               fail  soon  and  is  unavailable   for   use   per   system
               administrator  request.  See the update node command in the
               scontrol(1) man page or the slurm.conf(5) man page for more
               information.
   FUTURE      The node is currently not fully configured, but expected to
               be available at some point in  the  indefinite  future  for
               use.
   IDLE        The  node is not allocated to any jobs and is available for
               use.
   MAINT       The node is currently in a reservation with a flag value of
               "maintenance" or is scheduled to be rebooted.
   MIXED       The  node  has  some of its CPUs ALLOCATED while others are
               IDLE.
   PERFCTRS (NPC)
               Network Performance Counters associated with this node  are
               in  use,  rendering  this  node as not usable for any other
               jobs
   POWER_DOWN  The node is currently  powered  down  and  not  capable  of
               running any jobs.
   POWER_UP    The node is currently in the process of being powered up.
   RESERVED    The  node  is  in an advanced reservation and not generally
               available.
   UNKNOWN     The Slurm controller has just started and the node's  state
               has not yet been determined.
   Some  sinfo  options  may  be  set  via  environment  variables.  These
   environment variables, along  with  their  corresponding  options,  are
   listed  below.  (Note:  Commandline  options will always override these
   settings.)
   SINFO_ALL           -a, --all
   SINFO_FORMAT        -o <output_format>, --format=<output_format>
   SINFO_PARTITION     -p <partition>, --partition=<partition>
   SINFO_SORT          -S <sort>, --sort=<sort>
   SLURM_CLUSTERS      Same as --clusters
   SLURM_CONF          The location of the Slurm configuration file.
   SLURM_TIME_FORMAT   Specify the format used to report  time  stamps.  A
                       value  of  standard,  the  default value, generates
                       output            in            the            form
                       "year-month-dateThour:minute:second".   A  value of
                       relative returns only "hour:minute:second"  if  the
                       current  day.   For other dates in the current year
                       it prints the "hour:minute"  preceded  by  "Tomorr"
                       (tomorrow),  "Ystday"  (yesterday), the name of the
                       day for the coming week (e.g. "Mon", "Tue",  etc.),
                       otherwise  the  date  (e.g.  "25  Apr").  For other
                       years it returns a date month and  year  without  a
                       time  (e.g.   "6 Jun 2012"). All of the time stamps
                       use a 24 hour format.
                       A valid strftime() format can  also  be  specified.
                       For example, a value of "%a %T" will report the day
                       of the week and a time stamp (e.g. "Mon 12:34:56").
Report basic node and partition configurations: > sinfo PARTITION AVAIL TIMELIMIT NODES STATE NODELIST batch up infinite 2 alloc adev[8-9] batch up infinite 6 idle adev[10-15] debug* up 30:00 8 idle adev[0-7] Report partition summary information: > sinfo -s PARTITION AVAIL TIMELIMIT NODES(A/I/O/T) NODELIST batch up infinite 2/6/0/8 adev[8-15] debug* up 30:00 0/8/0/8 adev[0-7] Report more complete information about the partition debug: > sinfo --long --partition=debug PARTITION AVAIL TIMELIMIT JOB_SIZE ROOT OVERSUBS GROUPS NODES STATE NODELIST debug* up 30:00 8 no no all 8 idle dev[0-7] Report only those nodes that are in state DRAINED: > sinfo --states=drained PARTITION AVAIL NODES TIMELIMIT STATE NODELIST debug* up 2 30:00 drain adev[6-7] Report node-oriented information with details and exact matches: > sinfo -Nel NODELIST NODES PARTITION STATE CPUS MEMORY TMP_DISK WEIGHT FEATURES REASON adev[0-1] 2 debug* idle 2 3448 38536 16 (null) (null) adev[2,4-7] 5 debug* idle 2 3384 38536 16 (null) (null) adev3 1 debug* idle 2 3394 38536 16 (null) (null) adev[8-9] 2 batch allocated 2 246 82306 16 (null) (null) adev[10-15] 6 batch idle 2 246 82306 16 (null) (null) Report only down, drained and draining nodes and their reason field: > sinfo -R REASON NODELIST Memory errors dev[0,5] Not Responding dev8
Copyright (C) 2002-2007 The Regents of the University of California. Produced at Lawrence Livermore National Laboratory (cf, DISCLAIMER). Copyright (C) 2008-2009 Lawrence Livermore National Security. Copyright (C) 2010-2016 SchedMD LLC. This file is part of Slurm, a resource management program. For details, see <http://slurm.schedmd.com/>. Slurm is free software; you can redistribute it and/or modify it under the terms of the GNU General Public License as published by the Free Software Foundation; either version 2 of the License, or (at your option) any later version. Slurm is distributed in the hope that it will be useful, but WITHOUT ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU General Public License for more details.
scontrol(1), smap(1), squeue(1), slurm_load_ctl_conf (3), slurm_load_jobs (3), slurm_load_node (3), slurm_load_partitions (3), slurm_reconfigure (3), slurm_shutdown (3), slurm_update_job (3), slurm_update_node (3), slurm_update_partition (3), slurm.conf(5)
Personal Opportunity - Free software gives you access to billions of dollars of software at no cost. Use this software for your business, personal use or to develop a profitable skill. Access to source code provides access to a level of capabilities/information that companies protect though copyrights. Open source is a core component of the Internet and it is available to you. Leverage the billions of dollars in resources and capabilities to build a career, establish a business or change the world. The potential is endless for those who understand the opportunity.
Business Opportunity - Goldman Sachs, IBM and countless large corporations are leveraging open source to reduce costs, develop products and increase their bottom lines. Learn what these companies know about open source and how open source can give you the advantage.
Free Software provides computer programs and capabilities at no cost but more importantly, it provides the freedom to run, edit, contribute to, and share the software. The importance of free software is a matter of access, not price. Software at no cost is a benefit but ownership rights to the software and source code is far more significant.
Free Office Software - The Libre Office suite provides top desktop productivity tools for free. This includes, a word processor, spreadsheet, presentation engine, drawing and flowcharting, database and math applications. Libre Office is available for Linux or Windows.
The Free Books Library is a collection of thousands of the most popular public domain books in an online readable format. The collection includes great classical literature and more recent works where the U.S. copyright has expired. These books are yours to read and use without restrictions.
Source Code - Want to change a program or know how it works? Open Source provides the source code for its programs so that anyone can use, modify or learn how to write those programs themselves. Visit the GNU source code repositories to download the source.
Study at Harvard, Stanford or MIT - Open edX provides free online courses from Harvard, MIT, Columbia, UC Berkeley and other top Universities. Hundreds of courses for almost all major subjects and course levels. Open edx also offers some paid courses and selected certifications.
Linux Manual Pages - A man or manual page is a form of software documentation found on Linux/Unix operating systems. Topics covered include computer programs (including library and system calls), formal standards and conventions, and even abstract concepts.