← Back to branch summary

~ubuntu-branches/ubuntu/saucy/slurm-llnl/saucy

~ubuntu-branches/ubuntu/saucy/slurm-llnl/saucy

« back to all changes in this revision

Viewing changes to doc/html/gang_scheduling.shtml

Committer: Bazaar Package Importer
Author(s): Gennaro Oliva
Date: 2008-05-30 13:11:30 UTC
mfrom: (1.1.3 upstream)
Revision ID: james.westby@ubuntu.com-20080530131130-l6ko6aie7xhrlmxe

Tags: 1.3.3-1

* New upstream release
* Removed patches to src/slurmctd/controller.c src/slurmdbd/slurmdbd.c
doc/man/man1/sacctmgr.1 included to upstream
* Edited watch file to seek for 1.3 releases
* doc/man/man1/salloc.1 doc/man/man1/sbatch.1 doc/man/man5/slurm.conf.5
patched to improve formatting and avoid manual warnings

files added:
auxdir/x_ac_databases.m4

contribs/phpext

contribs/phpext/Makefile.am

contribs/phpext/Makefile.in

contribs/phpext/README

contribs/phpext/slurm_php

contribs/phpext/slurm_php/config.m4.in

contribs/phpext/slurm_php/slurm_php.c

contribs/phpext/slurm_php/slurm_php.h

debian/slurm-llnl-slurmdbd.dirs

debian/slurm-llnl-slurmdbd.examples

debian/slurm-llnl-slurmdbd.init.d

debian/slurm-llnl-slurmdbd.logrotate

debian/slurm-llnl-slurmdbd.postinst

debian/slurm-llnl-slurmdbd.postrm

debian/slurm-llnl-slurmdbd.preinst

debian/slurm-llnl.examples

debian/slurm-llnl.init.d

debian/slurm-llnl.logrotate

debian/slurm-llnl.postinst

debian/slurm-llnl.postrm

debian/slurm-llnl.preinst

debian/slurm-resume.sh

debian/slurm-suspend.sh

debian/slurmdbd.conf.simple

doc/html/accounting.shtml

doc/html/cons_res_share.shtml

doc/html/crypto_plugins.shtml

doc/html/gang_scheduling.shtml

doc/html/jobacct_gatherplugins.shtml

doc/html/jobacct_storageplugins.shtml

doc/html/preempt.shtml

doc/html/slurm_moab.pdf

doc/html/slurm_v1.3.pdf

doc/man/man1/sacctmgr.1

doc/man/man1/sreport.1

doc/man/man1/sstat.1

doc/man/man5/slurmdbd.conf.5

doc/man/man8/slurmdbd.8

etc/init.d.slurmdbd

src/api/allocate_msg.c

src/common/assoc_mgr.c

src/common/assoc_mgr.h

src/common/jobacct_common.c

src/common/jobacct_common.h

src/common/proc_args.c

src/common/proc_args.h

src/common/slurm_accounting_storage.c

src/common/slurm_accounting_storage.h

src/common/slurm_jobacct_gather.c

src/common/slurm_jobacct_gather.h

src/common/slurmdbd_defs.c

src/common/slurmdbd_defs.h

src/database

src/database/Makefile.am

src/database/Makefile.in

src/database/base64.c

src/database/base64.h

src/database/gold_interface.c

src/database/gold_interface.h

src/database/mysql_common.c

src/database/mysql_common.h

src/database/pgsql_common.c

src/database/pgsql_common.h

src/plugins/accounting_storage

src/plugins/accounting_storage/Makefile.am

src/plugins/accounting_storage/Makefile.in

src/plugins/accounting_storage/filetxt

src/plugins/accounting_storage/filetxt/Makefile.am

src/plugins/accounting_storage/filetxt/Makefile.in

src/plugins/accounting_storage/filetxt/accounting_storage_filetxt.c

src/plugins/accounting_storage/filetxt/filetxt_jobacct_process.c

src/plugins/accounting_storage/filetxt/filetxt_jobacct_process.h

src/plugins/accounting_storage/gold

src/plugins/accounting_storage/gold/Makefile.am

src/plugins/accounting_storage/gold/Makefile.in

src/plugins/accounting_storage/gold/accounting_storage_gold.c

src/plugins/accounting_storage/mysql

src/plugins/accounting_storage/mysql/Makefile.am

src/plugins/accounting_storage/mysql/Makefile.in

src/plugins/accounting_storage/mysql/accounting_storage_mysql.c

src/plugins/accounting_storage/mysql/mysql_jobacct_process.c

src/plugins/accounting_storage/mysql/mysql_jobacct_process.h

src/plugins/accounting_storage/mysql/mysql_rollup.c

src/plugins/accounting_storage/mysql/mysql_rollup.h

src/plugins/accounting_storage/none

src/plugins/accounting_storage/none/Makefile.am

src/plugins/accounting_storage/none/Makefile.in

src/plugins/accounting_storage/none/accounting_storage_none.c

src/plugins/accounting_storage/pgsql

src/plugins/accounting_storage/pgsql/Makefile.am

src/plugins/accounting_storage/pgsql/Makefile.in

src/plugins/accounting_storage/pgsql/accounting_storage_pgsql.c

src/plugins/accounting_storage/pgsql/pgsql_jobacct_process.c

src/plugins/accounting_storage/pgsql/pgsql_jobacct_process.h

src/plugins/accounting_storage/slurmdbd

src/plugins/accounting_storage/slurmdbd/Makefile.am

src/plugins/accounting_storage/slurmdbd/Makefile.in

src/plugins/accounting_storage/slurmdbd/accounting_storage_slurmdbd.c

src/plugins/checkpoint/xlch

src/plugins/checkpoint/xlch/Makefile.am

src/plugins/checkpoint/xlch/Makefile.in

src/plugins/checkpoint/xlch/checkpoint_xlch.c

src/plugins/crypto

src/plugins/crypto/Makefile.am

src/plugins/crypto/Makefile.in

src/plugins/crypto/munge

src/plugins/crypto/munge/Makefile.am

src/plugins/crypto/munge/Makefile.in

src/plugins/crypto/munge/crypto_munge.c

src/plugins/crypto/openssl

src/plugins/crypto/openssl/Makefile.am

src/plugins/crypto/openssl/Makefile.in

src/plugins/crypto/openssl/crypto_openssl.c

src/plugins/jobacct_gather

src/plugins/jobacct_gather/Makefile.am

src/plugins/jobacct_gather/Makefile.in

src/plugins/jobacct_gather/aix

src/plugins/jobacct_gather/aix/Makefile.am

src/plugins/jobacct_gather/aix/Makefile.in

src/plugins/jobacct_gather/aix/jobacct_gather_aix.c

src/plugins/jobacct_gather/linux

src/plugins/jobacct_gather/linux/Makefile.am

src/plugins/jobacct_gather/linux/Makefile.in

src/plugins/jobacct_gather/linux/jobacct_gather_linux.c

src/plugins/jobacct_gather/none

src/plugins/jobacct_gather/none/Makefile.am

src/plugins/jobacct_gather/none/Makefile.in

src/plugins/jobacct_gather/none/jobacct_gather_none.c

src/plugins/jobcomp/filetxt/filetxt_jobcomp_process.c

src/plugins/jobcomp/filetxt/filetxt_jobcomp_process.h

src/plugins/jobcomp/mysql

src/plugins/jobcomp/mysql/Makefile.am

src/plugins/jobcomp/mysql/Makefile.in

src/plugins/jobcomp/mysql/jobcomp_mysql.c

src/plugins/jobcomp/mysql/mysql_jobcomp_process.c

src/plugins/jobcomp/mysql/mysql_jobcomp_process.h

src/plugins/jobcomp/pgsql

src/plugins/jobcomp/pgsql/Makefile.am

src/plugins/jobcomp/pgsql/Makefile.in

src/plugins/jobcomp/pgsql/jobcomp_pgsql.c

src/plugins/jobcomp/pgsql/pgsql_jobcomp_process.c

src/plugins/jobcomp/pgsql/pgsql_jobcomp_process.h

src/plugins/jobcomp/slurmdbd

src/plugins/jobcomp/slurmdbd/Makefile.am

src/plugins/jobcomp/slurmdbd/Makefile.in

src/plugins/jobcomp/slurmdbd/jobcomp_slurmdbd.c

src/plugins/select/bluegene/plugin/bg_record_functions.c

src/plugins/select/bluegene/plugin/bg_record_functions.h

src/plugins/select/bluegene/plugin/defined_block.c

src/plugins/select/bluegene/plugin/defined_block.h

src/plugins/select/bluegene/plugin/dynamic_block.c

src/plugins/select/bluegene/plugin/dynamic_block.h

src/plugins/select/linear/select_linear.h

src/sacctmgr

src/sacctmgr/Makefile.am

src/sacctmgr/Makefile.in

src/sacctmgr/account_functions.c

src/sacctmgr/association_functions.c

src/sacctmgr/cluster_functions.c

src/sacctmgr/common.c

src/sacctmgr/print.c

src/sacctmgr/print.h

src/sacctmgr/sacctmgr.c

src/sacctmgr/sacctmgr.h

src/sacctmgr/user_functions.c

src/slurmctld/job_scheduler.h

src/slurmctld/licenses.c

src/slurmctld/licenses.h

src/slurmdbd

src/slurmdbd/Makefile.am

src/slurmdbd/Makefile.in

src/slurmdbd/agent.c

src/slurmdbd/agent.h

src/slurmdbd/proc_req.c

src/slurmdbd/proc_req.h

src/slurmdbd/read_config.c

src/slurmdbd/read_config.h

src/slurmdbd/rpc_mgr.c

src/slurmdbd/rpc_mgr.h

src/slurmdbd/slurmdbd.c

src/slurmdbd/slurmdbd.h

src/sreport

src/sreport/Makefile.am

src/sreport/Makefile.in

src/sreport/sreport.c

src/sreport/sreport.h

src/srun/debugger.c

src/srun/debugger.h

src/srun/srun_pty.c

src/srun/srun_pty.h

src/sstat

src/sstat/Makefile.am

src/sstat/Makefile.in

src/sstat/options.c

src/sstat/print.c

src/sstat/process.c

src/sstat/sstat.c

src/sstat/sstat.h

testsuite/expect/test1.93

testsuite/expect/test17.33

testsuite/expect/test21.1

testsuite/expect/test21.2

testsuite/expect/test21.3

testsuite/expect/test21.4

testsuite/expect/test21.5

testsuite/expect/test21.6

testsuite/expect/test3.10

testsuite/expect/test8.7

testsuite/expect/test8.7.crypto.c

testsuite/expect/test8.7.prog.c

testsuite/slurm_unit/slurmctld/security_2_2a.sh

testsuite/slurm_unit/slurmctld/security_2_2b.sh

files removed:
contribs/Makefile

contribs/perlapi/Makefile

contribs/torque/Makefile

debian/init.d

debian/postinst

debian/postrm

debian/preinst

doc/html/jobacctplugins.shtml

doc/man/man1/slaunch.1

src/common/global_defaults.c

src/common/slurm_jobacct.c

src/common/slurm_jobacct.h

src/plugins/jobacct

src/plugins/jobacct/Makefile.am

src/plugins/jobacct/Makefile.in

src/plugins/jobacct/aix

src/plugins/jobacct/aix/Makefile.am

src/plugins/jobacct/aix/Makefile.in

src/plugins/jobacct/aix/jobacct_aix.c

src/plugins/jobacct/common

src/plugins/jobacct/common/common_slurmctld.c

src/plugins/jobacct/common/common_slurmstepd.c

src/plugins/jobacct/common/jobacct_common.c

src/plugins/jobacct/common/jobacct_common.h

src/plugins/jobacct/gold

src/plugins/jobacct/gold/Makefile.am

src/plugins/jobacct/gold/Makefile.in

src/plugins/jobacct/gold/agent.c

src/plugins/jobacct/gold/agent.h

src/plugins/jobacct/gold/base64.c

src/plugins/jobacct/gold/base64.h

src/plugins/jobacct/gold/gold_interface.c

src/plugins/jobacct/gold/gold_interface.h

src/plugins/jobacct/gold/jobacct_gold.c

src/plugins/jobacct/linux

src/plugins/jobacct/linux/Makefile.am

src/plugins/jobacct/linux/Makefile.in

src/plugins/jobacct/linux/jobacct_linux.c

src/plugins/jobacct/none

src/plugins/jobacct/none/Makefile.am

src/plugins/jobacct/none/Makefile.in

src/plugins/jobacct/none/jobacct_none.c

src/sacct/sacct_stat.h

src/salloc/msg.c

src/salloc/msg.h

src/slaunch

src/slaunch/Makefile.am

src/slaunch/Makefile.in

src/slaunch/attach.c

src/slaunch/attach.h

src/slaunch/core-format.c

src/slaunch/core-format.h

src/slaunch/fname.c

src/slaunch/fname.h

src/slaunch/multi_prog.c

src/slaunch/multi_prog.h

src/slaunch/opt.c

src/slaunch/opt.h

src/slaunch/sigstr.c

src/slaunch/sigstr.h

src/slaunch/slaunch.c

src/slaunch/slaunch.h

src/slaunch/slaunch.wrapper.c

src/slurmd/slurmd/config.c

src/slurmd/slurmd/testconfig.c

src/srun/attach.c

src/srun/attach.h

src/srun/launch.c

src/srun/launch.h

src/srun/msg.c

src/srun/msg.h

src/srun/reattach.c

src/srun/reattach.h

src/srun/signals.c

src/srun/signals.h

src/srun/sigstr.c

src/srun/sigstr.h

testsuite/expect/test1.18.prog.c

testsuite/expect/test1.34

testsuite/expect/test1.37

testsuite/expect/test1.40

testsuite/expect/test1.45

testsuite/expect/test1.47

testsuite/expect/test1.53

testsuite/expect/test1.85

testsuite/expect/test18.1

testsuite/expect/test18.10

testsuite/expect/test18.11

testsuite/expect/test18.12

testsuite/expect/test18.13

testsuite/expect/test18.14

testsuite/expect/test18.15

testsuite/expect/test18.16

testsuite/expect/test18.16.prog.c

testsuite/expect/test18.17

testsuite/expect/test18.18

testsuite/expect/test18.19

testsuite/expect/test18.19.prog.c

testsuite/expect/test18.2

testsuite/expect/test18.20

testsuite/expect/test18.21

testsuite/expect/test18.22

testsuite/expect/test18.23

testsuite/expect/test18.24

testsuite/expect/test18.25

testsuite/expect/test18.26

testsuite/expect/test18.27

testsuite/expect/test18.28

testsuite/expect/test18.29

testsuite/expect/test18.3

testsuite/expect/test18.30

testsuite/expect/test18.31

testsuite/expect/test18.32

testsuite/expect/test18.32.prog.c

testsuite/expect/test18.33

testsuite/expect/test18.34

testsuite/expect/test18.35

testsuite/expect/test18.36

testsuite/expect/test18.36.prog.c

testsuite/expect/test18.37

testsuite/expect/test18.37.prog.c

testsuite/expect/test18.38

testsuite/expect/test18.4

testsuite/expect/test18.5

testsuite/expect/test18.6

testsuite/expect/test18.7

testsuite/expect/test18.8

testsuite/expect/test18.9

testsuite/expect/test7.5

testsuite/expect/test7.5.prog.c

testsuite/slurm_unit/slurmctld/security_2_2.sh

files modified:
AUTHORS

BUILD.NOTES

COPYING

DISCLAIMER

META

Makefile.am

Makefile.in

NEWS

README

RELEASE_NOTES

aclocal.m4

auxdir/Makefile.am

auxdir/Makefile.in

auxdir/config.guess

auxdir/config.sub

auxdir/depcomp

auxdir/install-sh

auxdir/ltmain.sh

auxdir/slurm.m4

auxdir/x_ac_aix.m4

auxdir/x_ac_bluegene.m4

auxdir/x_ac_gtk.m4

auxdir/x_ac_slurm_ssl.m4

config.h.in

configure

configure.ac

contribs/Makefile.am

contribs/Makefile.in

contribs/env_cache_builder.c

contribs/perlapi/Makefile.in

contribs/perlapi/libslurm-perl/Slurm.xs

contribs/perlapi/libslurm-perl/alloc.c

contribs/perlapi/libslurm-perl/conf.c

contribs/perlapi/libslurm-perl/job.c

contribs/perlapi/libslurm-perl/launch.c

contribs/perlapi/libslurm-perl/partition.c

contribs/perlapi/libslurm-perl/trigger.c

contribs/time_login.c

contribs/torque/Makefile.in

contribs/torque/mpiexec.pl

contribs/torque/pbsnodes.pl

contribs/torque/qdel.pl

contribs/torque/qhold.pl

contribs/torque/qrls.pl

contribs/torque/qstat.pl

contribs/torque/qsub.pl

debian/README.Debian

debian/changelog

debian/control

debian/rules

debian/slurm-llnl-configurator.html

debian/slurm-llnl.dirs

debian/slurm.conf.simple

debian/watch

doc/Makefile.in

doc/html/Makefile.am

doc/html/Makefile.in

doc/html/arch.gif

doc/html/big_sys.shtml

doc/html/bluegene.shtml

doc/html/checkpoint_plugins.shtml

doc/html/configurator.html.in

doc/html/cons_res.shtml

doc/html/documentation.shtml

doc/html/download.shtml

doc/html/faq.shtml

doc/html/footer.txt

doc/html/header.txt

doc/html/jobcompplugins.shtml

doc/html/maui.shtml

doc/html/moab.shtml

doc/html/news.shtml

doc/html/overview.shtml

doc/html/power_save.shtml

doc/html/programmer_guide.shtml

doc/html/publications.shtml

doc/html/quickstart.shtml

doc/html/quickstart_admin.shtml

doc/html/review_release.html

doc/html/schedplugins.shtml

doc/html/selectplugins.shtml

doc/html/slurm.shtml

doc/html/taskplugins.shtml

doc/html/team.shtml

doc/man/Makefile.am

doc/man/Makefile.in

doc/man/man1/salloc.1

doc/man/man1/sattach.1

doc/man/man1/sbatch.1

doc/man/man1/sbcast.1

doc/man/man1/scancel.1

doc/man/man1/scontrol.1

doc/man/man1/sinfo.1

doc/man/man1/slurm.1

doc/man/man1/smap.1

doc/man/man1/squeue.1

doc/man/man1/srun.1

doc/man/man1/strigger.1

doc/man/man1/sview.1

doc/man/man3/slurm_allocate_resources.3

doc/man/man3/slurm_checkpoint_error.3

doc/man/man3/slurm_complete_job.3

doc/man/man3/slurm_free_ctl_conf.3

doc/man/man3/slurm_free_job_info_msg.3

doc/man/man3/slurm_free_job_step_info_response_msg.3

doc/man/man3/slurm_free_node_info.3

doc/man/man3/slurm_free_partition_info.3

doc/man/man3/slurm_get_errno.3

doc/man/man3/slurm_hostlist_create.3

doc/man/man3/slurm_job_step_create.3

doc/man/man3/slurm_kill_job.3

doc/man/man3/slurm_reconfigure.3

doc/man/man3/slurm_resume.3

doc/man/man3/slurm_step_ctx_create.3

doc/man/man3/slurm_step_launch.3

doc/man/man5/bluegene.conf.5

doc/man/man5/slurm.conf.5

doc/man/man5/wiki.conf.5

doc/man/man8/slurmctld.8

doc/man/man8/slurmd.8

doc/man/man8/slurmstepd.8

doc/man/man8/spank.8

etc/bluegene.conf.example

etc/init.d.slurm

etc/slurm.conf.example

slurm.spec

slurm/slurm.h.in

slurm/slurm_errno.h

slurm/spank.h

src/Makefile.am

src/Makefile.in

src/api/Makefile.am

src/api/Makefile.in

src/api/allocate.c

src/api/cancel.c

src/api/checkpoint.c

src/api/complete.c

src/api/config_info.c

src/api/init_msg.c

src/api/job_info.c

src/api/job_info.h

src/api/job_step_info.c

src/api/node_info.c

src/api/node_select_info.c

src/api/node_select_info.h

src/api/partition_info.c

src/api/pmi.c

src/api/pmi_server.c

src/api/pmi_server.h

src/api/reconfigure.c

src/api/signal.c

src/api/slurm_pmi.c

src/api/slurm_pmi.h

src/api/step_ctx.c

src/api/step_ctx.h

src/api/step_io.c

src/api/step_io.h

src/api/step_launch.c

src/api/step_launch.h

src/api/submit.c

src/api/suspend.c

src/api/triggers.c

src/api/update_config.c

src/common/Makefile.am

src/common/Makefile.in

src/common/arg_desc.c

src/common/arg_desc.h

src/common/bitstring.c

src/common/bitstring.h

src/common/checkpoint.c

src/common/checkpoint.h

src/common/daemonize.c

src/common/daemonize.h

src/common/eio.c

src/common/eio.h

src/common/env.c

src/common/env.h

src/common/forward.c

src/common/forward.h

src/common/hostlist.c

src/common/hostlist.h

src/common/io_hdr.c

src/common/io_hdr.h

src/common/job_options.c

src/common/job_options.h

src/common/list.c

src/common/list.h

src/common/log.c

src/common/log.h

src/common/macros.h

src/common/mpi.c

src/common/mpi.h

src/common/net.c

src/common/net.h

src/common/node_select.c

src/common/node_select.h

src/common/optz.c

src/common/optz.h

src/common/pack.c

src/common/pack.h

src/common/parse_config.c

src/common/parse_config.h

src/common/parse_spec.c

src/common/parse_spec.h

src/common/parse_time.c

src/common/parse_time.h

src/common/plugin.c

src/common/plugin.h

src/common/plugrack.c

src/common/plugrack.h

src/common/plugstack.c

src/common/plugstack.h

src/common/read_config.c

src/common/read_config.h

src/common/safeopen.c

src/common/safeopen.h

src/common/slurm_auth.c

src/common/slurm_auth.h

src/common/slurm_cred.c

src/common/slurm_cred.h

src/common/slurm_errno.c

src/common/slurm_jobcomp.c

src/common/slurm_jobcomp.h

src/common/slurm_protocol_api.c

src/common/slurm_protocol_api.h

src/common/slurm_protocol_common.h

src/common/slurm_protocol_defs.c

src/common/slurm_protocol_defs.h

src/common/slurm_protocol_interface.h

src/common/slurm_protocol_mongo_common.h

src/common/slurm_protocol_pack.c

src/common/slurm_protocol_pack.h

src/common/slurm_protocol_socket_common.h

src/common/slurm_protocol_socket_implementation.c

src/common/slurm_protocol_util.c

src/common/slurm_protocol_util.h

src/common/slurm_resource_info.c

src/common/slurm_resource_info.h

src/common/slurm_selecttype_info.c

src/common/slurm_selecttype_info.h

src/common/slurm_step_layout.c

src/common/slurm_step_layout.h

src/common/slurm_xlator.h

src/common/stepd_api.c

src/common/stepd_api.h

src/common/switch.c

src/common/switch.h

src/common/timers.c

src/common/timers.h

src/common/uid.c

src/common/uid.h

src/common/unsetenv.c

src/common/unsetenv.h

src/common/xassert.c

src/common/xassert.h

src/common/xmalloc.c

src/common/xmalloc.h

src/common/xsignal.c

src/common/xsignal.h

src/common/xstring.c

src/common/xstring.h

src/plugins/Makefile.am

src/plugins/Makefile.in

src/plugins/auth/Makefile.in

src/plugins/auth/authd/Makefile.in

src/plugins/auth/authd/auth_authd.c

src/plugins/auth/munge/Makefile.in

src/plugins/auth/munge/auth_munge.c

src/plugins/auth/none/Makefile.in

src/plugins/auth/none/auth_none.c

src/plugins/checkpoint/Makefile.am

src/plugins/checkpoint/Makefile.in

src/plugins/checkpoint/aix/Makefile.in

src/plugins/checkpoint/aix/checkpoint_aix.c

src/plugins/checkpoint/none/Makefile.in

src/plugins/checkpoint/none/checkpoint_none.c

src/plugins/checkpoint/ompi/Makefile.in

src/plugins/checkpoint/ompi/checkpoint_ompi.c

src/plugins/jobcomp/Makefile.am

src/plugins/jobcomp/Makefile.in

src/plugins/jobcomp/filetxt/Makefile.am

src/plugins/jobcomp/filetxt/Makefile.in

src/plugins/jobcomp/filetxt/jobcomp_filetxt.c

src/plugins/jobcomp/none/Makefile.in

src/plugins/jobcomp/none/jobcomp_none.c

src/plugins/jobcomp/script/Makefile.in

src/plugins/jobcomp/script/jobcomp_script.c

src/plugins/mpi/Makefile.in

src/plugins/mpi/lam/Makefile.in

src/plugins/mpi/lam/lam.h

src/plugins/mpi/lam/mpi_lam.c

src/plugins/mpi/mpich1_p4/Makefile.in

src/plugins/mpi/mpich1_p4/mpich1_p4.c

src/plugins/mpi/mpich1_shmem/Makefile.in

src/plugins/mpi/mpich1_shmem/mpich1_shmem.c

src/plugins/mpi/mpichgm/Makefile.in

src/plugins/mpi/mpichgm/mpi_mpichgm.c

src/plugins/mpi/mpichgm/mpichgm.c

src/plugins/mpi/mpichgm/mpichgm.h

src/plugins/mpi/mpichmx/Makefile.in

src/plugins/mpi/mpichmx/mpi_mpichmx.c

src/plugins/mpi/mpichmx/mpichmx.c

src/plugins/mpi/mpichmx/mpichmx.h

src/plugins/mpi/mvapich/Makefile.in

src/plugins/mpi/mvapich/mpi_mvapich.c

src/plugins/mpi/mvapich/mvapich.c

src/plugins/mpi/mvapich/mvapich.h

src/plugins/mpi/none/Makefile.in

src/plugins/mpi/none/mpi_none.c

src/plugins/mpi/openmpi/Makefile.in

src/plugins/mpi/openmpi/mpi_openmpi.c

src/plugins/proctrack/Makefile.in

src/plugins/proctrack/aix/Makefile.in

src/plugins/proctrack/aix/proctrack_aix.c

src/plugins/proctrack/linuxproc/Makefile.in

src/plugins/proctrack/linuxproc/kill_tree.c

src/plugins/proctrack/linuxproc/kill_tree.h

src/plugins/proctrack/linuxproc/proctrack_linuxproc.c

src/plugins/proctrack/pgid/Makefile.in

src/plugins/proctrack/pgid/proctrack_pgid.c

src/plugins/proctrack/rms/Makefile.in

src/plugins/proctrack/rms/proctrack_rms.c

src/plugins/proctrack/sgi_job/Makefile.in

src/plugins/proctrack/sgi_job/proctrack_sgi_job.c

src/plugins/sched/Makefile.in

src/plugins/sched/backfill/Makefile.in

src/plugins/sched/backfill/backfill.c

src/plugins/sched/backfill/backfill.h

src/plugins/sched/backfill/backfill_wrapper.c

src/plugins/sched/builtin/Makefile.in

src/plugins/sched/builtin/builtin_wrapper.c

src/plugins/sched/gang/Makefile.in

src/plugins/sched/gang/gang.c

src/plugins/sched/gang/gang.h

src/plugins/sched/gang/sched_gang.c

src/plugins/sched/hold/Makefile.in

src/plugins/sched/hold/hold_wrapper.c

src/plugins/sched/wiki/Makefile.in

src/plugins/sched/wiki/cancel_job.c

src/plugins/sched/wiki/get_jobs.c

src/plugins/sched/wiki/get_nodes.c

src/plugins/sched/wiki/hostlist.c

src/plugins/sched/wiki/job_modify.c

src/plugins/sched/wiki/msg.c

src/plugins/sched/wiki/msg.h

src/plugins/sched/wiki/resume_job.c

src/plugins/sched/wiki/sched_wiki.c

src/plugins/sched/wiki/start_job.c

src/plugins/sched/wiki/suspend_job.c

src/plugins/sched/wiki2/Makefile.in

src/plugins/sched/wiki2/cancel_job.c

src/plugins/sched/wiki2/event.c

src/plugins/sched/wiki2/get_jobs.c

src/plugins/sched/wiki2/get_nodes.c

src/plugins/sched/wiki2/hostlist.c

src/plugins/sched/wiki2/initialize.c

src/plugins/sched/wiki2/job_add_task.c

src/plugins/sched/wiki2/job_modify.c

src/plugins/sched/wiki2/job_notify.c

src/plugins/sched/wiki2/job_release_task.c

src/plugins/sched/wiki2/job_requeue.c

src/plugins/sched/wiki2/job_signal.c

src/plugins/sched/wiki2/job_will_run.c

src/plugins/sched/wiki2/msg.c

src/plugins/sched/wiki2/msg.h

src/plugins/sched/wiki2/resume_job.c

src/plugins/sched/wiki2/sched_wiki.c

src/plugins/sched/wiki2/start_job.c

src/plugins/sched/wiki2/suspend_job.c

src/plugins/select/Makefile.in

src/plugins/select/bluegene/Makefile.in

src/plugins/select/bluegene/block_allocator/Makefile.am

src/plugins/select/bluegene/block_allocator/Makefile.in

src/plugins/select/bluegene/block_allocator/block_allocator.c

src/plugins/select/bluegene/block_allocator/block_allocator.h

src/plugins/select/bluegene/plugin/Makefile.am

src/plugins/select/bluegene/plugin/Makefile.in

src/plugins/select/bluegene/plugin/bg_job_place.c

src/plugins/select/bluegene/plugin/bg_job_place.h

src/plugins/select/bluegene/plugin/bg_job_run.c

src/plugins/select/bluegene/plugin/bg_job_run.h

src/plugins/select/bluegene/plugin/block_sys.c

src/plugins/select/bluegene/plugin/bluegene.c

src/plugins/select/bluegene/plugin/bluegene.h

src/plugins/select/bluegene/plugin/opts.c

src/plugins/select/bluegene/plugin/select_bluegene.c

src/plugins/select/bluegene/plugin/sfree.c

src/plugins/select/bluegene/plugin/sfree.h

src/plugins/select/bluegene/plugin/slurm_epilog.c

src/plugins/select/bluegene/plugin/slurm_prolog.c

src/plugins/select/bluegene/plugin/state_test.c

src/plugins/select/cons_res/Makefile.am

src/plugins/select/cons_res/Makefile.in

src/plugins/select/cons_res/dist_tasks.c

src/plugins/select/cons_res/dist_tasks.h

src/plugins/select/cons_res/select_cons_res.c

src/plugins/select/cons_res/select_cons_res.h

src/plugins/select/linear/Makefile.am

src/plugins/select/linear/Makefile.in

src/plugins/select/linear/select_linear.c

src/plugins/switch/Makefile.in

src/plugins/switch/elan/Makefile.in

src/plugins/switch/elan/qsw.c

src/plugins/switch/elan/qsw.h

src/plugins/switch/elan/switch_elan.c

src/plugins/switch/federation/Makefile.am

src/plugins/switch/federation/Makefile.in

src/plugins/switch/federation/federation.c

src/plugins/switch/federation/federation.h

src/plugins/switch/federation/federation_keys.h

src/plugins/switch/federation/switch_federation.c

src/plugins/switch/none/Makefile.am

src/plugins/switch/none/Makefile.in

src/plugins/switch/none/switch_none.c

src/plugins/task/Makefile.in

src/plugins/task/affinity/Makefile.am

src/plugins/task/affinity/Makefile.in

src/plugins/task/affinity/affinity.c

src/plugins/task/affinity/affinity.h

src/plugins/task/affinity/cpuset.c

src/plugins/task/affinity/dist_tasks.c

src/plugins/task/affinity/dist_tasks.h

src/plugins/task/affinity/numa.c

src/plugins/task/affinity/task_affinity.c

src/plugins/task/none/Makefile.am

src/plugins/task/none/Makefile.in

src/plugins/task/none/task_none.c

src/sacct/Makefile.am

src/sacct/Makefile.in

src/sacct/options.c

src/sacct/print.c

src/sacct/process.c

src/sacct/sacct.c

src/sacct/sacct.h

src/sacct/sacct_stat.c

src/salloc/Makefile.am

src/salloc/Makefile.in

src/salloc/opt.c

src/salloc/opt.h

src/salloc/salloc.c

src/salloc/salloc.h

src/sattach/Makefile.am

src/sattach/Makefile.in

src/sattach/attach.c

src/sattach/opt.c

src/sattach/opt.h

src/sattach/sattach.c

src/sbatch/Makefile.am

src/sbatch/Makefile.in

src/sbatch/opt.c

src/sbatch/opt.h

src/sbatch/sbatch.c

src/sbcast/Makefile.in

src/sbcast/agent.c

src/sbcast/opts.c

src/sbcast/sbcast.c

src/sbcast/sbcast.h

src/scancel/Makefile.in

src/scancel/opt.c

src/scancel/scancel.c

src/scancel/scancel.h

src/scontrol/Makefile.in

src/scontrol/info_job.c

src/scontrol/info_node.c

src/scontrol/info_part.c

src/scontrol/scontrol.c

src/scontrol/scontrol.h

src/scontrol/update_job.c

src/scontrol/update_node.c

src/scontrol/update_part.c

src/sinfo/Makefile.in

src/sinfo/opts.c

src/sinfo/print.c

src/sinfo/print.h

src/sinfo/sinfo.c

src/sinfo/sinfo.h

src/sinfo/sort.c

src/slurmctld/Makefile.am

src/slurmctld/Makefile.in

src/slurmctld/agent.c

src/slurmctld/agent.h

src/slurmctld/backup.c

src/slurmctld/controller.c

src/slurmctld/job_mgr.c

src/slurmctld/job_scheduler.c

src/slurmctld/locks.c

src/slurmctld/locks.h

src/slurmctld/node_mgr.c

src/slurmctld/node_scheduler.c

src/slurmctld/node_scheduler.h

src/slurmctld/partition_mgr.c

src/slurmctld/ping_nodes.c

src/slurmctld/ping_nodes.h

src/slurmctld/power_save.c

src/slurmctld/proc_req.c

src/slurmctld/proc_req.h

src/slurmctld/read_config.c

src/slurmctld/read_config.h

src/slurmctld/sched_plugin.c

src/slurmctld/sched_plugin.h

src/slurmctld/slurmctld.h

src/slurmctld/srun_comm.c

src/slurmctld/srun_comm.h

src/slurmctld/state_save.c

src/slurmctld/state_save.h

src/slurmctld/step_mgr.c

src/slurmctld/trigger_mgr.c

src/slurmctld/trigger_mgr.h

src/slurmd/Makefile.in

src/slurmd/common/proctrack.c

src/slurmd/common/proctrack.h

src/slurmd/common/reverse_tree.h

src/slurmd/common/run_script.c

src/slurmd/common/run_script.h

src/slurmd/common/setproctitle.c

src/slurmd/common/setproctitle.h

src/slurmd/common/slurmstepd_init.c

src/slurmd/common/slurmstepd_init.h

src/slurmd/common/task_plugin.c

src/slurmd/common/task_plugin.h

src/slurmd/slurmd/Makefile.am

src/slurmd/slurmd/Makefile.in

src/slurmd/slurmd/get_mach_stat.c

src/slurmd/slurmd/get_mach_stat.h

src/slurmd/slurmd/read_proc.c

src/slurmd/slurmd/req.c

src/slurmd/slurmd/req.h

src/slurmd/slurmd/reverse_tree_math.c

src/slurmd/slurmd/reverse_tree_math.h

src/slurmd/slurmd/slurmd.c

src/slurmd/slurmd/slurmd.h

src/slurmd/slurmd/xcpu.c

src/slurmd/slurmd/xcpu.h

src/slurmd/slurmstepd/Makefile.am

src/slurmd/slurmstepd/Makefile.in

src/slurmd/slurmstepd/fname.c

src/slurmd/slurmstepd/fname.h

src/slurmd/slurmstepd/io.c

src/slurmd/slurmstepd/io.h

src/slurmd/slurmstepd/mgr.c

src/slurmd/slurmstepd/mgr.h

src/slurmd/slurmstepd/multi_prog.c

src/slurmd/slurmstepd/multi_prog.h

src/slurmd/slurmstepd/pam_ses.c

src/slurmd/slurmstepd/pam_ses.h

src/slurmd/slurmstepd/pdebug.c

src/slurmd/slurmstepd/pdebug.h

src/slurmd/slurmstepd/req.c

src/slurmd/slurmstepd/req.h

src/slurmd/slurmstepd/slurmstepd.c

src/slurmd/slurmstepd/slurmstepd.h

src/slurmd/slurmstepd/slurmstepd_job.c

src/slurmd/slurmstepd/slurmstepd_job.h

src/slurmd/slurmstepd/step_terminate_monitor.c

src/slurmd/slurmstepd/step_terminate_monitor.h

src/slurmd/slurmstepd/task.c

src/slurmd/slurmstepd/task.h

src/slurmd/slurmstepd/ulimits.c

src/slurmd/slurmstepd/ulimits.h

src/smap/Makefile.am

src/smap/Makefile.in

src/smap/configure_functions.c

src/smap/grid_functions.c

src/smap/job_functions.c

src/smap/opts.c

src/smap/partition_functions.c

src/smap/smap.c

src/smap/smap.h

src/squeue/Makefile.in

src/squeue/opts.c

src/squeue/print.c

src/squeue/print.h

src/squeue/sort.c

src/squeue/squeue.c

src/squeue/squeue.h

src/srun/Makefile.am

src/srun/Makefile.in

src/srun/allocate.c

src/srun/allocate.h

src/srun/core-format.c

src/srun/core-format.h

src/srun/fname.c

src/srun/fname.h

src/srun/multi_prog.c

src/srun/multi_prog.h

src/srun/opt.c

src/srun/opt.h

src/srun/srun.c

src/srun/srun.h

src/srun/srun_job.c

src/srun/srun_job.h

src/strigger/Makefile.in

src/strigger/opts.c

src/strigger/strigger.c

src/strigger/strigger.h

src/sview/Makefile.am

src/sview/Makefile.in

src/sview/admin_info.c

src/sview/block_info.c

src/sview/common.c

src/sview/grid.c

src/sview/job_info.c

src/sview/node_info.c

src/sview/part_info.c

src/sview/popups.c

src/sview/submit_info.c

src/sview/sview.c

src/sview/sview.h

testsuite/Makefile.in

testsuite/expect/Makefile.am

testsuite/expect/Makefile.in

testsuite/expect/README

testsuite/expect/globals

testsuite/expect/pkill

testsuite/expect/regression

testsuite/expect/regression.py

testsuite/expect/test1.1

testsuite/expect/test1.10

testsuite/expect/test1.11

testsuite/expect/test1.12

testsuite/expect/test1.13

testsuite/expect/test1.14

testsuite/expect/test1.15

testsuite/expect/test1.16

testsuite/expect/test1.17

testsuite/expect/test1.18

testsuite/expect/test1.19

testsuite/expect/test1.2

testsuite/expect/test1.20

testsuite/expect/test1.21

testsuite/expect/test1.22

testsuite/expect/test1.23

testsuite/expect/test1.24

testsuite/expect/test1.25

testsuite/expect/test1.26

testsuite/expect/test1.27

testsuite/expect/test1.28

testsuite/expect/test1.29

testsuite/expect/test1.29.prog.c

testsuite/expect/test1.3

testsuite/expect/test1.30

testsuite/expect/test1.31

testsuite/expect/test1.32

testsuite/expect/test1.32.prog.c

testsuite/expect/test1.33

testsuite/expect/test1.35

testsuite/expect/test1.36

testsuite/expect/test1.38

testsuite/expect/test1.39

testsuite/expect/test1.39.prog.c

testsuite/expect/test1.4

testsuite/expect/test1.41

testsuite/expect/test1.42

testsuite/expect/test1.43

testsuite/expect/test1.44

testsuite/expect/test1.46

testsuite/expect/test1.48

testsuite/expect/test1.49

testsuite/expect/test1.5

testsuite/expect/test1.50

testsuite/expect/test1.51

testsuite/expect/test1.52

testsuite/expect/test1.54

testsuite/expect/test1.55

testsuite/expect/test1.56

testsuite/expect/test1.57

testsuite/expect/test1.58

testsuite/expect/test1.59

testsuite/expect/test1.6

testsuite/expect/test1.7

testsuite/expect/test1.8

testsuite/expect/test1.80

testsuite/expect/test1.81

testsuite/expect/test1.82

testsuite/expect/test1.83

testsuite/expect/test1.84

testsuite/expect/test1.86

testsuite/expect/test1.87

testsuite/expect/test1.88

testsuite/expect/test1.88.prog.c

testsuite/expect/test1.89

testsuite/expect/test1.89.prog.c

testsuite/expect/test1.9

testsuite/expect/test1.90

testsuite/expect/test1.90.prog.c

testsuite/expect/test1.91

testsuite/expect/test1.91.prog.c

testsuite/expect/test1.92

testsuite/expect/test10.1

testsuite/expect/test10.10

testsuite/expect/test10.11

testsuite/expect/test10.12

testsuite/expect/test10.13

testsuite/expect/test10.2

testsuite/expect/test10.3

testsuite/expect/test10.4

testsuite/expect/test10.5

testsuite/expect/test10.6

testsuite/expect/test10.7

testsuite/expect/test10.8

testsuite/expect/test10.9

testsuite/expect/test11.1

testsuite/expect/test11.2

testsuite/expect/test11.3

testsuite/expect/test11.4

testsuite/expect/test11.5

testsuite/expect/test11.6

testsuite/expect/test11.7

testsuite/expect/test12.1

testsuite/expect/test12.2

testsuite/expect/test12.2.prog.c

testsuite/expect/test13.1

testsuite/expect/test14.1

testsuite/expect/test14.2

testsuite/expect/test14.3

testsuite/expect/test14.4

testsuite/expect/test14.5

testsuite/expect/test14.6

testsuite/expect/test14.7

testsuite/expect/test14.8

testsuite/expect/test15.1

testsuite/expect/test15.10

testsuite/expect/test15.11

testsuite/expect/test15.12

testsuite/expect/test15.13

testsuite/expect/test15.14

testsuite/expect/test15.15

testsuite/expect/test15.16

testsuite/expect/test15.17

testsuite/expect/test15.18

testsuite/expect/test15.19

testsuite/expect/test15.2

testsuite/expect/test15.20

testsuite/expect/test15.21

testsuite/expect/test15.22

testsuite/expect/test15.23

testsuite/expect/test15.24

testsuite/expect/test15.3

testsuite/expect/test15.4

testsuite/expect/test15.5

testsuite/expect/test15.6

testsuite/expect/test15.7

testsuite/expect/test15.8

testsuite/expect/test15.9

testsuite/expect/test16.1

testsuite/expect/test16.2

testsuite/expect/test16.3

testsuite/expect/test16.4

testsuite/expect/test16.4.prog.c

testsuite/expect/test17.1

testsuite/expect/test17.10

testsuite/expect/test17.11

testsuite/expect/test17.12

testsuite/expect/test17.13

testsuite/expect/test17.14

testsuite/expect/test17.15

testsuite/expect/test17.15.prog.c

testsuite/expect/test17.16

testsuite/expect/test17.17

testsuite/expect/test17.18

testsuite/expect/test17.19

testsuite/expect/test17.2

testsuite/expect/test17.20

testsuite/expect/test17.21

testsuite/expect/test17.22

testsuite/expect/test17.23

testsuite/expect/test17.24

testsuite/expect/test17.25

testsuite/expect/test17.26

testsuite/expect/test17.27

testsuite/expect/test17.28

testsuite/expect/test17.29

testsuite/expect/test17.3

testsuite/expect/test17.31

testsuite/expect/test17.32

testsuite/expect/test17.4

testsuite/expect/test17.5

testsuite/expect/test17.6

testsuite/expect/test17.7

testsuite/expect/test17.8

testsuite/expect/test17.9

testsuite/expect/test19.1

testsuite/expect/test19.2

testsuite/expect/test19.3

testsuite/expect/test19.4

testsuite/expect/test19.5

testsuite/expect/test19.6

testsuite/expect/test19.7

testsuite/expect/test2.1

testsuite/expect/test2.10

testsuite/expect/test2.11

testsuite/expect/test2.2

testsuite/expect/test2.3

testsuite/expect/test2.4

testsuite/expect/test2.5

testsuite/expect/test2.6

testsuite/expect/test2.7

testsuite/expect/test2.8

testsuite/expect/test2.9

testsuite/expect/test20.1

testsuite/expect/test20.2

testsuite/expect/test20.3

testsuite/expect/test20.4

testsuite/expect/test3.1

testsuite/expect/test3.2

testsuite/expect/test3.3

testsuite/expect/test3.4

testsuite/expect/test3.5

testsuite/expect/test3.6

testsuite/expect/test3.7

testsuite/expect/test3.7.prog.c

testsuite/expect/test3.8

testsuite/expect/test3.9

testsuite/expect/test4.1

testsuite/expect/test4.10

testsuite/expect/test4.11

testsuite/expect/test4.2

testsuite/expect/test4.3

testsuite/expect/test4.4

testsuite/expect/test4.5

testsuite/expect/test4.6

testsuite/expect/test4.7

testsuite/expect/test4.8

testsuite/expect/test4.9

testsuite/expect/test5.1

testsuite/expect/test5.2

testsuite/expect/test5.3

testsuite/expect/test5.4

testsuite/expect/test5.5

testsuite/expect/test5.6

testsuite/expect/test5.7

testsuite/expect/test5.8

testsuite/expect/test6.1

testsuite/expect/test6.10

testsuite/expect/test6.11

testsuite/expect/test6.12

testsuite/expect/test6.13

testsuite/expect/test6.2

testsuite/expect/test6.3

testsuite/expect/test6.4

testsuite/expect/test6.5

testsuite/expect/test6.6

testsuite/expect/test6.7

testsuite/expect/test6.8

testsuite/expect/test6.9

testsuite/expect/test7.1

testsuite/expect/test7.10

testsuite/expect/test7.2

testsuite/expect/test7.2.prog.c

testsuite/expect/test7.3

testsuite/expect/test7.3.io.c

testsuite/expect/test7.3.prog.c

testsuite/expect/test7.4

testsuite/expect/test7.4.prog.c

testsuite/expect/test7.6

testsuite/expect/test7.6.prog.c

testsuite/expect/test7.7

testsuite/expect/test7.7.prog.c

testsuite/expect/test7.8

testsuite/expect/test7.8.prog.c

testsuite/expect/test7.9

testsuite/expect/test7.9.prog.c

testsuite/expect/test8.1

testsuite/expect/test8.2

testsuite/expect/test8.3

testsuite/expect/test8.4

testsuite/expect/test8.4.prog.c

testsuite/expect/test8.5

testsuite/expect/test8.6

testsuite/expect/test9.1

testsuite/expect/test9.2

testsuite/expect/test9.3

testsuite/expect/test9.4

testsuite/expect/test9.5

testsuite/expect/test9.6

testsuite/expect/test9.7

testsuite/expect/test9.7.bash

testsuite/expect/test9.8

testsuite/expect/usleep

testsuite/slurm_unit/Makefile.in

testsuite/slurm_unit/api/Makefile.in

testsuite/slurm_unit/api/manual/Makefile.in

testsuite/slurm_unit/api/manual/cancel-tst.c

testsuite/slurm_unit/api/manual/complete-tst.c

testsuite/slurm_unit/api/manual/job_info-tst.c

testsuite/slurm_unit/api/manual/node_info-tst.c

testsuite/slurm_unit/api/manual/partition_info-tst.c

testsuite/slurm_unit/api/manual/reconfigure-tst.c

testsuite/slurm_unit/api/manual/submit-tst.c

testsuite/slurm_unit/api/manual/update_config-tst.c

testsuite/slurm_unit/common/Makefile.in

testsuite/slurm_unit/common/pack-test.c

testsuite/slurm_unit/slurmctld/Makefile.in

testsuite/slurm_unit/slurmctld/security_2_1.py

testsuite/slurm_unit/slurmd/Makefile.in

Show diffs side-by-side

added added

removed removed

doc/html/gang_scheduling.shtml

1

2

3

<H1>Gang Scheduling</H1>

4

5

<P>

6

SLURM version 1.2 and earlier supported dedication of resources

7

to jobs.

8

Beginning in SLURM version 1.3, gang scheduling is supported.

9

Gang scheduling is when two or more jobs are allocated to the same resources

10

and these jobs are alternately suspended to let all of the tasks of each

11

job have full access to the shared resources for a period of time.

12

</P>

13

<P>

14

A resource manager that supports timeslicing can improve it's responsiveness

15

and utilization by allowing more jobs to begin running sooner. Shorter-running

16

jobs no longer have to wait in a queue behind longer-running jobs. Instead they

17

can be run "in parallel" with the longer-running jobs, which will allow them

18

to finish quicker. Throughput is also improved because overcommitting the

19

resources provides opportunities for "local backfilling" to occur (see example

20

below).

21

</P>

22

<P>

23

The SLURM 1.3.0 the <I>sched/gang</I> plugin provides timeslicing. When enabled,

24

it monitors each of the partitions in SLURM. If a new job has been allocated to

25

resources in a partition that have already been allocated to an existing job,

26

then the plugin will suspend the new job until the configured

27

<I>SchedulerTimeslice</I> interval has elapsed. Then it will suspend the

28

running job and let the new job make use of the resources for a

29

<I>SchedulerTimeslice</I> interval. This will continue until one of the

30

jobs terminates.

31

</P>

32

33

<H2>Configuration</H2>

34

<P>

35

There are several important configuration parameters relating to

36

gang scheduling:

37

</P>

38

<UL>

39

<LI>

40

<B>SelectType</B>: The SLURM <I>sched/gang</I> plugin supports nodes

41

allocated by the <I>select/linear</I> plugin and socket/core/CPU resources

42

allocated by the <I>select/cons_res</I> plugin.

43

</LI>

44

<LI>

45

<B>SelectTypeParameter</B>: Since resources will be getting overallocated

46

with jobs, the resource selection plugin should be configured to track the

47

amount of memory used by each job to ensure that memory page swapping does

48

not occur. When <I>select/linear</I> is chosen, we recommend setting

49

<I>SelectTypeParameter=CR_Memory</I>. When <I>select/cons_res</I> is

50

chosen, we recommend including Memory as a resource (ex.

51

<I>SelectTypeParameter=CR_Core_Memory</I>).

52

</LI>

53

<LI>

54

<B>DefMemPerTask</B>: Since job requests may not explicitly specify

55

a memory requirement, we also recommend configuring <I>DefMemPerTask</I>

56

(default memory per task). It may also be desirable to configure

57

<I>MaxMemPerTask</I> (maximum memory per task) in <I>slurm.conf</I>.

58

</LI>

59

<LI>

60

<B>JobAcctGatherType and JobAcctGatherFrequency</B>:

61

If you wish to enforce memory limits, accounting must be enabled

62

using the <I>JobAcctGatherType</I> and <I>JobAcctGatherFrequency</I>

63

parameters. If accounting is enabled and a job exceeds its configured

64

memory limits, it will be canceled in order to prevent it from

65

adversely effecting other jobs sharing the same resources.

66

</LI>

67

<LI>

68

<B>SchedulerType</B>: Configure the <I>sched/gang</I> plugin by setting

69

<I>SchedulerType=sched/gang</I> in <I>slurm.conf</I>.

70

</LI>

71

<LI>

72

<B>SchedulerTimeSlice</B>: The default timeslice interval is 30 seconds.

73

To change this duration, set <I>SchedulerTimeSlice</I> to the desired interval

74

(in seconds) in <I>slurm.conf</I>. For example, to set the timeslice interval

75

to one minute, set <I>SchedulerTimeSlice=60</I>. Short values can increase

76

the overhead of gang scheduling.

77

</LI>

78

<LI>

79

<B>Shared</B>: Configure the partitions <I>Shared</I> setting to

80

<I>FORCE</I> for all partitions in which timeslicing is to take place.

81

The <I>FORCE</I> option now supports an additional parameter that controls

82

how many jobs can share a resource (FORCE[:max_share]). By default the

83

max_share value is 4. To allow up to 6 jobs from this partition to be

84

allocated to a common resource, set <I>Shared=FORCE:6</I>.

85

</LI>

86

</UL>

87

<P>

88

In order to enable gang scheduling after making the configuration changes

89

described above, restart SLURM if it is already running. Any change to the

90

plugin settings in SLURM requires a full restart of the daemons. If you

91

just change the partition <I>Shared</I> setting, this can be updated with

92

<I>scontrol reconfig</I>.

93

</P>

94

<P>

95

For an advanced topic discussion on the potential use of swap space,

96

see "Making use of swap space" in the "Future Work" section below.

97

</P>

98

99

<H2>Timeslicer Design and Operation</H2>

100

101

<P>

102

When enabled, the <I>sched/gang</I> plugin keeps track of the resources

103

allocated to all jobs. For each partition an "active bitmap" is maintained that

104

tracks all concurrently running jobs in the SLURM cluster. Each time a new

105

job is allocated to resources in a partition, the <I>sched/gang</I> plugin

106

compares these newly allocated resources with the resources already maintained

107

in the "active bitmap". If these two sets of resources are disjoint then the new

108

job is added to the "active bitmap". If these two sets of resources overlap then

109

the new job is suspended. All jobs are tracked in a per-partition job queue

110

within the <I>sched/gang</I> plugin.

111

</P>

112

<P>

113

A separate <I>timeslicer thread</I> is spawned by the <I>sched/gang</I> plugin

114

on startup. This thread sleeps for the configured <I>SchedulerTimeSlice</I>

115

interval. When it wakes up, it checks each partition for suspended jobs. If

116

suspended jobs are found then the <I>timeslicer thread</I> moves all running

117

jobs to the end of the job queue. It then reconstructs the "active bitmap" for

118

this partition beginning with the suspended job that has waited the longest to

119

run (this will be the first suspended job in the run queue). Each following job

120

is then compared with the new "active bitmap", and if the job can be run

121

concurrently with the other "active" jobs then the job is added. Once this is

122

complete then the <I>timeslicer thread</I> suspends any currently running jobs

123

that are no longer part of the "active bitmap", and resumes jobs that are new to

124

the "active bitmap".

125

</P>

126

<P>

127

This <I>timeslicer thread</I> algorithm for rotating jobs is designed to prevent

128

jobs from starving (remaining in the suspended state indefinitly) and to be as

129

fair as possible in the distribution of runtime while still keeping all of the

130

resources as busy as possible.

131

</P>

132

<P>

133

The <I>sched/gang</I> plugin suspends jobs via the same internal functions that

134

support <I>scontrol suspend</I> and <I>scontrol resume</I>. A good way to

135

observe the operation of the timeslicer is by running <I>watch squeue</I> in a

136

terminal window.

137

</P>

138

139

<H2>A Simple Example</H2>

140

141

<P>

142

The following example is configured with <I>select/linear</I>,

143

<I>sched/gang</I>, and <I>Shared=FORCE</I>. This example takes place on a small

144

cluster of 5 nodes:

145

</P>

146

<PRE>

147

[user@n16 load]$ <B>sinfo</B>

148

PARTITION AVAIL TIMELIMIT NODES STATE NODELIST

149

active* up infinite 5 idle n[12-16]

150

</PRE>

151

<P>

152

Here are the Scheduler settings (the last two settings are the relevant ones):

153

</P>

154

<PRE>

155

[user@n16 load]$ <B>scontrol show config | grep Sched</B>

156

FastSchedule = 1

157

SchedulerPort = 7321

158

SchedulerRootFilter = 1

159

SchedulerTimeSlice = 30

160

SchedulerType = sched/gang

161

[user@n16 load]$

162

</PRE>

163

<P>

164

The <I>myload</I> script launches a simple load-generating app that runs

165

for the given number of seconds. Submit <I>myload</I> to run on all nodes:

166

</P>

167

<PRE>

168

[user@n16 load]$ <B>sbatch -N5 ./myload 300</B>

169

sbatch: Submitted batch job 3

170

[user@n16 load]$ <B>squeue</B>

171

JOBID PARTITION NAME USER ST TIME NODES NODELIST

172

3 active myload user 0:05 5 n[12-16]

173

</PRE>

174

<P>

175

Submit it again and watch the <I>sched/gang</I> plugin suspend it:

176

</P>

177

<PRE>

178

[user@n16 load]$ <B>sbatch -N5 ./myload 300</B>

179

sbatch: Submitted batch job 4

180

[user@n16 load]$ <B>squeue</B>

181

JOBID PARTITION NAME USER ST TIME NODES NODELIST

182

3 active myload user R 0:13 5 n[12-16]

183

4 active myload user S 0:00 5 n[12-16]

184

</PRE>

185

<P>

186

After 30 seconds the <I>sched/gang</I> plugin swaps jobs, and now job 4 is the

187

active one:

188

</P>

189

<PRE>

190

[user@n16 load]$ <B>squeue</B>

191

JOBID PARTITION NAME USER ST TIME NODES NODELIST

192

4 active myload user R 0:08 5 n[12-16]

193

3 active myload user S 0:41 5 n[12-16]

194

[user@n16 load]$ <B>squeue</B>

195

JOBID PARTITION NAME USER ST TIME NODES NODELIST

196

4 active myload user R 0:21 5 n[12-16]

197

3 active myload user S 0:41 5 n[12-16]

198

</PRE>

199

<P>

b'After another 30 seconds the <I>sched/gang</I> plugin sets job 3 running again:'

200

</P>

201

<PRE>

202

[user@n16 load]$ <B>squeue</B>

203

JOBID PARTITION NAME USER ST TIME NODES NODELIST

204

3 active myload user R 0:50 5 n[12-16]

205

4 active myload user S 0:30 5 n[12-16]

206

</PRE>

207

<P>

208

<B>A possible side effect of timeslicing</B>: Note that jobs that are

209

immediately suspended may cause their srun commands to produce the following

210

output:

211

</P>

212

<PRE>

213

[user@n16 load]$ <B>cat slurm-4.out</B>

214

srun: Job step creation temporarily disabled, retrying

215

srun: Job step creation still disabled, retrying

216

srun: Job step creation still disabled, retrying

217

srun: Job step creation still disabled, retrying

218

srun: Job step created

219

</PRE>

220

<P>

221

This occurs because <I>srun</I> is attempting to launch a jobstep in an

222

allocation that has been suspended. The <I>srun</I> process will continue in a

223

retry loop to launch the jobstep until the allocation has been resumed and the

224

jobstep can be launched.

225

</P>

226

<P>

227

When the <I>sched/gang</I> plugin is enabled, this type of output in the user

228

jobs should be considered benign.

229

</P>

230

231

<H2>More examples</H2>

232

<P>

233

The following example shows how the timeslicer algorithm keeps the resources

234

busy. Job 10 runs continually, while jobs 9 and 11 are timesliced:

235

</P>

236

<PRE>

237

[user@n16 load]$ <B>sbatch -N3 ./myload 300</B>

238

sbatch: Submitted batch job 9

239

[user@n16 load]$ <B>sbatch -N2 ./myload 300</B>

240

sbatch: Submitted batch job 10

241

[user@n16 load]$ <B>sbatch -N3 ./myload 300</B>

242

sbatch: Submitted batch job 11

243

[user@n16 load]$ <B>squeue</B>

244

JOBID PARTITION NAME USER ST TIME NODES NODELIST

245

9 active myload user R 0:11 3 n[12-14]

246

10 active myload user R 0:08 2 n[15-16]

247

11 active myload user S 0:00 3 n[12-14]

248

[user@n16 load]$ <B>squeue</B>

249

JOBID PARTITION NAME USER ST TIME NODES NODELIST

250

10 active myload user R 0:50 2 n[15-16]

251

11 active myload user R 0:12 3 n[12-14]

252

9 active myload user S 0:41 3 n[12-14]

253

[user@n16 load]$ <B>squeue</B>

254

JOBID PARTITION NAME USER ST TIME NODES NODELIST

255

10 active myload user R 1:04 2 n[15-16]

256

11 active myload user R 0:26 3 n[12-14]

257

9 active myload user S 0:41 3 n[12-14]

258

[user@n16 load]$ <B>squeue</B>

259

JOBID PARTITION NAME USER ST TIME NODES NODELIST

260

9 active myload user R 0:46 3 n[12-14]

261

10 active myload user R 1:13 2 n[15-16]

262

11 active myload user S 0:30 3 n[12-14]

263

[user@n16 load]$

264

</PRE>

265

</P>

266

<P>

267

The next example displays "local backfilling":

268

</P>

269

<PRE>

270

[user@n16 load]$ <B>sbatch -N3 ./myload 300</B>

271

sbatch: Submitted batch job 12

272

[user@n16 load]$ <B>sbatch -N5 ./myload 300</B>

273

sbatch: Submitted batch job 13

274

[user@n16 load]$ <B>sbatch -N2 ./myload 300</B>

275

sbatch: Submitted batch job 14

276

[user@n16 load]$ <B>squeue</B>

277

JOBID PARTITION NAME USER ST TIME NODES NODELIST

278

12 active myload user R 0:14 3 n[12-14]

279

14 active myload user R 0:06 2 n[15-16]

280

13 active myload user S 0:00 5 n[12-16]

281

[user@n16 load]$

282

</PRE>

283

<P>

284

Without timeslicing and without the backfill scheduler enabled, job 14 has to

285

wait for job 13 to finish.

286

</P><P>

287

This is called "local" backfilling because the backfilling only occurs with jobs

288

close enough in the queue to get allocated by the scheduler as part of

289

oversubscribing the resources. Recall that the number of jobs that can

290

overcommit a resource is controlled by the <I>Shared=FORCE:max_share</I> value,

291

so this value effectively controls the scope of "local backfilling".

292

</P><P>

293

Normal backfill algorithms check <U>all</U> jobs in the wait queue.

294

</P>

295

296

<H2>Consumable Resource Examples</H2>

297

<P>

298

The following two examples illustrate the primary difference between

299

<I>CR_CPU</I> and <I>CR_Core</I> when consumable resource selection is enabled

300

(<I>select/cons_res</I>).

301

</P>

302

<P>

303

When <I>CR_CPU</I> (or <I>CR_CPU_Memory</I>) is configured then the selector

304

treats the CPUs as simple, <I>interchangeable</I> computing resources. However

305

when <I>CR_Core</I> (or <I>CR_Core_Memory</I>) is enabled the selector treats

306

the CPUs as individual resources that are <U>specifically</U> allocated to jobs.

307

This subtle difference is highlighted when timeslicing is enabled.

308

</P>

309

<P>

310

In both examples 6 jobs are submitted. Each job requests 2 CPUs per node, and

311

all of the nodes contain two quad-core processors. The timeslicer will initially

312

let the first 4 jobs run and suspend the last 2 jobs. The manner in which these

313

jobs are timesliced depends upon the configured <I>SelectTypeParameter</I>.

314

</P>

315

<P>

316

In the first example <I>CR_Core_Memory</I> is configured. Note that jobs 46 and

317

47 don't <U>ever</U> get suspended. This is because they are not sharing their

318

cores with any other job. Jobs 48 and 49 were allocated to the same cores as

319

jobs 45 and 46. The timeslicer recognizes this and timeslices only those jobs:

320

</P>

321

<PRE>

322

[user@n16 load]$ <B>sinfo</B>

323

PARTITION AVAIL TIMELIMIT NODES STATE NODELIST

324

active* up infinite 5 idle n[12-16]

325

[user@n16 load]$ <B>scontrol show config | grep Select</B>

326

SelectType = select/cons_res

327

SelectTypeParameters = CR_CORE_MEMORY

328

[user@n16 load]$ <B>sinfo -o "%20N %5D %5c %5z"</B>

329

NODELIST NODES CPUS S:C:T

330

n[12-16] 5 8 2:4:1

331

[user@n16 load]$

332

[user@n16 load]$

333

[user@n16 load]$ <B>sbatch -n10 -N5 ./myload 300</B>

334

sbatch: Submitted batch job 44

335

[user@n16 load]$ <B>sbatch -n10 -N5 ./myload 300</B>

336

sbatch: Submitted batch job 45

337

[user@n16 load]$ <B>sbatch -n10 -N5 ./myload 300</B>

338

sbatch: Submitted batch job 46

339

[user@n16 load]$ <B>sbatch -n10 -N5 ./myload 300</B>

340

sbatch: Submitted batch job 47

341

[user@n16 load]$ <B>sbatch -n10 -N5 ./myload 300</B>

342

sbatch: Submitted batch job 48

343

[user@n16 load]$ <B>sbatch -n10 -N5 ./myload 300</B>

344

sbatch: Submitted batch job 49

345

[user@n16 load]$ <B>squeue</B>

346

JOBID PARTITION NAME USER ST TIME NODES NODELIST

347

44 active myload user R 0:09 5 n[12-16]

348

45 active myload user R 0:08 5 n[12-16]

349

46 active myload user R 0:08 5 n[12-16]

350

47 active myload user R 0:07 5 n[12-16]

351

48 active myload user S 0:00 5 n[12-16]

352

49 active myload user S 0:00 5 n[12-16]

353

[user@n16 load]$ <B>squeue</B>

354

JOBID PARTITION NAME USER ST TIME NODES NODELIST

355

46 active myload user R 0:49 5 n[12-16]

356

47 active myload user R 0:48 5 n[12-16]

357

48 active myload user R 0:06 5 n[12-16]

358

49 active myload user R 0:06 5 n[12-16]

359

44 active myload user S 0:44 5 n[12-16]

360

45 active myload user S 0:43 5 n[12-16]

361

[user@n16 load]$ <B>squeue</B>

362

JOBID PARTITION NAME USER ST TIME NODES NODELIST

363

44 active myload user R 1:23 5 n[12-16]

364

45 active myload user R 1:22 5 n[12-16]

365

46 active myload user R 2:22 5 n[12-16]

366

47 active myload user R 2:21 5 n[12-16]

367

48 active myload user S 1:00 5 n[12-16]

368

49 active myload user S 1:00 5 n[12-16]

369

[user@n16 load]$

370

</PRE>

371

<P>

372

Note the runtime of all 6 jobs in the output of the last <I>squeue</I> command.

373

Jobs 46 and 47 have been running continuously, while jobs 45 and 46 are

374

splitting their runtime with jobs 48 and 49.

375

</P><P>

376

The next example has <I>CR_CPU_Memory</I> configured and the same 6 jobs are

377

submitted. Here the selector and the timeslicer treat the CPUs as countable

378

resources which results in all 6 jobs sharing time on the CPUs:

379

</P>

380

<PRE>

381

[user@n16 load]$ <B>sinfo</B>

382

PARTITION AVAIL TIMELIMIT NODES STATE NODELIST

383

active* up infinite 5 idle n[12-16]

384

[user@n16 load]$ <B>scontrol show config | grep Select</B>

385

SelectType = select/cons_res

386

SelectTypeParameters = CR_CPU_MEMORY

387

[user@n16 load]$ <B>sinfo -o "%20N %5D %5c %5z"</B>

388

NODELIST NODES CPUS S:C:T

389

n[12-16] 5 8 2:4:1

390

[user@n16 load]$

391

[user@n16 load]$

392

[user@n16 load]$ <B>sbatch -n10 -N5 ./myload 300</B>

393

sbatch: Submitted batch job 51

394

[user@n16 load]$ <B>sbatch -n10 -N5 ./myload 300</B>

395

sbatch: Submitted batch job 52

396

[user@n16 load]$ <B>sbatch -n10 -N5 ./myload 300</B>

397

sbatch: Submitted batch job 53

398

[user@n16 load]$ <B>sbatch -n10 -N5 ./myload 300</B>

399

sbatch: Submitted batch job 54

400

[user@n16 load]$ <B>sbatch -n10 -N5 ./myload 300</B>

401

sbatch: Submitted batch job 55

402

[user@n16 load]$ <B>sbatch -n10 -N5 ./myload 300</B>

403

sbatch: Submitted batch job 56

404

[user@n16 load]$ <B>squeue</B>

405

JOBID PARTITION NAME USER ST TIME NODES NODELIST

406

51 active myload user R 0:11 5 n[12-16]

407

52 active myload user R 0:11 5 n[12-16]

408

53 active myload user R 0:10 5 n[12-16]

409

54 active myload user R 0:09 5 n[12-16]

410

55 active myload user S 0:00 5 n[12-16]

411

56 active myload user S 0:00 5 n[12-16]

412

[user@n16 load]$ <B>squeue</B>

413

JOBID PARTITION NAME USER ST TIME NODES NODELIST

414

51 active myload user R 1:09 5 n[12-16]

415

52 active myload user R 1:09 5 n[12-16]

416

55 active myload user R 0:23 5 n[12-16]

417

56 active myload user R 0:23 5 n[12-16]

418

53 active myload user S 0:45 5 n[12-16]

419

54 active myload user S 0:44 5 n[12-16]

420

[user@n16 load]$ <B>squeue</B>

421

JOBID PARTITION NAME USER ST TIME NODES NODELIST

422

53 active myload user R 0:55 5 n[12-16]

423

54 active myload user R 0:54 5 n[12-16]

424

55 active myload user R 0:40 5 n[12-16]

425

56 active myload user R 0:40 5 n[12-16]

426

51 active myload user S 1:16 5 n[12-16]

427

52 active myload user S 1:16 5 n[12-16]

428

[user@n16 load]$ <B>squeue</B>

429

JOBID PARTITION NAME USER ST TIME NODES NODELIST

430

51 active myload user R 3:18 5 n[12-16]

431

52 active myload user R 3:18 5 n[12-16]

432

53 active myload user R 3:17 5 n[12-16]

433

54 active myload user R 3:16 5 n[12-16]

434

55 active myload user S 3:00 5 n[12-16]

435

56 active myload user S 3:00 5 n[12-16]

436

[user@n16 load]$

437

</PRE>

438

<P>

439

Note that the runtime of all 6 jobs is roughly equal. Jobs 51-54 ran first so

440

they're slightly ahead, but so far all jobs have run for at least 3 minutes.

441

</P><P>

442

At the core level this means that SLURM relies on the linux kernel to move jobs

443

around on the cores to maximize performance. This is different than when

444

<I>CR_Core_Memory</I> was configured and the jobs would effectively remain

445

"pinned" to their specific cores for the duration of the job. Note that

446

<I>CR_Core_Memory</I> supports CPU binding, while <I>CR_CPU_Memory</I> does not.

447

</P>

448

449

<H2>Future Work</H2>

450

451

<P>

452

Priority scheduling and preemptive scheduling are other forms of gang

453

scheduling that are currently under development for SLURM.

454

</P>

455

<P>

456

<B>Making use of swap space</B>: (note that this topic is not currently

457

scheduled for development, unless someone would like to pursue this) It should

458

be noted that timeslicing does provide an interesting mechanism for high

459

performance jobs to make use of swap space. The optimal scenario is one in which

460

suspended jobs are "swapped out" and active jobs are "swapped in". The swapping

461

activity would only occur once every <I>SchedulerTimeslice</I> interval.

462

</P>

463

<P>

464

However, SLURM should first be modified to include support for scheduling jobs

465

into swap space and to provide controls to prevent overcommitting swap space.

466

For now this idea could be experimented with by disabling memory support in the

467

selector and submitting appropriately sized jobs.

468

</P>

469

470

<p style="text-align:center;">Last modified 17 March 2008</p>

471

472

Older »