← Back to branch summary

~ubuntu-branches/ubuntu/vivid/slurm-llnl/vivid

~ubuntu-branches/ubuntu/vivid/slurm-llnl/vivid

« back to all changes in this revision

Viewing changes to doc/html/faq.shtml

Committer: Bazaar Package Importer
Author(s): Gennaro Oliva
Date: 2009-09-24 23:28:15 UTC
mfrom: (1.1.11 upstream) (3.2.4 sid)
Revision ID: james.westby@ubuntu.com-20090924232815-enh65jn32q1ebg07

Tags: 2.0.5-1

http://bugs.debian.org/541252

* New upstream release
* Changed dependecy from lib-mysqlclient15 to lib-mysqlclient
* Added Default-Start for runlevel 2 and 4 and $remote_fs requirement in
  init.d scripts (Closes: #541252)
* Postinst checks for wrong runlevels 2 and 4 links
* Upgraded to standard version 3.8.3
* Add lintian overrides for missing slurm-llnl-configurator.html in doc
  base registration
* modified postrm scripts to ignore pkill return value in order to avoid
  postrm failure when no slurm process is running
* Checking for slurmctld.pid before cancelling running and pending
  jobs during package removal

files added:
RELEASE_NOTES_LLNL

auxdir/libtool.m4

auxdir/ltoptions.m4

auxdir/ltsugar.m4

auxdir/ltversion.m4

auxdir/lt~obsolete.m4

auxdir/x_ac_blcr.m4

auxdir/x_ac_cray.m4

auxdir/x_ac_env.m4

auxdir/x_ac_iso.m4

contribs/skilling.c

debian/README.cryptotype-openssl

debian/libslurm20.symbols

debian/slurm-llnl-slurmdbd.README.Debian

debian/slurm-llnl.docs

debian/slurm-llnl.lintian

debian/slurm-llnl.prerm

doc/html/AllocationPies.gif

doc/html/ExampleUsage.gif

doc/html/UsagePies.gif

doc/html/checkpoint_blcr.shtml

doc/html/cray.shtml

doc/html/mpi_guide.shtml

doc/html/priority_multifactor.shtml

doc/html/priority_plugins.shtml

doc/html/reservations.shtml

doc/html/resource_limits.shtml

doc/html/slurm.sc08.bof.pdf

doc/html/slurm.sc08.status.pdf

doc/html/sun_const.shtml

doc/html/topo_ex1.gif

doc/html/topo_ex2.gif

doc/html/topology.shtml

doc/html/topology_plugin.shtml

doc/man/man1/sprio.1

doc/man/man1/srun_cr.1

doc/man/man1/sshare.1

doc/man/man3/slurm_checkpoint_tasks.3

doc/man/man3/slurm_create_partition.3

doc/man/man3/slurm_create_reservation.3

doc/man/man3/slurm_delete_reservation.3

doc/man/man3/slurm_free_reservation_info_msg.3

doc/man/man3/slurm_init_resv_desc_msg.3

doc/man/man3/slurm_init_update_node_msg.3

doc/man/man3/slurm_load_reservations.3

doc/man/man3/slurm_print_reservation_info.3

doc/man/man3/slurm_print_reservation_info_msg.3

doc/man/man3/slurm_sprint_reservation_info.3

doc/man/man3/slurm_takeover.3

doc/man/man3/slurm_update_reservation.3

doc/man/man5/topology.conf.5

src/api/reservation_info.c

src/api/slurm_hostlist.c

src/api/topo_info.c

src/common/basil_resv_conf.c

src/common/basil_resv_conf.h

src/common/select_job_res.c

src/common/select_job_res.h

src/common/slurm_priority.c

src/common/slurm_priority.h

src/common/slurm_strcasestr.c

src/common/slurm_strcasestr.h

src/common/write_labelled_message.c

src/common/write_labelled_message.h

src/plugins/checkpoint/blcr

src/plugins/checkpoint/blcr/Makefile.am

src/plugins/checkpoint/blcr/Makefile.in

src/plugins/checkpoint/blcr/checkpoint_blcr.c

src/plugins/checkpoint/blcr/cr_checkpoint.sh.in

src/plugins/checkpoint/blcr/cr_restart.sh.in

src/plugins/priority

src/plugins/priority/Makefile.am

src/plugins/priority/Makefile.in

src/plugins/priority/basic

src/plugins/priority/basic/Makefile.am

src/plugins/priority/basic/Makefile.in

src/plugins/priority/basic/priority_basic.c

src/plugins/priority/multifactor

src/plugins/priority/multifactor/Makefile.am

src/plugins/priority/multifactor/Makefile.in

src/plugins/priority/multifactor/priority_multifactor.c

src/plugins/select/bluegene/block_allocator/wire_test.c

src/plugins/select/cons_res/job_test.c

src/plugins/select/cons_res/job_test.h

src/plugins/topology

src/plugins/topology/3d_torus

src/plugins/topology/3d_torus/Makefile.am

src/plugins/topology/3d_torus/Makefile.in

src/plugins/topology/3d_torus/hilbert.c

src/plugins/topology/3d_torus/hilbert.h

src/plugins/topology/3d_torus/hilbert_slurm.c

src/plugins/topology/3d_torus/topology_3d_torus.c

src/plugins/topology/Makefile.am

src/plugins/topology/Makefile.in

src/plugins/topology/none

src/plugins/topology/none/Makefile.am

src/plugins/topology/none/Makefile.in

src/plugins/topology/none/topology_none.c

src/plugins/topology/tree

src/plugins/topology/tree/Makefile.am

src/plugins/topology/tree/Makefile.in

src/plugins/topology/tree/topology_tree.c

src/sacctmgr/config_functions.c

src/scontrol/create_res.c

src/scontrol/info_res.c

src/slurmctld/basil_interface.c

src/slurmctld/basil_interface.h

src/slurmctld/port_mgr.c

src/slurmctld/port_mgr.h

src/slurmctld/reservation.c

src/slurmctld/reservation.h

src/slurmctld/topo_plugin.c

src/slurmctld/topo_plugin.h

src/slurmd/common/set_oomadj.c

src/slurmd/common/set_oomadj.h

src/slurmdbd/backup.c

src/slurmdbd/backup.h

src/smap/reservation_functions.c

src/sprio

src/sprio/Makefile.am

src/sprio/Makefile.in

src/sprio/opts.c

src/sprio/print.c

src/sprio/print.h

src/sprio/sprio.c

src/sprio/sprio.h

src/sreport/resv_reports.c

src/sreport/resv_reports.h

src/srun/task_state.c

src/srun/task_state.h

src/srun_cr

src/srun_cr/Makefile.am

src/srun_cr/Makefile.in

src/srun_cr/srun_cr.c

src/sshare

src/sshare/Makefile.am

src/sshare/Makefile.in

src/sshare/process.c

src/sshare/sshare.c

src/sshare/sshare.h

src/sview/resv_info.c

testsuite/expect/test1.45

testsuite/expect/test1.60

testsuite/expect/test12.4

testsuite/expect/test12.5

testsuite/expect/test2.12

testsuite/expect/test21.25

testsuite/expect/test21.26

testsuite/expect/test22.1

testsuite/expect/test22.2

testsuite/expect/test23.1

testsuite/expect/test23.2

testsuite/expect/test23.3

testsuite/expect/test24.1

testsuite/expect/test24.1.prog.c

testsuite/expect/test24.2

testsuite/expect/test25.1

testsuite/expect/test3.11

testsuite/expect/test6.14

files removed:
debian/docs

debian/libslurm13.symbols

doc/html/sched_policy.shtml

doc/man/man3/slurm_complete_job_step.3

doc/man/man3/slurm_trigger.3

src/sacct/sacct_stat.c

src/slurmctld/hilbert.c

src/slurmctld/hilbert.h

src/slurmctld/hilbert_slurm.c

src/slurmctld/private.key

testsuite/expect/test17.19

files modified:
AUTHORS

BUILD.NOTES

DISCLAIMER

META

Makefile.in

NEWS

RELEASE_NOTES

aclocal.m4

auxdir/Makefile.am

auxdir/Makefile.in

auxdir/config.guess

auxdir/config.sub

auxdir/ltmain.sh

auxdir/x_ac_affinity.m4

auxdir/x_ac_aix.m4

auxdir/x_ac_bluegene.m4

auxdir/x_ac_databases.m4

auxdir/x_ac_debug.m4

auxdir/x_ac_gtk.m4

auxdir/x_ac_readline.m4

config.guess

config.h.in

config.sub

configure

configure.ac

contribs/Makefile.am

contribs/Makefile.in

contribs/README

contribs/env_cache_builder.c

contribs/make.slurm.patch

contribs/mpich1.slurm.patch

contribs/perlapi/Makefile.in

contribs/perlapi/libslurm-perl/Slurm.pm

contribs/perlapi/libslurm-perl/Slurm.xs

contribs/perlapi/libslurm-perl/alloc.c

contribs/perlapi/libslurm-perl/conf.c

contribs/perlapi/libslurm-perl/job.c

contribs/perlapi/libslurm-perl/launch.c

contribs/perlapi/libslurm-perl/msg.h

contribs/perlapi/libslurm-perl/node.c

contribs/perlapi/libslurm-perl/partition.c

contribs/phpext/Makefile.in

contribs/phpext/slurm_php/slurm_php.c

contribs/phpext/slurm_php/slurm_php.h

contribs/python/Makefile.in

contribs/python/hostlist/Makefile.in

contribs/python/hostlist/test/Makefile.in

contribs/slurmdb-direct/Makefile.in

contribs/slurmdb-direct/moab_2_slurmdb.pl

contribs/time_login.c

contribs/torque/Makefile.in

contribs/torque/mpiexec.pl

contribs/torque/pbsnodes.pl

contribs/torque/qdel.pl

contribs/torque/qhold.pl

contribs/torque/qrls.pl

contribs/torque/qstat.pl

contribs/torque/qsub.pl

debian/README.Debian

debian/changelog

debian/control

debian/rules

debian/slurm-llnl-configurator.html

debian/slurm-llnl-slurmdbd.dirs

debian/slurm-llnl-slurmdbd.init.d

debian/slurm-llnl-slurmdbd.postinst

debian/slurm-llnl-slurmdbd.postrm

debian/slurm-llnl.dirs

debian/slurm-llnl.init.d

debian/slurm-llnl.postinst

debian/slurm-llnl.postrm

debian/slurm-llnl.preinst

debian/slurm.conf.simple

doc/Makefile.in

doc/html/Makefile.am

doc/html/Makefile.in

doc/html/accounting.shtml

doc/html/accounting_storageplugins.shtml

doc/html/api.shtml

doc/html/authplugins.shtml

doc/html/big_sys.shtml

doc/html/bluegene.shtml

doc/html/checkpoint_plugins.shtml

doc/html/configurator.html.in

doc/html/cons_res.shtml

doc/html/cons_res_share.shtml

doc/html/crypto_plugins.shtml

doc/html/dist_plane.shtml

doc/html/documentation.shtml

doc/html/download.shtml

doc/html/faq.shtml

doc/html/footer.txt

doc/html/gang_scheduling.shtml

doc/html/header.txt

doc/html/ibm.shtml

doc/html/jobacct_gatherplugins.shtml

doc/html/jobcompplugins.shtml

doc/html/maui.shtml

doc/html/mc_support.shtml

doc/html/moab.shtml

doc/html/mpiplugins.shtml

doc/html/news.shtml

doc/html/overview.shtml

doc/html/plane_ex5.gif

doc/html/platforms.shtml

doc/html/plugins.shtml

doc/html/power_save.shtml

doc/html/preempt.shtml

doc/html/proctrack_plugins.shtml

doc/html/programmer_guide.shtml

doc/html/publications.shtml

doc/html/quickstart.shtml

doc/html/quickstart_admin.shtml

doc/html/review_release.html

doc/html/selectplugins.shtml

doc/html/slurm.shtml

doc/html/switchplugins.shtml

doc/html/taskplugins.shtml

doc/html/team.shtml

doc/html/testimonials.shtml

doc/html/troubleshoot.shtml

doc/man/Makefile.am

doc/man/Makefile.in

doc/man/man1/sacct.1

doc/man/man1/sacctmgr.1

doc/man/man1/salloc.1

doc/man/man1/sattach.1

doc/man/man1/sbatch.1

doc/man/man1/sbcast.1

doc/man/man1/scancel.1

doc/man/man1/scontrol.1

doc/man/man1/sinfo.1

doc/man/man1/slurm.1

doc/man/man1/smap.1

doc/man/man1/squeue.1

doc/man/man1/sreport.1

doc/man/man1/srun.1

doc/man/man1/sstat.1

doc/man/man1/strigger.1

doc/man/man1/sview.1

doc/man/man3/slurm_allocate_resources.3

doc/man/man3/slurm_checkpoint_error.3

doc/man/man3/slurm_complete_job.3

doc/man/man3/slurm_free_ctl_conf.3

doc/man/man3/slurm_free_job_info_msg.3

doc/man/man3/slurm_free_job_step_info_response_msg.3

doc/man/man3/slurm_free_node_info.3

doc/man/man3/slurm_free_partition_info.3

doc/man/man3/slurm_get_errno.3

doc/man/man3/slurm_hostlist_create.3

doc/man/man3/slurm_job_step_create.3

doc/man/man3/slurm_kill_job.3

doc/man/man3/slurm_reconfigure.3

doc/man/man3/slurm_resume.3

doc/man/man3/slurm_slurmd_status.3

doc/man/man3/slurm_step_ctx_create.3

doc/man/man3/slurm_step_launch.3

doc/man/man5/bluegene.conf.5

doc/man/man5/slurm.conf.5

doc/man/man5/slurmdbd.conf.5

doc/man/man5/wiki.conf.5

doc/man/man8/slurmctld.8

doc/man/man8/slurmd.8

doc/man/man8/slurmdbd.8

doc/man/man8/slurmstepd.8

doc/man/man8/spank.8

etc/bluegene.conf.example

etc/slurm.conf.example

etc/slurm.epilog.clean

slurm.spec

slurm/pmi.h

slurm/slurm.h.in

slurm/slurm_errno.h

slurm/spank.h

src/Makefile.am

src/Makefile.in

src/api/Makefile.am

src/api/Makefile.in

src/api/allocate.c

src/api/allocate_msg.c

src/api/cancel.c

src/api/checkpoint.c

src/api/complete.c

src/api/config_info.c

src/api/init_msg.c

src/api/job_info.c

src/api/job_info.h

src/api/job_step_info.c

src/api/node_info.c

src/api/node_select_info.c

src/api/node_select_info.h

src/api/partition_info.c

src/api/pmi.c

src/api/pmi_server.c

src/api/pmi_server.h

src/api/reconfigure.c

src/api/signal.c

src/api/slurm_pmi.c

src/api/slurm_pmi.h

src/api/step_ctx.c

src/api/step_ctx.h

src/api/step_io.c

src/api/step_io.h

src/api/step_launch.c

src/api/step_launch.h

src/api/submit.c

src/api/suspend.c

src/api/triggers.c

src/api/update_config.c

src/common/Makefile.am

src/common/Makefile.in

src/common/arg_desc.c

src/common/arg_desc.h

src/common/assoc_mgr.c

src/common/assoc_mgr.h

src/common/bitstring.c

src/common/bitstring.h

src/common/checkpoint.c

src/common/checkpoint.h

src/common/daemonize.c

src/common/daemonize.h

src/common/eio.c

src/common/eio.h

src/common/env.c

src/common/env.h

src/common/forward.c

src/common/forward.h

src/common/hostlist.c

src/common/hostlist.h

src/common/io_hdr.c

src/common/io_hdr.h

src/common/job_options.c

src/common/job_options.h

src/common/jobacct_common.c

src/common/jobacct_common.h

src/common/list.h

src/common/log.c

src/common/log.h

src/common/macros.h

src/common/mpi.c

src/common/mpi.h

src/common/net.c

src/common/net.h

src/common/node_select.c

src/common/node_select.h

src/common/optz.c

src/common/optz.h

src/common/pack.c

src/common/pack.h

src/common/parse_config.c

src/common/parse_config.h

src/common/parse_spec.c

src/common/parse_spec.h

src/common/parse_time.c

src/common/parse_time.h

src/common/plugin.c

src/common/plugin.h

src/common/plugrack.c

src/common/plugrack.h

src/common/plugstack.c

src/common/plugstack.h

src/common/print_fields.c

src/common/print_fields.h

src/common/proc_args.c

src/common/proc_args.h

src/common/read_config.c

src/common/read_config.h

src/common/safeopen.c

src/common/safeopen.h

src/common/slurm_accounting_storage.c

src/common/slurm_accounting_storage.h

src/common/slurm_auth.c

src/common/slurm_auth.h

src/common/slurm_cred.c

src/common/slurm_cred.h

src/common/slurm_errno.c

src/common/slurm_jobacct_gather.c

src/common/slurm_jobacct_gather.h

src/common/slurm_jobcomp.c

src/common/slurm_jobcomp.h

src/common/slurm_protocol_api.c

src/common/slurm_protocol_api.h

src/common/slurm_protocol_common.h

src/common/slurm_protocol_defs.c

src/common/slurm_protocol_defs.h

src/common/slurm_protocol_interface.h

src/common/slurm_protocol_mongo_common.h

src/common/slurm_protocol_pack.c

src/common/slurm_protocol_pack.h

src/common/slurm_protocol_socket_common.h

src/common/slurm_protocol_socket_implementation.c

src/common/slurm_protocol_util.c

src/common/slurm_protocol_util.h

src/common/slurm_resource_info.c

src/common/slurm_resource_info.h

src/common/slurm_rlimits_info.c

src/common/slurm_rlimits_info.h

src/common/slurm_selecttype_info.c

src/common/slurm_selecttype_info.h

src/common/slurm_step_layout.c

src/common/slurm_step_layout.h

src/common/slurm_xlator.h

src/common/slurmdbd_defs.c

src/common/slurmdbd_defs.h

src/common/stepd_api.c

src/common/stepd_api.h

src/common/switch.c

src/common/switch.h

src/common/timers.c

src/common/timers.h

src/common/uid.c

src/common/uid.h

src/common/unsetenv.c

src/common/unsetenv.h

src/common/xassert.c

src/common/xassert.h

src/common/xmalloc.c

src/common/xmalloc.h

src/common/xsignal.c

src/common/xsignal.h

src/common/xstring.c

src/common/xstring.h

src/database/Makefile.am

src/database/Makefile.in

src/database/mysql_common.c

src/database/mysql_common.h

src/database/pgsql_common.c

src/database/pgsql_common.h

src/plugins/Makefile.am

src/plugins/Makefile.in

src/plugins/accounting_storage/Makefile.in

src/plugins/accounting_storage/filetxt/Makefile.in

src/plugins/accounting_storage/filetxt/accounting_storage_filetxt.c

src/plugins/accounting_storage/filetxt/filetxt_jobacct_process.c

src/plugins/accounting_storage/filetxt/filetxt_jobacct_process.h

src/plugins/accounting_storage/mysql/Makefile.am

src/plugins/accounting_storage/mysql/Makefile.in

src/plugins/accounting_storage/mysql/accounting_storage_mysql.c

src/plugins/accounting_storage/mysql/mysql_jobacct_process.c

src/plugins/accounting_storage/mysql/mysql_jobacct_process.h

src/plugins/accounting_storage/mysql/mysql_rollup.c

src/plugins/accounting_storage/mysql/mysql_rollup.h

src/plugins/accounting_storage/none/Makefile.in

src/plugins/accounting_storage/none/accounting_storage_none.c

src/plugins/accounting_storage/pgsql/Makefile.am

src/plugins/accounting_storage/pgsql/Makefile.in

src/plugins/accounting_storage/pgsql/accounting_storage_pgsql.c

src/plugins/accounting_storage/pgsql/pgsql_jobacct_process.c

src/plugins/accounting_storage/pgsql/pgsql_jobacct_process.h

src/plugins/accounting_storage/slurmdbd/Makefile.in

src/plugins/accounting_storage/slurmdbd/accounting_storage_slurmdbd.c

src/plugins/auth/Makefile.in

src/plugins/auth/authd/Makefile.in

src/plugins/auth/authd/auth_authd.c

src/plugins/auth/munge/Makefile.in

src/plugins/auth/munge/auth_munge.c

src/plugins/auth/none/Makefile.in

src/plugins/auth/none/auth_none.c

src/plugins/checkpoint/Makefile.am

src/plugins/checkpoint/Makefile.in

src/plugins/checkpoint/aix/Makefile.in

src/plugins/checkpoint/aix/checkpoint_aix.c

src/plugins/checkpoint/none/Makefile.in

src/plugins/checkpoint/none/checkpoint_none.c

src/plugins/checkpoint/ompi/Makefile.in

src/plugins/checkpoint/ompi/checkpoint_ompi.c

src/plugins/checkpoint/xlch/Makefile.am

src/plugins/checkpoint/xlch/Makefile.in

src/plugins/checkpoint/xlch/checkpoint_xlch.c

src/plugins/crypto/Makefile.in

src/plugins/crypto/munge/Makefile.in

src/plugins/crypto/munge/crypto_munge.c

src/plugins/crypto/openssl/Makefile.in

src/plugins/crypto/openssl/crypto_openssl.c

src/plugins/jobacct_gather/Makefile.in

src/plugins/jobacct_gather/aix/Makefile.in

src/plugins/jobacct_gather/aix/jobacct_gather_aix.c

src/plugins/jobacct_gather/linux/Makefile.in

src/plugins/jobacct_gather/linux/jobacct_gather_linux.c

src/plugins/jobacct_gather/none/Makefile.in

src/plugins/jobacct_gather/none/jobacct_gather_none.c

src/plugins/jobcomp/Makefile.in

src/plugins/jobcomp/filetxt/Makefile.in

src/plugins/jobcomp/filetxt/filetxt_jobcomp_process.c

src/plugins/jobcomp/filetxt/filetxt_jobcomp_process.h

src/plugins/jobcomp/filetxt/jobcomp_filetxt.c

src/plugins/jobcomp/mysql/Makefile.am

src/plugins/jobcomp/mysql/Makefile.in

src/plugins/jobcomp/mysql/jobcomp_mysql.c

src/plugins/jobcomp/mysql/mysql_jobcomp_process.c

src/plugins/jobcomp/mysql/mysql_jobcomp_process.h

src/plugins/jobcomp/none/Makefile.in

src/plugins/jobcomp/none/jobcomp_none.c

src/plugins/jobcomp/pgsql/Makefile.am

src/plugins/jobcomp/pgsql/Makefile.in

src/plugins/jobcomp/pgsql/jobcomp_pgsql.c

src/plugins/jobcomp/pgsql/pgsql_jobcomp_process.c

src/plugins/jobcomp/pgsql/pgsql_jobcomp_process.h

src/plugins/jobcomp/script/Makefile.in

src/plugins/jobcomp/script/jobcomp_script.c

src/plugins/mpi/Makefile.in

src/plugins/mpi/lam/Makefile.in

src/plugins/mpi/lam/lam.h

src/plugins/mpi/lam/mpi_lam.c

src/plugins/mpi/mpich1_p4/Makefile.in

src/plugins/mpi/mpich1_p4/mpich1_p4.c

src/plugins/mpi/mpich1_shmem/Makefile.in

src/plugins/mpi/mpich1_shmem/mpich1_shmem.c

src/plugins/mpi/mpichgm/Makefile.in

src/plugins/mpi/mpichgm/mpi_mpichgm.c

src/plugins/mpi/mpichgm/mpichgm.c

src/plugins/mpi/mpichgm/mpichgm.h

src/plugins/mpi/mpichmx/Makefile.in

src/plugins/mpi/mpichmx/mpi_mpichmx.c

src/plugins/mpi/mpichmx/mpichmx.c

src/plugins/mpi/mpichmx/mpichmx.h

src/plugins/mpi/mvapich/Makefile.in

src/plugins/mpi/mvapich/mpi_mvapich.c

src/plugins/mpi/mvapich/mvapich.c

src/plugins/mpi/mvapich/mvapich.h

src/plugins/mpi/none/Makefile.in

src/plugins/mpi/none/mpi_none.c

src/plugins/mpi/openmpi/Makefile.in

src/plugins/mpi/openmpi/mpi_openmpi.c

src/plugins/proctrack/Makefile.in

src/plugins/proctrack/aix/Makefile.in

src/plugins/proctrack/aix/proctrack_aix.c

src/plugins/proctrack/linuxproc/Makefile.in

src/plugins/proctrack/linuxproc/kill_tree.c

src/plugins/proctrack/linuxproc/kill_tree.h

src/plugins/proctrack/linuxproc/proctrack_linuxproc.c

src/plugins/proctrack/pgid/Makefile.in

src/plugins/proctrack/pgid/proctrack_pgid.c

src/plugins/proctrack/rms/Makefile.in

src/plugins/proctrack/rms/proctrack_rms.c

src/plugins/proctrack/sgi_job/Makefile.in

src/plugins/proctrack/sgi_job/proctrack_sgi_job.c

src/plugins/sched/Makefile.in

src/plugins/sched/backfill/Makefile.in

src/plugins/sched/backfill/backfill.c

src/plugins/sched/backfill/backfill.h

src/plugins/sched/backfill/backfill_wrapper.c

src/plugins/sched/builtin/Makefile.in

src/plugins/sched/builtin/builtin_wrapper.c

src/plugins/sched/gang/Makefile.in

src/plugins/sched/gang/gang.c

src/plugins/sched/gang/gang.h

src/plugins/sched/gang/sched_gang.c

src/plugins/sched/hold/Makefile.in

src/plugins/sched/hold/hold_wrapper.c

src/plugins/sched/wiki/Makefile.in

src/plugins/sched/wiki/cancel_job.c

src/plugins/sched/wiki/get_jobs.c

src/plugins/sched/wiki/get_nodes.c

src/plugins/sched/wiki/hostlist.c

src/plugins/sched/wiki/job_modify.c

src/plugins/sched/wiki/msg.c

src/plugins/sched/wiki/msg.h

src/plugins/sched/wiki/resume_job.c

src/plugins/sched/wiki/sched_wiki.c

src/plugins/sched/wiki/start_job.c

src/plugins/sched/wiki/suspend_job.c

src/plugins/sched/wiki2/Makefile.in

src/plugins/sched/wiki2/cancel_job.c

src/plugins/sched/wiki2/event.c

src/plugins/sched/wiki2/get_jobs.c

src/plugins/sched/wiki2/get_nodes.c

src/plugins/sched/wiki2/hostlist.c

src/plugins/sched/wiki2/initialize.c

src/plugins/sched/wiki2/job_add_task.c

src/plugins/sched/wiki2/job_modify.c

src/plugins/sched/wiki2/job_notify.c

src/plugins/sched/wiki2/job_release_task.c

src/plugins/sched/wiki2/job_requeue.c

src/plugins/sched/wiki2/job_signal.c

src/plugins/sched/wiki2/job_will_run.c

src/plugins/sched/wiki2/msg.c

src/plugins/sched/wiki2/msg.h

src/plugins/sched/wiki2/resume_job.c

src/plugins/sched/wiki2/sched_wiki.c

src/plugins/sched/wiki2/start_job.c

src/plugins/sched/wiki2/suspend_job.c

src/plugins/select/Makefile.in

src/plugins/select/bluegene/Makefile.in

src/plugins/select/bluegene/block_allocator/Makefile.am

src/plugins/select/bluegene/block_allocator/Makefile.in

src/plugins/select/bluegene/block_allocator/block_allocator.c

src/plugins/select/bluegene/block_allocator/block_allocator.h

src/plugins/select/bluegene/block_allocator/bridge_linker.c

src/plugins/select/bluegene/block_allocator/bridge_linker.h

src/plugins/select/bluegene/plugin/Makefile.in

src/plugins/select/bluegene/plugin/bg_block_info.c

src/plugins/select/bluegene/plugin/bg_block_info.h

src/plugins/select/bluegene/plugin/bg_boot_time.h

src/plugins/select/bluegene/plugin/bg_job_place.c

src/plugins/select/bluegene/plugin/bg_job_place.h

src/plugins/select/bluegene/plugin/bg_job_run.c

src/plugins/select/bluegene/plugin/bg_job_run.h

src/plugins/select/bluegene/plugin/bg_record_functions.c

src/plugins/select/bluegene/plugin/bg_record_functions.h

src/plugins/select/bluegene/plugin/bg_switch_connections.c

src/plugins/select/bluegene/plugin/block_sys.c

src/plugins/select/bluegene/plugin/bluegene.c

src/plugins/select/bluegene/plugin/bluegene.h

src/plugins/select/bluegene/plugin/defined_block.c

src/plugins/select/bluegene/plugin/defined_block.h

src/plugins/select/bluegene/plugin/dynamic_block.c

src/plugins/select/bluegene/plugin/dynamic_block.h

src/plugins/select/bluegene/plugin/libsched_if64.c

src/plugins/select/bluegene/plugin/opts.c

src/plugins/select/bluegene/plugin/select_bluegene.c

src/plugins/select/bluegene/plugin/sfree.c

src/plugins/select/bluegene/plugin/sfree.h

src/plugins/select/bluegene/plugin/slurm_epilog.c

src/plugins/select/bluegene/plugin/slurm_prolog.c

src/plugins/select/bluegene/plugin/state_test.c

src/plugins/select/bluegene/plugin/state_test.h

src/plugins/select/cons_res/Makefile.am

src/plugins/select/cons_res/Makefile.in

src/plugins/select/cons_res/dist_tasks.c

src/plugins/select/cons_res/dist_tasks.h

src/plugins/select/cons_res/select_cons_res.c

src/plugins/select/cons_res/select_cons_res.h

src/plugins/select/linear/Makefile.in

src/plugins/select/linear/select_linear.c

src/plugins/select/linear/select_linear.h

src/plugins/switch/Makefile.in

src/plugins/switch/elan/Makefile.in

src/plugins/switch/elan/qsw.c

src/plugins/switch/elan/qsw.h

src/plugins/switch/elan/switch_elan.c

src/plugins/switch/federation/Makefile.in

src/plugins/switch/federation/federation.c

src/plugins/switch/federation/federation.h

src/plugins/switch/federation/federation_keys.h

src/plugins/switch/federation/switch_federation.c

src/plugins/switch/none/Makefile.in

src/plugins/switch/none/switch_none.c

src/plugins/task/Makefile.in

src/plugins/task/affinity/Makefile.in

src/plugins/task/affinity/affinity.c

src/plugins/task/affinity/affinity.h

src/plugins/task/affinity/cpuset.c

src/plugins/task/affinity/dist_tasks.c

src/plugins/task/affinity/dist_tasks.h

src/plugins/task/affinity/numa.c

src/plugins/task/affinity/schedutils.c

src/plugins/task/affinity/task_affinity.c

src/plugins/task/none/Makefile.in

src/plugins/task/none/task_none.c

src/sacct/Makefile.am

src/sacct/Makefile.in

src/sacct/options.c

src/sacct/print.c

src/sacct/process.c

src/sacct/sacct.c

src/sacct/sacct.h

src/sacctmgr/Makefile.am

src/sacctmgr/Makefile.in

src/sacctmgr/account_functions.c

src/sacctmgr/archive_functions.c

src/sacctmgr/association_functions.c

src/sacctmgr/cluster_functions.c

src/sacctmgr/common.c

src/sacctmgr/file_functions.c

src/sacctmgr/qos_functions.c

src/sacctmgr/sacctmgr.c

src/sacctmgr/sacctmgr.h

src/sacctmgr/txn_functions.c

src/sacctmgr/user_functions.c

src/sacctmgr/wckey_functions.c

src/salloc/Makefile.in

src/salloc/opt.c

src/salloc/opt.h

src/salloc/salloc.c

src/salloc/salloc.h

src/sattach/Makefile.in

src/sattach/attach.c

src/sattach/opt.c

src/sattach/opt.h

src/sattach/sattach.c

src/sbatch/Makefile.in

src/sbatch/opt.c

src/sbatch/opt.h

src/sbatch/sbatch.c

src/sbcast/Makefile.am

src/sbcast/Makefile.in

src/sbcast/agent.c

src/sbcast/opts.c

src/sbcast/sbcast.c

src/sbcast/sbcast.h

src/scancel/Makefile.in

src/scancel/opt.c

src/scancel/scancel.c

src/scancel/scancel.h

src/scontrol/Makefile.am

src/scontrol/Makefile.in

src/scontrol/info_job.c

src/scontrol/info_node.c

src/scontrol/info_part.c

src/scontrol/scontrol.c

src/scontrol/scontrol.h

src/scontrol/update_job.c

src/scontrol/update_node.c

src/scontrol/update_part.c

src/sinfo/Makefile.in

src/sinfo/opts.c

src/sinfo/print.c

src/sinfo/print.h

src/sinfo/sinfo.c

src/sinfo/sinfo.h

src/sinfo/sort.c

src/slurmctld/Makefile.am

src/slurmctld/Makefile.in

src/slurmctld/acct_policy.c

src/slurmctld/acct_policy.h

src/slurmctld/agent.c

src/slurmctld/agent.h

src/slurmctld/backup.c

src/slurmctld/controller.c

src/slurmctld/job_mgr.c

src/slurmctld/job_scheduler.c

src/slurmctld/job_scheduler.h

src/slurmctld/licenses.c

src/slurmctld/licenses.h

src/slurmctld/locks.c

src/slurmctld/locks.h

src/slurmctld/node_mgr.c

src/slurmctld/node_scheduler.c

src/slurmctld/node_scheduler.h

src/slurmctld/partition_mgr.c

src/slurmctld/ping_nodes.c

src/slurmctld/ping_nodes.h

src/slurmctld/power_save.c

src/slurmctld/proc_req.c

src/slurmctld/proc_req.h

src/slurmctld/read_config.c

src/slurmctld/read_config.h

src/slurmctld/sched_plugin.c

src/slurmctld/sched_plugin.h

src/slurmctld/slurmctld.h

src/slurmctld/srun_comm.c

src/slurmctld/srun_comm.h

src/slurmctld/state_save.c

src/slurmctld/state_save.h

src/slurmctld/step_mgr.c

src/slurmctld/trigger_mgr.c

src/slurmctld/trigger_mgr.h

src/slurmd/Makefile.in

src/slurmd/common/proctrack.c

src/slurmd/common/proctrack.h

src/slurmd/common/reverse_tree.h

src/slurmd/common/run_script.c

src/slurmd/common/run_script.h

src/slurmd/common/setproctitle.c

src/slurmd/common/setproctitle.h

src/slurmd/common/slurmstepd_init.c

src/slurmd/common/slurmstepd_init.h

src/slurmd/common/task_plugin.c

src/slurmd/common/task_plugin.h

src/slurmd/slurmd/Makefile.am

src/slurmd/slurmd/Makefile.in

src/slurmd/slurmd/get_mach_stat.c

src/slurmd/slurmd/get_mach_stat.h

src/slurmd/slurmd/read_proc.c

src/slurmd/slurmd/req.c

src/slurmd/slurmd/req.h

src/slurmd/slurmd/reverse_tree_math.c

src/slurmd/slurmd/reverse_tree_math.h

src/slurmd/slurmd/slurmd.c

src/slurmd/slurmd/slurmd.h

src/slurmd/slurmd/xcpu.c

src/slurmd/slurmd/xcpu.h

src/slurmd/slurmstepd/Makefile.am

src/slurmd/slurmstepd/Makefile.in

src/slurmd/slurmstepd/fname.c

src/slurmd/slurmstepd/fname.h

src/slurmd/slurmstepd/io.c

src/slurmd/slurmstepd/io.h

src/slurmd/slurmstepd/mgr.c

src/slurmd/slurmstepd/mgr.h

src/slurmd/slurmstepd/multi_prog.c

src/slurmd/slurmstepd/multi_prog.h

src/slurmd/slurmstepd/pam_ses.c

src/slurmd/slurmstepd/pam_ses.h

src/slurmd/slurmstepd/pdebug.c

src/slurmd/slurmstepd/pdebug.h

src/slurmd/slurmstepd/req.c

src/slurmd/slurmstepd/req.h

src/slurmd/slurmstepd/slurmstepd.c

src/slurmd/slurmstepd/slurmstepd.h

src/slurmd/slurmstepd/slurmstepd_job.c

src/slurmd/slurmstepd/slurmstepd_job.h

src/slurmd/slurmstepd/step_terminate_monitor.c

src/slurmd/slurmstepd/step_terminate_monitor.h

src/slurmd/slurmstepd/task.c

src/slurmd/slurmstepd/task.h

src/slurmd/slurmstepd/ulimits.c

src/slurmd/slurmstepd/ulimits.h

src/slurmdbd/Makefile.am

src/slurmdbd/Makefile.in

src/slurmdbd/agent.c

src/slurmdbd/agent.h

src/slurmdbd/proc_req.c

src/slurmdbd/proc_req.h

src/slurmdbd/read_config.c

src/slurmdbd/read_config.h

src/slurmdbd/rpc_mgr.c

src/slurmdbd/rpc_mgr.h

src/slurmdbd/slurmdbd.c

src/slurmdbd/slurmdbd.h

src/smap/Makefile.am

src/smap/Makefile.in

src/smap/configure_functions.c

src/smap/grid_functions.c

src/smap/job_functions.c

src/smap/opts.c

src/smap/partition_functions.c

src/smap/smap.c

src/smap/smap.h

src/squeue/Makefile.in

src/squeue/opts.c

src/squeue/print.c

src/squeue/print.h

src/squeue/sort.c

src/squeue/squeue.c

src/squeue/squeue.h

src/sreport/Makefile.am

src/sreport/Makefile.in

src/sreport/assoc_reports.c

src/sreport/assoc_reports.h

src/sreport/cluster_reports.c

src/sreport/cluster_reports.h

src/sreport/common.c

src/sreport/job_reports.c

src/sreport/job_reports.h

src/sreport/sreport.c

src/sreport/sreport.h

src/sreport/user_reports.c

src/sreport/user_reports.h

src/srun/Makefile.am

src/srun/Makefile.in

src/srun/allocate.c

src/srun/allocate.h

src/srun/core-format.c

src/srun/core-format.h

src/srun/debugger.c

src/srun/debugger.h

src/srun/fname.c

src/srun/fname.h

src/srun/multi_prog.c

src/srun/multi_prog.h

src/srun/opt.c

src/srun/opt.h

src/srun/srun.c

src/srun/srun.h

src/srun/srun_job.c

src/srun/srun_job.h

src/srun/srun_pty.c

src/srun/srun_pty.h

src/sstat/Makefile.in

src/sstat/options.c

src/sstat/print.c

src/sstat/process.c

src/sstat/sstat.c

src/sstat/sstat.h

src/strigger/Makefile.in

src/strigger/opts.c

src/strigger/strigger.c

src/strigger/strigger.h

src/sview/Makefile.am

src/sview/Makefile.in

src/sview/admin_info.c

src/sview/block_info.c

src/sview/common.c

src/sview/grid.c

src/sview/job_info.c

src/sview/node_info.c

src/sview/part_info.c

src/sview/popups.c

src/sview/submit_info.c

src/sview/sview.c

src/sview/sview.h

testsuite/Makefile.in

testsuite/expect/Makefile.am

testsuite/expect/Makefile.in

testsuite/expect/README

testsuite/expect/globals

testsuite/expect/globals_accounting

testsuite/expect/pkill

testsuite/expect/regression

testsuite/expect/regression.py

testsuite/expect/test1.1

testsuite/expect/test1.10

testsuite/expect/test1.11

testsuite/expect/test1.12

testsuite/expect/test1.13

testsuite/expect/test1.14

testsuite/expect/test1.15

testsuite/expect/test1.16

testsuite/expect/test1.17

testsuite/expect/test1.18

testsuite/expect/test1.19

testsuite/expect/test1.2

testsuite/expect/test1.20

testsuite/expect/test1.21

testsuite/expect/test1.22

testsuite/expect/test1.23

testsuite/expect/test1.24

testsuite/expect/test1.25

testsuite/expect/test1.26

testsuite/expect/test1.27

testsuite/expect/test1.28

testsuite/expect/test1.29

testsuite/expect/test1.29.prog.c

testsuite/expect/test1.3

testsuite/expect/test1.30

testsuite/expect/test1.31

testsuite/expect/test1.32

testsuite/expect/test1.32.prog.c

testsuite/expect/test1.33

testsuite/expect/test1.34

testsuite/expect/test1.34.prog.c

testsuite/expect/test1.35

testsuite/expect/test1.36

testsuite/expect/test1.37

testsuite/expect/test1.38

testsuite/expect/test1.39

testsuite/expect/test1.39.prog.c

testsuite/expect/test1.4

testsuite/expect/test1.40

testsuite/expect/test1.41

testsuite/expect/test1.42

testsuite/expect/test1.43

testsuite/expect/test1.44

testsuite/expect/test1.46

testsuite/expect/test1.47

testsuite/expect/test1.48

testsuite/expect/test1.49

testsuite/expect/test1.5

testsuite/expect/test1.50

testsuite/expect/test1.51

testsuite/expect/test1.52

testsuite/expect/test1.54

testsuite/expect/test1.55

testsuite/expect/test1.56

testsuite/expect/test1.57

testsuite/expect/test1.58

testsuite/expect/test1.59

testsuite/expect/test1.6

testsuite/expect/test1.7

testsuite/expect/test1.8

testsuite/expect/test1.80

testsuite/expect/test1.81

testsuite/expect/test1.82

testsuite/expect/test1.83

testsuite/expect/test1.84

testsuite/expect/test1.86

testsuite/expect/test1.87

testsuite/expect/test1.88

testsuite/expect/test1.88.prog.c

testsuite/expect/test1.89

testsuite/expect/test1.89.prog.c

testsuite/expect/test1.9

testsuite/expect/test1.90

testsuite/expect/test1.90.prog.c

testsuite/expect/test1.91

testsuite/expect/test1.91.prog.c

testsuite/expect/test1.92

testsuite/expect/test1.93

testsuite/expect/test10.1

testsuite/expect/test10.10

testsuite/expect/test10.11

testsuite/expect/test10.12

testsuite/expect/test10.13

testsuite/expect/test10.2

testsuite/expect/test10.3

testsuite/expect/test10.4

testsuite/expect/test10.5

testsuite/expect/test10.6

testsuite/expect/test10.7

testsuite/expect/test10.8

testsuite/expect/test10.9

testsuite/expect/test11.1

testsuite/expect/test11.2

testsuite/expect/test11.3

testsuite/expect/test11.4

testsuite/expect/test11.5

testsuite/expect/test11.6

testsuite/expect/test11.7

testsuite/expect/test12.1

testsuite/expect/test12.2

testsuite/expect/test12.2.prog.c

testsuite/expect/test13.1

testsuite/expect/test14.1

testsuite/expect/test14.2

testsuite/expect/test14.3

testsuite/expect/test14.4

testsuite/expect/test14.5

testsuite/expect/test14.6

testsuite/expect/test14.7

testsuite/expect/test14.8

testsuite/expect/test15.1

testsuite/expect/test15.10

testsuite/expect/test15.11

testsuite/expect/test15.12

testsuite/expect/test15.13

testsuite/expect/test15.14

testsuite/expect/test15.15

testsuite/expect/test15.16

testsuite/expect/test15.17

testsuite/expect/test15.18

testsuite/expect/test15.19

testsuite/expect/test15.2

testsuite/expect/test15.20

testsuite/expect/test15.21

testsuite/expect/test15.22

testsuite/expect/test15.23

testsuite/expect/test15.24

testsuite/expect/test15.25

testsuite/expect/test15.3

testsuite/expect/test15.4

testsuite/expect/test15.5

testsuite/expect/test15.6

testsuite/expect/test15.7

testsuite/expect/test15.8

testsuite/expect/test15.9

testsuite/expect/test16.1

testsuite/expect/test16.2

testsuite/expect/test16.3

testsuite/expect/test16.4

testsuite/expect/test16.4.prog.c

testsuite/expect/test17.1

testsuite/expect/test17.10

testsuite/expect/test17.11

testsuite/expect/test17.12

testsuite/expect/test17.13

testsuite/expect/test17.14

testsuite/expect/test17.15

testsuite/expect/test17.15.prog.c

testsuite/expect/test17.16

testsuite/expect/test17.17

testsuite/expect/test17.18

testsuite/expect/test17.2

testsuite/expect/test17.20

testsuite/expect/test17.21

testsuite/expect/test17.22

testsuite/expect/test17.23

testsuite/expect/test17.24

testsuite/expect/test17.25

testsuite/expect/test17.26

testsuite/expect/test17.27

testsuite/expect/test17.28

testsuite/expect/test17.29

testsuite/expect/test17.3

testsuite/expect/test17.31

testsuite/expect/test17.32

testsuite/expect/test17.33

testsuite/expect/test17.4

testsuite/expect/test17.5

testsuite/expect/test17.6

testsuite/expect/test17.7

testsuite/expect/test17.8

testsuite/expect/test17.9

testsuite/expect/test19.1

testsuite/expect/test19.2

testsuite/expect/test19.3

testsuite/expect/test19.4

testsuite/expect/test19.5

testsuite/expect/test19.6

testsuite/expect/test19.7

testsuite/expect/test2.1

testsuite/expect/test2.10

testsuite/expect/test2.11

testsuite/expect/test2.2

testsuite/expect/test2.3

testsuite/expect/test2.4

testsuite/expect/test2.5

testsuite/expect/test2.6

testsuite/expect/test2.7

testsuite/expect/test2.8

testsuite/expect/test2.9

testsuite/expect/test20.1

testsuite/expect/test20.2

testsuite/expect/test20.3

testsuite/expect/test20.4

testsuite/expect/test21.1

testsuite/expect/test21.10

testsuite/expect/test21.11

testsuite/expect/test21.12

testsuite/expect/test21.13

testsuite/expect/test21.14

testsuite/expect/test21.15

testsuite/expect/test21.16

testsuite/expect/test21.17

testsuite/expect/test21.18

testsuite/expect/test21.19

testsuite/expect/test21.2

testsuite/expect/test21.20

testsuite/expect/test21.21

testsuite/expect/test21.22

testsuite/expect/test21.23

testsuite/expect/test21.24

testsuite/expect/test21.3

testsuite/expect/test21.4

testsuite/expect/test21.5

testsuite/expect/test21.6

testsuite/expect/test21.7

testsuite/expect/test21.8

testsuite/expect/test21.9

testsuite/expect/test3.1

testsuite/expect/test3.10

testsuite/expect/test3.2

testsuite/expect/test3.3

testsuite/expect/test3.4

testsuite/expect/test3.5

testsuite/expect/test3.6

testsuite/expect/test3.7

testsuite/expect/test3.7.prog.c

testsuite/expect/test3.8

testsuite/expect/test3.9

testsuite/expect/test4.1

testsuite/expect/test4.10

testsuite/expect/test4.11

testsuite/expect/test4.2

testsuite/expect/test4.3

testsuite/expect/test4.4

testsuite/expect/test4.5

testsuite/expect/test4.6

testsuite/expect/test4.7

testsuite/expect/test4.8

testsuite/expect/test4.9

testsuite/expect/test5.1

testsuite/expect/test5.2

testsuite/expect/test5.3

testsuite/expect/test5.4

testsuite/expect/test5.5

testsuite/expect/test5.6

testsuite/expect/test5.7

testsuite/expect/test5.8

testsuite/expect/test6.1

testsuite/expect/test6.10

testsuite/expect/test6.11

testsuite/expect/test6.12

testsuite/expect/test6.13

testsuite/expect/test6.2

testsuite/expect/test6.3

testsuite/expect/test6.4

testsuite/expect/test6.5

testsuite/expect/test6.6

testsuite/expect/test6.7

testsuite/expect/test6.8

testsuite/expect/test6.9

testsuite/expect/test7.1

testsuite/expect/test7.10

testsuite/expect/test7.11

testsuite/expect/test7.11.prog.c

testsuite/expect/test7.2

testsuite/expect/test7.2.prog.c

testsuite/expect/test7.3

testsuite/expect/test7.3.io.c

testsuite/expect/test7.3.prog.c

testsuite/expect/test7.4

testsuite/expect/test7.4.prog.c

testsuite/expect/test7.6

testsuite/expect/test7.6.prog.c

testsuite/expect/test7.7

testsuite/expect/test7.7.prog.c

testsuite/expect/test7.8

testsuite/expect/test7.8.prog.c

testsuite/expect/test7.9

testsuite/expect/test7.9.prog.c

testsuite/expect/test8.1

testsuite/expect/test8.2

testsuite/expect/test8.3

testsuite/expect/test8.4

testsuite/expect/test8.4.prog.c

testsuite/expect/test8.5

testsuite/expect/test8.6

testsuite/expect/test8.7

testsuite/expect/test8.7.prog.c

testsuite/expect/test9.1

testsuite/expect/test9.2

testsuite/expect/test9.3

testsuite/expect/test9.4

testsuite/expect/test9.5

testsuite/expect/test9.6

testsuite/expect/test9.7

testsuite/expect/test9.7.bash

testsuite/expect/test9.8

testsuite/expect/usleep

testsuite/slurm_unit/Makefile.in

testsuite/slurm_unit/api/Makefile.in

testsuite/slurm_unit/api/manual/Makefile.in

testsuite/slurm_unit/api/manual/cancel-tst.c

testsuite/slurm_unit/api/manual/complete-tst.c

testsuite/slurm_unit/api/manual/job_info-tst.c

testsuite/slurm_unit/api/manual/node_info-tst.c

testsuite/slurm_unit/api/manual/partition_info-tst.c

testsuite/slurm_unit/api/manual/reconfigure-tst.c

testsuite/slurm_unit/api/manual/submit-tst.c

testsuite/slurm_unit/api/manual/update_config-tst.c

testsuite/slurm_unit/common/Makefile.in

testsuite/slurm_unit/slurmctld/Makefile.in

testsuite/slurm_unit/slurmctld/security_2_1.bash

testsuite/slurm_unit/slurmd/Makefile.in

testsuite/slurm_unit/slurmdbd/Makefile.in

Show diffs side-by-side

added added

removed removed

doc/html/faq.shtml

31

31

killed?</a></li>

32

32

<li><a href="#arbitrary">How do I run specific tasks on certain nodes

33

33

in my allocation?</a></li>

34

<li><a href="#hold">How can I temporarily prevent a job from running

35

(e.g. place it into a <i>hold</i> state)?</a></li>

34

36

</ol>

37

35

38

<h2>For Administrators</h2>

36

39

<ol>

37

40

<li><a href="#suspend">How is job suspend/resume useful?</a></li>

93

96

information between major SLURM updates?</li>

94

97

<li><a href="#health_check">Why doesn't the <i>HealthCheckProgram</i>

95

98

execute on DOWN nodes?</li>

99

<li><a href="#batch_lost">What is the meaning of the error

100

"Batch JobId=# missing from master node, killing it"?</a></li>

101

<li><a href="#accept_again">What does the messsage

102

"srun: error: Unable to accept connection: Resources temporarily unavailable"

103

indicate?</a></li>

104

<li><a href="#task_prolog">How could I automatically print a job's

105

SLURM job ID to its standard output?</li>

106

<li><a href="#moab_start">I run SLURM with the Moab or Maui scheduler.

107

How can I start a job under SLURM without the scheduler?</li>

108

<li><a href="#orphan_procs">Why are user processes and <i>srun</i>

109

running even though the job is supposed to be completed?</li>

110

<li><a href="#slurmd_oom">How can I prevent the <i>slurmd</i> and

111

<i>slurmstepd</i> daemons from being killed when a node's memory

112

is exhausted?</li>

113

<li><a href="#ubuntu">I see my host of my calling node as 127.0.1.1

114

instead of the correct ip address. Why is that?</a></li>

96

115

</ol>

97

116

117

98

118

<h2>For Users</h2>

99

119

<p><a name="comp"><b>1. Why is my job/node in COMPLETING state?</b></a><br>

100

120

When a job is terminating, both the job and its nodes enter the COMPLETING state.

111

131

This may be indicative of processes hung waiting for a core file

112

132

to complete I/O or operating system failure.

113

133

If this state persists, the system administrator should check for processes

114

associated with the job that can not be terminated then use the

134

associated with the job that cannot be terminated then use the

115

135

<span class="commandline">scontrol</span> command to change the node's

116

136

state to DOWN (e.g. "scontrol update NodeName=<i>name</i> State=DOWN Reason=hung_completing"),

117

137

reboot the node, then reset the node's state to IDLE

123

143

<p>Note that SLURM has two configuration parameters that may be used to

124

144

automate some of this process.

125

145

<i>UnkillableStepProgram</i> specifies a program to execute when

126

non-killable proceses are identified.

146

non-killable processes are identified.

127

147

<i>UnkillableStepTimeout</i> specifies how long to wait for processes

128

148

to terminate.

129

149

See the "man slurm.conf" for more information about these parameters.</p>

130

150

131

151

<p><a name="rlimit"><b>2. Why are my resource limits not propagated?</b></a><br>

132

152

When the <span class="commandline">srun</span> command executes, it captures the

133

resource limits in effect at that time. These limits are propagated to the allocated

153

resource limits in effect at submit time. These limits are propagated to the allocated

134

154

nodes before initiating the user's job. The SLURM daemon running on that node then

135

155

tries to establish identical resource limits for the job being initiated.

136

156

There are several possible reasons for not being able to establish those

139

159

<li>The hard resource limits applied to SLURM's slurmd daemon are lower

140

160

than the user's soft resources limits on the submit host. Typically

141

161

the slurmd daemon is initiated by the init daemon with the operating

142

system default limits. This may be address either through use of the

162

system default limits. This may be addressed either through use of the

143

163

ulimit command in the /etc/sysconfig/slurm file or enabling

144

164

<a href="#pam">PAM in SLURM</a>.</li>

145

<li>The user's hard resource limits on the allocated node sre lower than

146

the same user's soft hard resource limits on the node from which the

165

<li>The user's hard resource limits on the allocated node are lower than

166

the same user's soft hard resource limits on the node from which the

147

167

job was submitted. It is recommended that the system administrator

148

168

establish uniform hard resource limits for users on all nodes

149

169

within a cluster to prevent this from occurring.</li>

150

170

</ul></p>

151

171

<p>NOTE: This may produce the error message "Can't propagate RLIMIT_...".

152

The error message is printed only if the user explicity specifies that

172

The error message is printed only if the user explicitly specifies that

153

173

the resource limit should be propagated or the srun command is running

154

174

with verbose logging of actions from the slurmd daemon (e.g. "srun -d6 ...").</p>

155

175

166

186

then jobs will generally be executed in the order of submission for a given partition

167

187

with one exception: later submitted jobs will be initiated early if doing so does

168

188

not delay the expected execution time of an earlier submitted job. In order for

169

backfill scheduling to be effective, users jobs should specify reasonable time

189

backfill scheduling to be effective, users' jobs should specify reasonable time

170

190

limits. If jobs do not specify time limits, then all jobs will receive the same

171

191

time limit (that associated with the partition), and the ability to backfill schedule

172

192

jobs will be limited. The backfill scheduler does not alter job specifications

208

228

SLURM has a job purging mechanism to remove inactive jobs (resource allocations)

209

229

before reaching its time limit, which could be infinite.

210

230

This inactivity time limit is configurable by the system administrator.

211

You can check it's value with the command</p>

231

You can check its value with the command</p>

212

232

<blockquote>

213

233

<p><span class="commandline">scontrol show config | grep InactiveLimit</span></p>

214

234

</blockquote>

233

253

</blockquote>

234

254

<p>srun processes "-N2" as an option to itself. "hostname" is the

235

255

command to execute and "-pdebug" is treated as an option to the

236

hostname command. Which will change the name of the computer

256

hostname command. This will change the name of the computer

237

257

on which SLURM executes the command - Very bad, <b>Don't run

238

258

this command as user root!</b></p>

239

259

242

262

There are significant limitations in the current backfill scheduler plugin.

243

263

It was designed to perform backfill node scheduling for a homogeneous cluster.

244

264

It does not manage scheduling on individual processors (or other consumable

245

resources). It also does not update the required or excluded node list of

246

individual jobs. These are the current limiations. You can use the

247

scontrol show command to check if these conditions apply.</p>

265

resources). It does not update the required or excluded node list of

266

individual jobs. It does support job's with constraints/features unless

267

the exclusive OR operator is used in the constraint expression.

268

You can use the scontrol show command to check if these conditions apply.</p>

248

269

<ul>

249

270

<li>Partition: State=UP</li>

250

271

<li>Partition: RootOnly=NO</li>

287

308

to avoid the possibility of re-using switch resources for other

288

309

jobs (even on different nodes).

289

310

SLURM considers jobs COMPLETED when all nodes allocated to the

290

job are either DOWN or confirm termination of all it's processes.

311

job are either DOWN or confirm termination of all its processes.

291

312

This enables SLURM to purge job information in a timely fashion

292

313

even when there are many failing nodes.

293

314

Unfortunately the job step information may persist longer.</p>

297

318

There is a srun option <i>--jobid</i> that can be used to specify

298

319

a job's ID.

299

320

For a batch job or within an existing resource allocation, the

300

environment variable <i>SLURM_JOBID</i> has already been defined,

321

environment variable <i>SLURM_JOB_ID</i> has already been defined,

301

322

so all job steps will run within that job allocation unless

302

323

otherwise specified.

303

324

The one exception to this is when submitting batch jobs.

442

463

#

443

464

# Simple batch script that starts SCREEN.

444

465

445

exec screen -Dm -S slurm$SLURM_JOBID

466

exec screen -Dm -S slurm$SLURM_JOB_ID

446

467

</pre>

447

468

448

469

<p>The following script named <i>_interactive_screen</i> is also used.</p>

472

493

The srun command normally terminates when the standard output and

473

494

error I/O from the spawned tasks end. This does not necessarily

474

495

happen at the same time that a job step is terminated. For example,

475

a file system problem could render a spawned tasks non-killable

496

a file system problem could render a spawned task non-killable

476

497

at the same time that I/O to srun is pending. Alternately a network

477

498

problem could prevent the I/O from being transmitted to srun.

478

499

In any event, the srun command is notified when a job step is

508

529

effect for the <i>slurmd</i> daemon will be used for the spawned job.

509

530

A simple way to control this is to insure that user <i>root</i> has a

510

531

sufficiently large resource limit and insuring that <i>slurmd</i> takes

511

full advantage of this limit. For example, you can set user's root's

512

locked memory limit limit to be unlimited on the compute nodes (see

532

full advantage of this limit. For example, you can set user root's

533

locked memory limit ulimit to be unlimited on the compute nodes (see

513

534

<i>"man limits.conf"</i>) and insuring that <i>slurmd</i> takes

514

535

full advantage of this limit (e.g. by adding something like

515

536

<i>"ulimit -l unlimited"</i> to the <i>/etc/init.d/slurm</i>

521

542

SLURM has a configuration parameter <i>InactiveLimit</i> intended

522

543

to kill jobs that do not spawn any job steps for a configurable

523

544

period of time. Your system administrator may modify the <i>InactiveLimit</i>

524

to satisfy your needs. Alternatly, you can just spawn a job step

545

to satisfy your needs. Alternately, you can just spawn a job step

525

546

at the beginning of your script to execute in the background. It

526

547

will be purged when your script exits or your job otherwise terminates.

527

548

A line of this sort near the beginning of your script should suffice:<br>

571

592

We can now use this script in our srun line in this fashion.<p>

572

593

<i>srun -m arbitrary -n5 -w `arbitrary.pl 4,1` -l hostname</i><p>

573

594

This will layout 4 tasks on the first node in the allocation and 1

574

task on the second node.

595

task on the second node.</p>

575

596

576

</p>

597

<p><a name="hold"><b>21. How can I temporarily prevent a job from running

598

(e.g. place it into a <i>hold</i> state)?</b></a><br>

599

The easiest way to do this is to change a job's earliest begin time

600

(optionally set at job submit time using the <i>--begin</i> option).

601

The example below places a job into hold state (preventing its initiation

602

for 30 days) and later permitting it to start now.</p>

603

<pre>

604

$ scontrol update JobId=1234 StartTime=now+30days

605

... later ...

606

$ scontrol update JobId=1234 StartTime=now

607

</pre>

577

608

578

609

<p class="footer"><a href="#top">top</a></p>

579

610

604

635

<p><a name="fast_schedule"><b>2. How can I configure SLURM to use

605

636

the resources actually found on a node rather than what is defined

606

637

in <i>slurm.conf</i>?</b></a><br>

607

SLURM can either base it's scheduling decisions upon the node

638

SLURM can either base its scheduling decisions upon the node

608

639

configuration defined in <i>slurm.conf</i> or what each node

609

640

actually returns as available resources.

610

641

This is controlled using the configuration parameter <i>FastSchedule</i>.

611

Set it's value to zero in order to use the resources actually

642

Set its value to zero in order to use the resources actually

612

643

found on each node, but with a higher overhead for scheduling.

613

644

A value of one is the default and results in the node configuration

614

645

defined in <i>slurm.conf</i> being used. See "man slurm.conf"

622

653

returned to service once the <i>slurmd</i> daemon registers

623

654

with a valid node configuration.

624

655

A value of zero is the default and results in a node staying DOWN

625

until an administrator explicity returns it to service using

656

until an administrator explicitly returns it to service using

626

657

the command "scontrol update NodeName=whatever State=RESUME".

627

658

See "man slurm.conf" and "man scontrol" for more

628

659

details.</p>

639

670

640

671

<p><a name="multi_job"><b>5. How can I control the execution of multiple

641

672

jobs per node?</b></a><br>

642

There are two mechanism to control this.

673

There are two mechanisms to control this.

643

674

If you want to allocate individual processors on a node to jobs,

644

675

configure <i>SelectType=select/cons_res</i>.

645

676

See <a href="cons_res.html">Consumable Resources in SLURM</a>

649

680

Each partition also has a configuration parameter <i>Shared</i>

650

681

that enables more than one job to execute on each node.

651

682

See <i>man slurm.conf</i> for more information about these

652

configuration paramters.</p>

683

configuration parameters.</p>

653

684

654

685

<p><a name="inc_plugin"><b>6. When the SLURM daemon starts, it

655

686

prints "cannot resolve X plugin operations" and exits.

663

694

664

695

<p><a name="sigpipe"><b>7. Why are user tasks intermittently dying

665

696

at launch with SIGPIPE error messages?</b></a><br>

666

If you are using ldap or some other remote name service for

697

If you are using LDAP or some other remote name service for

667

698

username and groups lookup, chances are that the underlying

668

699

libc library functions are triggering the SIGPIPE. You can likely

669

700

work around this problem by setting <i>CacheGroups=1</i> in your slurm.conf

672

703

673

704

<p><a name="maint_time"><b>8. How can I dry up the workload for a

674

705

maintenance period?</b></a><br>

675

There isn't a mechanism to tell SLURM that all jobs should be

676

completed by a specific time. The best way to address this is

677

to shorten the <i>MaxTime</i> associated with the partitions so

678

as to avoid initiating jobs that will not have completed by

679

the maintenance period.

706

Create a resource reservation as described by SLURM's

707

<a href="reservations.html">Resource Reservation Guide</a>.

680

708

681

709

<p><a name="pam"><b>9. How can PAM be used to control a user's limits on

682

710

or access to compute nodes?</b></a><br>

697

725

</pre>

698

726

<p>Finally, you need to disable SLURM's forwarding of the limits from the

699

727

session from which the <i>srun</i> initiating the job ran. By default

700

all resource limits are propogated from that session. For example, adding

728

all resource limits are propagated from that session. For example, adding

701

729

the following line to <i>slurm.conf</i> will prevent the locked memory

702

730

limit from being propagated:<i>PropagateResourceLimitsExcept=MEMLOCK</i>.</p>

703

731

740

768

<li>Stop all SLURM daemons</li>

741

769

<li>Modify the <i>ControlMachine</i>, <i>ControlAddr</i>,

742

770

<i>BackupController</i>, and/or <i>BackupAddr</i> in the <i>slurm.conf</i> file</li>

743

<li>Distribute the updated <i>slurm.conf</i> file file to all nodes</li>

771

<li>Distribute the updated <i>slurm.conf</i> file to all nodes</li>

744

772

<li>Restart all SLURM daemons</li>

745

773

</ol>

746

774

<p>There should be no loss of any running or pending jobs. Insure that

778

806

Yes, this can be useful for testing purposes.

779

807

It has also been used to partition "fat" nodes into multiple SLURM nodes.

780

808

There are two ways to do this.

781

The best method for most conditins is to run one <i>slurmd</i>

809

The best method for most conditions is to run one <i>slurmd</i>

782

810

daemon per emulated node in the cluster as follows.

783

811

<ol>

784

812

<li>When executing the <i>configure</i> program, use the option

785

<i>--multiple-slurmd</i> (or add that option to your <i>~/.rpmmacros</i>

813

<i>--enable-multiple-slurmd</i> (or add that option to your <i>~/.rpmmacros</i>

786

814

file).</li>

787

815

<li>Build and install SLURM in the usual manner.</li>

788

816

<li>In <i>slurm.conf</i> define the desired node names (arbitrary

797

825

of the node that it is supposed to serve on the execute line.</li>

798

826

</ol>

799

827

<p>It is strongly recommended that SLURM version 1.2 or higher be used

800

for this due to it's improved support for multiple slurmd daemons.

828

for this due to its improved support for multiple slurmd daemons.

801

829

See the

802

<a href="programmer_guide.shtml#multiple_slurmd_support">Programmers Guide</a>

830

<a href="programmer_guide.html#multiple_slurmd_support">Programmers Guide</a>

803

831

for more details about configuring multiple slurmd support.</p>

804

832

805

833

<p>In order to emulate a really large cluster, it can be more

863

891

<i>CoresPerSocket</i>, <i>ThreadsPerCore</i>, and/or <i>TmpDisk</i>).

864

892

SLURM will use the resource specification for each node that is

865

893

given in <i>slurm.conf</i> and will not check these specifications

866

against those actaully found on the node.

894

against those actually found on the node.

867

895

868

896

<p><a name="credential_replayed"><b>16. What does a

869

897

"credential replayed"

958

986

corresponds to a job that the slurmd daemon has already revoked.

959

987

The slurmctld daemon selects job ID values based upon the configured

960

988

value of <b>FirstJobId</b> (the default value is 1) and each job gets

961

an value one large than the previous job.

989

a value one larger than the previous job.

962

990

On job termination, the slurmctld daemon notifies the slurmd on each

963

991

allocated node that all processes associated with that job should be

964

992

terminated.

972

1000

This solution to this problem is to cold-start all slurmd daemons whenever

973

1001

the slurmctld daemon is cold-started.

974

1002

975

<p><a name="globus"><b>23. Can SLURM be used with Globus?</b><br>

1003

<p><a name="globus"><b>23. Can SLURM be used with Globus?</b></a><br>

976

1004

Yes. Build and install SLURM's Torque/PBS command wrappers along with

977

1005

the Perl APIs from SLURM's <i>contribs</i> directory and configure

978

1006

<a href="http://www-unix.globus.org/">Globus</a> to use those PBS commands.

980

1008

<i>torque</i> and <i>perlapi</i> respectively.

981

1009

982

1010

<p><a name="time_format"><b>24. Can SLURM time output format include the

983

year?</b><br>

1011

year?</b></a><br>

984

1012

The default SLURM time format output is <i>MM/DD-HH:MM:SS</i>.

985

1013

Define "ISO8601" at SLURM build time to get the time format

986

1014

<i>YYYY-MM-DDTHH:MM:SS</i>.

988

1016

SLURM output expecting the old format (e.g. LSF, Maui or Moab).

989

1017

990

1018

<p><a name="file_limit"><b>25. What causes the error

991

"Unable to accept new connection: Too many open files"?</b><br>

1019

"Unable to accept new connection: Too many open files"?</b></a><br>

992

1020

The srun command automatically increases its open file limit to

993

1021

the hard limit in order to process all of the standard input and output

994

1022

connections to the launched tasks. It is recommended that you set the

995

1023

open file hard limit to 8192 across the cluster.

996

1024

997

1025

<p><a name="slurmd_log"><b>26. Why does the setting of <i>SlurmdDebug</i>

998

fail to log job step information at the appropriate level?</b><br>

1026

fail to log job step information at the appropriate level?</b></a><br>

999

1027

There are two programs involved here. One is <b>slurmd</b>, which is

1000

1028

a persistent daemon running at the desired debug level. The second

1001

1029

program is <b>slurmstep</b>, which executed the user job and its

1005

1033

of the program.

1006

1034

1007

1035

<p><a name="rpm"><b>27. Why isn't the auth_none.so (or other file) in a

1008

SLURM RPM?</b><br>

1009

The auth_none plugin is in a separete RPM and not built by default.

1036

SLURM RPM?</b></a><br>

1037

The auth_none plugin is in a separate RPM and not built by default.

1010

1038

Using the auth_none plugin means that SLURM communications are not

1011

1039

authenticated, so you probably do not want to run in this mode of operation

1012

1040

except for testing purposes. If you want to build the auth_none RPM then

1015

1043

in the SLURM distribution for a list of other options.

1016

1044

1017

1045

<p><a name="slurmdbd"><b>28. Why should I use the slurmdbd instead of the

1018

regular database plugins?</b><br>

1046

regular database plugins?</b></a><br>

1019

1047

While the normal storage plugins will work fine without the added

1020

layer of the slurmdbd there are some great benifits to using the

1048

layer of the slurmdbd there are some great benefits to using the

1021

1049

slurmdbd.

1022

1050

1023

1051

1. Added security. Using the slurmdbd you can have an authenticated

1032

1060

slurmdbd you can also query any cluster using the slurmdbd from any

1033

1061

other cluster's nodes.

1034

1062

1035

<p><a name="debug"><b>29. How can I build SLURM with debugging symbols?</b></br>

1063

<p><a name="debug"><b>29. How can I build SLURM with debugging symbols?</b></a></br>

1036

1064

Set your CFLAGS environment variable before building.

1037

1065

You want the "-g" option to produce debugging information and

1038

1066

"-O0" to set the optimization level to zero (off). For example:<br>

1039

1067

CFLAGS="-g -O0" ./configure ...

1040

1068

1041

1069

<p><a name="state_preserve"><b>30. How can I easily preserve drained node

1042

information between major SLURM updates?</b><br>

1070

information between major SLURM updates?</b></a><br>

1043

1071

Major SLURM updates generally have changes in the state save files and

1044

1072

communication protocols, so a cold-start (without state) is generally

1045

1073

required. If you have nodes in a DRAIN state and want to preserve that

1052

1080

</pre>

1053

1081

1054

1082

<p><a name="health_check"><b>31. Why doesn't the <i>HealthCheckProgram</i>

1055

execute on DOWN nodes?</b><br>

1083

execute on DOWN nodes?</a></b><br>

1056

1084

Hierarchical communications are used for sending this message. If there

1057

1085

are DOWN nodes in the communications hierarchy, messages will need to

1058

be re-routed. This limits SLURM's ability to tightly synchroize the

1086

be re-routed. This limits SLURM's ability to tightly synchronize the

1059

1087

execution of the <i>HealthCheckProgram</i> across the cluster, which

1060

could adversly impact performance of parallel applications.

1088

could adversely impact performance of parallel applications.

1061

1089

The use of CRON or node startup scripts may be better suited to insure

1062

1090

that <i>HealthCheckProgram</i> gets executed on nodes that are DOWN

1063

1091

in SLURM. If you still want to have SLURM try to execute

1079

1107

continue;

1080

1108

</pre>

1081

1109

1110

<p><a name="batch_lost"><b>32. What is the meaning of the error

1111

"Batch JobId=# missing from master node, killing it"?</b></a><br>

1112

A shell is launched on node zero of a job's allocation to execute

1113

the submitted program. The <i>slurmd</i> daemon executing on each compute

1114

node will periodically report to the <i>slurmctld</i> what programs it

1115

is executing. If a batch program is expected to be running on some

1116

node (i.e. node zero of the job's allocation) and is not found, the

1117

message above will be logged and the job cancelled. This typically is

1118

associated with exhausting memory on the node or some other critical

1119

failure that cannot be recovered from. The equivalent message in

1120

earlier releases of slurm is

1121

"Master node lost JobId=#, killing it".

1122

1123

<p><a name="accept_again"><b>33. What does the messsage

1124

"srun: error: Unable to accept connection: Resources temporarily unavailable"

1125

indicate?</b></a><br>

1126

This has been reported on some larger clusters running SUSE Linux when

1127

a user's resource limits are reached. You may need to increase limits

1128

for locked memory and stack size to resolve this problem.

1129

1130

<p><a name="task_prolog"><b>34. How could I automatically print a job's

1131

SLURM job ID to its standard output?</b></a></br>

1132

The configured <i>TaskProlog</i> is the only thing that can write to

1133

the job's standard output or set extra environment variables for a job

1134

or job step. To write to the job's standard output, precede the message

1135

with "print ". To export environment variables, output a line of this

1136

form "export name=value". The example below will print a job's SLURM

1137

job ID and allocated hosts for a batch job only.

1138

1139

<pre>

1140

#!/bin/sh

1141

#

1142

# Sample TaskProlog script that will print a batch job's

1143

# job ID and node list to the job's stdout

1144

#

1145

1146

if [ X"$SLURM_STEP_ID" = "X" -a X"$SLURM_PROCID" = "X"0 ]

1147

then

1148

echo "print =========================================="

1149

echo "print SLURM_JOB_ID = $SLURM_JOB_ID"

1150

echo "print SLURM_NODELIST = $SLURM_NODELIST"

1151

echo "print =========================================="

1152

fi

1153

</pre>

1154

1155

<p><a name="moab_start"><b>35. I run SLURM with the Moab or Maui scheduler.

1156

How can I start a job under SLURM without the scheduler?</b></a></br>

1157

When SLURM is configured to use the Moab or Maui scheduler, all submitted

1158

jobs have their priority initialized to zero, which SLURM treats as a held

1159

job. The job only begins when Moab or Maui decide where and when to start

1160

the job, setting the required node list and setting the job priority to

1161

a non-zero value. To circumvent this, submit your job using a SLURM or

1162

Moab command then manually set its priority to a non-zero value (must be

1163

done by user root). For example:</p>

1164

<pre>

1165

$ scontrol update jobid=1234 priority=1000000

1166

</pre>

1167

<p>Note that changes in the configured value of <i>SchedulerType</i> only

1168

take effect when the <i>slurmctld</i> daemon is restarted (reconfiguring

1169

SLURM will not change this parameter. You will also manually need to

1170

modify the priority of every pending job.

1171

When changing to Moab or Maui scheduling, set every job priority to zero.

1172

When changing from Moab or Maui scheduling, set every job priority to a

1173

non-zero value (preferably fairly large, say 1000000).</p>

1174

1175

<p><a name="orphan_procs"><b>36. Why are user processes and <i>srun</i>

1176

running even though the job is supposed to be completed?</b></a></br>

1177

SLURM relies upon a configurable process tracking plugin to determine

1178

when all of the processes associated with a job or job step have completed.

1179

Those plugins relying upon a kernel patch can reliably identify every process.

1180

Those plugins dependent upon process group IDs or parent process IDs are not

1181

reliable. See the <i>ProctrackType</i> description in the <i>slurm.conf</i>

1182

man page for details. We rely upon the sgi_job for most systems.</p>

1183

1184

<p><a name="slurmd_oom"><b>37. How can I prevent the <i>slurmd</i> and

1185

<i>slurmstepd</i> daemons from being killed when a node's memory

1186

is exhausted?</b></a></br>

1187

You can the value set in the <i>/proc/self/oom_adj</i> for

1188

<i>slurmd</i> and <i>slurmstepd</i> by initiating the <i>slurmd</i>

1189

daemon with the <i>SLURMD_OOM_ADJ</i> and/or <i>SLURMSTEPD_OOM_ADJ</i>

1190

environment variables set to the desired values.

1191

A value of -17 typically will disable killing.</p>

1192

1193

<p><a name="ubuntu"><b>38. I see my host of my calling node as 127.0.1.1

1194

instead of the correct ip address. Why is that?</b></a></br>

1195

Some systems by default will put your host in the /etc/hosts file as

1196

something like

1197

<pre>

1198

127.0.1.1 snowflake.llnl.gov snowflake

1199

</pre>

1200

This will cause srun and other things to grab 127.0.1.1 as it's

1201

address instead of the correct address and make it so the

1202

communication doesn't work. Solution is to either remove this line or

1203

set a different nodeaddr that is known by your other nodes.</p>

1204

1082

1205

<p class="footer"><a href="#top">top</a></p>

1083

1206

1084

<p style="text-align:center;">Last modified 24 October 2008</p>

1207

<p style="text-align:center;">Last modified 12 June 2009</p>

1085

1208

1086

1209

Older »