Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

echo 'CONDOR_CONFIG=/etc/condor/glidein-slurm.conf' > /var/run/condor/config
echo 'STARTD.DAEMON_SHUTDONW = State == "Unclaimed" && Activity == "Idle" && (MyCurrentTime - EnteredCurrentActivity) > 600' > /var/run/condor/condor_config.local

echo 'MASTER.DAEMON_SHUTDOWN = STARTD_StartTime == 0' >> /var/run/condor/condor_config.local

systemctl start condor
# loop until condor_startd is no longer a running process
systemctl stop condor
rm -f /var/run/condor/condor_config.local
rm -f /var/run/condor/config
exit

...

If jobs are waiting in the HTCondor cluster (perhaps only vlapipe jobs)

If nodes are available in the Slurm Cluster (If not perhaps send email)

Launch one Pilot job

Sleep some amount of time, presumably more than the time HTCondor takes to launch a job


Problems



Ideas

Running condor_master -f from the Pilot script might work better if I set both STARTD and MASTER DAEMON_SHUTDOWN