You are viewing an old version of this page. View the current version.

Compare with Current View Page History

« Previous Version 7 Next »

  • upgrade testpost-master to RHEL7 so it can run HTCondor-8.9 and Slurm
  • upgrade nmpost-master to RHEL7 so it can run HTCondor-8.9 and Slurm
  • Configure nmpost-master so that it can flock to CHTC
  • Configure testpost-master so that it can flock to CHTC and unconfigure testpost-serv-1
  • Implement some sort of mechanism to keep vlass jobs on vlass nodes, hera jobs on hera nodes, etc


  • Set a PoolName for the testpost and nmpost clusters.  E.g. NRAO-NM-PROD and NRAO-NM-TEST.  They don't have to be allcaps.



  • To Do
    • Change slurm so that nodes come up properly after a reboot instead of "unexpectedly rebooted"
    • SLURM runs two jobs that each ask for all the memory (#SBATCH --mem=0) on the same node.  That seems wrong.  May need to look info partition oversubscription settings.
  • No labels