Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

  • Configure DHCP on zia to boot all nmpost nodes to RHEL-7.8.1.5
  • Switch remaining nmpost nodes from Torque/Moab to Slurm on nmpost-serv-1
    • Change to the snapshot directory
      • cd /opt/services/diskless_boot/RHEL-7.8.1.5/nmpost/snapshot
    • Enable slurmd to start on boot
      • for x in nmpost{001..090}* ; do echo 'SLURMD_OPTIONS="--conf-server nmpost-serv-1"' > ${x}/etc/sysconfig/slurmd ; done
      • for x in nmpost{001..090}* ; do  echo '/etc/sysconfig/slurmd' >> ${x}/files ; done
    • Disalbe Torque from starting on boot
      • rm -f nmpost*/etc/sysconfig/pbs_mom
      • for x in nmpost* ; do sed -i '/^\/etc\/ssh\/pbs_mom/d' ${x}/files ; done
      • for x in nmpost* ; do sed -i '/^\/etc\/ssh\/shosts.equiv/d' ${x}/files ; done
      • for x in nmpost* ; do sed -i '/^\/etc\/ssh\/ssh_known_hosts/d' ${x}/files ; done
    • Reboot each node
  • Switch Torque nodescheduler, nodeextendjob, nodesfree with Slurm versions on zia
    • cd /home/local/Linux/rhel7/x86_64/stow
    • #edit cluster/share/cluster/*.sh and change 'nodescheduler-slurm' to 'nodescheduler' in the slurm email functions
    • stow -D cluster
    • (cd cluster/bin ; rm -f nodescheduler ; ln -s nodescheduler-slurm nodescheduler)
    • (cd cluster/bin ; rm -f nodescheduler-test ; ln -s nodescheduler-test-slurm nodescheduler-test)
    • (cd cluster/bin ; rm -f nodeextendjob ; ln -s nodeextendjob-slurm nodeextendjob)
    • (cd cluster/bin ; rm -f nodesfree ; ln -s nodesfree-slurm nodesfree)
    • stow cluster
  • Uncomment nmpost lines in nmpot-serv-1:/etc/slurm/slurm.conf
    • On nmpost-serv-1 restart with systemctl restart slurmctld
    • On nmpost-master restart with systemctl restart slurmd
  • Remove the bold note about Slurm in the docs on info.nrao.edu
  • Remove pam_pbssimpleauth.so from files in /etc/pam.d in the OS image
  • Remove /usr/lib64/security/pam_pbssimpleauth.* from the OS image

...