Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

  • Interactive: The ability to assign all or part of a node to a user with shell level access (nodescheduler, qsub -I, etc),  minimal granularity is per NUMA node, finer would be useful.

    • What is it that we like about nodescheduler over something like qsub -I or srun --pty bash or condor_submit -i
      • It's not tied to any tty so a user can login multiple times from multiple places to their reserved node without requiring screen or tmux or VNC vnc.  It also means that users aren't all going through nmpost-master.
      • Its creation is asynchronous.  If the cluster is full you don't wait around for your reservation to start, you get an email message when it is ready.
      • It's time limited (e.g. two weeks).  We might be able to do the same with a queue/partition setting but could we then extend that reservation?
      • We get to define the shape of a reservation (whole node, NUMA node, etc).  If we just let people use qsub -I they could reserve all sorts of sizes which may be less efficient.  Then again it may be more efficient.  But either way it is simpler for our users.
    • Slurm
      • I don't see how Slurm can reserve NUMA nodes so we will have to just reserve X tasks with Y memory.
      • I don't know how to keep Slurm from giving a user multiple portions of the same host.  With Moab I used naccesspolicy=uniqueuser  This prevents the ambiguity of which ssh connection goes to which cgroup.
    • HTCondor
      • Can HTCondor even do this?

...