Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

Current Questions


See retired nodes

2024-04-15 krowe: Say I set a few nodes to offline with a command like condor_off -startd -peaceful -name nmpost120  How can I later check to see which nodes are offline?

  • condor_status -offline returns nothing
  • condor_status -long nmpost120 returns nothing about being offline
  • The following shows nodes where startd has actually stopped but it doesn't show nodes that are set offline but still running jobs (e.g. Retiring)
    • condor_status -master -constraint 'STARTD_StartTime == 0'
  • This shows nodes that are set offline but still running jobs (a.k.a. Retiring)
    • condor_status |grep Retiring

ANSWER: 2022-06-27

condor_status -const 'Activity == "Retiring"'

offline ads, which is a way for HTCondor to update the status of a node after the startd has exited.

condor_drain -peaceful # CHTC is working on this.  I think this might be the best solution.


K8s

2024-04-15 krowe: There is a lot of talk around NRAO about k8s these days. Can you explain if/how HTCondor works with k8s?  I'm not suggesting we run HTCondor on top of k8s but I would like to know the options.

...