Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

  • How can you tell which job is assossiated associated with an email given the email message doesn't have include a working dir or the assigned batch_name?
  • Is there a config option that will cause condor to not start?  We have diskless nodes and it is easier to modify the config file then change systemd.
  • Bug where James's jobs are all put on the same core.  Here is top -u krowe showing the Last Used Cpu (SMP) after I submitted five sleep jobs to the same host.
    • Is this just a side effect of condor using cpuacct instead of cpuset in cgroup?
    • Is this a failure of the Linux kernel to schedule things on separate cores?
    • Is this because cpu.shares is set to 100 instead of 1024?

...

  • Bug in condor_annex: The following will wait for an annex named krowe - annex - casa5 (note the spaces).  If I pass $(myannex) as an argument to a shell script, the spaces are not there.  Underscores instead of hyphens cause different problems.
    • include.htc
      • myannex = krowe-annex-casa5
    • submit.htc
      • include : include.htc
      • executable = /bin/sleep
      • arguments = 127
      • +MayUseAWS = True
      • requirements = AnnexName == $(myannex)
      • queue
  • Actually, I think these aren't bug exactly but limitations on using macros.  The AnnexName needs to be quoted but how can I quote a macro?
    • No: requirements = AnnexName == "$(myannex)"
    • No: myannex = "krowetest2"
    • No: myannex = \"krowetest2\"
    • No:myannex = "\"krowetest2\""


  • Bug in condor_annex: Underscores in the AnnexName prevent the annex from moving into the pool.
    • Also when I try to terminate an annex with underscores (e.g. krowe_annex_casa5) with the command condor_off -annex krowe_annex_casa5 I get the following error
      • Found no ClassAds when querying pool (local)
      • Can't find addresses for master's for constraint 'AnnexName =?= "krowe_annex_casa5"'
      • Perhaps you need to query another pool.

  • Torque has this command called pbsnodes that can not only offline/drain a node but keeps a note about it that all can see in one place.  I know I can use condor_off to drain a node but is there a central place keep notes so I can remember a month later why I set a certain node to drain?Can we get an increase in quota for /software/nu_jrobnett.  Quota appears to be 4GB which is not enough for 2 version of our software package (it's close).CHTC will increase our quota


  • How can I set a variable in a DAG file that I can then use in the submit file in a conditional?  None of the following seem to work
    • DAG:
      • VARS step01 CHTC=""

      • VARS step05 CHTC="True"
    • Submit:
      • if defined $(CHTC)
        • requirements = PoolName == "CHTC"
      • endif
    • or
    • DAG:
      • #VARS step01 CHTC="True"
      • VARS step05 CHTC="True"
    • Submit:
      • if defined $(CHTC)
        • requirements = PoolName == "CHTC"
      • endif
    • or
    • DAG:
      • VARS step01 CHTC="False"
      • VARS step05 CHTC="True"
    • Submit:
      • chtc_var = $(CHTC)
      • if $(chtc_var)
        • requirements = PoolName == "CHTC"
      • endif
    • even though when I pass $(chtc_var) as arguments to the shell script, the shell script sees it as True.
    • or
    • DAG:
      • VARS node1 file="chtc.htc"

      • VARS node2 file="aws.htc"
    • Submit:
      • include : $(file)

...