Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

  • How can I find out what hosts are available for given requirements (LongJobs, memory, staging)
    • condor_status -compact -constraint "HasChtcStaging==true" -constraint 'DetectedMemory>500000' -constraint "CanRunLongJobs isnt Undefined"
  • It looks to me like most hosts at CHTC are setup to run LongJobs.  The following shows a small list of about 20 hosts.  Is the correct?
    • condor_status -compact -constraint "CanRunLongJobs is Undefined"
  • How can I know if my job swapped?
    • Actually, it looks like the e2000 nodes don't have swap so this may not be an issue.
  • Is there a ganglia server or some other monitor service at CHTC we can view?
  • Are there bugs in the condor.log output of a DAG node?  For example, I have a condor.log file that clearly shows the job taking about three hours to run yet at the bottom lists user time of 13 hours and system time of 1 hour.  https://open-confluence.nrao.edu/download/attachments/40541486/step07.py.condor.log?api=v2
  • Condor Annex processing in AWS.  Is there support for spot market
  • What network should we use to do ssh from CHTC?  Is there a class B or several class Cs?

...