...
- How can I find out what hosts are available for given requirements (LongJobs, memory, staging)
- condor_status -compact -constraint "HasChtcStaging==true" -constraint 'DetectedMemory>500000' -constraint "CanRunLongJobs isnt Undefined"
- It looks to me like most hosts at CHTC are setup to run LongJobs. The following shows a small list of about 20 hosts. Is the correct?
- condor_status -compact -constraint "CanRunLongJobs is Undefined"
- How can I know if my job swapped?
- ANSWER: CHTC nodes have no or minimal swap space.
- Is there a ganglia server or some other monitor service at CHTC we can view?
We have a bunch of ganglia and grafana graphs for the system, but I think they are restricted to campus folks and tend to show system-wide utliization and problem -- are there particular nationalizations you'd like to see?
- Are there bugs in the condor.log output of a DAG node? For example, I have a condor.log file that clearly shows the job taking about three hours to run yet at the bottom lists user time of 13 hours and system time of 1 hour. https://open-confluence.nrao.edu/download/attachments/40541486/step07.py.condor.log?api=v2
And as for the cpu usage report, there could very well be a bug, but first, is your job multi-threaded or multi-process? If so, the cpu usage will be the aggregate across all cpu cores.
- Yes they are all parallel jobs to some extent so I accept your answer for that job. But I have another job that took 21 hours of wallclock time and yet the condor.log shows 55 minutes of user and 5:34 hours of system time
- Condor Annex processing in AWS. Is there support for spot market
- ANSWER: Condor Annex does indeed support the spot market. It is a bit more work to set up because you don't say "give my X of Y", but "I'll pay d1 dollars for machines like X1 and d2 for machines like X2, etc.".
- ANSWER: Condor Annex does indeed support the spot market. It is a bit more work to set up because you don't say "give my X of Y", but "I'll pay d1 dollars for machines like X1 and d2 for machines like X2, etc.".
- What network mask should we use to allow ssh from CHTC into NRAO? Is it a class B or several class Cs?
- ANSWER: The ip (v4 !) ranges for CHTC execute nodes are
128.104.100.0/22
128.104.55.0/24
128.104.58.0/23
128.105.244.0/23
- ANSWER: The ip (v4 !) ranges for CHTC execute nodes are
...