You are viewing an old version of this page. View the current version.

Compare with Current View Page History

« Previous Version 14 Next »

Numbers are in hours


Large data set VLASS1.2.sb36491855.eb36574404.58585.53016267361_datacolumn.ms with full parameters

StepNRAO (steps-all-parallel9)NRAO/CHTC (steps-all-parallel10)NRAO/AWS (steps-all-parallel16)
019.49.212.3
0560.2killed at 72 hours65.9
0624
24.4
0711.8
14.4
1555.2

166.1

23230.8

2446

Total443.5

Small data set test.ms with full parameters

StepNRAO (steps-all-parallel12)NRAO/CHTC (steps-all-parallel15)NRAO/AWS (steps-all-parallel14)
011.82.01.9
058.656.85.1
063.03.92.0
072.02.32.2
156.956.34.3
161.41.71.4
238.347.85.3
2414.1
16.8
Total46.1
39.0


CPUs at CHTC are noticibly slower than CPUs at NRAO.  For example, their set of c20xx machines (e20{03..18}) each have two Intel Xeon Silver 4114 2.20GHz processors and 0.5TB to 1TB of memory, while their large memory machines (mem3, mem2001, mem2002) each have four Intel Xeon E7-4820 v4 2.00GHz processors and 2TB to 4TB of memory.  Possible reasons for this slowdown:

  • cfcache on cephfs
  • Slower CPUs
  • Multiple users
  • Hyperthreading

I ran a small data set test with full parameters at CHTC that copied cfcache from /staging to local disk and step05 took only 16.7 hours.



  • No labels