You are viewing an old version of this page. View the current version.

Compare with Current View Page History

« Previous Version 75 Next »

Page for tracking an apparently slow down w.r.t CASA-5 and CASA-6 for VLASS calibration: https://open-jira.nrao.edu/browse/PIPE-568

Comparing CASA-5 and CASA-6 (casa-pipeline-validation-8) across the two different CPUs available for batch processing in NM and CV shows that the newer CPUs (E5-2640v3) run a small calibration job (6.7GB) about 1.25 times faster than the old CPUs (E5-2670) with CASA-6 performing slower in every case.  There was no significant run-time difference between NM and CV for similar hardware and software.  Results are in minutes.

Here is the full pipeline script I have used for all of these tests casa_pipescript.py For some tests, I commented out all but hifv_importdata.

Full, serial pipeline with small dataset

RHEL7 - 6.7GB dataset with NM Lustre-2.5.5 (results are in minutes)

CASAnmpost051 (E5-2640v3)cvpost020 (E5-2640v3)nmpost038 (E5-2670)cvpost003 (E5-2670)
5114, 117110, 111144, 143140, 141
6156*, 164*156*, 158*200*, 201*197*, 199*


RHEL7 - 6.7GB dataset after NM upgrade Lustre-2.10.8 and CV results copied from last test (results are in minutes)

CASANM (E5-2640v3)CV (E5-2640v3)NM (E5-2670)CV (E5-2670)
5113, 110110, 111142, 141140, 141
6155*156*, 158*198*197*, 199*

Mar. 3, 2020 krowe: I tried the nmpost051-casa6-rhel7 with the latest casa-pipeline-validation-17.  The run-time was the same as were the tclean() errors.

"*" Means it completed with tclean() errors


Full, new, serial pipeline with small dataset

Mar. 17, 2020 I started using the same pipeline script that Brian is currently using.

RHEL7 - 6.7GB dataset with NM Lustre-2.10.x (results are in minutes)  I testing a CASA-6 job with and without cf.validate_parameters = False and both jobs took the same amount of time +/- 1 minute.

"*" Means "SEVERE pipeline.hifv.tasks.flagging No flag summary statistics"


Full, new, serial pipeline with large dataset

Mar. 17, 2020 I started using the same pipeline script that Brian is currently using.

RHEL7 - 350GB dataset with NM Lustre-2.10.x, CASA-pipeline-5.6.3-9 or CASA 6.0.0.23a100.dev17 (results are in minutes)

CASANM (E5-2640v3)CV (E5-2640v3)NM (E5-2670)CV (E5-2670)
53,350*^3,362*^4,605*^4,480*^
64,016*3,943*5,671*5,253*

"*" Means "SEVERE pipeline.hifv.tasks.flagging No flag summary statistics"

"^" Means "SEVERE setjy No rows were selected"


Full, new, serial pipeline with large dataset and profiling metrics

Mar. 17, 2020 I started using the same pipeline script that Brian is currently using.

RHEL7 - 350GB dataset with NM Lustre-2.10.x, CASA-pipeline-5.6.3-9 or CASA 6.0.0.23a100.dev17 (results are in minutes)

"*" Means "SEVERE pipeline.hifv.tasks.flagging No flag summary statistics"

"^" Means "SEVERE setjy No rows were selected"


Full, new serial pipeline with large dataset and times per pipeline task

Comparing two profiling jobs against one of Brian's jobs on the same hardware (E5-2670) in NM.  Times were calculated from the CASA logs.  Times are in minutes.

Large dataset (350GB) times are in minutes


Taskkent2-pr-c5-l-70kent2-pr-c6-l-70CASA-6 Bkent
hifv_importdata247425392
hifv_hanning175188460
hifv_flagdata272323452
hifv_vlasetjy75199357
hifv_priorcals254281494
hifv_testBPdcals7484123
hifv_flagbaddef010
hifv_checkflag687069
hifv_semiFinalBPdcals75153154
hifv_checkflag189254253
hifv_solint6689105
hifv_fluxboot2104181175
hifv_finalcals162182177
hifv_circfeedpolcal313332
hifv_flagcal010
hifv_applycals205212437
hifv_checkflag174118402930
hifv_statwt645710500
hifv_plotsummary101346350




TOTAL (minutes)448455737460


  • No labels