You are viewing an old version of this page. View the current version.

Compare with Current View Page History

« Previous Version 17 Next »

This page is for tracking the testing of the CASA 6 cube refactor documented here:

https://docs.google.com/document/d/1Jv-t_4k5Vv1Rgh_eYeU4j84UWrrnO3d6INLBFtuMD6s/edit?usp=sharing


There are three primary goals to our tests:

  1. Test the behavior of Tier0 parallelization of calibrator imaging in the calibration pipelline (provides CASA6 based calibrates MSes as a side effect for imaging run) 
  2. Demonstrate that the refactored code has the desired memory footprint effect.  We'll start with the referenced data set and then expand to larger data sets.  
  3. Demonstrate the runtime cost of the refactored code and whether it's a fixed overhead so it's contribution goes to zero for larger data sets or whether the overhead scales with image complexity


Phase 1, calibrator imaging tests run vs hifacal.py

ALMA dataset (project)

casa-CAS-9386-51 (CASA 6.1.0.54a9386.dev51)

Pipeline master-v0.1-145-ge322387-dirty (hifacal.py)

2017.1.00717.Scomplete
2017.1.01214.Stesting
2017.1.00884.Stesting
E2E6.1.00080.Stesting
2017.1.00983.Stesting

For all tests below Record tclean parameters and telemetry data for each of the 3 tclean calls.

Run each standard ALMA imaging pipeline generated data set through the following 3 casa revs.  All tests run with 8 way parallelization and 128GB memory limit.  All tests run within AWS.

ALMA dataset (project)

casa-pipeline-release-5.6.1-8.el7

pipeline rev. 42866 hifatargets.py

Casa6 version and pipe rev TBD

hifatargets.py

casa-CAS-9386-51 (CASA 6.1.0.54a9386.dev51)

Pipeline master-v0.1-145-ge322387-dirty hifatargets.py

2017.1.00717.Scomplete (local run)not startedtesting
2017.1.01214.Scomplete (local run)
not started
testing
2017.1.00884.Scomplete (local run)
not startedtesting
E2E6.1.00080.Scomplete (local run)
not startednot started
2017.1.00983.Scomplete (local run)
not startedtesting


The following tests vary memory environment for each data set, all tests using casa-CAS-9386-51 (CASA 6.1.0.54a9386.dev51) refactor code. All tests run within AWS

ALMA dataset (project)

128 GB memory 8 way parallelization

hifatargets.py

256 GB memory 8 way parallelization

hifatargets.py

512 GB memory 8 way parallelization

hifatargets.py

2017.1.00717.Stestingtestingtesting
2017.1.01214.Stestingtesting

testing

2017.1.00884.Stestingtestingtesting
E2E6.1.00080.Snot startednot startednot started
2017.1.00983.Stestingtestingtesting

As a control the following two data sets will be run on NRAO clusters as a check against AWS runs.  Of the 5 test data sets 2017.1.0084.S has the highest memory footprint, 2017.1.00983.S is the longest running.

ALMA dataset (project)

128 GB memory 8 way parallelization

256 GB memory 8 way parallelization

512 GB memory 8 way parallelization


2017.1.00884.Stestingnot startednot started
2017.1.00983.Snot startednot startedtesting
  • No labels