CASA 6 cube imaging refactor

This page is for tracking the testing of the CASA 6 cube refactor documented here:

https://docs.google.com/document/d/1Jv-t_4k5Vv1Rgh_eYeU4j84UWrrnO3d6INLBFtuMD6s/edit?usp=sharing

There are three primary goals to our tests:

Test the behavior of Tier0 parallelization of calibrator imaging in the calibration pipelline (provides CASA6 based calibrates MSes as a side effect for imaging run)
Demonstrate that the refactored code has the desired memory footprint effect. We'll start with the referenced data set and then expand to larger data sets.
Demonstrate the runtime cost of the refactored code and whether it's a fixed overhead so it's contribution goes to zero for larger data sets or whether the overhead scales with image complexity

Phase 1, calibrator imaging tests run vs hifacal.py

ALMA dataset (project)	casa-CAS-9386-51 (CASA 6.1.0.54a9386.dev51) Pipeline master-v0.1-145-ge322387-dirty (hifacal.py)
2017.1.00717.S	complete
2017.1.01214.S	testing
2017.1.00884.S	testing
E2E6.1.00080.S	testing
2017.1.00983.S	testing

For all tests below Record tclean parameters and telemetry data for each of the 3 tclean calls.

Run each standard ALMA imaging pipeline generated data set through the following 3 casa revs. All tests run with 8 way parallelization and 128GB memory limit. All tests run within AWS.

ALMA dataset (project)	casa-pipeline-release-5.6.1-8.el7 pipeline rev. 42866 hifatargets.py	Casa6 version and pipe rev TBD hifatargets.py	casa-CAS-9386-51 (CASA 6.1.0.54a9386.dev51) Pipeline master-v0.1-145-ge322387-dirty hifatargets.py
2017.1.00717.S	complete (local run)	not started	testing
2017.1.01214.S	complete (local run)	not started	testing
2017.1.00884.S	complete (local run)	not started	testing
E2E6.1.00080.S	complete (local run)	not started	not started
2017.1.00983.S	complete (local run)	not started	testing

The following tests vary memory environment for each data set, all tests using casa-CAS-9386-51 (CASA 6.1.0.54a9386.dev51) refactor code. All tests run within AWS

ALMA dataset (project)	128 GB memory 8 way parallelization hifatargets.py	256 GB memory 8 way parallelization hifatargets.py	512 GB memory 8 way parallelization hifatargets.py
2017.1.00717.S	testing	testing	testing
2017.1.01214.S	testing	testing	testing
2017.1.00884.S	testing	testing	testing
E2E6.1.00080.S	not started	not started	not started
2017.1.00983.S	testing	testing	testing

As a control the following two data sets will be run on NRAO clusters as a check against AWS runs. Of the 5 test data sets 2017.1.0084.S has the highest memory footprint, 2017.1.00983.S is the longest running.

ALMA dataset (project)

128 GB memory 8 way parallelization

256 GB memory 8 way parallelization

512 GB memory 8 way parallelization

2017.1.00884.S

testing

not started

2017.1.00983.S

not started

testing

Space shortcuts

Page tree