You are viewing an old version of this page. View the current version.

Compare with Current View Page History

« Previous Version 43 Next »

The DSOC cluster will be offline October 28 and 29 for lustre improvements and to switch the cluster from RHEL6 to RHEL7, this document enumerates the steps we will take beforehand to prepare for it and the steps we will take afterwords to confirm things are working properly and restore access to it. Note that these upgrades won't touch the control systems for the AAT/PPI or VLASS, but they will touch the environment the workflows for both execute on.

1 Week Before the Shutdown (October 21)

Morning of the Shutdown (October 28)

Morning After the Shutdown (October 30)

  • User-facing things that need to be tested on the AAT/PPI production system (https://archive-new.nrao.edu):
    • Downloads of VLA EBs: SDM-only, SDM, basic MS, CMS - ops tests on SRDP-348, SRDP-356
    • Downloads of VLA calibrations. - X - ML failed to be able to download my own proprietary data (but dd not try before the upgrade) - JT -I have not had trouble yet.
    • AUDI imaging (JT-309617329)
    • ALMA restored MS download (Jt- 309621798 )
    • ASDM Download (JT-309629153 )
    • ALMA basic MS download  (JT -309625789)
    • Proprietary periods respected for VLA and ALMA
  • User-facing things that need to be tested on the legacy archive production system (https://archive.nrao.edu):
    • Downloads of VLA EBs: basic MS
  • Stakeholders (Mark LacyDrew Medlin) test critical operations-facing functions of the AAT/PPI under RHEL7
  • Operations-facing things that need to be tested:
    • EB ingestion
    • CIPL being triggered
    • CIPL working
    • calibration ingestion (QAPass)
  • GO/NO decision, 3pm MDT October 30th (Drew Medlin, John Tobin, Mark Lacy, Stephan Witz):
    • If GO: Stephan Witzto with with CIS to undo the DNS change and MOTD banners, re-nable CIPL triggers
    • If NO-GO: SSA to iterate with stakeholders until result is GO, note that this may mean putting out a bugfix release of AAT/PPI 3.6 and doing the same for VLASS

Stakeholder tests

VLASS

    • Run QL calibration job (test epoch)
    • Run QL imaging job (as part of reprocessing)
    • A&A / R&A QL imaging job (as part of reprocessing)
    • Create scheduling block and products (test epoch)


  • No labels