The data taken by the VLA and VLBA have various issues at times that need to be communicated to potential users of those data. These issues need to be documented in the metadata for those datasets such that the information is preserve for future utilization of those data. Recently the data issues and annotations from the legacy archive were migrated to the new NRAO archive and displayed with those data. However, there are still some obvious shortcomings in that DAs cannot annotated data themselves, they must make use of Jira tickets that take the time of the SSA developers to attach issue to metadata. Also, the legacy archive has a mechanism to automatically add annotations for some common issues, like missing BDFs. The new archive does not detect and annotate these issues.


We request the following:

  1. An annotation tool that can be used by the DAs to annotate data issues to individual VLA, VLBA, GBT, and ALMA datasets/correlations in the new archive. 
    1. This tool should allow selection of multiple datasets for annotation of identical issues through various mechanisms.
      1. by a project code (for projects that have multiple datasets/correlations)
      2. by date range
      3. list of FSIDs
      4. by configuration
      5. by band
    2. The need for GBT and ALMA is to be able to alert users to uncorrected data issues from those observatories.
      1. e.g., ALMA renormalization issue, VLA restore bug
    3. The capability to remove data issues is also needed for when calibrations/data get fixed in the archive
      1. e.g., ALMA renormalization issue, VLA restore bug
    4. This is encapsulated in: 
  2. The new archive needs to implement the automated annotations that the legacy archive has in place.
    1. These are expected to just be the detection of missing BDFs, but other functionality should also be replicated.
    2. e.g., see the data of the project 21A-147 observed on May 13, 2021 in legacy archive vs. new archive
    3. This work is encapsulated in:  
  3. (added 03/28/2022) A second type of annotations are needed to describe data issues that prevented the pipeline from operating correctly.
    1. These should appear as an informational indicator (similar to the yellow '!' used for issues with the data themselves, but should appear associated with the Cals column)
    2. DAs should be able to enter free-format text that will be associated with the EB to document why the pipeline failed in this instance and corrective action a user could take