Below is a list of items that are priorities for Archive implementation, some of these are Legacy AAT retirement tickets, others are new development. The order in the tables is in priority order.


Main topics (click link to jump to section in page):

Data Delivery

Search/Data Selection

VLBA-specific

VLA(VLBA) Metadata presentation

Legacy VLA Data

Data Integrity

DA Tools

Missing Data

Other

ALMA

Product Versioning



Data Delivery

Priority/Done?NotesJiRA ticket /Confluence page
CriticalDownloads via Scriptable query interface
criticalA friendly error is needed when a users is not authorized to access proprietary data.

confusing to staff as well

-need to kill RH, WS 3.0+

criticalRestore data using same version of CASA as calibrated


mediumPut warning that only one CMS at a time can be requested in AAT


could be a flyover text (currently has a flyover circle with slash)
high (WS 2.8/2.9)

Clean up the nested directory data delivery mess


legacy VLA data deliveries are one level deeper in directories than an SDM

*May need revisions to account for VLBA differences.

WS 2.7 or 3.0?
planned 4.2.2

Don't restrict download number limit.

New ticket to remove the '/100 string from webpage'

We would like no limit on the number of files, but keep the 10TB for the total downloaded volume.
4.2.1

Fix select-all functionality

  • Always have select all available (only displayed files/SDM or from all pages)


  • make select-all work properly with select/deselect


mediumDownload weblogs directly (and/or serve them as well!)
mediumBulk download of Calibration tarballs


lowarchive should ask for confirmation to continue if the download or processing job is considered largeTo be requestedThe front end can show size estimates for various products.
lowData Delivery times out if tar file(s) are too large/time consuming


will not be an issue with taring not being the default.
lowarchive-new delivery not chmod'd/chowned



√ 2022.2 pipelineoption to return UNTAR'd file does not work pipeline topic
√ (4.2.0)The email sent by the archive tool will include the wget command in the near future.

 


open up local delivery restrictions (to /home and /lustre). 

The part related to opening the download destination to nm-### accounts on lustre is

This should cover users with visitor accounts wanting to download data to lustre, and local staff wanting to download data to their local machines. We have a problem with the quota of the observer accounts (I downloaded 8 TB of data to my observer account!); CIS remains irresponsive!
disable checksums by default


make tar downloads default to 'off'



Search interface/Data Selection

Priority/Done?Notes

high

Fix Frequency-dependent FOV and position searches

  • exact same feature as legacy archive, must select to do the freq. dependent FOV
  • legacy archive has this feature


high (4.2.2)

Position searches provide distance from searched position and sort by distance

  • legacy archive feature

User request, but is a legacy AAT feature
medium

Treat wild cards consistently in AAT search interface

  • account for * in source names


lowAllow search return of observation scans?

Helpdesk ticket for a search return we do not have in AAT, but legacy archive provides.

high

Allow search filtering based on total time on source.

Based on legacy archive helpdisk question from user. UC feedback as well.
mediumFix mosaic position searches for VLASS and ALMA mosaics (subscan vs. scan)



Rearrange boxes in 'Search Inputs'Ticket to be created.
√ (4.2.0)Scriptable query interface

Download number limit: 100 data sets, but download data volume limit to 10TB per request.


Doesn't fully solve problem see also


This change has already been made by Stephan, but 100 files is too limiting for various cases (take BM360 as one example). We would more likely want no limit on the number of files, but keep the 10TB for the total downloaded volume.


VLBA Stuff

Priority/Done?NotesJiRA ticket /Confluence page
High

VLBA: DA archive tools (note specifically the limitation of USNO data not having PI)

  • CHECK IF ALL REQUIREMENTS ON CONFLUENCE ARE SATISFIED
  • Current tools should be able to add PI for USNO

The VLBA DA tools for changing PI and setting proprietary period have been complete for some time. The confluence page also mentions a tool for annotations; maybe that part should be moved to the Data Integrity section?

HighVLBA+EVLA projects without metadata and sometimes also mismatched PI


High (4.2.1)Proxy issue for huge VLBA projects


High (4.2.1)Slow VLBA indexing


MedWrong band designation in VLBA data (old data only?)

This is in both the legacy archive and the new archive tool. Issue needs more research on how prevalent it is; unclear.
Med
  • For observing VLBA projects that incorporate other dishes (e.g. +Y1, +Y27 etc), the VLA files should be associated with the relevant VLBA segment and not considered as "mixed" projects. This has been descoped from ticket  but needs to be picked up in future AAT releases

Ticket created Anna Kapinska, Emmanuel Momjian, and Anthony Sowinski should check it to make sure it's correct and specifies all requirements.


MedDefault Instrument of VLBA projects in AAT/PPI
MedAdd PFT (proposal finder tool) link in Project View for VLBA projects


LowImport remaining 8 GMVA files


LowMark4 files with 0 length getting added to new archive metadata

Small number of files affected.
LowIncomplete VLBA data of observations crossing midnight (open in case more cases show up)


HighCreate USNO download acknowledgement


(reopened) Low?
Remove Legacy Archive label and link from new archive

Update (26-Sep-2022, Anna): the link is still present on some of the AAT subpages, especially the download status page when request for data is submitted.

The new archive has a link to the legacy archive at the top. This needs to be removed since the legacy archive will not be accessible from outside.

√ 4.2.0Add wget command in emails generated by the AAT upon downloading data


√ 4.1.1

import FITSAIPS files without matching projects (51 files) and project GP0024


√ 4.1.1

Add FITSAIPS files to new archive and segment association


√ 4.1.1old VLBA project metadata issues


√ 4.1.1VLBA data structure in the new archive - improved interface
√ (4.1.1)Database has zeros for min/max frequencies for some archive files


877 VLBA correlations with missing files entries


proprietary clock for VLBA data

done
VLBA Reingestion Refinements

ongoing → done
VLBA Polarization Duplication

done
String Terminators in VLBA fields

done
BL473 unlock and missing segments

done
VLBA data structure in the new archive

done
USNO Proprietary Download System/A way to deliver data from projects that are not archiveddone





VLA Metadata Presentation 

Priority/Done?NotesJiRA ticket /Confluence page
critical

Scan list does not show coordinates between dec= 00 and dec = -1 correctly


Critical (4.3.0)Make scanlist information useful 

JT: revise to capture what we want about the frequency/setup information

This ticket has multiple points to make the new tool match the Legacy archive and to have the displayed data be much more useful.
High (4.3/4.4)VLA Metadata display updates. This has many items including 'telescope observing logs'.
Medium

Incorrect metadata sometimes displayed for projects with duplicate proposal IDs between VLA and VLBA


MediumVLA/VLBA proposal code degeneracy


MediumIndexing of VLA low band projects


Subscans table holding both Radian and Degree positions

done
archive search returning incorrect projects


done





Legacy VLA Data

Priority/Done?NotesJiRA ticket /Confluence page
Criticalcorrupt legacy VLA data in the AAT


HighMetadata Issues in Legacy VLA data

Needs input from Bryan
HighLegacy VLA data missing error codes in the new archive tool

Mapping of quality errors were lost. Stephan will continue following up with Bryan. Blocker for LA retirement.
Highmetadata errors in VLBA (BD114) and old VLA data in new archive

This has been split into two tickets: one VLBA and one VLA
Mediumlegacy VLA metadata error: EBs that claim to have run for 2+ days

requires re-ingestion


Data Integrity

Priority/Done?NotesJiRA ticket /Confluence page
BlockerGap Analysis Between Legacy and New Archive


can be done once RLSW finishes the AIPSFITS files
CriticalAutomated annotations for missing scans/BDF, Automated Error Recognition of Ingested Data
CriticalDisplay of archive ingestion/observation issues


related to SSA-7190
CriticalIdentification of ingestion problems and automatically fix

Example ticket of a silent failure: 


create missing db entries for Flag.xml


Done
fix ASDM.xml files missing FlagTable entities


Done


DA Tools for Data Editing

Priority/Done?NotesJiRA ticket /Confluence page
Tool for setting Proprietary Period of science observations on the VLA and the VLBAhttps://open-confluence.nrao.edu/pages/viewpage.action?spaceKey=SPR&title=Tool+for+setting+Proprietary+Period+of+science+observations+on+the+VLA+and+the+VLBA
CriticalDA annotation tools/Archive data annotation tools


Missing Data

Priority/Done?NotesJiRA ticket /Confluence page
Critical

Tix to be created 

May require discussion about how new archive will display images and the uvfits datasets.


HighALMA Standard images into AAT


HighLarge project support - ALFALFA + Other collection ingest


Other

Priority/Done?NotesJiRA ticket /Confluence pageDetails

Other tickets (so they are not forgotten)Tickets remaining are listed throughout their respective categories.
Critical

Proposal Contact Author not added to data.nrao.edu database

Bug when a project uses different PI and Contact Author
MediumPreserve login between AAT and my.nrao.edu


√ (4.2.0)Remove pop-up page when accessing data.nrao.edu



ALMA

Priority/Done?NotesJiRA ticket /Confluence pageDetails
CriticalALMA MOUS/calibration ingest unreliable; ALMA butler implementation into production



Product Versioning

Priority/Done?NotesJiRA ticket /Confluence pageDetails
CriticalNeed to allow multiple versions of products (images/calibrations) in the archive