Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

Projects

  • NM Cluster scheduler replacementreplacement 
  • ALMA profiling
  • Alamo StarlinkStarlink  DONE
  • GPU support
  • NAASC scheduler replacement plan POSTPONED
  • ngVLA Data Processing ConOps
  • VLASS, assist with cube imaging plan DONE
  • Create RADIAL specification

Discussion items

ItemWhoNotes
HERA hardwareJames
  • aocoss13 130466 racked, booted. Needs Lustre.  Stolen to repair aocoss04.
    • 2022-02-10 krowe: spoke with Matthew today and explained that HERA paid for a 13th OSS and there are only 12 in production.  He didn't understand the HERA angle but now seems to agree that the broken OSS should get fixed even if that means just buying a new chassis.  We will see.
    • 2022-03-16 there is motion on this.  I think CIS is replacing the hardware.
Slurm bugkrowe, jrobnettRespond to SchedMD's question about suport contract.
HTCondor show and tellkrowe

Make a show-and-tell to Amy and the DAs.  Thursday, Mar. 17 at 9am.

Make a slide about how to submit jobs manually.

Use James's zoom.  Tell Amy and the DAs

VLASS cron jobkrowe

Done: VLASS wants a cron job set up to copy data hourly, putting here so I don't forget.

In place but a pgrep -f /lustre/aoc/projects/vlass/bin/pb-psf-storage.py would be good to keep it from running twice.  Problem is cron won't let me use the -f

Of course you can't use pgrep -f in a crontab entry because it will always find itself.  So I wrote a wrapper script.  It now checks for itself before running.

DMS computing plan JamesReview DMS computing plan https://sharepoint.nrao.edu/dms/_layouts/15/WopiFrame.aspx?sourcedoc=/dms/Shared%20Documents/Management/Science%20Information%20Services/DMS%20Scientific%20Computing%20Management%20Plan.docx&action=default
RADIAL planJamesWe need to characterize cluster computer services from the perspective of what can be supported by the remote host and what needs to be supported by NRAO
Data reduction workshopfmadsenFrank Schinzel asked if I could support for a day to workto work with participants with imaging on the cluster - scheduled for October 18thVLASS SE memorykroweTest an SE continuum imaging job on a dedicated node to see what the memory footprint should be.  It's currently 50GB could it be 32GB with swap?  Ask Amy for a test job.

Jira
serverDMS JIRA
columnIdsissuekey,summary,updated,assignee,priority,status
columnskey,summary,updated,assignee,priority,status
maximumIssues20
jqlQueryfilter=12536 order by assignee
serverIdeb2e750b-a83a-387e-8345-36eee8a98f01

...