Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

Projects

  • NM Cluster scheduler replacementreplacement 
  • ALMA profiling
  • Alamo StarlinkStarlink  DONE
  • GPU support
  • NAASC scheduler replacement plan ngVLA Data Processing ConOpsPOSTPONED
  • VLASS, assist with cube imaging plan DONE
  • Create RADIAL specification

Discussion items

ItemWhoNotes
HERA hardwareJames
  • aocoss13 130466 racked, booted. Needs Lustre.  Stolen to repair aocoss04.
    • 2022-02-10 krowe: spoke with Matthew today and explained that HERA paid for a 13th OSS and there are only 12 in production.  He didn't understand the HERA angle but now seems to agree that the broken OSS should get fixed even if that means just buying a new chassis.  We will see.
    • 2022-03-16 there is motion on this.  I think CIS is replacing the hardware.
Slurm bugkrowe, jrobnett

We now have a Slurm contract for 1 year at $11,750.  PO: 375504

https://bugs.schedmd.com/show_bug.cgi?id=13548

I need to submit some bugs.

CARTA glide-insJamesNeed to investigate gliding in based on lack of free slots rather than idle jobs.  Can one query HTCondor for a CARTA-shaped slot (core, mem, disk)?
Condor WeekAll

http://htcondor.org/HTCondorWeek2022 (May 23 to May 26)

  • ngVLA presentation?
  • nraorsync presentation? ~krowe/nraorsync.odp (it has notes)
Vacation

krowe

fmadsen

Jun. 14 2022 - jun. 23, 2022 (7 days)

May 31 2022 - June 7 2022 (6 days)

NVMe Drivesjrobnett, krowe

ticket 139472 buy 30 NVMe drives for nmpost nodes.  What type to get?

The one test SAMSUNG_MZPLJ6T4HALA-00007 arrived and I was able to partition, format, and mount on testpost001.  Peter ordered 32 drives.

HERA CAHMPkrowe

Ask CIS if we can have a few more fastX licenses for HERA CHAMP (Jun 6 - 10)

krowe submitted a ticket for this (140530)

herastore02krowe

make the four volumes on herastore02 NFS accessable to just herapost nodes

krowe submitted a ticket for this (140533)

Data reduction workshopfmadsenFrank Schinzel asked if I could support for a day to workto work with participants with imaging on the cluster - scheduled for October 18thHERA jupyterhubkroweDone: configure jupyterhub on herapost-master to restart on fail.

Jira
serverDMS JIRA
columnIdsissuekey,summary,updated,assignee,priority,status
columnskey,summary,updated,assignee,priority,status
maximumIssues20
jqlQueryfilter=12536 order by assignee
serverIdeb2e750b-a83a-387e-8345-36eee8a98f01

...