Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

ItemWhoNotes
HERA hardwareJames

herastore01

  • herastore01b Needs firmware
  • herastore01c Needs firmware
  • herastore01d 127013 racked, disked
.  Needs power
  • ,
SAS
  • powered,
firmware?
  • SASed,
format
  • firmwared, formatted.  Needs mount.

herastore02 and four shelves 129289 129289

  • herastore02 racked, powered, OSed.  Needs /opt. CIS borrowing for NGAS firmware upgrades.
  • 02a racked, disked, firmwared.  Needs power, SAS, firmware, format, mount.
  • Done: 02b racked, firmwaredhavenHaven't purchased disks yet.  Needs firmware.herastore02
  • Done: 02c racked, firmwaredNeeds power, OS.02c racked. havenHaven't purchased disks yet.  Needs firmware.
  • Done: 02d racked, firmwared. haven  Haven't purchased disks yet.  Needs firmware.

Done: aoc253k-pdu-1 has critical alamrs 132028.  During the power outage they replaced the PDU with the spare.

aocoss13 130466 racked, booted. Needs Lustre.  Stolen to repair aocoss04.

Lustre project quotasjrobnett

Lustre project quotas are still not quite right. 130900

Leo's new scripts are doing the wrong thing.  (/lustre/aoc/admin/bin/set_quota_lustre.sh) is setting user quotas for /lustre/aoc/users.  Leo has created project quotas for /lustre/aoc/projects and some of /lustre/aoc/users.

VLASS RAM swapkroweRestore VLASS memory in nodes at NMT  131614
More HERA nodesjrobnett, krowe
  • Done: new herapost-master and make old herapost-master a compute node.
  • Done: new IB card/cable for new herapost-master 132576
  • Done: Buy an IB switch for HERA racks.  $13,300 133166
  • Rack Connect switch and connect to fabric.  Requires some re-arranging of ports.  133166
  • Cards/cables req: 182337, 182338.  Install in new nodes.
  • Boot three 2U nodes with 24 cores each with GPU kits but no GPUs for now
nmngasjrobnett, krowe114896
  • nmngas{01..04}c racked, firmwared, powerd, SASd.  Needs firmware, power, SAS, format, mount.
  • nmngas{01..04}c-mirror still in box.  Needs racked, firmwarefirmwared, disks, power, SAS, powerd, SASd.  Needs format, mount.
master nodesjrobnett, kroweDONE: 122408 upgrade testpost-master and nmpost-master.  testpost-master is done.  nmpost-master replaced on Sep. 15, 2021.  It did kill all HTCondor jobs.
nmgnas updatejrobnettNGAS replacement
HTCondor Euro weekallhttps://indico.cern.ch/e/htcondor2021  Starts Sep. 20, 2021.
  • Done: Ticket 114896 sadly didn't mention formatting or mounting volumes so it was closed.
  • krowe submitted ticket 134766 to format and mount the new volumes.
VLASS memory usagejrobnettWe need to investigate memory usage for VLASS SE imaging jobs.
Order test GPUSjrobnettNeed to order test GPUs against 114412506.6432HERA accountsjrobnettGet with CIS about HERA accounts.  HERA maintains spreadsheet.  Helpdesk sets status (opened/closed).

Jira
serverDMS JIRA
columnskey,summary,updated,assignee,priority,status
maximumIssues20
jqlQueryfilter=12536
serverIdeb2e750b-a83a-387e-8345-36eee8a98f01

...