Date
06 Jul
Goals
- Begin imaging with GPUsDon't die
Discussion items
Item | Who | Notes | |||
---|---|---|---|---|---|
HERA hardware | James | herastore01d 127013 Racked racked, disked, needs power and SAS. Needs power, SAS, firmware?, format, mount. herastore02 and four shelves 129289
Done: aoc253k-pdu-1 has critical alamrs 132028. During the power outage they replaced the PDU with the spare. aocoss13 130466 racked, booted. Needs Lustre. Stolen to repair aocoss04. | |||
Lustre project quotas | jrobnett | Lustre project quotas are still not quite right. 130900 krowe isn't sure Leo's new scripts are doing the right thing. It looks to me like the script (/lustre/aoc/admin/bin/set_quota_lustre.sh) is setting user quotas for users, sciops, and observers. | |||
VLASS RAM swap | krowe | Restore VLASS memory in nodes at NMT 131614 | |||
More HERA nodes | jrobnett, krowe |
| |||
nmngas | jrobnetjrobnett, krowe |
| |||
master nodes | jrobnetjrobnett, krowe | 122408 upgrade testpost-master and nmpost-master. testpost-master is done. nmpost-master scheduled for Sep. 15, 2021. | |||
nmgnas update | jrobnett | NGAS replacement | |||
HTCondor requirements | krowe | We need to set requirements for cluster nodes. SSA wants to run on NMT machines. Is requirements = (HasLustre =!= True) really the best way to do that? How about two axis (HasLustre and Partition == VLASS)? | Agedu | jrobnett | Worth pondering ordering of executions and timestamps |
Jira | ||||||||||
---|---|---|---|---|---|---|---|---|---|---|
|
...