...
Item | Who | Notes | |||
---|---|---|---|---|---|
HERA hardware | James | herastore01
herastore02 and four shelves 129289 129289
Done: aoc253k-pdu-1 has critical alamrs 132028. During the power outage they replaced the PDU with the spare. aocoss13 130466 racked, booted. Needs Lustre. Stolen to repair aocoss04. | Lustre project quotas | jrobnett | Lustre project quotas are still not quite right. 130900 Leo's new scripts are doing the wrong thing. (/lustre/aoc/admin/bin/set_quota_lustre.sh) is setting user quotas for /lustre/aoc/users. Leo has created project quotas for /lustre/aoc/projects and some of /lustre/aoc/users. |
VLASS RAM swap | krowe | Restore VLASS memory in nodes at NMT 131614 | |||
More HERA nodes | jrobnett, krowe |
| |||
nmngas | jrobnett, krowe114896 |
| |||
master nodes | jrobnett, krowe | DONE: 122408 upgrade testpost-master and nmpost-master. testpost-master is done. nmpost-master replaced on Sep. 15, 2021. It did kill all HTCondor jobs. | |||
nmgnas update | jrobnett | NGAS replacement | |||
| |||||
VLASS memory usage | jrobnett | We need to investigate memory usage for VLASS SE imaging jobs. | |||
Order test GPUS | jrobnett | Need to order test GPUs against 114412506.6432 | HTCondor Euro week | all | https://indico.cern.ch/e/htcondor2021 Starts Sep. 20, 2021. |
Jira | ||||||||||
---|---|---|---|---|---|---|---|---|---|---|
|
...