...
Item | Who | Notes |
---|---|---|
GIbson/NMT network | James | Need to rethink rsync to gibson network path |
OOM error for some torque jobs | James | Twice now VLASS imaging jobs have been killed by torque for going over memory. This shouldn't ever happen. |
HERA orders | James | Ticket to rack herastore01d 127013 Ticket to rack herastore02 and shelves 129289 |
herastore01 | krowe | Done: five missing/bad disks on VD_3. helpdesk ticket 127465
Done: Mar. 4, 2021: rebuild herastore01-4, tell Paul and he will put data back on it and let Librarian know about it. Remove backup copies krowe made of the shelf (herastore01a) |
nmpost091 | krowe, Jrobnett | Done: nmpost091 is up at NMT (127446 and 126684).Should we change the hourly rsync to be every other hour so it doesn't conflict with itself as often? Nevermind. It is seeing jame's rsync for his job and exiting. Perhaps I should just remove the check or make it smarter. Done: Change sync-cfcache.sh to exit if it sees itself running
|
Return VLASS nodes to torque | krowe | Done: Jobs are draining off so we can start returning some nodes. 127669 Keep one node like nmpost071 but not in condor and offline. |
Tobin imaging step | James | Need to add imaging step to htcondor workflow (works, waiting for word to roll into git repo) pbmask=0.4 Should krowe fold step25 into the git repo? |
nmpost004 | krowe | jtobin is borrowing nmpost004 on Jan. 21, 2021 |
...