...
Item | Who | Notes | |||
---|---|---|---|---|---|
HERA hardware | James | herastore02 135532
aocoss13 130466 racked, booted. Needs Lustre. Stolen to repair aocoss04. | Order test GPUS | jrobnett | Tesla T4, SW, HH, 16 . 5cm long, in nmpost064RTX A4000, SW, FH, 24cm long (too long for R640), in gpuhost003RTX A5000, DW, FH, FL, in gpuhost003 |
Glideins | krowe | 135553 Port RHEL-7.8.1.5 to CV. Started, doesn't boot. cvpost006 135799 Install HTCondor on cvpost. Started, needs to be host_based.factory/pilot scripts. 136304 Change routing on gibson /etc/sysconfig/network-scripts/route-p1p1 10.7.7.0/24 via 146.88.10.1 and set DEFROUTE=no for ifcfg-p1p2 | |||
T4 HERA | krowe | Working: Felipe is done with nmpost064 So I told plaplant about it. | |||
MPI and Slurm | krowe | Understand and better document how to use MPI in Slurm | |||
HTCondor plugin | krowe | fix _condor_stdout and _condo_stderr in the plugin if possible. Tell Charlotte | |||
HERA GPU in Slurm | krowe | Working: Put a herapost GPU server in Slurm such that it requires a special something to use or at least is last in line to run jobs. Tell plaplant when ready to test. | |||
Slurm | krowefmadsen | test a some casa pipeline job with Slurmjobs with Slurm (parallel, multinode, machinefile, no machine file, no -n) | |||
Hera product storage | jrobnett | HERA would like some form of product storage that can be exposed to the outside world. |
Jira | ||||||||||
---|---|---|---|---|---|---|---|---|---|---|
|
...