...
This is conceptual at this point.
- We pre-stage data on the remote head node
- We then either submit a job locally and it flocks to the remote site or we login to the remote site and submit from there.
- Can we use a nifty filesystem to simplify this (Ceph or that LHC fs)?
- This might be a good phase2 problem to solve.
- Is this kinda what nraorsync does?
- The remote execute hosts transfer data from the remote head node
- The job uploads resulting data to the remote head node
- We retrieve data from the remote head node
Other
- Keep each rack as similar to the other racks as possible.
- Test system at NRAO should be one of everything.
- Since we are making our own little OSG, should we try to leverage OSG for this or not? Or do we want to make each POD a pool and flock?
- Should we try to buy as much as we can from one vendor like Dell to simplify things?
- APC sells a packaged rack on a pallet ready for shipping. We could fill this with gear and ship it. Not sure if that is a good idea or not. We will not be able to move the unit into the server room while still on the pallet because no doorway is tall enough. We would have to roll it off the pallet (it comes with a ramp and the rack is on casters) move it into the server room, fill and configure it, roll it out of the server room, roll it back onto the pallet, probably remove the bottom server(s) so we can attach it to the pallet, then re-add the bottom server(s). We could use the double glass doors for this but there is a lip on the transition. We could use the doors in the PRA closet as it has no lip but would require a lot of moving of shelves and stuff.
- APC NetShelter SX packaged:
- On Pallet: Height 85.79in (2179mm) Width 43.5in (1105mm)
- On Casters: Height 78.39in 1991mm) Width 23.62in (600mm)
- Double Glass doors: Height: 80in (2032mm) (because of the 2in maglock)
- NRAO-NM wide server doors: Height: 83in (2133mm) Width: 48in (1187mm)
- I could start prototyping now using AWS.
- Do we want jobs to flock or do we want to submit jobs on the remote host and have pre-transfered data? Involve SSA and VLASS in this question.
- If jobs are submitted from the remote host does that mean SSA will want a container on that remote host?
...
- Voltage in server room (120V or 208V or 240V)
- Receptacles in server room (L5-30R or L21-30R or ...)
- Single or dual power feeds?
- Is power from below or from above?
- Door width and height and path to server room.
- Can a rack-on-pallet fit upright? Height: 85.79inches (2179mm) Width: 43.5inches (1105mm)
- Can a rack-on-casters fit upright? Height: 78.39inches (1991mm) Width: 23.62inches (600mm)
- NRAO-NM wide server door Height: 84inches (2108mm) Width: 46.75inches (1219mm)
- Firewalls
- How are you going to use this?
- Do you care if this is in your DNS zone or ours?
- Is NAT available for the execute hosts?
Resources
- USNO correlator (Mark Wainright)
- VLBA Control Computers (William Colburn)
- Red Hat maintenance (William Colburn)
- Virtual kickstart (William Colburn)
- Switch models and ethernet (Jeff Long)
- HTCondor best practices (Greg Thain)
- OSG (Lauren Michael)
- SDSC at UCSD
- TACC at UT Austin
- IDIA https://www.idia.ac.za/
...