Overview

On Jul. 25, 2022 Jeff Kern asked K. Scott Rowe to head a tiger team to investigate the various issues that have affected the ALMA Archive hosted in CV for the past few weeks to months.  The team was initially just K. Scott.

Communcation lines

Documented Issues

Diagrams

Timeline of events

Benchmarks


Table3: iperf3 to/from ingress_sbox (Mb/s)


na-arc-1

10.0.0.2

na-arc-2

10.0.0.21

na-arc-3

10.0.0.19


na-arc-4

10.0.0.5

na-arc-5

10.0.0.6

na-arc-1
4,0002,0004,0003,000
na-arc-24,000
2,0004,0003,000
na-arc-30.30.3
0.33,000
na-arc-44,0004,0002,000
3,000
na-arc-50.30.32,0000.3
Table4: iperf3 to/from ingress_sbox (Mb/s)

natest-arc-1

10.0.0.2

natest-arc-2

10.0.0.8

natest-arc-3

10.0.0.4

natest-arc-1
900700
natest-arc-2900
700
natest-arc-3300300


TCP Retransmissions

Dropped packets

I see dropped Rx packets on interface ens1f0np0 on naasc-vs-2 at a rate of about 100 packets per minute.  You can see this with watch ifconfig ens1f0np0.  This is especially interesting given that there isn't much traffic on naasc-vs-2 right now.  It is only hosting one VM guest (na-arc-6) and that guest is only running the docker agent container.  I am not seeing any dropped packets on na-arc-6.

I see dropped Rx packets on all the other naasc-vs hosts as well.  Hosts naasc-vs-3 and naasc-vs-5 show only about 2 packets dropped per minute while naasc-vs-4 shows about 100 packets per minute.  I don't think this is related to the TCP Retransmissions as I don't see any of them when sending to naasc-vs-4.

Comparisons

naasc-vs-2, 3, 4, 5

Identical

Differences


Questions

To Do


Answers


Done

Conclusions

NAASC Archive Stabilization Solutions


People (not necessarily team members)

References