Скачать презентацию How is the Internet Performing Les Cottrell Скачать презентацию How is the Internet Performing Les Cottrell

b0a3d5d56d37e7bf42b0a1e91e4db71a.ppt

  • Количество слайдов: 47

How is the Internet Performing? Les Cottrell – SLAC Lecture # 2 presented at How is the Internet Performing? Les Cottrell – SLAC Lecture # 2 presented at the 26 th International Nathiagali Summer College on Physics and Contemporary Needs, 25 th June – 14 th July, Nathiagali, Pakistan Partially funded by DOE/MICS Field Work Proposal on Internet End-to-end Performance Monitoring (IEPM), also supported by IUPAP 1

Overview • Internet characteristics – packet sizes, protocols, hops, hosts … – complexity, flows, Overview • Internet characteristics – packet sizes, protocols, hops, hosts … – complexity, flows, applications • Application requirements • How the Internet worldwide is performing as seen by various measurements and metrics • How well are requirements met? • Many sources of measurements CAIDA/Skitter Matrix Ping. ER/IEPM Surveyor 2

Packet size • primarily 3 sizes: Measured Feb 2000 at Ames Internet e. Xchange Packet size • primarily 3 sizes: Measured Feb 2000 at Ames Internet e. Xchange Cu, mulative probability % – close to minimum=telnet and ACKs, 1500 (max Ethernet payload, e. g. FTP, HTTP); ~ 560 Bytes for TCP implementations not using max transmission unit discovery Mean ~ 420 Bytes, median ~ 80 Bytes Packets ~ 84 M packets, < 0. 05% fragmented Bytes Packet size (bytes) 3

Internet protocol use • There are 3 main protocols in use on the Internet: Internet protocol use • There are 3 main protocols in use on the Internet: SLAC protocol flows ICMP TCP Flows/10 min In Out – UDP (connectionless datagrams, best effort delivery), – TCP (Connection oriented, “guaranteed” delivery) – ICMP (Control Message protocol) TCP dominates today UDP Time Feb-May 2001 4

Web use characteristics • Size of web objects varies from site to site, server Web use characteristics • Size of web objects varies from site to site, server to server and by time of day. – Typical medians vary from 1500 to 4000 bytes • Also varies by object type, e. g. medians for – movies few 100 KB to MBs, postscript & audio few 100 KB – text, html, applets and images few thousand KB Big peaks for error messages Bytes 5

Hops • Hop counts seen from 4 Skitter sites (Japan, S. Cal, N. Cal, Hops • Hop counts seen from 4 Skitter sites (Japan, S. Cal, N. Cal, E. Canada, i. e. 10 -15 hops on average RTT Weak RTT dependence on hop count 95% 50% 5% Hops Hop Count 6

Autonomous Systems (AS) Disperson • Color indicates the AS responsible for the router at Autonomous Systems (AS) Disperson • Color indicates the AS responsible for the router at the hop, height is number of probes for that route • Seen by Skitter at Palo Alto US (F root name server) Hop number 7

Country dispersion • Seen from Japan • After 3 to 4 hops most goes Country dispersion • Seen from Japan • After 3 to 4 hops most goes to US. Probes – In some cases goes US & back to jp – Some goes to UK & onto other European countries Hops 8

Route maps • Simple routes from TRIUMF, Canada to several sites already gets quite Route maps • Simple routes from TRIUMF, Canada to several sites already gets quite complex TRIUMF DESY UW SLAC FNAL CERN KEK 9

Getting more complex • Ping. ER Beacon sites in US seen from TRIUMF, Vancouver Getting more complex • Ping. ER Beacon sites in US seen from TRIUMF, Vancouver (from Andrew Daviel, TRIUMF) 10

Connections by country NL IT US Unknown RU UK JP DE 11 Connections by country NL IT US Unknown RU UK JP DE 11

Richness of connectivity • Angle = longitude of AS HQ in whois records • Richness of connectivity • Angle = longitude of AS HQ in whois records • Radius=1 -log(outdegree(AS)+1)/(maxoutdegree + 1) – Outdegree = number of next Hops As’ accepting traffic • Deeper blue & red more connections • All except 1 of top 15 AS’ are in US, exception in Canada • Few links between ISPs in Europe and Asia 12

Hosts by regions • Jan 2001, 109 Million hosts – Source: Internet Software Consortium Hosts by regions • Jan 2001, 109 Million hosts – Source: Internet Software Consortium (www. isc. org) • see web site also for hosts/population Notes: • Many. com are in N. America • S. Asia = in (36 K), pk (6 K), lk, bd • E. Asia=jp, cn, my, sg, tw, hk, th, id, bn, mm • Mid East=il, kw, lb, ae, tr, sa • TLDs with hosts~238 • Total TLDs~258 13

Backbone utilization Shows utilization of I 2/Abilene backbone links, NB Backbone < 30% loaded Backbone utilization Shows utilization of I 2/Abilene backbone links, NB Backbone < 30% loaded Most losses at exchange points & edges 14

Flow sizes SNMP Real A/V AFS file server Heavy tailed, in ~ out, UDP Flow sizes SNMP Real A/V AFS file server Heavy tailed, in ~ out, UDP flows shorter than TCP, packet~bytes 75% TCP-in < 5 k. Bytes, 75% TCP-out < 1. 5 k. Bytes (<10 pkts) UDP 80% < 600 Bytes (75% < 3 pkts), ~10 * more TCP than UDP Top UDP = AFS (>55%), Real(~25%), SNMP(~1. 4%) 15

Flow lengths TCP outbound flows Measured by Netflows tied off at 30 mins Active Flow lengths TCP outbound flows Measured by Netflows tied off at 30 mins Active time in secs • 60% of TCP flows less than 1 second • Would expect TCP streams longer lived – But 60% of UDP flows over 10 seconds, maybe due to heavy use of AFS at SLAC – Another (CAIDA) study indicates UDP flows are shorter 16 than TCP flows

Typical Internet traffic by Application • CERFnet link • Dominated by WWW (http) Mail Typical Internet traffic by Application • CERFnet link • Dominated by WWW (http) Mail WWW FTP Real. Audio 17

SLAC Traffic profile Mbps in SLAC offsite links: OC 3 to ESnet, 1 Gbps SLAC Traffic profile Mbps in SLAC offsite links: OC 3 to ESnet, 1 Gbps to Stanford U & thence OC 12 to I 2 OC 48 to NTON HTTP Profile bulk-data xfer dominates Last 6 months Mbps out iperf 2 Days FTP SSH bbftp 18

SLAC Internet Application usage Ames IXP: approximately 60 -65% was HTTP, about 13% was SLAC Internet Application usage Ames IXP: approximately 60 -65% was HTTP, about 13% was NNTP Uwisc: 34% HTTP, 24% FTP, 13% Napster 19

What does performance depend on? • End-to end internet performance seen by applications depends What does performance depend on? • End-to end internet performance seen by applications depends on: – round trip times – packet loss – jitter – reachability – bottleneck bandwidth – implementation/configurations – application requirements • Data transmitted in packets 20

Application requirements • Based on ITU Y 1541 • The Vo. IP loss of Application requirements • Based on ITU Y 1541 • The Vo. IP loss of 10^-3 used to be 0. 25 but that assumed random flat loss – actual loss is often bursty • Tail drop in routers • Sync loss in circuits, bridge spanning tree reconfiguration, route changes 21

RTT from ESnet to Groups of Sites RTT ~ distance/(0. 6*c) + hops * RTT from ESnet to Groups of Sites RTT ~ distance/(0. 6*c) + hops * router delay Router delay = queuing + clocking in & out + processing ITU G. 114 300 ms RTT limit for voice 20%/year 22

RTT Region to Region OK White 0 -64 ms Green 64 -128 ms Yellow RTT Region to Region OK White 0 -64 ms Green 64 -128 ms Yellow 128 -256 ms NOT OK Pink 256 -512 ms Red > 512 ms OK within regions, N. America OK with Europe, Japan 23

Brazil 300 ms E. Coast Europe & S. America RTT (ms) E. Coast US Brazil 300 ms E. Coast Europe & S. America RTT (ms) E. Coast US W. Coast US Frequency RTT from California to world Europe 0. 3*0. 6 c 300 ms RTT (ms. ) Longitude (degrees) Source = Palo Alto CA, W. Coast 24 Data from CAIDA Skitter project

RTT(ms) RTT from Japan to world Longitude Seen from Japan 25 RTT(ms) RTT from Japan to world Longitude Seen from Japan 25

 • Gives quality measure • Seen from San Diego, US Skitter • Steeper • Gives quality measure • Seen from San Diego, US Skitter • Steeper = less jitter, i. e. better • Small values better Cumulative % Cumulative RTT distributions RTT ms 26

Routes are not symmetric Advanced to U. Chicago RTT ms • Min, 50% & Routes are not symmetric Advanced to U. Chicago RTT ms • Min, 50% & 90% RTT measured by Surveyor • Notice big differences in RTTs • May be due to different paths in the 2 directions or to different loading U. Chicago to Advanced 27

Loss seen from US to groups of Sites 50% imp rove men t / Loss seen from US to groups of Sites 50% imp rove men t / ye ar ETSI DTR/TIPHON-05001 V 1. 2. 5 threshold for good speech 28

Detailed example of improvements Increase of bandwidth by factor of 460 in 6 years, Detailed example of improvements Increase of bandwidth by factor of 460 in 6 years, more than kept pace - factor of 50 times improvement in loss Note valleys when students on vacation 29

Loss to world from US Using year 2000, fraction of world’s population/country from www. Loss to world from US Using year 2000, fraction of world’s population/country from www. nua. ie/surveys/how_many_online/ 30

How are the U. S. Nets doing? In general performance is good (i. e. How are the U. S. Nets doing? In general performance is good (i. e. <= 1%) ESnet holding steady, still better than others Edu (v. BNS/Abilene) &. com improving 31

Losses for 28 days in May 2001 % Loss DNS Internet WWW ISP • Losses for 28 days in May 2001 % Loss DNS Internet WWW ISP • Measured by MIDS to 583 DNS services, 383 Web services, 1367 Internet (ping) hosts, & 1225 ISPs (routers) 32

Losses between Regions 33 Losses between Regions 33

Bulk throughput • Important for long TCP flows where we want to copy large Bulk throughput • Important for long TCP flows where we want to copy large amounts of data from one site to another in a relatively short time, e. g. file transfer • Depends on RTT, loss, timeouts, window sizes 34

Throughput quality TCPBW < 1/(RTT*sqrt(loss)) Note E. Europe catching up Macroscopic Behavior of the Throughput quality TCPBW < 1/(RTT*sqrt(loss)) Note E. Europe catching up Macroscopic Behavior of the TCP Congestion Avoidance Algorithm, Matthis, Semke, Mahdavi, Ott, Computer Communication Review 27(3), July 1997 35

Throughput also depends on window • Optimal window size depends on: – Bandwidth end Throughput also depends on window • Optimal window size depends on: – Bandwidth end to end, i. e. min(BWlinks) AKA bottleneck bandwidth – Round Trip Time (RTT) – For TCP keep pipe full • Window (sometime called pipe) ~ RTT*BW – Can increase bandwidth by orders of magnitude Src Rcv • If no loss Throughput ~ Window/RTT CK A t = bits in packet/link speed RTT 36

“Jitter” from N. America to W. Europe “Jitter” = IQR(ipdv), where ipdv(i) =RTT(i) – “Jitter” from N. America to W. Europe “Jitter” = IQR(ipdv), where ipdv(i) =RTT(i) – RTT(i-1) 214 pairs ETSI: DTR/TIPHON-05001 V 1. 2. 5 (1998 -09) good speech < 75 ms jitter 37

“Jitter” between regions ETSI: DTR/TIPHON-05001 V 1. 2. 5 (1998 -09) 75 ms=Good 125 “Jitter” between regions ETSI: DTR/TIPHON-05001 V 1. 2. 5 (1998 -09) 75 ms=Good 125 ms=Med Jitter varies with loading 225 ms=Poor 38

SLAC-CERN Jitter ETSI/TIPHON delay jitter threshold (75 ms) 39 SLAC-CERN Jitter ETSI/TIPHON delay jitter threshold (75 ms) 39

Reachability Within N. America, & W. Europe loss, RTT and jitter is acceptable for Reachability Within N. America, & W. Europe loss, RTT and jitter is acceptable for Vo. IP But what about reachability 40

Reachability – Outage Probability Surveyor probes randomly 2/second Measure time (Outage length) consecutive probes Reachability – Outage Probability Surveyor probes randomly 2/second Measure time (Outage length) consecutive probes don’t get through Heavy tailed outage lengths (packet loss not Poisson) http: //www-iepm. slac. stanford. edu/monitoring/surveyor/outage. html 41

Europe seen from U. S. Monitor site Beacon site (~10% sites) HENP country Not Europe seen from U. S. Monitor site Beacon site (~10% sites) HENP country Not HENP & not monitored 200 ms 1% loss 7% loss 650 ms 10% loss 42

Asia seen from U. S. 10% loss 3. 6% loss 0. 1% loss 250 Asia seen from U. S. 10% loss 3. 6% loss 0. 1% loss 250 ms 640 ms 450 ms 43

Latin America, Africa & Australasia 4% Loss 170 ms 220 ms 700 ms 2% Latin America, Africa & Australasia 4% Loss 170 ms 220 ms 700 ms 2% Loss 350 ms 44

Animated monthly 2000 20% loss 200 ms RTT Big is Bad 20% unreachable 45 Animated monthly 2000 20% loss 200 ms RTT Big is Bad 20% unreachable 45

RTT worldwide from the Matrix 46 RTT worldwide from the Matrix 46

More Information • IEEE Communications, May 2000, Vol 38, No 5, pp 120 -159 More Information • IEEE Communications, May 2000, Vol 38, No 5, pp 120 -159 • IEPM/Ping. ER home site – www-iepm. slac. stanford. edu/ • CAIDA/Skitter home site – www. caida. org/home/ • Matrix Net home site – www. matrix. net/index. html • Surveyor home site: – www. advanced. org/csg-ippm/ 47