Dai Davies, DANTE DRCN 2014 Ghent April 1-3 2014.

Post on 28-Mar-2015

217 views 1 download

Transcript of Dai Davies, DANTE DRCN 2014 Ghent April 1-3 2014.

Dai Davies, DANTE

DRCN 2014 Ghent

April 1-3 2014

Network Reliability – the Role of Excellence in Network Operations

2Connect | Communicate | Collaborate

All About GÉANT

• Resilient Topology• Resilient Physical

Routing• Minimal Hardware

Resilience• Relies on Vendor

Sparing (4hr response time)

3Connect | Communicate | Collaborate

GÉANT Availability - February 2014

IP Access Availability No of Accesses

IP Access availability above 99.99% 60 (α)

IP Access availability between 99.9% and 99.99% 0

IP Access availability between 99% and 99.9% 0

IP Access availability below 99% 2 (β)

Total Accesses 62

α - Effectively 100% Availability

β – Romania, Serbia

4Connect | Communicate | Collaborate

Anatomy of GÉANT Operations

5Connect | Communicate | Collaborate

Ticket Count by Process Type

Type Of Ticket Ticket count

Incident 709Maintenance 894Service Requests 94Housekeeping 42Security 13Escalation/Complaint 1

6Connect | Communicate | Collaborate

Analysis of GÉANT Trouble Tickets

709

894

94

42 131Ticket Count

IncidentMaintenanceService RequestsHousekeepingSecurityEscalation/Complaint

7Connect | Communicate | Collaborate

Analysis of GÉANT Incident Tickets

561

67

58

Ticket Count

Circuit Related

Hardware

S/W

8Connect | Communicate | Collaborate

Operations’ issues

Performance

9Connect | Communicate | Collaborate

Performance Parameters

Availability (Sorted)

Available Bandwidth (Statistical)

One-Way delay (Path)

Packet Loss (Potentially Fatal)

Packet Delay Variation(Jitter) -Important for Real Time Applications

10Connect | Communicate | Collaborate

Performance Monitoring.

Offers:-

- Multi-Domain Monitoring Capability

- Supported by Measurement Archive (Trending)

- Capable of Measuring

• One-Way delay

• Packet Delay Variation (Jitter)

• Packet Loss

• Available Bandwidth using BWCTL

• +ping, traceroute etc.

11Connect | Communicate | Collaborate

PerfSonar at its best

12Connect | Communicate | Collaborate

Operations issues

Security

13Connect | Communicate | Collaborate

14Connect | Communicate | Collaborate

GÉANT Security Tickets 2013-2014

July

Augus

t

Septe

mbe

r

Octob

er

Novem

ber

Decem

ber

Janu

ary

Febru

ary

Non S

ecur

ity0

200

400

600

800

1,000

1,200

Number of tickets

Typical Month

15Connect | Communicate | Collaborate

GÉANT –Security Tickets

Type of attack Tickets created

D-Dos TCP SYN Flood 39

D-Dos UDP Flood 302

Dos TCP SYN Flood 18

Dos UDP Flood 602

Network Scan 5812

Port Scan 257

16

First Level Support in FED4FIRE

Developer(Subject Matter

Expert)

Sys Admin(Operational

Support)

Ticketing System / processes

First Level Support

Alarms

Dashboard

17

FED4FIRE Dashboard

Network Reachability

Overall Health

Internal Summary

Check

18

Fed4FIRE - FLS Experience

Incident (Dashboard) Incident (Reported) Maintenance Bug0

5

10

15

20

25

30

35

19Connect | Communicate | Collaborate

www.geant.net

www.twitter.com/GEANTnews | www.facebook.com/GEANTnetwork | www.youtube.com/GEANTtv

Connect | Communicate | Collaborate

Thank you!