Short-Term Conflict Resolution for Unmanned Aircraft Traffic Management

Short-Term Conflict Resolution forUnmanned Aircraft Traffic Management

Hao Yi Ong and Mykel J. Kochenderfer

Stanford University

September 22, 2015

Outline

Introduction

Mathematical formulation

Approach

Numerical experiments

UTM-α: A distributed framework

Conclusion and future work

Introduction 2

Unmanned aircraft traffic management

I automating conflict avoidance is critical to integrating smallunmanned aerial systems (UAS) to civil airspace

I automation to augment controllers

Introduction 3

Conflict avoidance

given potential conflicts between drones, find the best set of advisories

I focus on horizontal or co-altitude conflict resolution

– NASA’s proposed airspace for UAS is under 150 m (500 ft)– flight altitudes need to avoid disruption and buildings

I conflict defined as loss of minimum separation between aircraft

Introduction 4

Complications

I multiagent problem

– system must coordinate between many aircraft– large search space for solution

I uncertainty in system

– imperfect sensor measurements of current state– variable pilot response, vehicle performance, etc. affect future path

I appropriate trade-off between safety and efficiency not obvious

Introduction 5

robust and efficient method

I tractable approach to complex stochastic problem

– account for uncertainty in large problem with reasonable computeresources

I balances between airspace safety and efficiency

– ensure safety and provide timely conflict alerts to aircraft

I arbitrary-scale optimization

– real-time conflict resolution for large airspace

Introduction 6

Previous approaches

I mathematical programming (MIP, SCP) [SVFH05, ASD12]

– can work well for simple vehicle networks

I distributed convex optimization [OG15]

– hard to incorporate stochastic objectives/constraints

I Markov decision process formulation (ACAS X) [KC11, CK12]

– assumes “white noise” accelerations for intruder aircraft

I and many more. . . [KY00]

Introduction 7

Overview

idea: decomposition + coordination

multi-agent pairwise joint policy

decompose pairwise solutions

+ coordination

ac1: left ac2: right ac3: straight

Introduction 8

Conflict Resolution as a UTM service

UTM server client side

resolution server

filtered sector status updates

advisories

status updates

Introduction 9

Conflict Resolution as a Standalone service

I derivative UTM client service

– subscribes to UTM server to track aircraft and deconflict routes– pub-sub system that operators subscribe to to receive advisories

I current implementation uses standalone model

– still unclear about end architecture for resolution service– clean approach to get started with prototype system– implemented with scalability and modularity in mind

Introduction 10

Outline

Introduction

Approach

Mathematical formulation 11

Markov decision process (MDP)

defined by the tuple (S,A, T,R)I S and A are the sets of all possible states and actions, respectively

I T (s, a, s′) gives the probability of transitioning into state s′ bytaking action a at the current state s

I R (s, a) gives the reward for taking action a at the current state s

environment T(s,a,s’ )!

action a!

state s!

reward R(s,a)!

Value iteration

agent π ⋆(s )"

action a"

state s"

I want to find the optimal policy π? (s)

– gives action that maximizes the utility Q? (s, a) from any given state

π? (s) = argmaxa∈A

Q? (s, a)

I value iteration updates value function guess Q until convergence

– expensive one-off compute but cheap policy extraction (Q lookup)

Q (s, a) := R (s, a) +∑s′∈S

T (s, a, s′)maxa′∈A

Q (s′, a′)

Multiagent MDP

extension of MDP to cooperative multiagent setting

I similar to case where centralized planner has access to system state

I also defined by the tuple (S,A, T,R), except

– S and A are all possible joint states and actions– T and R operate on elements of S and A

Short-term conflict resolution

I horizontal conflict resolution between n aircraft

– conflict defined as loss of minimum separation distance, 500 m– aircraft at risk if it could experience a conflict within two minutes

I alert aircraft that need corrective maneuvers

– but not to the extent that alerts become a nuisance

Resolution advisories

I at each time step T = 5 s, system issues a joint advisory

– joint advisory φ chosen out of a finite set of corrective bank anglesfor each aircraft

I action set A = {−20◦,−10◦, 0◦, 10◦, 20◦,COC}n

– positive and negative angles correspond to left and right banks– COC is a clear-of-conflict status advisory—nothing needs to be done

I higher resolution comes at the cost of computational complexity

Dynamics

I ith aircraft state si = (xi, yi, ψi, vi, pi)

– latitude and longitude xi and yi– heading ψi

– groundspeed vi– responsiveness indicator pi

I aircraft follow Dubin’s kinematic model

– simplicity avoids risk of overfitting complicated models

xi = vi cosψi yi = vi sinψi ψi =g tanφivi

bank: φi!

(xi , yi )! vi!

Pilot model

I advisory response determined stochastically by Bernoulli process

I 5 s average response delay to first corrective advisory asrecommended by ICAO [ICA07]

– when responding, the pilot executes the advisory for 5 s time step

non-responsive: white noise path

responsive: corrective action

COC 5 s average response lag

Reward function

balance competing objectives

I maximize safety

loss of separation

separation

closeness penalty

I minimize disruption

advisory COC ±0° ±10° ±20°

conflict penalty

Outline

Introduction

Approach

Approach 20

Curse of dimensionality

I in an MMDP, S and A are all possible joint states and actions

I problem: S and A scale exponentially with number of agents

3 drones: 1000 states

2 drones: 100 states

1 drone: 10 states

Approach 21

Decompose and coordinate

+ coordination

Approach 22

MMDP decomposition

decompose

pairwise solutions +

coordination =

Approach 23

Pairwise conflict resolution

decompose full MMDP into O(n2)

pairwise encounters

I action set, pilot response model, and reward function unchanged

I use relative positions and bearing to reduce state space

– arbitrarily set one aircraft as ownship, and the other as intruder– speeds remain in global reference frame

s =(xrel, yrel, ψrel, vownship, vintruder, pownship, pintruder

(0, 0 )!

v ownship

ψ rel

(x rel

, y rel

v intruder

Approach 24

Discretization

I in general, no analytical solution to problem with continuous statespace

I approximate value function Q over discretized state space

– discretize state space via multilinear interpolation– iteratively update Q until convergence

Q (s, a) := R (s, a) +∑s′∈S

T (s, a, s′)maxa′∈A

Q (s′, a′)

Approach 25

Approximating environment noise

sigma-point sampling

speed bank

weights

sigma-point sample

Approach 26

Implementation

I discretize continuous state space into 9.6 million states

– expensive but one-off computation

I Q converges in 7 hours (40 iterations over S ×A)

– solver: parallel implementation in Julia– machine: 20 2.3 GHz Intel Xeon cores with 125 GB RAM

Variable Minimum Maximum Number of values

xrel, yrel −3000 m 3000 m 51ψrel 0◦ 360◦ 37vownship, vintruder 10 m s−1 20 m s−1 5pownship, pintruder - - 2

Approach 27

Pairwise policy visualization

passing intrusion (240◦ bearing)

I−2,000−1,000 0 1,000 2,000

−2,000

−1,000

Ownship action

−2,000−1,000 0 1,000 2,000−2,000

−1,000

Intruder action

Approach 28

Example advisory execution

passing intrusion at (1000 m, 0 m)

ownship: 10° left bank intruder: 10° left bank

Approach 29

Accounting for pilot response

pilots responsive (top) and non-responsive (bottom)

−2,000−1,000 0 1,000 2,000−2,000

−1,000

)Ownship action

−2,000−1,000 0 1,000 2,000−2,000

−1,000

Intruder action

−2,000−1,000 0 1,000 2,000−2,000

−1,000

Ownship action

−2,000−1,000 0 1,000 2,000−2,000

−1,000

Intruder action

Approach 30

Accounting for pilot response

ownship responsive (top) and intruder responsive (bottom)

−2,000−1,000 0 1,000 2,000−2,000

−1,000

)Ownship action

−2,000−1,000 0 1,000 2,000−2,000

−1,000

Intruder action

−2,000−1,000 0 1,000 2,000−2,000

−1,000

Ownship action

−2,000−1,000 0 1,000 2,000−2,000

−1,000

Intruder action

Approach 31

Multithreat coordination

+ coordination

Approach 32

Utility fusion

idea: combine pairwise utilities Qij to form proxy for global utility Q

I Qij operates on the pairwise MDP state-action pair (sij , aij) foraircraft i and j

I fusion strategies π(s) = argmaxa∈A

Q(s, a)

– max-min: “worst-case”

Qmin(s, a) = mini<j

{Uij(sij , aij)}

– max-sum: “average”

Qsum(s, a) =∑i<j

Uij(sij , aij)

problem: action space A scales exponentially with number of aircraft

Approach 33

Search heuristic: Alternating maximization

I maximize global utility by varying one action and fixing the rest

I e.g., fix drone 1 and 2’s actions (R and L) and vary drone 3’s action

1.1 1.3 0.9

2.1 1.1 0.9

1.3 2.3 0.8

2.3 1.4 0.3

0.9 1.7 0.8

0.8 2.2 1.4

2.1 1.1 0.3

argmax

pair 1 pair 2

L R S L R S

aircraft 3 action: L

Q13(s, a)! Q23(s, a)!

Approach 34

Distributed variant I: Parallel search

greedy local maximization

broadcast action

repeat until convergence

Approach 35

Distributed variant II: Consensus search

parallel alt. maximization

broadcast joint action

greedy local maximization

incorporate msgs

Approach 36

Consensus search message

I parallel search could incur large communication cost

I messages reduce communication between compute nodes, but riskyielding unsynchronized joint action across aircraft

I incorporate messages by “averaging” what intruders think ownshipshould do to approximate consensus

broadcast joint action

incorporate msgs

R L? L

R R 0?

R? R L

Approach 37

Joint policy visualization

triple aircraft encounter max-min (top) and max-sum policies (bottom)

−2,000−1,000 0 1,000 2,000

−2,000

First aircraft action

−2,000−1,000 0 1,000 2,000

−2,000

Second aircraft action

−2,000−1,000 0 1,000 2,000

−2,000

Third aircraft action

−2,000−1,000 0 1,000 2,000

−2,000

First aircraft action

−2,000−1,000 0 1,000 2,000

−2,000

Second aircraft action

−2,000−1,000 0 1,000 2,000

−2,000

Third aircraft action

Approach 38

Outline

Introduction

Approach

Numerical experiments 39

Encounter model

I generate uniformly random starting aircraft states in annulus

– speeds uniformly random– headings initialized to point to annulus center– resample if loss of separation

Aircraft model

I ODE solver for dynamics

– continuous Dubin’s kinematic model– Gaussian noise in acceleration and banking

I corrective bank angles mapped to PID control policies

Baseline methods

I closest threat

– multithreat scenario decomposed into pairwise encounters– aircraft execute solution to encounter with closest intruder

I uncoordinated

– aircraft assumes all other aircraft are white-noise intruders– executes greedy local maximization to find own action

Experiment

I >1 million simulations from 2 to 10 aircraft

– code: Julia implementation– machine: 3.4 GHz Intel i7 processor with 32 GB RAM

I ∼10 ms decision times (vs. 5 s decision period)

– serial solve times of <10 ms for up to 10 aircraft sufficient forserver-based resolution system

– unoptimized code that doesn’t account for communication costs fordistributed variants

Safety performance

I max-min utility fusion

4 5 6 7 8 9 10

9×10−3

number of aircraft

uncoordcentralizedparallelconsensus

I max-sum utility fusion

4 5 6 7 8 9 10

×10−2

number of aircraft

uncoordcentralizedparallelconsensus

I distributed variants perform as well as serial version

I coordinated methods >10 times better than closest threat heuristic

I coordinated methods ∼10% better than uncoordinated method

Safety performance takeaways

I no near mid-air collisions (NMACs) in simulations

– NMAC threshold distance defined at 30 m (100 ft)– due to penalty on smaller separation distances even in conflict

I simulations validate all algorithmic variants

– distributed variants perform as well as serial version– coordinated methods significantly better than baselines

Trade-off: Safety vs. alert rate

I vary relative penalty between loss of separation and disruption

I best performance at “knee” of Pareto frontier

0 2 4 6

×10−2

conflict probability

tecentralizedparallelconsensus

Outline

Introduction

Approach

UTM-α: A distributed framework 47

Practical conflict avoidance

I recall: pub-sub system that UTM clients subscribe to for advisories

– standalone system that subscribes to UTM server for flight tracking

I goals: robustness, scalability, and modularity

– cluster computing framework for large-scale data processing– streaming analytics for real-time conflict resolution– built-in system fault-tolerance

System design

I why Kafka?

– designed as unified, high-throughput, low-latency platform forhandling real-time data feeds

– open source with active industry use (originally developed byLinkedIn)

I why Spark?

– easy, high-level API– ease of operations with minimal fuss over fault-tolerance– enormous, active community and industry deployments (CISCO,

Netflix, Intel,. . . )

Driver-worker model

I driver-worker model for conflict resolution

– driver loads up policy object that contains lookup table andalgorithm for conflict resolution

– workers receives policy object via broadcast and subsequentlydelegated tasks by driver

I formatted conflict data stored as an “infinite” stream of resilientdistributed datasets (RDDs)

– distributed, partitioned collection of conflict objects (JSON)– automatically rebuilt on failure

Pub-sub model

I ingestor publishes conflict objects to conflict Kafka topic

– subscribes to and ingests UTM server stream and crudely identifiespotential conflicts by proximity

– formats conflict data as JSON strings

I advisor publishes advisories to advisory Kafka topic

– producers are worker nodes in cluster generating advisories– consumers are UTM operator clients flying drones

Working model

I Spark driver launches workers/executors and broadcasts policy object

I workers receive conflict data in parallel from conflict Kafka topicas RDD stream

I workers publish advisories to advisory Kafka topic for UTMoperator clients

Outline

Introduction

Approach

Conclusion and future work 54

Summary

I three variants of coordination-based conflict resolution algorithm

– 1 centralized/serial and 2 distributed variants

I millisecond solve times for real-time application for multiple aircraft

I robust distributed system for practical conflict avoidance

Implementation

I policy generation on Julia scientific programming language

https://github.com/sisl/ConflictAvoidanceDASC

I distributed system with Apache Kafka and Spark Streaming

https://bitbucket.org/sisl/utm-alpha

Further research

I tracking aircraft via belief states

– use the UTM client server to track reported aircraft states– generate local set of belief states for all aircraft to better estimate

state uncertainty in real-time

I best action for drone in case of communication loss

– figure out the response state of the aircraft via belief state tracking– where to go in case of communication loss, what flight profile to

take, and what UTM should assume about the drones flight

I dealing with adversarial flights

Further software development

I standardize resolution advisory and advisory receipt formats withfocus on minimizing message size

I quantify delay times for sending packets between advisory server andclients in order to model it into the problem

References

I Federico Augugliaro, Angela P Schoellig, and Raffaello D’Andrea.Generation of collision-free trajectories for a quadrocopter fleet: Asequential convex programming approach.In IEEE/RSJ International Conference on Intelligent Robots andSystems (IROS), pages 1917–1922, October 2012.

I J. P. Chryssanthacopoulos and M. J. Kochenderfer.Decomposition methods for optimized collision avoidance withmultiple threats.Journal of Guidance, Control, and Dynamics, 35(2):398–405, 2012.

I International Civil Aviation Organization ICAO.Surveillance, radar and collision avoidance.International Standards and Recommended Practices, 4, 2007.

References

I M.J. Kochenderfer and J.P. Chryssanthacopoulos.Robust airborne collision avoidance through dynamic programming.Project Report ATC-371, MIT Lincoln Laboratory, 2011.

I J. K. Kuchar and L. C. Yang.A review of conflict detection and resolution modeling methods.IEEE Trans. Intell. Transp. Syst., 1(4):179–189, 2000.

I H. Y. Ong and J. C. Gerdes.Cooperative collision avoidance via proximal message passing.In American Control Conference (ACC), July 2015.

I Tom Schouwenaars, Mario Valenti, Eric Feron, and Jonathan How.Implementation and flight test results of MILP-based UAV guidance.In IEEE Aerospace Conference, pages 1–13. IEEE, 2005.

Impact of search timeout

conflict decreases with iteration limit but with diminishing returns

2 3 4 5 6

×10−3

iteration limit

Restriction messages for consensus search

2 4 6 8 100.5

×10−3

number of uavs

consensus (maxmin)

restriction (maxmin)

I instead of broadcasting joint action, aircraft broadcast restrictionmessages that indicate what other aircraft shouldn’t do

– e.g., reward straight path when it receives no left and no rightmessages at a decision period

Short-Term Conflict Resolution for Unmanned Aircraft Traffic Management

Engineering

Transcript of Short-Term Conflict Resolution for Unmanned Aircraft Traffic Management

Life Without Conflict (Gujarati)

Process for the Management of Air Traffic Controllers Licences, Εdition_2_ 21022013

Low Cost Traffic Building Strategies

r.unitn.itr.unitn.it/filesresearch/images/maths-opthysys/Workshop/slides/ga... · PDEs on networks Vehicular traffic (scalar and systems) Air traffic management Gas pipelines (systems)

Adaptive Routing of Video Traffic in ATM Networksconta.uom.gr/conta/publications/PDF/Adaptive routing of...allocation, service discipline, video traffic. I. Introduction ATM is the

C A P A C I T Y than you think - kanbanway.com€™ve-got-more... · than you think Charan Atreya ... Phantom traffic jam •Phantom traffic jam – explanation •Experiment . Phantom

Wireless Cache Invalidation Schemes with Link Adaptation and Downlink Traffic Presented by Ying Jin.

TGF- induces miR-182 to sustain NF-kB · Authorship note: Libing Song and Liping Liu contributed equally to this work. Conflict of interest: The authors have declared that no conflict

International perspectives: European Youth Forum for Road ... · 1st Flashmob for Traffic Safety dedicated to Pedestrians 10th May 2013, Syntagma Square Greece . Why Flashmob? Ideal

industrial buildings 1 - UPT industrial buildings Florea Dinu Lecture 9: ... arrangement of operation = traffic tracks ... Θ light cranes only (if any ...

Functional Hazard Assessment (FHA) Report for Unmanned Aircraft · PDF file · 2014-08-14ε Functional Hazard Assessment (FHA) Report for Unmanned Aircraft Systems P09005.10.5 Commercial

1 ENVIRONMENT AND DEVELOPMENT Conflict or Reconciliation ? PERSA FOKIALI Associate Professor University of the Aegean 2011.

Analysis of PM 10 using Scanning Electron Microscopy Comparison between a Traffic Site and a Forest Site S. Wagener 1, J.K. Gietl, O. Klemm University.

Network Traffic Analysis With Query Driven Visualization ...agl.cs.unm.edu/~chris2d/papers/HPC_Analytics_Final.pdf · Analysis and Continuous Queries (SMACQ) allows ad-hoc, relational

ΓΕΩΡΓΙΟΣ K. ΒΟΣΒΟΤΕΚΑΣ - epemy.gr · Σύγκρουση συμφερόντων “Conflict of interest” Κανένα για αυτήν την παρουσίαση

Λύσεις ασύρματης δικτύωσης · Market Trends Explosive growth in mobile computing. By 2020: Wi-Fi traffic will exceed wired traffic by 21%(1) 21 Billion Mobile

Conflict of Interest · Conflict of Interest • Grant Support: NIH, ... Effect of IL-4 priming of human MCs on ... primarily acts to block CysLT production "

ethnic conflict in a strategic area: the case of Cyprus

Physics 2D Lecture Slides Jan 10 - modphys.ucsd.edumodphys.ucsd.edu/2dw04/slides/f03/jan10.pdf · ``A paradox is not a conflict within reality. It is a conflict between reality and

κοινά ψηφιακά πολιτιστικά αγαθα 3 (unicode encoding conflict)