gregory j. barlow 1,2 and choong k. oh 2 1 the robotics institute, carnegie mellon university

1

Robustness Analysis of Genetic Programming Controllers for

Unmanned Aerial Vehicles

Gregory J. Barlow1,2 and Choong K. Oh2

1 The Robotics Institute, Carnegie Mellon University2 The U.S. Naval Research Laboratory

2

Motivation

• Evolutionary robotics (ER) controllers may evolve in simulation or on real robots, but the true test of performance must happen in real-world conditions

• Testing unfit controllers may be dangerous or expensive for some robots

3

Transference

• For controllers evolved in simulation, evaluation in a noisy environment does not ensure good transference if simulated noise is not consistent with true noise

• If a controller performs well over a wide range of state and sensor noise conditions in simulation, prior work suggests that the controller should transfer well

4

Evolving controllers for unmanned aerial vehicles

• Unmanned aerial vehicles (UAVs) require assurance of off-design performance

• Even under noise not considered during evolution, controllers must still be able to efficiently accomplish the task

• Poorly performing controllers could cause crashes, possibly destroying the UAV

5

Overview

• Controller evolution (Barlow et al., 2004)• Goals• Robustness testing• Results• Conclusions

6

Controller evolution

Evolve unmanned aerial vehicle (UAV) navigation controllers able to:• Fly to a target radar based only on

sensor measurements• Circle closely around the radar• Maintain a stable and efficient flight

path throughout flight

7

Controller Requirements

• Autonomous flight controllers for UAV navigation

• Reactive control with no internal world model

• Able to handle multiple radar types including mobile radars and intermittently emitting radars

• Robust enough to transfer to real UAVs

8

Simulation

• To test the fitness of a controller, the UAV is simulated for 4 hours of flight time in a 100 by 100 square nmi area

• The initial starting positions of the UAV and the radar are randomly set for each simulation trial

9

Sensors

• UAVs can sense the angle of arrival (AoA) and amplitude of incoming radar signals

10

UAV Control

EvolvedController

AutopilotUAVFlight

Sensors

Roll angle

11

Radars

• Stationary, continuously emitting• Mobile, continuously emitting• Stationary, intermittently emitting with

regular period• Stationary, intermittently emitting with

irregular period• Mobile, intermittently emitting with

regular period

12

Transference

• To encourage good transference to real UAVs, during evolution:

• Modeled only the sidelobes of radars• Added noise to the modeled radar emissions• Set accuracy of the angle of arrival values to be

within ±10°• Evolved controllers were successfully

tested on wheeled mobile robots (Barlow et al., 2005)

13

Multi-objective GP

• We had four desired behaviors which often conflicted, so we used NSGA-II (Deb et al., 2002) with genetic programming to evolve controllers

• Each evaluation ran 30 simulations• Each of 50 evolutionary runs had a

population size of 500• We used environmental incremental

evolution to produce controllers evolved for a total of 1800 generations

15

Functions and Terminals

Functions• Prog2, Prog3, IfThen, IfThenElse, And, Or, Not,

<, <=, >, >=, < 0, > 0, =, +, -, *, /, X < 0, Y < 0, X > max, Y > max, Amplitude > 0, AmplitudeSlope > 0, AmplitudeSlope < 0, AoA > Arg, AoA < Arg

Terminals• HardLeft, HardRight, ShallowLeft, ShallowRight,

WingsLevel, NoChange, rand, 0, 1

16

Considerations

• We have many acceptable controllers on the Pareto front, but we need to choose one “best” controller

• Controllers may be optimized to the simulation parameters, may not be robust to other noise levels or sources

• Fitness values are only measured over 30 trials

17

Goals

• Choose a single “best” evolved controller for future flight tests

• Evaluate the robustness of the best evolved controllers to sensor and state noise to assure off-design performance

• Compare evolved controllers to human designed controllers

18

Test functions

1. Flying to the radar• Percent error in time to radar

2. Circling the radar• Average circling distance

3. Efficient flight• Percent error in flying with a roll angle of zero

degrees

4. Stable flight• Cost of sharp, sudden turns

19

Test functions

T

iii

T

ii

in

T

iout

outintotal

rollanglerollangleT

testT

test

T

T

testT

TTtest

DD

T

TTT

11}10 angle {roll4

1}rangein if{2

expect

expect1

level} wings{if}rangein if{

3expect

expect1

expect

11

distance11

111

45

seconds 3600hour 1

knots 80

seconds 14400

20

Baseline Values

Flying to the radar ≤ 0.2• Error in flight time to radar less than 20%

Circling the radar ≤ 2• Average distance less than 2 nmi

Efficient flight ≤ 0.5• ~50% of time (not in-range) with roll angle = 0

Turn Cost ≤ 0.05• Turn sharply less than 0.5% of the time

21

Performance metrics

• Failures• Percent of trials that don’t meet the baseline values

• Normalized maximum• Magnitude of failure normalized by the baseline value

• Normalized mean• Means for each test function normalized by the baseline

value and then averaged

• Average rank• Combination of first three performance metrics

22

Performance metrics

3

14

1

1 13

,2

1

),(3

1),(

)),((min)),((max

)),((min),(),(

),(11

1),(

),(maxmax),(

),(

kk

kGkG

kGkk

M

m m

R

r

N

nmm

m

mmNRM

tfnormtfmetric

tgmetrictgmetric

tgmetrictfmetrictfnorm

baseline

nrtestNR

baseline

Mtfmetric

baseline

baselinenrtesttfmetric

N

Ftfmetric

23

Selecting controllers for testing

1. GP produced 25,000 controllers2. Based on prior work, 1,602 had

acceptable mean fitness values3. We ran 100 simulations on each of the

five radar types for each of these 1,602 and chose ~300.

4. We cut these down to 10 using the normalized maximum performance metric

24

Designed controllers

• Hand-written• Based on functions and terminals available

to GP and knowledge of good GP strategies

• Proportional-derivative (PD)• Takes AoA as input (approximates

derivative)• PID performed poorly with mobile radars,

so integral term was not used

25

Robustness tests

• Robustness tests fell into five categories: AoA error, amplitude error, varied airspeed, heading error, and wind effects (position error)

• For every combination of radar type and controller, we performed 10,000 simulations, for a total of 50,000 simulations per controller per test

26

Robustness tests

• Angle of arrival error ±{10, 15, 20, 30}°• Amplitude error {6, 12} dB• UAV airspeed {50, 80, 100} knots• Heading error {0, 0.5, 1, 1.5, 2}°• Wind (position error) {0, 5, 10, 20, 30}

knots

27

Results

• For each test, we ranked the 12 controllers based on the four performance metrics

• We combined these results into an overall ranking to determine the best controller

• The best evolved controller fails gracefully and compares well to the PD controller

28

Rankings

Overall ranking best 2 3 4 5 6 7 8 9 10 11 12

failures G D E F J H A C B pd I hd

norm maximum pd D I F G hd J E B A H C

norm mean D E hd G F J H pd A C B I

average rank D G E pd F J H hd A B C I

Control case ranking best 2 3 4 5 6 7 8 9 10 11 12

failures pd A B C D E F G H I J hd

norm maximum pd G E J F I B A D C H hd

norm mean pd I J A B C G E D F H hd

average rank pd J G I E B A F C D H hd

29

Comparison

Stationary,

intermittently emitting

Mobile, intermittently

emitting

Average for all five radar types

Test type D pd D pd D pd

control case 1.18 0.07 13.87 0.03 7.13 0.04

AoA=15 6.55 100.0 16.74 100.0 9.88 100.0

AoA=20 20.10 100.0 24.81 100.0 16.15 100.0

AoA=25 35.39 100.0 34.90 100.0 23.49 100.0

AoA=30 77.15 100.0 85.25 100.0 90.24 100.0

Amp=12 1.41 0.04 13.21 0.07 7.24 0.04

Speed=50 0.56 100.0 16.53 100.0 6.63 100.0

Speed=100 2.63 92.23 14.83 93.00 8.96 92.69

Stationary,

intermittently emitting

Mobile, intermittently

emitting

Average forall five radar

types

Test type D pd D pd D pd

Head=0.5 1.70 0.19 14.38 0.18 7.51 0.17

Head=1.0 4.48 2.50 16.08 2.60 9.66 2.56

Head=1.5 9.29 54.30 20.31 55.03 13.32 55.06

Head=2.0 19.85 98.55 29.38 98.64 19.58 98.60

Wind=5 2.40 0.01 15.70 0.14 8.41 0.12

Wind=10 21.13 1.39 30.82 1.11 24.98 0.51

Wind=20 49.96 95.12 60.77 94.83 66.77 96.45

Wind=30 63.43 96.64 86.10 99.67 75.44 98.39

30

Conclusions

• Selected a single best controller for future flight tests

• Established the off-design performance of evolved UAV controllers; evolved controllers failed gracefully

• Performance of the best controller compares favorably with PD control

• Established the limits of performance for these evolved controllers

31

Acknowledgements

• Financial support was provided by Swampworks project office of the Office of Naval Research

• The U.S. Naval Research Laboratory (Code 5730) provided computation time on their Beowulf cluster

• Gregory J. Barlow is supported by a National Defense Science and Engineering Graduate Fellowship

gregory j. barlow 1,2 and choong k. oh 2 1 the robotics institute, carnegie mellon university

Documents