advances in artificial intelligence and the impact on test ... · • deep learning revolution...

19
www.hume.vt.edu T. Charles Clancy, PhD Executive Director , Hume Center for National Security and Technology Interim Executive Director , Commonwealth Cyber Initiative Bradley Professor of Cybersecurity , Electrical and Computer Engineering Advances in Artificial Intelligence and the Impact on Test and Evaluation (T&E)

Upload: others

Post on 05-Oct-2020

1 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Advances in Artificial Intelligence and the Impact on Test ... · • Deep Learning revolution fueled by competition – ImageNet • ImageNet crowd-sourced database has grown to

www.hume.vt.edu

T. Charles Clancy, PhDExecutive Director, Hume Center for National Security and Technology

Interim Executive Director, Commonwealth Cyber Initiative

Bradley Professor of Cybersecurity, Electrical and Computer Engineering

Advances in Artificial Intelligence and the Impact on Test and Evaluation (T&E)

Page 2: Advances in Artificial Intelligence and the Impact on Test ... · • Deep Learning revolution fueled by competition – ImageNet • ImageNet crowd-sourced database has grown to

[Incomplete] Artificial Intelligence Taxonomy

Apr 2019 | DATAWorks 2

Reasoning Learning

Programmed Supervised Reinforcement Unsupervised

Planning/Logic/Expert Systems

Optimization

Agent-Based Systems

Bayesian Networks

Regression

Neural Networks

Genetic Algorithms

Markov Decision Processes

Clustering

Principal Component Analysis

Deep Learning

AutoencodersConvolutional Neural Networks Recurrent Neural Networks

Deep Q Learning Generative Adversarial Nets

Page 3: Advances in Artificial Intelligence and the Impact on Test ... · • Deep Learning revolution fueled by competition – ImageNet • ImageNet crowd-sourced database has grown to

• Deep Learning revolution fueled by competition – ImageNet• ImageNet crowd-sourced database has grown to include 14M images, hand-annotated with

20K categories• In 2012, AlexNet was a breakout winner and used an 8-Layer CNNs, with CNNs being feasible

for the first time due to GPU resources

ImageNet

Apr 2019 | DATAWorks 3

Page 4: Advances in Artificial Intelligence and the Impact on Test ... · • Deep Learning revolution fueled by competition – ImageNet • ImageNet crowd-sourced database has grown to

ImageNet

Apr 2019 | DATAWorks 4

Page 5: Advances in Artificial Intelligence and the Impact on Test ... · • Deep Learning revolution fueled by competition – ImageNet • ImageNet crowd-sourced database has grown to

ImageNet

Apr 2019 | DATAWorks 5

GoogLeNet, 2014 Winner – 22 Layers

ResNet (Microsoft), 2015 Winner – 152 Layers

Page 6: Advances in Artificial Intelligence and the Impact on Test ... · • Deep Learning revolution fueled by competition – ImageNet • ImageNet crowd-sourced database has grown to

Deep Reinforcement Learning

• Use DNN to learn optimal policy for engaging in a stateful environment

• Advances in game play• AlphaGo

• Fully observable, quantized state• Learn policy (6 weeks of GPU training)

• Video Games• 2015 – Atari Breakout (Google)• Today – Complex first person shooters

Apr 2019 | DATAWorks 6

Page 7: Advances in Artificial Intelligence and the Impact on Test ... · • Deep Learning revolution fueled by competition – ImageNet • ImageNet crowd-sourced database has grown to

Generative Adversarial Networks

• Related concepts have existed since 1990, but in limited form

• Ian Goodfellow published current GAN concept in 2014

• Field has taken off

Apr 2019 | DATAWorks 7

Edmond de BelamyGAN-generated painting

Sold at auction for $432,500

Page 8: Advances in Artificial Intelligence and the Impact on Test ... · • Deep Learning revolution fueled by competition – ImageNet • ImageNet crowd-sourced database has grown to

Deep Fakes

• Ability to leverage latent space to create specific images

• Tie together audio and video synchronously

Apr 2019 | DATAWorks 8

Page 9: Advances in Artificial Intelligence and the Impact on Test ... · • Deep Learning revolution fueled by competition – ImageNet • ImageNet crowd-sourced database has grown to

Tools

• Tools are widely available to develop, train, and execute networks in a plug-and-play way

• Google TensorFlow seamlessly integrates GPU capabilities

• Python integration makes it easy to compose networks

• Right now more “art” than “science” with how networks are constructed to achieve good performance

• No rigorous methodology for composing layers into networks

• “Hyperparameter optimization” is a fancy way of saying “brute force” network structure until it works

Apr 2019 | DATAWorks 9

Page 10: Advances in Artificial Intelligence and the Impact on Test ... · • Deep Learning revolution fueled by competition – ImageNet • ImageNet crowd-sourced database has grown to

Impact on T&E

Apr 2019 | DATAWorks 10

AI for T&E

T&E for AI

Page 11: Advances in Artificial Intelligence and the Impact on Test ... · • Deep Learning revolution fueled by competition – ImageNet • ImageNet crowd-sourced database has grown to

AI for T&E

• Q: how can we use new AI algorithms to improve the T&E process?

Apr 2019 | DATAWorks 11

Page 12: Advances in Artificial Intelligence and the Impact on Test ... · • Deep Learning revolution fueled by competition – ImageNet • ImageNet crowd-sourced database has grown to

AI for T&E (2)

Synthetic Data Generation

• Approach• Use small sample of input data to train a GAN• Generate large amounts of input data across

large latent space

• Applications• Primary applicable to testing individual

units/components• Identify characteristics of failure modes• (Leverage latent space representation)

State Space Exploration/Excitation

• Approach• Create an RL-based agent that seeks to interact

with system• Create sequences of interactions that drive

different state transitions within the system under test

• Applications• Subsystem and system-level testing• Suitable for T&E on stateful systems• Identify sequences that result in “bad” states

Apr 2019 | DATAWorks 12

Page 13: Advances in Artificial Intelligence and the Impact on Test ... · • Deep Learning revolution fueled by competition – ImageNet • ImageNet crowd-sourced database has grown to

AI Test Automation

Apr 2019 | DATAWorks 13

Page 14: Advances in Artificial Intelligence and the Impact on Test ... · • Deep Learning revolution fueled by competition – ImageNet • ImageNet crowd-sourced database has grown to

T&E for AI

• Many systems, military and commercial, are incorporating AI algorithms as part of their design and implementation

• AI for machine perception – DCNNs for sensing the environment, and detecting/identifying objects within it

• AI for system control – DRLs for understanding the environment and making optimal decisions within the estimated state

• Machine learning algorithms – challenge with repeatability and explainability• Under what circumstances does it fail?• How robust is it to noise?• How robust is it to failures (correlated noise)?• How robust is it to adversarial inputs?

Apr 2019 | DATAWorks 14

Page 15: Advances in Artificial Intelligence and the Impact on Test ... · • Deep Learning revolution fueled by competition – ImageNet • ImageNet crowd-sourced database has grown to

Examples

Apr 2019 | DATAWorks 15

Page 16: Advances in Artificial Intelligence and the Impact on Test ... · • Deep Learning revolution fueled by competition – ImageNet • ImageNet crowd-sourced database has grown to

Examples

Apr 2019 | DATAWorks 16

Page 17: Advances in Artificial Intelligence and the Impact on Test ... · • Deep Learning revolution fueled by competition – ImageNet • ImageNet crowd-sourced database has grown to

Examples

Apr 2019 | DATAWorks 17

Page 18: Advances in Artificial Intelligence and the Impact on Test ... · • Deep Learning revolution fueled by competition – ImageNet • ImageNet crowd-sourced database has grown to

Examples

Apr 2019 | DATAWorks 18

Page 19: Advances in Artificial Intelligence and the Impact on Test ... · • Deep Learning revolution fueled by competition – ImageNet • ImageNet crowd-sourced database has grown to

Opportunities

Systematic Testing

• Leverage AI to test AI• Generate large synthetic data sets• Use latent space to identify areas where AI

model fails

• Tests that characterize many models against range of criteria to identify those with most resilience

• Noise• Distortion

Formal Verification

• Black box testing from digital circuits• Neural network has similar structure to circuit• Systematic exploration of all paths• Identify undertrained areas in network that most

frequently lead to divergent output, and adjust model parameters to compensate

• Very preliminary research in this area

Apr 2019 | DATAWorks 19

Adversarial AINeed to develop a threat model for AI models – impact adversary can have depending on access

Training data, model parameters, runtime data, etc