learning fusion planning decision · develop fundamental theory for learning, aggregation, and...

19
Hero VoI MURI: ARL 06/14 VOI Hero VoI MURI: ARL 06/14 VOI Value-centered Information Theory for Adaptive Learning, Inference, Tracking, and Exploitation W911NF-11-1-0391 Al Hero * (PI), Raj Nadakuditi * , Randy Moses + , Emre Ertin + , Doug Cochran @ , Michael Jordan ^ , Stefano Soatto & , Angela Yu % , John Fisher # , Jon How # , Alan Willsky # Sensing Planning Sensing Action Sensing Action Sensing Action Sensing Action Y1 Y2 Y3 Y4 Target state 0 Target state 4 Y0 Learning Fusion Decision Aim: Develop fundamental theory for learning, aggregation, and exploitation of information 1. Learning and representation in high dimension 2. Distributed information fusion 3. Adaptive sensing and planning Principal thrusts * University of Michigan + Ohio State University @ Arizona State University ^ UC Berkeley & UCLA % UCSD # MIT

Upload: others

Post on 25-Jul-2020

3 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Learning Fusion Planning Decision · Develop fundamental theory for learning, aggregation, and exploitation of information. 1. Learning and representation in high dimension 2. Distributed

Hero VoI MURI: ARL 06/14

VOI

Hero VoI MURI: ARL 06/14

VOI

Value-centered Information Theory for Adaptive Learning, Inference, Tracking, and Exploitation

W911NF-11-1-0391Al Hero* (PI), Raj Nadakuditi*, Randy Moses+, Emre Ertin+, Doug Cochran@,

Michael Jordan^, Stefano Soatto&, Angela Yu%, John Fisher#, Jon How#, Alan Willsky#

Sensing Planning

SensingAction

SensingAction

SensingAction

SensingAction

Y1 Y2 Y3 Y4

Target state 0 Target state 4

Y0

Learning Fusion

Decision

Aim: Develop fundamental theory for learning, aggregation, and exploitation of information

1. Learning and representation in high dimension2. Distributed information fusion3. Adaptive sensing and planning

Principal thrusts* University of Michigan+ Ohio State University@ Arizona State University^ UC Berkeley& UCLA% UCSD# MIT

Page 2: Learning Fusion Planning Decision · Develop fundamental theory for learning, aggregation, and exploitation of information. 1. Learning and representation in high dimension 2. Distributed

Hero VoI MURI: ARL 06/14

VOI

Newstadt, Zelnio, H 2013

Two example problems

• Objective: detect and localize targets.

• Sensors: – EO: high spatial resolution– Radar: high Doppler resolution– IR: high caloric resolution– Analyst: high context resolution

• Action/resource constraints– Cannot use all sensors at any time– Cannot use any sensor for all time

• Decisions/Tasks– Detect targets– Track targets– Localize targets

• Objective: find emergent agents and communities in social media

• Sensors: – Firehose: 100% of public tweets – Gardenhose: 10% of tweets/sec– Spritzer: 1% of tweets/sec– Key on user streams or topic streams

• Action/resource constraints– Can stream users or topics but not

both at the same time

• Decisions/Tasks– Detect communities– Track communities– Identify central nodes

Automated Target Recognition Twitter social analytics

Xu and H, 2010

Page 3: Learning Fusion Planning Decision · Develop fundamental theory for learning, aggregation, and exploitation of information. 1. Learning and representation in high dimension 2. Distributed

Hero VoI MURI: ARL 06/14

VOI

Hero VoI MURI: ARL 06/14

VOI

Value of information (VoI) accrued from a snapshot depends on task and sensor

𝑀𝑀𝑀𝑀𝑀𝑀(𝑛𝑛)

𝑃𝑃𝑒𝑒(𝑛𝑛)

0 𝑛𝑛 𝑛𝑛 + 1

Sensor Bdeployed

Sensor Adeployed

Sensor Adeployed

VoI(B)(for detection)

VoI(B)(for estimation)

Page 4: Learning Fusion Planning Decision · Develop fundamental theory for learning, aggregation, and exploitation of information. 1. Learning and representation in high dimension 2. Distributed

Hero VoI MURI: ARL 06/14

VOI

Hero VoI MURI: ARL 06/14

VOIOur approach: fundamental design principles

• Develop theory of VoI in terms of fundamental limits depending on the task, the selected performance criterion, and the resource constraints.

• Apply VoI theory to develop better feature learning algorithms, information fusion methods, and sensor planning strategies.

Num

ber o

f var

iabl

es (p

)

Samples (n)

Direction ofimproving VoI

𝑃𝑃𝑒𝑒 𝐿𝐿𝐿𝐿𝐿𝐿 = 𝑒𝑒−𝑛𝑛𝑛𝑛𝑛𝑛 𝑆𝑆𝑆𝑆𝑆𝑆,𝑝𝑝 < 0.01

𝑀𝑀𝑀𝑀𝑀𝑀 �̂�𝜃 = 𝐹𝐹𝐹𝐹𝐹𝐹−1(𝑆𝑆𝑆𝑆𝑆𝑆,𝑝𝑝)𝑛𝑛

< 0.1𝑚𝑚2

Page 5: Learning Fusion Planning Decision · Develop fundamental theory for learning, aggregation, and exploitation of information. 1. Learning and representation in high dimension 2. Distributed

Hero VoI MURI: ARL 06/14

VOI

Hero VoI MURI: ARL 06/14

VOITowards a fundamental theory of VoI

• A theory is “a coherent group of general propositions used as principles of explanation for a class of phenomena” (Websters 2010).

• For us these propositions and principles should be able to1. provide design principles for tractable surrogate measures of VoI2. specify fundamental limits and approximations for resource-constrained

collection systems3. lead to new VoI-preserving information fusion strategies4. improve understanding of information exploitation in terms of mission value 5. predict value of including humans-in-the loop for sensing, processing and

decision making

• This MURI has made progress on developing such propositions and principles.

Page 6: Learning Fusion Planning Decision · Develop fundamental theory for learning, aggregation, and exploitation of information. 1. Learning and representation in high dimension 2. Distributed

Hero VoI MURI: ARL 06/14

VOI

Hero VoI MURI: ARL 06/14

VOI

VoI via Markov Decision Processes

• Multi-stage (three-step) plan-ahead scheduling tree

• Nodes represent data gathered at each stage• Optimal planning policy π will maximize Bellman’s value function

O0

O11

a0=1a0=0

O200

O3000 O3

001 O3010 O3

011 O3100 O3

101 O3110 O3

111

O201 O2

10 O211

O10

a1=0

a2=0 a2=0 a2=0 a2=0

a1=0 a1=1

a2=1a2=1

a1=1

a2=1 a2=1

Source: Blatt H 2006

Page 7: Learning Fusion Planning Decision · Develop fundamental theory for learning, aggregation, and exploitation of information. 1. Learning and representation in high dimension 2. Distributed

Hero VoI MURI: ARL 06/14

VOI

Hero VoI MURI: ARL 06/14

VOIWhy is this problem difficult?

• There is lots of relevant theory…1. Communication theory: Shannon theory, rate exponents, entropy inequalities 2. Signal representation theory: statistical modeling/representation,

detection/estimation, convex relaxation3. Control theory: Markov Decision Processes (MDPs), value-functions, bandit

approximations

• …and these are foundational building blocks that we are using… • …but, there are gaps that have to be filled

– Existing theories are inadequate when there are algorithmic complexity constraints– Existing theories are inadequate when there are human factors – Shannon theory was developed for communications and almost all propositions hold

only for infinite block length (not real-time)– MDPs do not scale well to practical problem sizes

• We have made progress on filling these gaps

Page 8: Learning Fusion Planning Decision · Develop fundamental theory for learning, aggregation, and exploitation of information. 1. Learning and representation in high dimension 2. Distributed

Hero VoI MURI: ARL 06/14

VOI

Hero VoI MURI: ARL 06/14

VOIHighlights of cumulative progress

• Fundamental limits: fundamental tradeoffs and phase transitions give insight into relative merits of different models/constraints and establish benchmarks for algorithm performance

– Sample complexity vs computational complexity in function estimation (Jordan 13)

– Streaming Bayes inference algorithm has minimal loss in performance(Jordan 14)

– Sample complexity vs model complexity in covariance estimation (Hero 12, 13, 14)

– Sample complexity vs signal-to-noise ratio in signal subspace processing (PCA)

(Nadakuditi 13)– Sampling complexity vs geometry in active vision “actionable information”

(Soatto 12, Fisher 13)

Page 9: Learning Fusion Planning Decision · Develop fundamental theory for learning, aggregation, and exploitation of information. 1. Learning and representation in high dimension 2. Distributed

Hero VoI MURI: ARL 06/14

VOI

Hero VoI MURI: ARL 06/14

VOI

Fundamental limits: bounds + phase transitions

p=#s

enso

rs

n=#snapshotsU

nach

ieva

ble

regi

on

MSE phase transition regions for spatio-temporal models (Hero 13)n=#snapshots

t=ru

ntim

e

Tradeoff between runtime and sample complexity (Jordan 13)

0 10 20 30 40 50 60 70 80 90 100

20 uncorrelated sequences

Time index i

snap

shot

sens

or n

et

K-PCA

Page 10: Learning Fusion Planning Decision · Develop fundamental theory for learning, aggregation, and exploitation of information. 1. Learning and representation in high dimension 2. Distributed

Hero VoI MURI: ARL 06/14

VOI

Hero VoI MURI: ARL 06/14

VOI

Learning and RepresentationSpatio-temporal Kronecker PCA

• Problem: Standard dimensionality reduction (PCA, ISOMAP, etc) does not account for spatio-temporal patterning of covariance matrix. A new type of PCA needs to be developed for denoising space-time covariance

Ref: Tsiligkaridis and Hero, IEEE Trans on SP, to appear 2014

• Universal model: Any spatio-temporal covariance has Kronecker-sum form (for some r)

Results:• Fast KPCA algorithm developed for model • KPCA has better phase transitions (similar

to Kglasso) than ordinary PCA • KPCA achieves much better energy

compression than PCA

91% of energy 48% of energyIn 1st component in 1st component

KPCA Spectrum PCA Spectrum

Page 11: Learning Fusion Planning Decision · Develop fundamental theory for learning, aggregation, and exploitation of information. 1. Learning and representation in high dimension 2. Distributed

Hero VoI MURI: ARL 06/14

VOI

Learning and representationStreaming Variational Bayes

Ref: T. Broderick, N. Boyd, A. Wibisono, A. C. Wilson, and M. I. Jordan. NIPS 2013

Problem: We want the modularity, flexibility, and coherent treatment of uncertainty of Bayesian learning, but Bayesian methods traditionally haven’t been formulated in the streaming or Big Data settings.

Results: We develop SDA-Bayes, a framework for (S)treaming, (D)istributed, (A)synchronouscomputation of a Bayesian posterior.

We demonstrate the usefulness of our framework by modeling topics in two large-scale document collections (3.6 M Wikipedia documents and 350 K Nature documents). We show that our algorithm performs comparably to a state-of-the-art batch algorithm but has the added advantage of being streaming and distributed.

(a) Wikipedia (b) Nature

Page 12: Learning Fusion Planning Decision · Develop fundamental theory for learning, aggregation, and exploitation of information. 1. Learning and representation in high dimension 2. Distributed

Hero VoI MURI: ARL 06/14

VOI

Hero VoI MURI: ARL 06/14

VOIHighlights of cumulative progress

• Information-driven fusion: theory of distributed information collection and fusion using VoI proxies have led to better algorithms

– VoI-preserving distributed learning of graphical models without message passing

(Hero 13)– VoI-aware censoring for tracking in sensor nets

(How 12, Moses 12)– Decentralized human and human-machine collaborative decision-making

(Hero 14, Yu 13)

Page 13: Learning Fusion Planning Decision · Develop fundamental theory for learning, aggregation, and exploitation of information. 1. Learning and representation in high dimension 2. Distributed

Hero VoI MURI: ARL 06/14

VOI

Hero VoI MURI: ARL 06/14

VOI

Distributed information fusionOptShrink: Data-driven low rank denoising & VoI estimation

Ref. R. Nadakuditi, “OptShrink: An algorithm for improved low-rank signal matrix denoising by optimal data-driven singular value shrinkage.” Accepted for publication in Trans. on Information Theory, 2014. http://arxiv.org/abs/1306.6042

Problem: given a low-rank signal buried in noise –optimally estimate low-rank component and return a VoI-type metric of the mean-squared estimation error. Such signal-plus-noise matrices arise in modeling sensor array measurements, hyperspectral scenes

Model: signal matrix is low-rank; noise matrix has isotropically random left and right singular vectors but not necessarily Gaussian. Open Solved problem: Optimal data-driven low-rank signal matrix denoising and data-driven eigen-VoImetric of denoising qualityProblem solution: Shrink singular values using “D-transform” (from non-commutative information theory); optimal to shrink larger empirical singular values less – suboptimality of convex penalty functions (including nuclear norm).

Page 14: Learning Fusion Planning Decision · Develop fundamental theory for learning, aggregation, and exploitation of information. 1. Learning and representation in high dimension 2. Distributed

Hero VoI MURI: ARL 06/14

VOI

Hero VoI MURI: ARL 06/14

VOI

Distributed information fusionin sensor networks

Meng,Wei,Wiesel,H, AISTATS 2013(NP Award)

||𝐂𝐂 − 𝐂𝐂𝑟𝑟 ||𝐹𝐹2

≤ 𝐦𝐦𝐦𝐦𝐦𝐦𝒓𝒓𝒓𝒓𝒓𝒓𝒓𝒓 𝐑𝐑 ≤𝒓𝒓

||𝐑𝐑 − 𝑹𝑹 𝐂𝐂 ||𝑭𝑭𝟐𝟐 + 𝑪𝑪𝒓𝒓 𝒑𝒑𝟐𝟐 + 𝒒𝒒𝟐𝟐

𝒓𝒓

Theorem [1]: For pq x pq sample cov C the MSE of Kronecker PCA of rank r is bounded

Objective: predict states measured by SNModel: Gauss-Markov random field:

Standard: Iterative ML by message passingProposed: Non-iterative by 2-NN relaxation

Page 15: Learning Fusion Planning Decision · Develop fundamental theory for learning, aggregation, and exploitation of information. 1. Learning and representation in high dimension 2. Distributed

Hero VoI MURI: ARL 06/14

VOI

Hero VoI MURI: ARL 06/14

VOIHighlights of cumulative progress

• Information exploitation: theoretical bounds and approximations to VoI allow more accurate performance prediction for sensor planning

– Mission-adaptive planning(Hero\&How 13)

– Submodularity for MI and other VoI proxies in multistage planning (Fisher 12)

– Information geometric planning and navigation (Cochran 12, 13, Cochran&Hero 13)

– VoI for multistage cooperative foraging (Yu 12)

Page 16: Learning Fusion Planning Decision · Develop fundamental theory for learning, aggregation, and exploitation of information. 1. Learning and representation in high dimension 2. Distributed

Hero VoI MURI: ARL 06/14

VOI

Hero VoI MURI: ARL 06/14

VOIHighlights of cumulative progress

• VoI function approximation: Tractable proxies and surrogate functions can capture qualitative properties of VoI that are useful for system design and analysis

– Weighted or constrained entropy and MI surrogates (Soatto 12, Fisher 12, Ertin 13, Cochran 12)

– New convex and concave-convex proxies for VoI(Hero 12-13, Ertin 12, Cochran 13)

– Mission-adaptive convex VoI surrogates (Hero\&How 13)

– Proxies for reward in adversarial tracking and pursuit games (Ertin 12)

Page 17: Learning Fusion Planning Decision · Develop fundamental theory for learning, aggregation, and exploitation of information. 1. Learning and representation in high dimension 2. Distributed

Hero VoI MURI: ARL 06/14

VOI

Hero VoI MURI: ARL 06/14

VOIHighlights of cumulative progress

• Humans-in-the-loop: theoretically optimal strategies for involving humans-in-the-loop for sensing, processing and decision-making

– Collaborative 20 questions models for human-aided search (Hero 12, 13)

– Identifiability of human preference ranking from partial observations (preference trees)

(Jordan 13)– Multiarmed bandit models for human interaction in foraging

(Yu 13)

Page 18: Learning Fusion Planning Decision · Develop fundamental theory for learning, aggregation, and exploitation of information. 1. Learning and representation in high dimension 2. Distributed

Hero VoI MURI: ARL 06/14

VOI

Hero VoI MURI: ARL 06/14

VOISome things we have learned

• There are important fundamental VoI tradeoffs that can be exploited – Model vs sample complexity for fusion (Hero 12, Nadakuditi 13)– Computation vs sample complexity for fusion (Jordan 12, Fisher 13)– Energy vs geometry in active vision (Soatto 12)

• Computational bottlenecks can be overcome by using good proxies– Information theoretic surrogates (Soatto 12, Fisher 12, Cochran 12)– Convex and concave-convex proxies (Hero 12-13, Ertin 12, Cochran 13)– Submodular myopic strategies (Fisher 12)

• Distributed processing framework can benefit from VoI perspective– Second order marginal MLE for SN (Hero 13)– VoI-aware censoring for tracking in sensor nets (How 12, Moses 12)– Cooperative human-machine interaction (Yu 13, Hero 13)

Page 19: Learning Fusion Planning Decision · Develop fundamental theory for learning, aggregation, and exploitation of information. 1. Learning and representation in high dimension 2. Distributed

Hero VoI MURI: ARL 06/14

VOI

• Progress on software radar testbed, Emre Ertin

• VoI for humans, Angela Yu