learning and testing submodular functions

Learning and Testing Submodular Functions

Grigory Yaroslavtsev

Columbia UniversityOctober 26, 2012

With Sofya Raskhodnikova (SODA’13)

+ Work in progress with Rocco Servedio

Submodularity• Discrete analog of convexity/concavity, law of

diminishing returns• Applications: optimization, algorithmic game

theory

Let • Discrete derivative:

• Submodular function:

Exact learning• Q: Reconstruct a submodular with poly(|X|) queries (for all

arguments)?• A: Only-approximation (multiplicative) possible [Goemans, Harvey,

Iwata, Mirrokni, SODA’09]

• Q: O-fraction of points (PAC-style learning with membership queries under

uniform distribution)?

• A: Almost as hard [Balcan, Harvey, STOC’11].

Pr𝑟𝑎𝑛𝑑𝑜𝑚𝑛𝑒𝑠𝑠 𝑜𝑓 𝑨 [ Pr

𝑺∼𝑈 (2𝑋)[ 𝑨 (𝑺 )= 𝒇 (𝑺 ) ]≥1−𝜖]≥ 12

Approximate learning• PMAC-learning (Multiplicative), with poly(|X|) queries :

(over arbitrary distributions [BH’11])

• PAAC-learning (Additive)

• Running time: [Gupta, Hardt, Roth, Ullman, STOC’11]

• Running time: poly [Cheraghchi, Klivans, Kothari, Lee, SODA’12]

Pr𝑟𝑎𝑛𝑑𝑜𝑚𝑛𝑒𝑠𝑠 𝑜𝑓 𝑨 [ Pr

𝑺∼𝑈 (2𝑋)[¿ 𝒇 (𝑺 )− 𝑨 (𝑺 )∨≤ 𝜷 ]≥1−𝜖 ]≥ 12

Learning

Goemans,Harvey,Iwata,Mirrokni

Balcan,Harvey

Gupta,Hardt,Roth,Ullman

Cheraghchi,Klivans,Kothari,Lee

Our result with Sofya

Learning -approximationEverywhere

PMAC Multiplicative

PAACAdditive

(bounded integral range )

Time Poly(|X|) Poly(|X|)

Extrafeatures

Under arbitrary distribution

Tolerant queries

SQ- queries,Agnostic

Agnostic

• For all algorithms

Learning: Bigger picture

}XOS = Fractionally subadditive

Subadditive

Submodular

Gross substitutes

[Badanidiyuru, Dobzinski, Fu, Kleinberg, Nisan, Roughgarden,SODA’12]

Additive (linear)

Value demand

Other positive results:• Learning valuation functions [Balcan,

Constantin, Iwata, Wang, COLT’12]• PMAC-learning (sketching) valuation functions

[BDFKNR’12]• PMAC learning Lipschitz submodular functions

[BH’10] (concentration around average via Talagrand)

Discrete convexity• Monotone convex

• Convex

1 2 3 … <=R … … … … … … … … n02468

1 2 3 … <=R … … … … >= n-R

… … … n02468

Discrete submodularity

• Monotone submodular

∅|𝑺|≤𝑹

• Submodular

∅|𝑺|≤𝑹

|𝑺|≥|𝑿|−𝑹

• Case study: = 1 (Boolean submodular functions )Monotone submodular = (monomial)Submodular = (2-term CNF)

Discrete monotone submodularity

≥𝒎𝒂𝒙 ( 𝒇 (𝑺𝟏 ) , 𝒇 (𝑺𝟐))

|𝑺|≤𝑹

• Monotone submodular

Discrete monotone submodularity• Theorem: for monotone submodular f• (by monotonicity)

|𝑺|≤𝑹

Discrete monotone submodularity

• S smallest subset of such that • we have => Restriction of on is monotone increasing =>

|𝑺|≤𝑹

𝑆 ′

Representation by a formula

• Theorem: for monotone submodular f

• Notation switch: ,

• (Monotone)

(no negations)

• (Monotone) submodular can be represented as a (monotone) pseudo-Boolean 2R-DNF with constants

Discrete submodularity• Submodular can be represented as a

pseudo-Boolean 2R-DNF with constants • Hint [Lovasz] (Submodular monotonization): Given submodular define

Then is monotone and submodular.

∅|𝑺|≤𝑹

|𝑺|≥|𝑿|−𝑹

Learning pB-formulas and k-DNF• = class of pB-DNF of width with • i-slice defined as

• If its i-slices are -DNF and:

• PAC-learning

Pr𝑟𝑎𝑛𝑑 (𝑨) [ Pr

𝑺∼𝑈( {0,1 }𝑛)[ 𝑨 (𝑺 )= 𝒇 (𝑺 ) ]≥1−𝜖]≥ 12

Learning pB-formulas and k-DNF• Learn every i-slice on fraction of arguments • Learning -DNF () (let Fourier sparsity = )

– Kushilevitz-Mansour (Goldreich-Levin): queries/time. – ``Attribute efficient learning’’: queries– Lower bound: () queries to learn a random -junta ( -DNF) up to

constant precision.• Optimizations:

– Slightly better than KM/GL by looking at the Fourier spectrum of (see SODA paper: switching lemma => bound)

– Do all R iterations of KM/GL in parallel by reusing queries

Property testing• Let C be the class of submodular • How to (approximately) test, whether a given is in C?• Property tester: (Randomized) algorithm for distinguishing:

1. -far):

• Key idea: -DNFs have small representations:– [Gopalan, Meka,Reingold CCC’12] (using quasi-sunflowers [Rossman’10]), -DNF formula F there exists:-DNF formula F’ of size such that

Testing by implicit learning• Good approximation by juntas => efficient property

testing [surveys: Ron; Servedio]

– -approximation by -junta– Good dependence on :

• [Blais, Onak] sunflowers for submodular functions

– Query complexity: (independent of n)– Running time: exponential in (we think can be reduced it to

O(– We have lower bound for testing -DNF (reduction from Gap

Set Intersection: distinguishing a random -junta vs + O(log )-junta requires queries)

Previous work on testing submodularity

[Parnas, Ron, Rubinfeld ‘03, Seshadhri, Vondrak, ICS’11]:

• U. • Lower bound: Special case: coverage functions [Chakrabarty, Huang, ICALP’12].

Gap in query complexity

Thanks!

learning and testing submodular functions

bounded range learning

testing submodularitythanks

dnflearning pbformulas

implicit learningprevious

bigger picturexos

Documents

approximability of combinatorial optimization problems...

machine learning and convex optimization with submodular...

1 submodular functions in combintorial optimization lecture...

metric learning with submodular functions

an introduction to submodular functions and optimization

submodular optimization and approximation...

learning submodular functions

tutorial on optimization with submodular functions

minimizing general submodular functions cvpr 2015 tutorial...

on submodular value functions of dynamic programming ›...

submodular functions learnability , structure &...

convex analysis of submodular set functions for machine...

learning with submodular functions: a convex optimization

submodular functions in graph theory -...

maximizing submodular functions

5 maximizing submodular functions minimizing convex...

a max-sum diversiﬁcation, monotone submodular functions...

optimization of submodular functions tutorial - lecture...

learning and testing submodular functions grigory...

matchings, matroids and submodular...