approximating boolean functions with small-depth circuits (slides)

27
Approximating Boolean functions with depth-2 circuits Eric Blais (MIT) and Li-Yang Tan (Columbia) Simons Institute for Theory of Computing 28 August 2013, Berkeley

Upload: hanga

Post on 01-Jan-2017

234 views

Category:

Documents


1 download

TRANSCRIPT

Page 1: Approximating Boolean Functions With Small-Depth Circuits (slides)

Approximating Boolean functions with depth-2 circuits

Eric Blais (MIT) and Li-Yang Tan (Columbia)

Simons Institute for Theory of Computing28 August 2013, Berkeley

Page 2: Approximating Boolean Functions With Small-Depth Circuits (slides)

DNFs

π‘₯1 π‘₯17Β¬ π‘₯3 π‘₯5 π‘₯2

size

width

number of terms

max number of literals in any term

term

Page 3: Approximating Boolean Functions With Small-Depth Circuits (slides)

1

1

1

1

1

{0,1}𝑛

Size 𝑠, width 𝑀 DNF

≑Union of 𝑠 subcubes of

dimension β‰₯ 𝑛 βˆ’ 𝑀

π‘₯1 π‘₯17Β¬ π‘₯3 π‘₯5 π‘₯2

1

Page 4: Approximating Boolean Functions With Small-Depth Circuits (slides)

A simple exercise often used to introduce complexity theory:

Any DNF computing PARITY has size β‰₯ 2π‘›βˆ’1 and width β‰₯ 𝑛.

DNFs and PARITY

Every Boolean function: DNF size ≀ 2π‘›βˆ’1, width ≀ 𝑛.

⟹ PARITY = hardest function

But what about approximation?

DNF only has to be correct on 0.99-fraction of inputs {0,1}𝑛

PARITY10000

11111111000000000000

111111111111111100000000000000000000

1111111111111111111111110000000000000000000000000000

111111111111111111111111111111110000000000000000000000000000

11111111111111111111111100000000000000000000

1111111111111111000000000000

111111110000

1

Definition: 𝑓 is an πœ€-approximator for 𝑔 if Pr 𝑓 π‘₯ β‰  𝑔 π‘₯ ≀ πœ€

Page 5: Approximating Boolean Functions With Small-Depth Circuits (slides)

Tradeoffs between accuracy and efficiency in circuit complexity

Basic, seemingly simple, problems open even for DNFs!

1. Is approximating PARITY asymptotically easier than computing it exactly?

2. Is PARITY also the hardest function to approximate?

3. Universal bounds on approximability of every Boolean function?

Starting point of this research

Page 6: Approximating Boolean Functions With Small-Depth Circuits (slides)

Approximating PARITY with DNFs

Does 0.1-approximating PAR require DNF size Ξ©(2𝑛),

or can we 0.1-approximate PAR with size π‘œ(2𝑛)?

Approximation not much easier: Ξ©(2𝑛) vs.

Approximation a lot easier: ≀ 2𝑛/exp(𝑛)

Does 0.1-approximating PAR require DNF width 𝑛 βˆ’ 𝑂(1),

or can we 0.1-approximate PAR with width 𝑛 βˆ’ πœ”(1)?

Approximation not much easier: 𝑛 βˆ’ 𝑂(1) vs.

Approximation a lot easier: 𝑛 βˆ’ Ξ©(𝑛)

size

width

Theorem [Lupanov 61]:Any DNF computing PARITY has size β‰₯ 2π‘›βˆ’1 and width β‰₯ 𝑛.

Page 7: Approximating Boolean Functions With Small-Depth Circuits (slides)

Previous work: correlation bounds between PAR and AC0

Long and fruitful line of research.

Started in the 80’s [FSS 84, Ajtai 83, HΓ₯stad 86], remains active today.

[HΓ₯stad 12]: correlation of size-𝑠 DNF with PARITY 2βˆ’Ξ© 𝑛/log(𝑠) .

⟹ any DNF that agrees with PAR on 99% of inputs has size 2Ξ© 𝑛 .

A small AC0 circuit agrees with PAR on at most 1

2+ tiny fraction of inputs.

But still leaves open exponential gap of Ξ©(2𝑛) vs. ≀ 2𝑛/exp(𝑛).

Page 8: Approximating Boolean Functions With Small-Depth Circuits (slides)

our results in this work

Page 9: Approximating Boolean Functions With Small-Depth Circuits (slides)

Approximating PARITY with DNFs

Theorem [Blais-T.]:

PAR can be πœ€-approximated by a DNF of size 2 1βˆ’2πœ€ 𝑛 and width (1 βˆ’ 2πœ€)𝑛.

Exponential savings on size, linear savings on width.

(Almost) matching lower bounds:

Theorem [Blais-T.]:

Any DNF that πœ€-approximates PARITY has size 2 1βˆ’4πœ€ 𝑛 and width (1 βˆ’ 2πœ€)𝑛.

Theorem [Lupanov 61]:Any DNF computing PARITY has size β‰₯ 2π‘›βˆ’1 and width β‰₯ 𝑛.

Page 10: Approximating Boolean Functions With Small-Depth Circuits (slides)

00000000000000000000

111111111111111111111111

Parity can be 0.01-approximated by the union of Ξ©(𝑛) dimensional subcubes.

Each covers exponentially many points.

Incurs error 50% within each subcube

Yet overall error only 1%!

Solution: overlap heavily over 0-inputs, essentially disjoint over 1-inputs.

PARITY

Theorem [Blais-T.]:

PAR can be πœ€-approximated by a DNF of size 2 1βˆ’2πœ€ 𝑛 and width 1 βˆ’ 2πœ€ 𝑛.

10000

11111111000000000000

111111111111111100000000000000000000

000000000000000000000000000011111111111111111111111111111111

0000000000000000000000000000111111111111111111111111

1111111111111111000000000000

111111110000

1

Page 11: Approximating Boolean Functions With Small-Depth Circuits (slides)

Universal bounds on DNF size

PARITY = hardest function to compute exactly. Same true for approximation?

Theorem [Blais-T.]:Every function can be 0.1-approximated by a DNF of size ≀ 2𝑛/log(𝑛).

PARITY exponentially easier to approximate than almost all functions!

Theorem [Blais-T.]: No!Any DNF that 0.1-approximates a random function has size β‰₯ 2𝑛/𝑛.

Is there a function that requires size Ξ©(2𝑛) to approximate

or can we prove π‘œ(2𝑛) upper bound for all functions?

Theorem [Blais-T.]:

PAR can be πœ€-approximated by a DNF of size ≀ 2 1βˆ’2πœ€ 𝑛.

Page 12: Approximating Boolean Functions With Small-Depth Circuits (slides)

Theorem [Blais-T.]: Yes! Every function can be 0.1-approximated by a DNF of width ≀ 𝑛 βˆ’ Ξ©(𝑛).

Random function: every 1-monochromatic subcube has dimension ≀ log(𝑛).

[Blais-T.] All cubes can be made exponentially larger at the cost of small constant error.

Universal bounds on DNF width

1010110110000010101101010101011110100000111100000011101010101010101010101001

11101011110101010101010011010101101010100001011010101

10111011010101011000110001110010101011

1000111000110101001110

100111

Parity can be 0.1-approximated by union of Ξ©(𝑛)-dimensional subcubes.

Same true for any function?

11110000001110101010101010101001

11110000001110101010101010101010100111110000001110101010101010101001

11101011110101010101010011010101101010100001011010101

10111011010101011000110001110010101011

1000111000110101001110

100111

Page 13: Approximating Boolean Functions With Small-Depth Circuits (slides)

The rest of this talk

1. Universal upper bound on DNF size.

2. Universal upper bound on DNF width.

3. DNF approximator for PARITY.

4. Open problems.

* Unfortunately, will not have time for lower bounds.

Page 14: Approximating Boolean Functions With Small-Depth Circuits (slides)

Error on 0-inputs

Error on 1-inputs

DNF size

Theorem: Every function can be 0.1-approximated by a DNF of size ≀ 2𝑛/log(𝑛).

Goal: small family of subcubes, covers almost all 1-inputs, but almost none of 0-inputs. Seems tough!

1010110110000010101101010101011110100000111100000011101010101010101010101001

11101011110101010101010011010101101010100001011010101

10111011010101011000110001110010101011

1000111000110101001110

100111

11110000001110101010101010101001

11110000001110101010101010101010100111110000001110101010101010101001

11101011110101010101010011010101101010100001011010101

10111011010101011000110001110010101011

1000111000110101001110

100111

1. Randomly flip tiny fraction of 0’s to 1’s.

2. Include all β€œlarge” 1-monochromatic subcubes.

First try:

Page 15: Approximating Boolean Functions With Small-Depth Circuits (slides)

Theorem:Every function f can be 0.1-approximated by a DNF of size ≀ 2𝑛/log(𝑛).

1. Flip each 0-input to 1 with tiny probability.

2. Define special subcubes, every π‘₯ is contained in β€œmany” special subcubes.

3. Include each 1-monochromatic special subcube in approximator with small probability.

Any 1-input π‘₯ likely to be covered.

DNF approximator has small size.

Tiny fraction of 0’s covered.

1010110110000010101101010101011110100000111100000011101010101010101010101001

11101011110101010101010011010101101010100001011010101

10111011010101011000110001110010101011

1000111000110101001110

100111

11110000001110101010101010101001

11110000001110101010101010101010100111110000001110101010101010101001

11101011110101010101010011010101101010100001011010101

10111011010101011000110001110010101011

1000111000110101001110

100111

Page 16: Approximating Boolean Functions With Small-Depth Circuits (slides)

1. Flip each 0-input to 1 independently with probability πœ€/2.

w.p. 1-π‘œ(1) at most πœ€ fraction 0-inputs flipped.

Conditioned on this, error on 0-inputs ≀ πœ€.

Remains to consider error on 1-inputs and DNF size:

1

11

1

1

1

1

11

1

1

11

1

1

1

1

1

1

11

1

1

1

1

1

1

1

1

1

1

w.p. β‰₯ 3/4, error on 1-inputs ≀ πœ€.

w.p. β‰₯ 3/4, DNF size 𝑂(2𝑛/log(𝑛)).

Page 17: Approximating Boolean Functions With Small-Depth Circuits (slides)

2. Let 𝑑 ∼ log log(𝑛), partition 𝑛 into 𝑛/𝑑 blocks of size 𝑑.

β€œSpecial” subcube := 100011 ****** 110101 111101 000101 110101

d d

All *’s in exactly one block.

Every π‘₯ is contained in 𝑛/𝑑 special subcubes.

3. Each special subcube included with probability exactly πœ€2𝑑

.

111111

11111111111111111111111

1111111111111111111111111111

1111111111111

1

Pr π‘₯ π‘›π‘œπ‘‘ π‘π‘œπ‘£π‘’π‘Ÿπ‘’π‘‘ = 1 βˆ’ πœ€2𝑑 𝑛/𝑑

≀ πœ€/4.

E # 𝑠𝑒𝑏𝑐𝑒𝑏𝑒𝑠 𝑖𝑛𝑐𝑙𝑒𝑑𝑒𝑑 = πœ€2𝑑⋅𝑛

𝑑⋅ 2π‘›βˆ’π‘‘

∼ 2𝑛/log(𝑛).

Page 18: Approximating Boolean Functions With Small-Depth Circuits (slides)

Theorem: Every function can be 0.1-approximated by a DNF of size ≀ 2𝑛/log(𝑛).

Page 19: Approximating Boolean Functions With Small-Depth Circuits (slides)

The rest of this talk

Universal upper bound on DNF size.

2. Universal upper bound on DNF width.

3. DNF approximator for PARITY.

4. Open problems.

Page 20: Approximating Boolean Functions With Small-Depth Circuits (slides)

1. Fix 𝑑 = 0.001𝑛. Approximator will have width 𝑛 βˆ’ 𝑑 =𝑛 βˆ’ Ξ©(𝑛) .

2. Cover 99.9% of {0,1}𝑛 with ∼ 10 β‹… 2𝑛/π‘£π‘œπ‘™(𝑑) balls of radius 𝑑. Essentially a partition.

2. Construct width 𝑛-𝑑 = 𝑛-Ξ©(𝑛) DNF for each ball B satisfying:

99.9% correct within B, always 0 outside B.

3. Final approximator: OR of sub-approximators.

(OR of DNFs = DNF)

11101011110101010101010011010101101010100001011010101

100011100101010111000111000110

10100111010011

1010110110000010101101010101011110100000

11101011110101010101010011010101101010100001011010101

10111011010101011000110001110010101011

1000111000110101001110

10011

11110000001110101010101010101001

11110000001110101010101010101001111100000011101010101010101010101001

111100000011101010101010101010101001

1

101110110101010110001

1

10100101000101010101001

Theorem:Every function can be 0.1-approximated by a DNF of width ≀ 𝑛 βˆ’ Ξ©(𝑛).

B110110100101000101

11011010010101111011010011

10101100100

1

Page 21: Approximating Boolean Functions With Small-Depth Circuits (slides)

Small-width approximators for Hamming balls

*

010100101000101010101001

110110100101000101110110100101011

110110100111010110

01001

10100001000100000100001

99.99% of points lie on surface

Suffices to be 100% correct on surface

One width 𝑛-𝑑 term for each point

𝑑 = 0.001𝑛

0

Page 22: Approximating Boolean Functions With Small-Depth Circuits (slides)

Theorem:Every function can be 0.1-approximated by a DNF of width ≀ 𝑛 βˆ’ Ξ©(𝑛).

Page 23: Approximating Boolean Functions With Small-Depth Circuits (slides)

The rest of this talk

Universal upper bound on DNF size.

Universal upper bound on DNF width.

3. DNF approximator for PARITY.

4. Open problems.

Page 24: Approximating Boolean Functions With Small-Depth Circuits (slides)

π‘₯ = 1001101010100101000101010101001

𝑦 𝑧

PAR(π‘₯) = PAR(𝑦) βŠ• PAR(𝑧)

Consider 𝐹(π‘₯) = PAR(𝑦) ∨ PAR(𝑧):

PAR(π‘₯) = 1 ⟹ 𝐹(π‘₯) = 1

PAR(π‘₯) = 0 ⟹𝐹(π‘₯) = 0 half the time.

PAR(𝑦) and PAR(𝑧) have trivial DNFs of size 2 𝑛/2 βˆ’1 and width 𝑛/2.

⟹ 1/4 -approximate PAR with size 2𝑛/2 and width 𝑛/2.

Theorem:

PAR can be πœ€-approximated by a DNF of size 2 1βˆ’2πœ€ 𝑛 and width (1 βˆ’ 2πœ€)𝑛.

Pr[𝐹(π‘₯) = PAR(π‘₯)] = 3/4.

πœ€ = 1/4

Page 25: Approximating Boolean Functions With Small-Depth Circuits (slides)

The rest of this talk

Universal upper bound on DNF size.

Universal upper bound on DNF width.

DNF approximator for PARITY.

4. Open problems.

Page 26: Approximating Boolean Functions With Small-Depth Circuits (slides)

Open problems

Any DNF that 0.1-approximates a random function has size β‰₯ 2𝑛/𝑛.

1. Close this gap.

2. Explicit hard function showing β‰₯ 2𝑛/poly 𝑛 .

Every function can be 0.1-approximated by a DNF of size ≀ 2𝑛/log(𝑛).

Page 27: Approximating Boolean Functions With Small-Depth Circuits (slides)

thank you!