a dichotomy in the complexity of counting database repairs · complexity classes the complexity...

42
A dichotomy in the Complexity of Counting Database Repairs Dany Maslowski Université de Mons, Belgium October 17th, 2014

Upload: others

Post on 21-Feb-2020

2 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: A dichotomy in the Complexity of Counting Database Repairs · Complexity classes The complexity class ]P The class ]P contains all function problems which consist of counting the

A dichotomy in the Complexity of CountingDatabase Repairs

Dany Maslowski

Université de Mons, Belgium

October 17th, 2014

Page 2: A dichotomy in the Complexity of Counting Database Repairs · Complexity classes The complexity class ]P The class ]P contains all function problems which consist of counting the

Outline

1 Complexity Classes and the Complexity Dichotomy

2 Uncertain Database Model

3 The problem ]CERTAINTY(q)

4 Probabilistic Database Model

5 Uncertain Databases vs Probabilistic Databases

6 Conclusion

D. Maslowski (UMONS) DBDBD 2014 October 17th, 2014 2 / 13

Page 3: A dichotomy in the Complexity of Counting Database Repairs · Complexity classes The complexity class ]P The class ]P contains all function problems which consist of counting the

Complexity classes

The complexity class ]PThe class ]P contains all function problems which consist of countingthe number of accepting computation paths of a non-deterministicpolynomial-time Turing machine.

The complexity class FPThe complexity class FP contains all counting problems in ]P whichcan be solved in deterministic polynomial time.

SAT is in NP : Are there any variable assignments that satisfy a givenBoolean formula ?

]SAT is in ]P : How many variable assignments satisfy a givenBoolean formula ?

D. Maslowski (UMONS) DBDBD 2014 October 17th, 2014 3 / 13

Page 4: A dichotomy in the Complexity of Counting Database Repairs · Complexity classes The complexity class ]P The class ]P contains all function problems which consist of counting the

Complexity classes

The complexity class ]PThe class ]P contains all function problems which consist of countingthe number of accepting computation paths of a non-deterministicpolynomial-time Turing machine.

The complexity class FPThe complexity class FP contains all counting problems in ]P whichcan be solved in deterministic polynomial time.

SAT is in NP : Are there any variable assignments that satisfy a givenBoolean formula ?

]SAT is in ]P : How many variable assignments satisfy a givenBoolean formula ?

D. Maslowski (UMONS) DBDBD 2014 October 17th, 2014 3 / 13

Page 5: A dichotomy in the Complexity of Counting Database Repairs · Complexity classes The complexity class ]P The class ]P contains all function problems which consist of counting the

Complexity classes

The complexity class ]PThe class ]P contains all function problems which consist of countingthe number of accepting computation paths of a non-deterministicpolynomial-time Turing machine.

The complexity class FPThe complexity class FP contains all counting problems in ]P whichcan be solved in deterministic polynomial time.

SAT is in NP : Are there any variable assignments that satisfy a givenBoolean formula ?

]SAT is in ]P : How many variable assignments satisfy a givenBoolean formula ?

D. Maslowski (UMONS) DBDBD 2014 October 17th, 2014 3 / 13

Page 6: A dichotomy in the Complexity of Counting Database Repairs · Complexity classes The complexity class ]P The class ]P contains all function problems which consist of counting the

The Complexity Dichotomy

Effective FP-]P dichotomyA class C of counting problems exhibits an effective FP-]P-dichotomyif all problems in C are either in FP or ]P-hard under polynomial-timeTuring reduction and it is decidable whether a given problem in C is inFP or ]P-hard.

D. Maslowski (UMONS) DBDBD 2014 October 17th, 2014 4 / 13

Page 7: A dichotomy in the Complexity of Counting Database Repairs · Complexity classes The complexity class ]P The class ]P contains all function problems which consist of counting the

Uncertain database

DefinitionA database in which primary keys need not be satisfied.

R Conf Year TownEDBT 2016 MonsEDBT 2016 BrusselsEDBT 2016 Belgrade

S Town CountryMons BelgiumBrussels BelgiumBelgrade BelgiumBelgrade Serbia

T Country ContBelgium EuropeSerbia EuropeTunisia Africa

3×2 repairs

Repair (or possible world)A maximal subset of tuples that satisfy primary keys.

D. Maslowski (UMONS) DBDBD 2014 October 17th, 2014 5 / 13

Page 8: A dichotomy in the Complexity of Counting Database Repairs · Complexity classes The complexity class ]P The class ]P contains all function problems which consist of counting the

Uncertain database

DefinitionA database in which primary keys need not be satisfied.

R Conf Year TownEDBT 2016 MonsEDBT 2016 BrusselsEDBT 2016 Belgrade

S Town CountryMons BelgiumBrussels BelgiumBelgrade BelgiumBelgrade Serbia

T Country ContBelgium EuropeSerbia EuropeTunisia Africa

3×2 repairs

Repair (or possible world)A maximal subset of tuples that satisfy primary keys.

D. Maslowski (UMONS) DBDBD 2014 October 17th, 2014 5 / 13

Page 9: A dichotomy in the Complexity of Counting Database Repairs · Complexity classes The complexity class ]P The class ]P contains all function problems which consist of counting the

Uncertain database

DefinitionA database in which primary keys need not be satisfied.

R Conf Year TownEDBT 2016 MonsEDBT 2016 BrusselsEDBT 2016 Belgrade

S Town CountryMons BelgiumBrussels BelgiumBelgrade BelgiumBelgrade Serbia

T Country ContBelgium EuropeSerbia EuropeTunisia Africa

3×2 repairs

Repair (or possible world)A maximal subset of tuples that satisfy primary keys.

D. Maslowski (UMONS) DBDBD 2014 October 17th, 2014 5 / 13

Page 10: A dichotomy in the Complexity of Counting Database Repairs · Complexity classes The complexity class ]P The class ]P contains all function problems which consist of counting the

Uncertain database

DefinitionA database in which primary keys need not be satisfied.

R Conf Year Town

EDBT 2016 Mons

EDBT 2016 Brussels

EDBT 2016 Belgrade

S Town CountryMons BelgiumBrussels BelgiumBelgrade Belgium

Belgrade Serbia

T Country ContBelgium EuropeSerbia EuropeTunisia Africa

3×2 repairs

Repair (or possible world)A maximal subset of tuples that satisfy primary keys.

D. Maslowski (UMONS) DBDBD 2014 October 17th, 2014 5 / 13

Page 11: A dichotomy in the Complexity of Counting Database Repairs · Complexity classes The complexity class ]P The class ]P contains all function problems which consist of counting the

Uncertain database

DefinitionA database in which primary keys need not be satisfied.

R Conf Year TownEDBT 2016 MonsEDBT 2016 BrusselsEDBT 2016 Belgrade

S Town CountryMons BelgiumBrussels BelgiumBelgrade BelgiumBelgrade Serbia

T Country ContBelgium EuropeSerbia EuropeTunisia Africa

3×2 repairs

Repair (or possible world)A maximal subset of tuples that satisfy primary keys.

D. Maslowski (UMONS) DBDBD 2014 October 17th, 2014 5 / 13

Page 12: A dichotomy in the Complexity of Counting Database Repairs · Complexity classes The complexity class ]P The class ]P contains all function problems which consist of counting the

Uncertain database

DefinitionA database in which primary keys need not be satisfied.

R Conf Year TownEDBT 2016 MonsEDBT 2016 BrusselsEDBT 2016 Belgrade

S Town CountryMons BelgiumBrussels BelgiumBelgrade BelgiumBelgrade Serbia

T Country ContBelgium EuropeSerbia EuropeTunisia Africa

3×2 repairs

Repair (or possible world)A maximal subset of tuples that satisfy primary keys.

D. Maslowski (UMONS) DBDBD 2014 October 17th, 2014 5 / 13

Page 13: A dichotomy in the Complexity of Counting Database Repairs · Complexity classes The complexity class ]P The class ]P contains all function problems which consist of counting the

Certainty

Certain Boolean queryA Boolean query is certain if it evaluates to true on each repair.

R Conf Year TownEDBT 2016 MonsEDBT 2016 BrusselsEDBT 2016 Belgrade

S Town CountryMons BelgiumBrussels BelgiumBelgrade BelgiumBelgrade Serbia

T Country ContBelgium EuropeSerbia EuropeTunisia Africa

q1 = ∃x(R(EDBT,2016,x)∧S(x ,Belgium))

q2 = ∃x∃y(R(EDBT,2016,x)∧S(x ,y)∧T (y ,Europe))

q1 is not certain, q2 is certain

D. Maslowski (UMONS) DBDBD 2014 October 17th, 2014 6 / 13

Page 14: A dichotomy in the Complexity of Counting Database Repairs · Complexity classes The complexity class ]P The class ]P contains all function problems which consist of counting the

Certainty

Certain Boolean queryA Boolean query is certain if it evaluates to true on each repair.

R Conf Year TownEDBT 2016 MonsEDBT 2016 BrusselsEDBT 2016 Belgrade

S Town CountryMons BelgiumBrussels BelgiumBelgrade BelgiumBelgrade Serbia

T Country ContBelgium EuropeSerbia EuropeTunisia Africa

q1 = ∃x(R(EDBT,2016,x)∧S(x ,Belgium))

q2 = ∃x∃y(R(EDBT,2016,x)∧S(x ,y)∧T (y ,Europe))

q1 is not certain, q2 is certain

D. Maslowski (UMONS) DBDBD 2014 October 17th, 2014 6 / 13

Page 15: A dichotomy in the Complexity of Counting Database Repairs · Complexity classes The complexity class ]P The class ]P contains all function problems which consist of counting the

Certainty

Certain Boolean queryA Boolean query is certain if it evaluates to true on each repair.

R Conf Year Town

EDBT 2016 MonsEDBT 2016 Brussels

EDBT 2016 Belgrade

S Town CountryMons BelgiumBrussels Belgium

Belgrade Belgium

Belgrade Serbia

T Country ContBelgium EuropeSerbia EuropeTunisia Africa

q1 = ∃x(R(EDBT,2016,x)∧S(x ,Belgium))

q2 = ∃x∃y(R(EDBT,2016,x)∧S(x ,y)∧T (y ,Europe))

q1 is not certain, q2 is certain

D. Maslowski (UMONS) DBDBD 2014 October 17th, 2014 6 / 13

Page 16: A dichotomy in the Complexity of Counting Database Repairs · Complexity classes The complexity class ]P The class ]P contains all function problems which consist of counting the

Certainty

Certain Boolean queryA Boolean query is certain if it evaluates to true on each repair.

R Conf Year TownEDBT 2016 MonsEDBT 2016 BrusselsEDBT 2016 Belgrade

S Town CountryMons BelgiumBrussels BelgiumBelgrade BelgiumBelgrade Serbia

T Country ContBelgium EuropeSerbia EuropeTunisia Africa

q1 = ∃x(R(EDBT,2016,x)∧S(x ,Belgium))

q2 = ∃x∃y(R(EDBT,2016,x)∧S(x ,y)∧T (y ,Europe))

q1 is not certain, q2 is certain

D. Maslowski (UMONS) DBDBD 2014 October 17th, 2014 6 / 13

Page 17: A dichotomy in the Complexity of Counting Database Repairs · Complexity classes The complexity class ]P The class ]P contains all function problems which consist of counting the

The problem CERTAINTY(q)

DefinitionFor a fixed Boolean query q, the problem CERTAINTY(q) is :

INPUT An uncertain database db.

OUTPUT Is q true in every repair of db ?

First-order expressibility of CERTAINTY(q) has been studied by Wijsenin [Wij12].

A dichotomy P/co-NP-complete of CERTAINTY(q) has been studied byPema and Kolaitis in [KP12] and by Koutris and Suciu in [KS12].

RemarkAll complexity results concern data complexity

D. Maslowski (UMONS) DBDBD 2014 October 17th, 2014 7 / 13

Page 18: A dichotomy in the Complexity of Counting Database Repairs · Complexity classes The complexity class ]P The class ]P contains all function problems which consist of counting the

The problem CERTAINTY(q)

DefinitionFor a fixed Boolean query q, the problem CERTAINTY(q) is :

INPUT An uncertain database db.

OUTPUT Is q true in every repair of db ?

First-order expressibility of CERTAINTY(q) has been studied by Wijsenin [Wij12].

A dichotomy P/co-NP-complete of CERTAINTY(q) has been studied byPema and Kolaitis in [KP12] and by Koutris and Suciu in [KS12].

RemarkAll complexity results concern data complexity

D. Maslowski (UMONS) DBDBD 2014 October 17th, 2014 7 / 13

Page 19: A dichotomy in the Complexity of Counting Database Repairs · Complexity classes The complexity class ]P The class ]P contains all function problems which consist of counting the

The problem CERTAINTY(q)

DefinitionFor a fixed Boolean query q, the problem CERTAINTY(q) is :

INPUT An uncertain database db.

OUTPUT Is q true in every repair of db ?

First-order expressibility of CERTAINTY(q) has been studied by Wijsenin [Wij12].

A dichotomy P/co-NP-complete of CERTAINTY(q) has been studied byPema and Kolaitis in [KP12] and by Koutris and Suciu in [KS12].

RemarkAll complexity results concern data complexity

D. Maslowski (UMONS) DBDBD 2014 October 17th, 2014 7 / 13

Page 20: A dichotomy in the Complexity of Counting Database Repairs · Complexity classes The complexity class ]P The class ]P contains all function problems which consist of counting the

The problem CERTAINTY(q)

DefinitionFor a fixed Boolean query q, the problem CERTAINTY(q) is :

INPUT An uncertain database db.

OUTPUT Is q true in every repair of db ?

First-order expressibility of CERTAINTY(q) has been studied by Wijsenin [Wij12].

A dichotomy P/co-NP-complete of CERTAINTY(q) has been studied byPema and Kolaitis in [KP12] and by Koutris and Suciu in [KS12].

RemarkAll complexity results concern data complexity

D. Maslowski (UMONS) DBDBD 2014 October 17th, 2014 7 / 13

Page 21: A dichotomy in the Complexity of Counting Database Repairs · Complexity classes The complexity class ]P The class ]P contains all function problems which consist of counting the

The problem ]CERTAINTY(q)

DefinitionFor a fixed Boolean query q, the counting problem ]CERTAINTY(q) is :

INPUT An uncertain database db.OUTPUT How many repairs of db satisfy q ? ?

R Conf Year TownEDBT 2016 MonsEDBT 2016 BrusselsEDBT 2016 Belgrade

S Town CountryMons BelgiumBrussels BelgiumBelgrade BelgiumBelgrade Serbia

T Country ContBelgium EuropeSerbia EuropeTunisia Africa

q1 = ∃x(R(EDBT,2016,x)∧S(x ,Belgium))

q2 = ∃x∃y(R(EDBT,2016,x)∧S(x ,y)∧T (y ,Europe))

q1 is true in 5/6 repairs, q2 in 6/6 repairs.

D. Maslowski (UMONS) DBDBD 2014 October 17th, 2014 8 / 13

Page 22: A dichotomy in the Complexity of Counting Database Repairs · Complexity classes The complexity class ]P The class ]P contains all function problems which consist of counting the

The problem ]CERTAINTY(q)

DefinitionFor a fixed Boolean query q, the counting problem ]CERTAINTY(q) is :

INPUT An uncertain database db.OUTPUT How many repairs of db satisfy q ? ?

R Conf Year TownEDBT 2016 MonsEDBT 2016 BrusselsEDBT 2016 Belgrade

S Town CountryMons BelgiumBrussels BelgiumBelgrade BelgiumBelgrade Serbia

T Country ContBelgium EuropeSerbia EuropeTunisia Africa

q1 = ∃x(R(EDBT,2016,x)∧S(x ,Belgium))

q2 = ∃x∃y(R(EDBT,2016,x)∧S(x ,y)∧T (y ,Europe))

q1 is true in 5/6 repairs, q2 in 6/6 repairs.

D. Maslowski (UMONS) DBDBD 2014 October 17th, 2014 8 / 13

Page 23: A dichotomy in the Complexity of Counting Database Repairs · Complexity classes The complexity class ]P The class ]P contains all function problems which consist of counting the

The problem ]CERTAINTY(q)

DefinitionFor a fixed Boolean query q, the counting problem ]CERTAINTY(q) is :

INPUT An uncertain database db.OUTPUT How many repairs of db satisfy q ? ?

R Conf Year Town

EDBT 2016 MonsEDBT 2016 Brussels

EDBT 2016 Belgrade

S Town CountryMons BelgiumBrussels Belgium

Belgrade Belgium

Belgrade Serbia

T Country ContBelgium EuropeSerbia EuropeTunisia Africa

q1 = ∃x(R(EDBT,2016,x)∧S(x ,Belgium))

q2 = ∃x∃y(R(EDBT,2016,x)∧S(x ,y)∧T (y ,Europe))

q1 is true in 5/6 repairs, q2 in 6/6 repairs.

D. Maslowski (UMONS) DBDBD 2014 October 17th, 2014 8 / 13

Page 24: A dichotomy in the Complexity of Counting Database Repairs · Complexity classes The complexity class ]P The class ]P contains all function problems which consist of counting the

The problem ]CERTAINTY(q)

DefinitionFor a fixed Boolean query q, the counting problem ]CERTAINTY(q) is :

INPUT An uncertain database db.OUTPUT How many repairs of db satisfy q ? ?

R Conf Year TownEDBT 2016 MonsEDBT 2016 BrusselsEDBT 2016 Belgrade

S Town CountryMons BelgiumBrussels BelgiumBelgrade BelgiumBelgrade Serbia

T Country ContBelgium EuropeSerbia EuropeTunisia Africa

q1 = ∃x(R(EDBT,2016,x)∧S(x ,Belgium))

q2 = ∃x∃y(R(EDBT,2016,x)∧S(x ,y)∧T (y ,Europe))

q1 is true in 5/6 repairs, q2 in 6/6 repairs.

D. Maslowski (UMONS) DBDBD 2014 October 17th, 2014 8 / 13

Page 25: A dichotomy in the Complexity of Counting Database Repairs · Complexity classes The complexity class ]P The class ]P contains all function problems which consist of counting the

Contributions

Complexity result [MW13]The class of problems ]CERTAINTY(q), where q is a Booleanconjunctive query without self-join, exhibits an effectiveFP-]P-dichotomy

Complexity result [MW14]The class of problems ]CERTAINTY(q), where q is a Booleanconjunctive query (possibly with self-joins) in which all primary keysconsist of a single attribute, exhibits an effective FP-]P-dichotomy.

if q = ∃x(R(EDBT,2016,x)∧S(x ,Belgium)), ]CERTAINTY(q) ∈ FP ;

if q = ∃x ,y(R(x ,y)∧R(y ,a)), ]CERTAINTY(q) is hard for ]P ;

the query ∃x ,y ,z(R(x ,y ,a)∧R(y , t ,a)) does not meet our criterions.

D. Maslowski (UMONS) DBDBD 2014 October 17th, 2014 9 / 13

Page 26: A dichotomy in the Complexity of Counting Database Repairs · Complexity classes The complexity class ]P The class ]P contains all function problems which consist of counting the

Contributions

Complexity result [MW13]The class of problems ]CERTAINTY(q), where q is a Booleanconjunctive query without self-join, exhibits an effectiveFP-]P-dichotomy

Complexity result [MW14]The class of problems ]CERTAINTY(q), where q is a Booleanconjunctive query (possibly with self-joins) in which all primary keysconsist of a single attribute, exhibits an effective FP-]P-dichotomy.

if q = ∃x(R(EDBT,2016,x)∧S(x ,Belgium)), ]CERTAINTY(q) ∈ FP ;

if q = ∃x ,y(R(x ,y)∧R(y ,a)), ]CERTAINTY(q) is hard for ]P ;

the query ∃x ,y ,z(R(x ,y ,a)∧R(y , t ,a)) does not meet our criterions.

D. Maslowski (UMONS) DBDBD 2014 October 17th, 2014 9 / 13

Page 27: A dichotomy in the Complexity of Counting Database Repairs · Complexity classes The complexity class ]P The class ]P contains all function problems which consist of counting the

Contributions

Complexity result [MW13]The class of problems ]CERTAINTY(q), where q is a Booleanconjunctive query without self-join, exhibits an effectiveFP-]P-dichotomy

Complexity result [MW14]The class of problems ]CERTAINTY(q), where q is a Booleanconjunctive query (possibly with self-joins) in which all primary keysconsist of a single attribute, exhibits an effective FP-]P-dichotomy.

if q = ∃x(R(EDBT,2016,x)∧S(x ,Belgium)), ]CERTAINTY(q) ∈ FP ;

if q = ∃x ,y(R(x ,y)∧R(y ,a)), ]CERTAINTY(q) is hard for ]P ;

the query ∃x ,y ,z(R(x ,y ,a)∧R(y , t ,a)) does not meet our criterions.

D. Maslowski (UMONS) DBDBD 2014 October 17th, 2014 9 / 13

Page 28: A dichotomy in the Complexity of Counting Database Repairs · Complexity classes The complexity class ]P The class ]P contains all function problems which consist of counting the

Contributions

Complexity result [MW13]The class of problems ]CERTAINTY(q), where q is a Booleanconjunctive query without self-join, exhibits an effectiveFP-]P-dichotomy

Complexity result [MW14]The class of problems ]CERTAINTY(q), where q is a Booleanconjunctive query (possibly with self-joins) in which all primary keysconsist of a single attribute, exhibits an effective FP-]P-dichotomy.

if q = ∃x(R(EDBT,2016,x)∧S(x ,Belgium)), ]CERTAINTY(q) ∈ FP ;

if q = ∃x ,y(R(x ,y)∧R(y ,a)), ]CERTAINTY(q) is hard for ]P ;

the query ∃x ,y ,z(R(x ,y ,a)∧R(y , t ,a)) does not meet our criterions.

D. Maslowski (UMONS) DBDBD 2014 October 17th, 2014 9 / 13

Page 29: A dichotomy in the Complexity of Counting Database Repairs · Complexity classes The complexity class ]P The class ]P contains all function problems which consist of counting the

Contributions

Complexity result [MW13]The class of problems ]CERTAINTY(q), where q is a Booleanconjunctive query without self-join, exhibits an effectiveFP-]P-dichotomy

Complexity result [MW14]The class of problems ]CERTAINTY(q), where q is a Booleanconjunctive query (possibly with self-joins) in which all primary keysconsist of a single attribute, exhibits an effective FP-]P-dichotomy.

if q = ∃x(R(EDBT,2016,x)∧S(x ,Belgium)), ]CERTAINTY(q) ∈ FP ;

if q = ∃x ,y(R(x ,y)∧R(y ,a)), ]CERTAINTY(q) is hard for ]P ;

the query ∃x ,y ,z(R(x ,y ,a)∧R(y , t ,a)) does not meet our criterions.

D. Maslowski (UMONS) DBDBD 2014 October 17th, 2014 9 / 13

Page 30: A dichotomy in the Complexity of Counting Database Repairs · Complexity classes The complexity class ]P The class ]P contains all function problems which consist of counting the

Block-Independent-Disjoint Probabilistic DatabaseR conf rank frequency P

ICDT A biennial 0.3ICDT A annual 0.6KDD A annual 0.5KDD B annual 0.5

R conf rank frequencyICDT A biennialKDD A annual

R conf rank frequencyKDD B annual

Possible world w1 with Possible world w2 withP(w1) = 0.3×0.5 = 0.15 P(w2) = 0.1×0.5 = 0.05

Complexity result (Dalvi and Suciu [DS07])Let q be a Boolean query. The problem PROBABID(q) is : for a givenBlock-Independent-Disjoint Database, compute P(q).The class of problems PROBABID(q), where q is a Boolean conjunctive query withoutself-join, exhibits an effective FP-]P-dichotomy.

D. Maslowski (UMONS) DBDBD 2014 October 17th, 2014 10 / 13

Page 31: A dichotomy in the Complexity of Counting Database Repairs · Complexity classes The complexity class ]P The class ]P contains all function problems which consist of counting the

Block-Independent-Disjoint Probabilistic DatabaseR conf rank frequency P

ICDT A biennial 0.3ICDT A annual 0.6KDD A annual 0.5KDD B annual 0.5

R conf rank frequencyICDT A biennialKDD A annual

R conf rank frequencyKDD B annual

Possible world w1 with Possible world w2 withP(w1) = 0.3×0.5 = 0.15 P(w2) = 0.1×0.5 = 0.05

Complexity result (Dalvi and Suciu [DS07])Let q be a Boolean query. The problem PROBABID(q) is : for a givenBlock-Independent-Disjoint Database, compute P(q).The class of problems PROBABID(q), where q is a Boolean conjunctive query withoutself-join, exhibits an effective FP-]P-dichotomy.

D. Maslowski (UMONS) DBDBD 2014 October 17th, 2014 10 / 13

Page 32: A dichotomy in the Complexity of Counting Database Repairs · Complexity classes The complexity class ]P The class ]P contains all function problems which consist of counting the

Uncertain Databases vs Probabilistic DatabasesR Conf Year Town P

EDBT 2016 Mons 1/3EDBT 2016 Brussels 1/3EDBT 2016 Belgrade 1/3

S Town Country PMons Belgium 1Brussels Belgium 1Belgrade Belgium 1/2Belgrade Serbia 1/2

T Country Cont PBelgium Europe 1Serbia Europe 1Tunisia Africa 1

Difference about the modelsUniform probability for tuples in a same block.Probabilities in a block sum up to 1.

Difference about the dichotomy resultsThere exists queries q such that ]CERTAINTY(q) is in P andPROBABID(q) is ]P-hard.No result exist about queries with self-joins in BID databasesmodel.

D. Maslowski (UMONS) DBDBD 2014 October 17th, 2014 11 / 13

Page 33: A dichotomy in the Complexity of Counting Database Repairs · Complexity classes The complexity class ]P The class ]P contains all function problems which consist of counting the

Uncertain Databases vs Probabilistic DatabasesR Conf Year Town P

EDBT 2016 Mons 1/3EDBT 2016 Brussels 1/3EDBT 2016 Belgrade 1/3

S Town Country PMons Belgium 1Brussels Belgium 1Belgrade Belgium 1/2Belgrade Serbia 1/2

T Country Cont PBelgium Europe 1Serbia Europe 1Tunisia Africa 1

Difference about the modelsUniform probability for tuples in a same block.Probabilities in a block sum up to 1.

Difference about the dichotomy resultsThere exists queries q such that ]CERTAINTY(q) is in P andPROBABID(q) is ]P-hard.No result exist about queries with self-joins in BID databasesmodel.

D. Maslowski (UMONS) DBDBD 2014 October 17th, 2014 11 / 13

Page 34: A dichotomy in the Complexity of Counting Database Repairs · Complexity classes The complexity class ]P The class ]P contains all function problems which consist of counting the

ConclusionSummary

Uncertain Database Model ;

Conjunctive queries without self-join ;

Conjunctive queries in which all primary keys consist of a singleattribute ;

CERTAINTY(q) ]CERTAINTY(q) ;

Uncertain Database as a special case of Probabilistic Database ;

Effective FP-]P-dichotomies.

Open questions

Can we adapt our dichotomy result - that concerns queries withself-joins - to BID databases ?

Does the class of problems ]CERTAINTY(q), where q is an union ofconjunctive queries, exhibit an effective FP-]P-dichotomy ?

D. Maslowski (UMONS) DBDBD 2014 October 17th, 2014 12 / 13

Page 35: A dichotomy in the Complexity of Counting Database Repairs · Complexity classes The complexity class ]P The class ]P contains all function problems which consist of counting the

ConclusionSummary

Uncertain Database Model ;

Conjunctive queries without self-join ;

Conjunctive queries in which all primary keys consist of a singleattribute ;

CERTAINTY(q) ]CERTAINTY(q) ;

Uncertain Database as a special case of Probabilistic Database ;

Effective FP-]P-dichotomies.

Open questions

Can we adapt our dichotomy result - that concerns queries withself-joins - to BID databases ?

Does the class of problems ]CERTAINTY(q), where q is an union ofconjunctive queries, exhibit an effective FP-]P-dichotomy ?

D. Maslowski (UMONS) DBDBD 2014 October 17th, 2014 12 / 13

Page 36: A dichotomy in the Complexity of Counting Database Repairs · Complexity classes The complexity class ]P The class ]P contains all function problems which consist of counting the

ConclusionSummary

Uncertain Database Model ;

Conjunctive queries without self-join ;

Conjunctive queries in which all primary keys consist of a singleattribute ;

CERTAINTY(q) ]CERTAINTY(q) ;

Uncertain Database as a special case of Probabilistic Database ;

Effective FP-]P-dichotomies.

Open questions

Can we adapt our dichotomy result - that concerns queries withself-joins - to BID databases ?

Does the class of problems ]CERTAINTY(q), where q is an union ofconjunctive queries, exhibit an effective FP-]P-dichotomy ?

D. Maslowski (UMONS) DBDBD 2014 October 17th, 2014 12 / 13

Page 37: A dichotomy in the Complexity of Counting Database Repairs · Complexity classes The complexity class ]P The class ]P contains all function problems which consist of counting the

ConclusionSummary

Uncertain Database Model ;

Conjunctive queries without self-join ;

Conjunctive queries in which all primary keys consist of a singleattribute ;

CERTAINTY(q) ]CERTAINTY(q) ;

Uncertain Database as a special case of Probabilistic Database ;

Effective FP-]P-dichotomies.

Open questions

Can we adapt our dichotomy result - that concerns queries withself-joins - to BID databases ?

Does the class of problems ]CERTAINTY(q), where q is an union ofconjunctive queries, exhibit an effective FP-]P-dichotomy ?

D. Maslowski (UMONS) DBDBD 2014 October 17th, 2014 12 / 13

Page 38: A dichotomy in the Complexity of Counting Database Repairs · Complexity classes The complexity class ]P The class ]P contains all function problems which consist of counting the

ConclusionSummary

Uncertain Database Model ;

Conjunctive queries without self-join ;

Conjunctive queries in which all primary keys consist of a singleattribute ;

CERTAINTY(q) ]CERTAINTY(q) ;

Uncertain Database as a special case of Probabilistic Database ;

Effective FP-]P-dichotomies.

Open questions

Can we adapt our dichotomy result - that concerns queries withself-joins - to BID databases ?

Does the class of problems ]CERTAINTY(q), where q is an union ofconjunctive queries, exhibit an effective FP-]P-dichotomy ?

D. Maslowski (UMONS) DBDBD 2014 October 17th, 2014 12 / 13

Page 39: A dichotomy in the Complexity of Counting Database Repairs · Complexity classes The complexity class ]P The class ]P contains all function problems which consist of counting the

ConclusionSummary

Uncertain Database Model ;

Conjunctive queries without self-join ;

Conjunctive queries in which all primary keys consist of a singleattribute ;

CERTAINTY(q) ]CERTAINTY(q) ;

Uncertain Database as a special case of Probabilistic Database ;

Effective FP-]P-dichotomies.

Open questions

Can we adapt our dichotomy result - that concerns queries withself-joins - to BID databases ?

Does the class of problems ]CERTAINTY(q), where q is an union ofconjunctive queries, exhibit an effective FP-]P-dichotomy ?

D. Maslowski (UMONS) DBDBD 2014 October 17th, 2014 12 / 13

Page 40: A dichotomy in the Complexity of Counting Database Repairs · Complexity classes The complexity class ]P The class ]P contains all function problems which consist of counting the

ConclusionSummary

Uncertain Database Model ;

Conjunctive queries without self-join ;

Conjunctive queries in which all primary keys consist of a singleattribute ;

CERTAINTY(q) ]CERTAINTY(q) ;

Uncertain Database as a special case of Probabilistic Database ;

Effective FP-]P-dichotomies.

Open questions

Can we adapt our dichotomy result - that concerns queries withself-joins - to BID databases ?

Does the class of problems ]CERTAINTY(q), where q is an union ofconjunctive queries, exhibit an effective FP-]P-dichotomy ?

D. Maslowski (UMONS) DBDBD 2014 October 17th, 2014 12 / 13

Page 41: A dichotomy in the Complexity of Counting Database Repairs · Complexity classes The complexity class ]P The class ]P contains all function problems which consist of counting the

ConclusionSummary

Uncertain Database Model ;

Conjunctive queries without self-join ;

Conjunctive queries in which all primary keys consist of a singleattribute ;

CERTAINTY(q) ]CERTAINTY(q) ;

Uncertain Database as a special case of Probabilistic Database ;

Effective FP-]P-dichotomies.

Open questions

Can we adapt our dichotomy result - that concerns queries withself-joins - to BID databases ?

Does the class of problems ]CERTAINTY(q), where q is an union ofconjunctive queries, exhibit an effective FP-]P-dichotomy ?

D. Maslowski (UMONS) DBDBD 2014 October 17th, 2014 12 / 13

Page 42: A dichotomy in the Complexity of Counting Database Repairs · Complexity classes The complexity class ]P The class ]P contains all function problems which consist of counting the

Thank you for your attention !

Nilesh N. Dalvi and Dan Suciu.Management of probabilistic data : foundations and challenges.In Leonid Libkin, editor, PODS, pages 1–12. ACM, 2007.

Phokion G. Kolaitis and Enela Pema.A dichotomy in the complexity of consistent query answering for queries with two atoms.Inf. Process. Lett., 112(3) :77–85, 2012.

Paraschos Koutris and Dan Suciu.A dichotomy on the complexity of consistent query answering for atoms with simple keys.CoRR, abs/1212.6636, 2012.

Dany Maslowski and Jef Wijsen.A dichotomy in the complexity of counting database repairs.Journal of Computer and System Sciences, 79(6) :958 – 983, 2013.

Dany Maslowski and Jef Wijsen.Counting database repairs that satisfy conjunctive queries with self-joins.In 17th International Conference on Database Theory, pages 155–164, 2014.

Jef Wijsen.Certain conjunctive query answering in first-order logic.ACM Trans. Database Syst., 37(2) :9, 2012.

D. Maslowski (UMONS) DBDBD 2014 October 17th, 2014 13 / 13