the search for new resonances using the atlas detector at the lhc: a statistical analysis

48
The search for new resonances using the ATLAS detector at the LHC: A statistical analysis Eric Williams Columbia University April 20, 2012 Memorial Sloan Kettering Cancer Center

Upload: eric-williams

Post on 08-Jul-2015

132 views

Category:

Technology


3 download

DESCRIPTION

General overview of how particle physics works

TRANSCRIPT

Page 1: The search for new resonances using the ATLAS detector at the LHC: A statistical analysis

The search for new resonances using the ATLASdetector at the LHC: A statistical analysis

Eric Williams

Columbia University

April 20, 2012

Memorial Sloan Kettering Cancer Center

Page 2: The search for new resonances using the ATLAS detector at the LHC: A statistical analysis

Overview

The Large Hadron Collider

The ATLAS detector

Why new particles?

How to find new particles?

The statistical analysis of search results

E. Williams (Columbia U.) April 20, 2012 2 / 40

Page 3: The search for new resonances using the ATLAS detector at the LHC: A statistical analysis

The Large Hadron Collider (LHC)The LHC is the world’s largest and highest-energy particle accelerator.

Located just outside of Geneva, Switzerland at the European Organization for NuclearResearch (CERN)

Collides counter-rotating proton beams at center-of-mass energy = 7 TeV (!)

Consists of a 17 mile long tunnel, 300 feet underground

Beams collide at the centers of four experiments (detectors):

ATLAS, ALICE, CMS and LHC-b

E. Williams (Columbia U.) April 20, 2012 3 / 40

Page 4: The search for new resonances using the ATLAS detector at the LHC: A statistical analysis

The Large Hadron Collider (LHC)

Many challenges:

1232 superconducting (1.9 K) dipole magnets at 8.33 T

7600 km Nb-Ti superconducting cable carrying 11850 A

Ultra-high vacuum: ∼ 10−13 atm ( 110 × Pmoon)

Number of bunches per beam (2011): 1380 with 1× 1011 protons perbunch

E. Williams (Columbia U.) April 20, 2012 4 / 40

Page 5: The search for new resonances using the ATLAS detector at the LHC: A statistical analysis

The ATLAS Detector

The ATLAS (A Toroidal LHC ApparatuS) detector is designed to be a ‘general-purpose’detector undertaking a broad range of physics analyses.

E. Williams (Columbia U.) April 20, 2012 5 / 40

Page 6: The search for new resonances using the ATLAS detector at the LHC: A statistical analysis

The ATLAS Detector

It’s BIG!

E. Williams (Columbia U.) April 20, 2012 6 / 40

Page 7: The search for new resonances using the ATLAS detector at the LHC: A statistical analysis

The ATLAS Detector

ATLAS is composed of components, each optimized for particular functions

Inner Detector: measures the momentum andtrajectories charged particles

Electromagnetic Calorimeters: measures theenergies of electrons, photons, and others

Hadronic Calorimeters: measures the energiesof the hadronic particles (‘jets’, protons,neutrons)

Muon System: measures the momenta ofmuons in the event

The goal of particle detection is to reconstruct the kinematics of each collision(particle energies, directions, charges and masses), to determine whethersomething “interesting”1 happened during that event

1In this talk, the term “interesting” refers to relevance for new particle detection

E. Williams (Columbia U.) April 20, 2012 7 / 40

Page 8: The search for new resonances using the ATLAS detector at the LHC: A statistical analysis

Why search for new particles?

Page 9: The search for new resonances using the ATLAS detector at the LHC: A statistical analysis

Why new particles?

The Standard Model of particle physics is a wildly successful theory,

that it provides a description of the fundamental particles and their interactions:

however...

E. Williams (Columbia U.) April 20, 2012 9 / 40

Page 10: The search for new resonances using the ATLAS detector at the LHC: A statistical analysis

Why new particles?

The Standard Model of particle physics is a wildly successful theory,

that it provides a description of the fundamental particles and their interactions:

however...E. Williams (Columbia U.) April 20, 2012 9 / 40

Page 11: The search for new resonances using the ATLAS detector at the LHC: A statistical analysis

Why new particles?

... as it stands, it is an incomplete theory of nature (even with a Higgs):

Does not incorporate dark matter or dark energy

Says nothing about gravity

Neutrinos have mass??

Matter-antimatter asymmetry of the universe

Has inherent ‘unnaturalness’ in the hierarchy problem(why is gravity so weak?)

No Higgs seen yet... or has it?? (more on this later)

Bullet cluster as evidence of dark

matter

Proposed solutions to these issues all have one thing in common: new particles!

particle/theory motivation what is it? seen at LHC?

Supersymmetry Hierarchy problem & Dark matter All SM particle have SUSY ’twin’ Not yet

Extra dimensions2 Gravity & Hierarchy Problem 4D space-time rests on ’brane’ Not yetof larger dimensional space

Higgs Explains particle mass Generator of electroweak maybe (not yet)symmetry breaking

2My particular interest (thesis topic)

E. Williams (Columbia U.) April 20, 2012 10 / 40

Page 12: The search for new resonances using the ATLAS detector at the LHC: A statistical analysis

How to search for new particles?

Page 13: The search for new resonances using the ATLAS detector at the LHC: A statistical analysis

How to find new particles?

The problem: y-axis – the production rate (or cross section) of a given process

7  

E. Williams (Columbia U.) April 20, 2012 12 / 40

Page 14: The search for new resonances using the ATLAS detector at the LHC: A statistical analysis

How to find new particles?

7  

The problem: y-axis – the production rate (or cross section) of a given process

The true challenge in the search for new physics is finding the signal…

E. Williams (Columbia U.) April 20, 2012 13 / 40

Page 15: The search for new resonances using the ATLAS detector at the LHC: A statistical analysis

How to find new particles?

underneath all this background!

7  

The true challenge in the search for new physics is finding the signal…

The problem: y-axis – the production rate (or cross section) of a given process

Discovery rate ~11 orders of magnitude below the noise*!

E. Williams (Columbia U.) April 20, 2012 14 / 40

Page 16: The search for new resonances using the ATLAS detector at the LHC: A statistical analysis

How to find new particles?

underneath all this background!

7  

The true challenge in the search for new physics is finding the signal…

The problem: y-axis – the production rate (or cross section) of a given process

Discovery rate ~11 orders of magnitude below the noise*!

* Today’s “noise” is 1979’s (W) and 1984’s (Z) Nobel Prizes

E. Williams (Columbia U.) April 20, 2012 15 / 40

Page 17: The search for new resonances using the ATLAS detector at the LHC: A statistical analysis

How to find new particles?

How to hack through 11 orders of magnitude of particle jungle?

t

Three steps:

1 ‘Trigger’: An event filter designed to keep only ‘interesting’ events

2 Kinematic ‘cuts’: Use distinguishing variables specific to the decay ofour particle of interest to discriminate between signal and background

3 Statistical analysis: Quantify the statistical significance of discoveryor exclusion

E. Williams (Columbia U.) April 20, 2012 16 / 40

Page 18: The search for new resonances using the ATLAS detector at the LHC: A statistical analysis

How to find new particles? 1) Trigger

There are 20 million collisions per second in the ATLAS detector. Event filtering(triggering) is not only necessary for identifying the ‘interesting’ events out of the hugebackgrounds, but also crutial to controling unmanageably large amount of data output.

Without triggers: 2× 107 crossingss× ∼ 10 collisions

crossing× ∼ 4 MB

event→∼ TB

sof data!

With triggers: Recorded event rate reduced to few hundred events per second

Example of what to trigger on:

not “interesting” “interesting”

E. Williams (Columbia U.) April 20, 2012 17 / 40

Page 19: The search for new resonances using the ATLAS detector at the LHC: A statistical analysis

How to find new particles? 2) Kinematic “cuts”

A “cut” on an event, is simply a choice of whether or not to keep the event, dependingits observable properties. Examples:

Data quality: Was the detector working when event was recorded?

Object quality: Were the objects (particles) in the event recorded properly?

Kinematic: Do the objects in the event look more like signal than they dobackground?

Example:Searching for Higgs: H →WW → eνeν: expect to see 2 electrons, 2 neutrinos

Can cut (remove) background events:

Z → µµ/ττ or (no electrons or neutrinos)

W → eν/µν/τν (single electron or neutrino)

tt̄→W+bW−b̄→ eνeν + bb̄ (veto on bb̄)

But it is often the case that the background ‘looks’ just like the signal:e.g. pp→WW → eνeν or Z → ee + detector mis-measurement (looks like νν)

E. Williams (Columbia U.) April 20, 2012 18 / 40

Page 20: The search for new resonances using the ATLAS detector at the LHC: A statistical analysis

How to find new particles? 2) Kinematic “cuts”

A “cut” on an event, is simply a choice of whether or not to keep the event, dependingits observable properties. Examples:

Data quality: Was the detector working when event was recorded?

Object quality: Were the objects (particles) in the event recorded properly?

Kinematic: Do the objects in the event look more like signal than they dobackground?

Example:Searching for Higgs: H →WW → eνeν: expect to see 2 electrons, 2 neutrinos

Can cut (remove) background events:

Z → µµ/ττ or (no electrons or neutrinos)

W → eν/µν/τν (single electron or neutrino)

tt̄→W+bW−b̄→ eνeν + bb̄ (veto on bb̄)

But it is often the case that the background ‘looks’ just like the signal:e.g. pp→WW → eνeν or Z → ee + detector mis-measurement (looks like νν)

E. Williams (Columbia U.) April 20, 2012 18 / 40

Page 21: The search for new resonances using the ATLAS detector at the LHC: A statistical analysis

How to find new particles? 2) Kinematic “cuts”

A “cut” on an event, is simply a choice of whether or not to keep the event, dependingits observable properties. Examples:

Data quality: Was the detector working when event was recorded?

Object quality: Were the objects (particles) in the event recorded properly?

Kinematic: Do the objects in the event look more like signal than they dobackground?

Example:Searching for Higgs: H →WW → eνeν: expect to see 2 electrons, 2 neutrinos

Can cut (remove) background events:

Z → µµ/ττ or (no electrons or neutrinos)

W → eν/µν/τν (single electron or neutrino)

tt̄→W+bW−b̄→ eνeν + bb̄ (veto on bb̄)

But it is often the case that the background ‘looks’ just like the signal:e.g. pp→WW → eνeν or Z → ee + detector mis-measurement (looks like νν)

E. Williams (Columbia U.) April 20, 2012 18 / 40

Page 22: The search for new resonances using the ATLAS detector at the LHC: A statistical analysis

How to find new particles? 2) Kinematic “cuts”

A “cut” on an event, is simply a choice of whether or not to keep the event, dependingits observable properties. Examples:

Data quality: Was the detector working when event was recorded?

Object quality: Were the objects (particles) in the event recorded properly?

Kinematic: Do the objects in the event look more like signal than they dobackground?

Example:Searching for Higgs: H →WW → eνeν: expect to see 2 electrons, 2 neutrinos

Can cut (remove) background events:

Z → µµ/ττ or (no electrons or neutrinos)

W → eν/µν/τν (single electron or neutrino)

tt̄→W+bW−b̄→ eνeν + bb̄ (veto on bb̄)

But it is often the case that the background ‘looks’ just like the signal:e.g. pp→WW → eνeν or Z → ee + detector mis-measurement (looks like νν)

E. Williams (Columbia U.) April 20, 2012 18 / 40

Page 23: The search for new resonances using the ATLAS detector at the LHC: A statistical analysis

How to find new particles? 2) Kinematic “cuts”

Can you tell the difference?

“Ordinary” event with 2 “jets” Z boson decaying to electron + antielectronits discovery was rewarded by a Nobel prize

in 1984

E. Williams (Columbia U.) April 20, 2012 19 / 40

Page 24: The search for new resonances using the ATLAS detector at the LHC: A statistical analysis

How to find new particles? 2) Kinematic “cuts”

Can you tell the difference?

“Ordinary” event with 2 “jets” Z boson decaying to electron + antielectronits discovery was rewarded by a Nobel prize

in 1984

E. Williams (Columbia U.) April 20, 2012 19 / 40

Page 25: The search for new resonances using the ATLAS detector at the LHC: A statistical analysis

How to find new particles? 2) Kinematic “cuts”

We know what the signal (and background) decay products should look likethrough Monte-Carlo simulations

From these simulations, we can identify certain variables that will give bestsignal/background separation

Many additional cuts are made on object energies, directions, combinations,composition, etc.

E. Williams (Columbia U.) April 20, 2012 20 / 40

Page 26: The search for new resonances using the ATLAS detector at the LHC: A statistical analysis

How to find new particles? 2) Kinematic “cuts”

We know what the signal (and background) decay products should look likethrough Monte-Carlo simulations

From these simulations, we can identify certain variables that will give bestsignal/background separation

Many additional cuts are made on object energies, directions, combinations,composition, etc.

E. Williams (Columbia U.) April 20, 2012 20 / 40

Page 27: The search for new resonances using the ATLAS detector at the LHC: A statistical analysis

How to find new particles? 2) Kinematic “cuts”

After optimizing cuts made on final state particles for best signal to background

rejection, we choose a ‘discriminating variable’:

A quantity that gives best separation between signal and background after allthe event level cuts are applied

Common discriminating variables:

Number of estimated signal/background/data eventsPhysical observable (below), e.g. invariant mass of the decay particlesOutput from neural net or decision tree

Use as input to statistical analysis!

E. Williams (Columbia U.) April 20, 2012 21 / 40

Page 28: The search for new resonances using the ATLAS detector at the LHC: A statistical analysis

How to find new particles? 2) Kinematic “cuts”

After optimizing cuts made on final state particles for best signal to background

rejection, we choose a ‘discriminating variable’:

A quantity that gives best separation between signal and background after allthe event level cuts are applied

Common discriminating variables:

Number of estimated signal/background/data eventsPhysical observable (below), e.g. invariant mass of the decay particlesOutput from neural net or decision tree

Use as input to statistical analysis!

E. Williams (Columbia U.) April 20, 2012 21 / 40

Page 29: The search for new resonances using the ATLAS detector at the LHC: A statistical analysis

How to find new particles? 2) Kinematic “cuts”

After optimizing cuts made on final state particles for best signal to background

rejection, we choose a ‘discriminating variable’:

A quantity that gives best separation between signal and background after allthe event level cuts are applied

Common discriminating variables:

Number of estimated signal/background/data eventsPhysical observable (below), e.g. invariant mass of the decay particlesOutput from neural net or decision tree

Use as input to statistical analysis!

E. Williams (Columbia U.) April 20, 2012 21 / 40

Page 30: The search for new resonances using the ATLAS detector at the LHC: A statistical analysis

Statistical Analysis of Search Results

Page 31: The search for new resonances using the ATLAS detector at the LHC: A statistical analysis

Signal Significance

The ultimate goal of an experimental search for a new particle is to state whether or nota statistically significant observation of the signal has been made. In other words, toanswer the canonical question:

Given the data, is it possible to distinguish between two hypotheses?

Three main steps toward answering this question:

1 Define a test-statistic which optimizes theseparation of the signal+backgroundhypothesis (H1) and the background-onlyhypothesis (H0)

2 Run an appropriate number ofpseudo-experiments (Frequentist) for bothhypothesis, incorporating all signal andbackground nuisance parameters (systematics)in a coherent way (Bayesian).

3 Define confidence levels designating exclusionsor discoveries 2012 Higgs → γγ 4.7fb−1 result

E. Williams (Columbia U.) April 20, 2012 23 / 40

Page 32: The search for new resonances using the ATLAS detector at the LHC: A statistical analysis

1) Define test-statistic: Likelihood-Ratio

Neyman-Pearson lemma suggests that the most powerful test forstatistically separating two point hypotheses is the likelihood-ratiotest, that is:

Λ =L(s + b|x)

L(b|x)

s = signalb = backgroundx = dataL = likelihood

Rate of signal or background events follow a Poisson distribution,appropriate choice for likelihood functional form:

L(s + b) =(s + b)xe−(s+b)

x!, L(b) =

(b)xe−b

x!

E. Williams (Columbia U.) April 20, 2012 24 / 40

Page 33: The search for new resonances using the ATLAS detector at the LHC: A statistical analysis

1) Define test-statistic: Likelihood-Ratio

With this choice, combining likelihoods from multiple channels (e.g. X → Y andX → Z) as well as from multiple bins within a discriminating variable (e.g.M(X)) is natural:

Λ(x) =

channels∏i

bins∏j

(sij + bij)xij e−(sij+bij)

xij !/

(bij)xij e−(bij)

xij !.

In the high-statistics limit the distributions of -2 ln Λ are expected to converge to(χ2s+b − χ2

b), thus it is more common to use:

NLLR(x) = −2 ln(Λ(x))

= −2

channels∑i

bins∑j

[sij − xij ln

(1 +

sijbij

)]

This test statistic decreases monotonically for increasingly signal-like (decreasinglybackground-like) experiments. Can be used to order data outcomes relative toeach other in hypothesis significance

E. Williams (Columbia U.) April 20, 2012 25 / 40

Page 34: The search for new resonances using the ATLAS detector at the LHC: A statistical analysis

2) Pseudo-Experiments: A Semi-Frequentist Approach

Assuming that the data is drawn randomly from a Poisson parent distribution, wecan create pdfs of NLLR(x) for both the signal+background hypothesis (H1) andthe background-only (H0) hypothesis, by conducting pseudo-experiments

Systematic uncertainties (nuisance parameters) are incorporated by sampling abifurcated Gaussian distribution with the ±σ uncertainties estimated for eachsource (hence ‘Semi’-Frequentist)

The pseudo-experiment background (Bmj ) and signal (Smj ) yields are then given as:

Bmj = B0,m

j (1 +

Nbkgdsys∑i

gbkgdi )

Smk = S0,m

k (1 +

Nsigsys∑i

gsigi )

Where B0,m (S0,m) is the nominal background (signal) poisson yield for channel j (k)and bin m. gbkgd (gsig) is the contribution from systematic uncertainty i.

E. Williams (Columbia U.) April 20, 2012 26 / 40

Page 35: The search for new resonances using the ATLAS detector at the LHC: A statistical analysis

2) Pseudo-Experiments: A Semi-Frequentist Approach

Running O(20k) pseudo-experiments, we evaulate the NLLR distributions underthe H0, NLLR(x = Db), and H1, NLLR(x = Ds+b), hypotheses. Where:

Db =

Nbins∑m

Nb∑j

Bmj , Ds+b =

Nbins∑m

(

Nb∑j

Bmj +

Ns∑k

Smk )

N

-2ln(Λ(x))

Bkgd Only

Sig + Bkgd

E. Williams (Columbia U.) April 20, 2012 27 / 40

Page 36: The search for new resonances using the ATLAS detector at the LHC: A statistical analysis

2) Pseudo-Experiments: A Semi-Frequentist Approach

Running O(20k) pseudo-experiments, we evaulate the NLLR distributions underthe H0, NLLR(x = Db), and H1, NLLR(x = Ds+b), hypotheses. Where:

Db =

Nbins∑m

Nb∑j

Bmj , Ds+b =

Nbins∑m

(

Nb∑j

Bmj +

Ns∑k

Smk )

N

-2ln(Λ(x))

Bkgd Only

Sig + Bkgd

NLLR(xdata)

Location of measured data on NLLR pdf (Prior Predictive Ensemble) used toquantify exclusion/discovery

E. Williams (Columbia U.) April 20, 2012 28 / 40

Page 37: The search for new resonances using the ATLAS detector at the LHC: A statistical analysis

3) Modified Frequentist Confidence Levels: CLs

Confidence levels defined as the fraction of outcomes predicted to falloutside of the specified confidence interval

CLs+b: fraction of H1 pseudo-experiments less signal-like than data

CLs+b = Ps+b(X ≥ Xobs) =

∫ ∞NLLR(x=Dobs)

P(x = Ds+b) dP

CLb: fraction of H0 pseudo-experiments less signal-like than data

CLb = Pb(X ≥ Xobs) =

∫ ∞NLLR(x=Dobs)

P(x = Db) dP

Therefore...

High CLs+b → data signal-like. (otherwise, used for exclusion)High CLb (or low 1− CLb) → data not background like.

For discovery, (1-CLb) ≡ p-value = the probability, under H0 hypothesis, that backgroundfluctuated to produce observed signal. Typically require (1-CLb) < 5σ(4.3× 10−7) toclaim discovery

E. Williams (Columbia U.) April 20, 2012 29 / 40

Page 38: The search for new resonances using the ATLAS detector at the LHC: A statistical analysis

3) Modified Frequentist Confidence Levels: CLs

N

-2ln(Λ(x))

Bkgd Only

Sig + Bkgd

NLLR(xdata)

1-CLb

CLs+b

Therefore...

High CLs+b → data signal-like. (otherwise, used for exclusion)High CLb (or low 1− CLb) → data not background like.

For discovery, (1-CLb) ≡ p-value = the probability, under H0 hypothesis, that backgroundfluctuated to produce observed signal. Typically require (1-CLb) < 5σ(4.3× 10−7) toclaim discovery

E. Williams (Columbia U.) April 20, 2012 30 / 40

Page 39: The search for new resonances using the ATLAS detector at the LHC: A statistical analysis

3) Modified Frequentist Confidence Levels: CLs

The strictly frequentist CLs+b confidence level, while a powerfulstatistical tool, is unstable if the background model dramaticallydisagrees with the data:

Background overestimated → low CLs+b → possible exclusion!Background underestimated → high CLs+b → possible discovery!

The solution: The modified frequentist confidence level, CLs

CLs ≡CLs+b

CLb

Normalizing CLs+b with CLb removes the dependence on backgroundmodelling and leads to more conservative limits on H1 hypothesis, aswell as lower false exclusion rate (type II error) than nominal (1− CL)

A signal model is then excluded at or above95% confidence level if CLs ≤ 0.05

E. Williams (Columbia U.) April 20, 2012 31 / 40

Page 40: The search for new resonances using the ATLAS detector at the LHC: A statistical analysis

CLs in action

Latest results from the ATLAS Higgs searchSummary plots from complicated combination of 7 different Higgsdecay channels

E. Williams (Columbia U.) April 20, 2012 32 / 40

Page 41: The search for new resonances using the ATLAS detector at the LHC: A statistical analysis

CLs in action

Latest results from the ATLAS Higgs searchSummary plots from complicated combination of 7 different Higgsdecay channels

CLs provides ranges of 95% CL exclusion:

113–116 GeV 131-238 GeV 251-466 GeV

E. Williams (Columbia U.) April 20, 2012 33 / 40

Page 42: The search for new resonances using the ATLAS detector at the LHC: A statistical analysis

CLs in action

If we look in the non-excluded regionsThe significance of an excess is quantified by (1−CLb) = p-value ≡ the probability(p0) that a background-only experiment is more signal-like than observed

Observed local significances for a 126 GeV Higgs boson is 3.5σ (expected SMHiggs significance is 2.5σ)

Observed global probability for such an excess to be found in the full search range,in the absence of a signal, is approximately 1.4%, corresponding to 2.2σ.

E. Williams (Columbia U.) April 20, 2012 34 / 40

Page 43: The search for new resonances using the ATLAS detector at the LHC: A statistical analysis

Other results and the future!These statistical methods have been adopted collaboration wide and many limits havebeen set thus far:

E. Williams (Columbia U.) April 20, 2012 35 / 40

Page 44: The search for new resonances using the ATLAS detector at the LHC: A statistical analysis

Other results and the future!

In 2012, the ATLAS will collect 4× the amount of data and the LHChas ramped up to a center-of-mass energy of 8 TeV!

Will also be able to probe even further for new physics.

This means, assuming Higgs, a 5σ Higgs discover will happen withinthe year!

Stay tuned!

E. Williams (Columbia U.) April 20, 2012 36 / 40

Page 45: The search for new resonances using the ATLAS detector at the LHC: A statistical analysis

Backups

Page 46: The search for new resonances using the ATLAS detector at the LHC: A statistical analysis

Cross section exclusion

It is common to express confidence interval exclusions in terms of signal cross sections

A confidence limit for exclusion is defined as the value of a parameter (e.g. cross section)which is excluded at a specified confidence level.

E. Williams (Columbia U.) April 20, 2012 38 / 40

Page 47: The search for new resonances using the ATLAS detector at the LHC: A statistical analysis

Cross section exclusion

It is common to express confidence interval exclusions in terms of signal cross sections

A confidence limit for exclusion is defined as the value of a parameter (e.g. cross section)which is excluded at a specified confidence level.

E. Williams (Columbia U.) April 20, 2012 39 / 40

Page 48: The search for new resonances using the ATLAS detector at the LHC: A statistical analysis

Signal mass exclusion

Cross section exclusion plots can also be used to express an upper limit on signal massexclusionSignal model excluded where signal limits cross observed limits (Randall-SundrumGraviton search shown as example)

E. Williams (Columbia U.) April 20, 2012 40 / 40