genechips and microarray expression data david paoletti

23
GeneChips and Microarray Expression Data David Paoletti

Upload: rudolf-watts

Post on 23-Dec-2015

225 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: GeneChips and Microarray Expression Data David Paoletti

GeneChips andMicroarray Expression Data

David Paoletti

Page 2: GeneChips and Microarray Expression Data David Paoletti

The Problem

• Determine gene expression (activity)

• What proteins are being produced by a group of cells?

Page 3: GeneChips and Microarray Expression Data David Paoletti

The Assumption

• The RNA present in the cell determines what proteins are being produced

• Efficiency

Page 4: GeneChips and Microarray Expression Data David Paoletti

The Why

• Understanding

• Toxicology

• Drug design– Evaluation– Specificity– Response

Page 5: GeneChips and Microarray Expression Data David Paoletti

What is a GeneChip?

• 1.28 x 1.28 cm glass wafer 500,000 features

– 24 x 24 m probe site– 25 mer oligo, complementary

• PM: perfect match

• MM: mismatch

2.5 M copies

GeneChip

Page 6: GeneChips and Microarray Expression Data David Paoletti

The Solution

Page 7: GeneChips and Microarray Expression Data David Paoletti

The Gains

• Speed

• Possibility

• Sensitivity

• Reproducibility

Page 8: GeneChips and Microarray Expression Data David Paoletti
Page 9: GeneChips and Microarray Expression Data David Paoletti

The Process

CellsPoly-ARNA

AAAA

cDNA

L L L

L

IVT

Biotin-labeledAntisense cRNA

L

Fragment (heat, Mg2+)

Labeledfragments

Hybridize Wash/stain Scan

L

Page 10: GeneChips and Microarray Expression Data David Paoletti

Hybridization and Staining

LL

GeneChip BiotinLabeled cRNA

+L

L

L

L

L

L

L

L

L

L+

SAPEStreptavidin-phycoerythrin

Hybridized Array

Page 11: GeneChips and Microarray Expression Data David Paoletti

Specialized Equipment

Page 12: GeneChips and Microarray Expression Data David Paoletti
Page 13: GeneChips and Microarray Expression Data David Paoletti

How Features Are Chosen5’ 3’Gene Sequence

Multipleoligo probes

25 mers

Perfect MatchMismatch

Page 14: GeneChips and Microarray Expression Data David Paoletti

Feature Values

83 112 96 32

47 382 165 87

55 246 140 93

104 552 187 65

Remove outermost rows and columns

Find 75th percentile of remaining values

This value is taken as representative of this feature

Page 15: GeneChips and Microarray Expression Data David Paoletti

Background Noise Removal

• The array is divided into 16 equal sectors

• For each sector– Find the lowest 2% of the feature intensities– Average these– Subtract this average from the intensity value of

all features in the sector

Page 16: GeneChips and Microarray Expression Data David Paoletti

Noise Calculation

bgi i

iraw

pixel

stdev

NQ

1

NFSFQQ raw

Page 17: GeneChips and Microarray Expression Data David Paoletti

Average Difference Intensity

• For a given gene– For each probe pair for the given gene

• Calculate the difference PM-MM

– Calculate , for this set– If abs( (PM – MM) - ) 3, delete from set– Remaining set is pairs in avg

avgin pairsavgin pairs#

1

iii MMPMAvgDiff

Page 18: GeneChips and Microarray Expression Data David Paoletti

Positive & Negative Probe Pairs

If both true, mark as positive

If both true, mark as negative

PM-MM SDT

PM/MM SRT

MM-PM SDT

MM/PM SRT

SDT = Q · STDmult

By default, SRT = 1.5, STDmult = 2.0 (low density), 4.0 (high)

Page 19: GeneChips and Microarray Expression Data David Paoletti

Voting Methods forAbsolute Call

• Positive/negative ratio

PNR = #pos / #neg

• Positive fraction

PF = #pos / #used

• Log average ratio

avgin pairs

)/log(avgin pairs#

10MMPMLA

Page 20: GeneChips and Microarray Expression Data David Paoletti

Decision Matrix

Absent Marginal Present

PNR 3.00 4.00

PF 0.33 0.43

LA 0.90 1.30

Page 21: GeneChips and Microarray Expression Data David Paoletti

Average Difference andAbsolute Call

• Which of these do you base a decision on, for whether a gene is being expressed?

• Use the absolute call for decision

• Use average difference to compare those which are present

Page 22: GeneChips and Microarray Expression Data David Paoletti

Conclusions

• Incredible amalgam of biological and computational processes

• Allows analyses that would not be performed otherwise

• Already of proven worth

Page 23: GeneChips and Microarray Expression Data David Paoletti

References

• Moore, S K; Making chips to probe genes, IEEE Spectrum, March 2001, 54-60.

• GeneChip Gene Expression Algorithm Training, Part I: Absolute Analysis; Affymetrix.

• Berberich, S, and McGorry, M; GeneChip protocols; Wright State University.