genomics of gene regulation

43
Genomics of Gene Regulation Genomic and Proteomic Approaches to Heart, Lung, Blood and Sleep Disorders Jackson Laboratories Ross Hardison October 6, 2010

Upload: job-jefferson

Post on 08-Jan-2018

221 views

Category:

Documents


1 download

DESCRIPTION

Heritable variation in gene regulation “Simple” Mendelian traits, e.g. thalassemias Variation in expression is common in normal individuals Variation in expression may be a major contributor to complex traits (including heart, lung, blood and sleep disorders)

TRANSCRIPT

Page 1: Genomics of Gene Regulation

Genomics of Gene Regulation

Genomic and Proteomic Approaches to Heart, Lung, Blood and Sleep DisordersJackson Laboratories

Ross HardisonOctober 6, 2010

Page 2: Genomics of Gene Regulation

Heritable variation in gene regulation

“Simple” Mendelian traits, e.g. thalassemias

Variation in expression is common in normal individuals

Variation in expression may be a major contributor to complex traits (including heart, lung, blood and sleep disorders)

Page 3: Genomics of Gene Regulation

Deletions of noncoding DNA can affect gene expression

Forget and Hardison, Chapter in Disorders of Hemoglobin, 2nd edition

Page 4: Genomics of Gene Regulation

Substitutions in promoters can affect expression

Forget and Hardison, Chapter in Disorders of Hemoglobin, 2nd edition

Page 5: Genomics of Gene Regulation

Variation of gene expression among individuals

• Levels of expression of many genes vary in humans (and other species)

• Variation in expression is heritable• Determinants of variability map to discrete genomic intervals• Often multiple determinants• This variation indicates an abundance of cis-regulatory variation in

the human genome• "We predict that variants in regulatory regions make a greater

contribution to complex disease than do variants that affect protein sequence" Manolis Dermitzakis, ScienceDaily– Microarray expression analyses of 3554 genes in 14 families

• Morley M … Cheung VG (2004) Nature 430:743-747– Expression analysis of EBV-transformed lymphoblastoid cells from all 270

individuals genotypes in HapMap• Stranger BE … Dermitzakis E (2007) Nature Genetics 39:1217-1224

Page 6: Genomics of Gene Regulation

Risk loci in noncoding regions

(2007) Science 316: 1336-1341

Page 7: Genomics of Gene Regulation

DNA sequences involved in regulation of gene transcription

Protein-DNA interactionsChromatin effects

Page 8: Genomics of Gene Regulation

Distinct classes of regulatory regions

Maston G, Evans S and Green M (2006) Annu Rev Genomics Hum Genetics 7:29-59

Act in cis, affecting expression of a gene on the same chromosome.

Cis-regulatory modules (CRMs)

Page 9: Genomics of Gene Regulation

General features of promoters

• A promoter is the DNA sequence required for correct initiation of transcription

• Most promoters are at the 5’ end of the gene.

Maston, Evans & Green (2006) Ann Rev Genomics & Human Genetics, 7:29-59

TATA box + Initiator:Core or minimal promoter. Site of assembly of preinitiation complex

Upstream regulatory elements:Regulate efficiency of utilization of minimal promoter

RNA polymerase II

Page 10: Genomics of Gene Regulation

Conventional view of eukaryotic gene promoters

Maston, Evans & Green (2006) Ann Rev Genomics & Human Genetics, 7:29-59

Page 11: Genomics of Gene Regulation

Most promoters in mammals are CpG islands

TATA, no CpG island10-20% of promoters

CpG island, no TATA80-90% of promoters

Carninci … Hayashizaki (2006)Nature Genetics 38:626

Page 12: Genomics of Gene Regulation

Differences in specificity of start sites for transcription for TATA vs CpG island promoters

Carninci … Hayashizaki (2006)Nature Genetics 38:626

Frac

tion

of m

RN

As

Page 13: Genomics of Gene Regulation

Enhancers• Cis-acting sequences that cause an increase in expression of a gene• Act independently of position and orientation with respect to the

gene.

Pennacchio et al., http://enhancer.lbl.gov/

Tested UCE

Over half of ultraconserved noncoding sequences aredevelopmental enhancersPennacchio et al. (2006) Nature 444:499-502

lacZUCE prluciferaseprCRM

About half of the enhancers predicted by interspecies alignments are validated in erythroid cellsWang et al. (2006) Genome Research 16:1480- 1492

Page 14: Genomics of Gene Regulation

CRMs are clusters of specific binding sites for transcription factors

Hardison (2002) on-line textbook Working with Molecular Genetics http://www.bx.psu.edu/~ross/

Page 15: Genomics of Gene Regulation

Repression by PcG proteins via chromatin modification

Polycomb Group (PcG) Repressor Complex 2: ESC, E(Z), NURF-55, and PcG repressor SU(Z)12Methylates K27 of Histone H3 via the SET domain of E(Z)

me3H3 N-tailK27 OFF

Page 16: Genomics of Gene Regulation

trx group (trxG) proteins activate via chromatin changes

• SWI/SNF nucleosome remodeling• Histone H3 and H4 acetylation• Methylation of K4 in histone H3

– Trx in Drosophila, MLL in humans• http://www.igh.cnrs.fr/equip/cavalli/

link.PolycombTeaching.html#Part_3

Me1,2,3

H3 N-tail

K4 ON

Page 17: Genomics of Gene Regulation

Features interrogated by ChIP-seq and RNA-seq assays

CTCF

DNase hypersensitive sites

Page 18: Genomics of Gene Regulation

Chromatin immunoprecipitation: Greatly enrich for DNA occupied by a protein

Elaine Mardis (2007) Nature Methods 4: 613-614

Page 19: Genomics of Gene Regulation

Enrichment of sequence tags reveals function

Barbara Wold & Richard M Myers (2008) “Sequence Census Methods” Nature Methods 5:19-21

Page 20: Genomics of Gene Regulation

Illumina (Solexa) short read sequencing

- 8 lanes per run- 10 M to 20 M reads of 36 nucleotides (or longer) per run. - 1 lane can produce enough reads to map locations of a transcription factor in a mammalian genome.

Page 21: Genomics of Gene Regulation

Example of ChIP-seq

ChIP vs NRSF = neuron-restrictive silencing factorJurkat human lymphoblast line

NPAS4 encodes neuronal PAS domain protein 4

Johnson DS, Mortazavi A, Myers RM, Wold B. (2007) Genome-Wide Mapping of in Vivo Protein-DNA Interactions. Science 316:1497-1502.

Page 22: Genomics of Gene Regulation

Distinctive histone modifications and protein binding at promoters and enhancers

Heintzman …Ren (2007) Nature Genetics 39:311-308; Birney et al. (2007) Nature, 447:799-816

• Promoters– H3K4me3,

H3K4me2– RNA Pol II

• Enhancers– H3K4me1– P300

coactivator

Page 23: Genomics of Gene Regulation

Genomic features at T2D risk variants

Overlap of risk associated variants with DHSs and other epigenetic features suggest a role in transcriptional regulation. Overlap with an exon of a noncoding RNA suggests a role in post-transcriptionalregulation. Different hypotheses to test in future work.

UCSC Genome Browser, Regulation tracks, ENCODE http://genome.ucsc.edu/

Page 24: Genomics of Gene Regulation

Variants in 8q24 associated with cancer risk

UCSC Genome Browser, Regulation tracks, ENCODE http://genome.ucsc.edu/

Page 25: Genomics of Gene Regulation

Factor occupancy at cancer-associated variant

UCSC Genome Browser, Regulation tracks, ENCODE http://genome.ucsc.edu/

Page 26: Genomics of Gene Regulation

Genomics of Erythroid Gene Regulation

Page 27: Genomics of Gene Regulation

Hematopoiesis

Page 28: Genomics of Gene Regulation

GATA1, partners, teammates

Page 29: Genomics of Gene Regulation

Gata1–

ES cells

in vitro hematopoietic differentiationerythropoietinstem cell factor

G1E

Somatic cell model to study GATA1 function

immortalize

Erythroid progenitorsBFU-e, CFU-e

G1E-ER4+estradiol

estradiol

Differentiated erythroblasts

add back GATA-1, hybrid protein with ER

G1E-ER4

Page 30: Genomics of Gene Regulation

Restoration of GATA1 in G1E cells mimics many of the steps in erythroid differentiation

Page 31: Genomics of Gene Regulation

Repress proliferative genes, induce differentiation genes

Page 32: Genomics of Gene Regulation

Features interrogated by ChIP-seq and RNA-seq assays

CTCF

DNase hypersensitive sites

Page 33: Genomics of Gene Regulation

ChIP-seq finds previously known distal CRMs: Hbb LCR

-

+Combine

Known CRMs

GATA1

Page 34: Genomics of Gene Regulation

Discrete regions with activating and repressive chromatin modifications

-

+

GATA1Active

Page 35: Genomics of Gene Regulation

Chromatin state distinguishes on from off, not induction from repression

Constitutivehetero-chromatin?

Facultativehetero-chromatin?

Dynamic chromatinbut mostly repressed

Euchromatin

Page 36: Genomics of Gene Regulation

GATA1 activates Zfpm1 by displacing

GATA2 and retaining

TAL1

Page 37: Genomics of Gene Regulation

GATA1 represses

Kit by displacing

GATA2 and TAL1

Page 38: Genomics of Gene Regulation

All GATA1-occupied segments active as enhancers are also occupied by SCL and LDB1

Page 39: Genomics of Gene Regulation

Chromatin state precedes GATA1-induced TF changes

Chromatin stateestablished (mostly):ActiveRepressedDead zones

Induction and repression:Dynamics of transcription factor bindingwithin the already-established chromatin context.

Chromatin condensesNucleus removed

Page 40: Genomics of Gene Regulation

Binding site motifs in occupied DNA segments can be deeply preserved during evolution Consensus binding site motif for GATA-1: WGATAR or YTATCW

5997constrained

7308not constrained

2055no motif

Page 41: Genomics of Gene Regulation

GATA1-occupied segments conserved between mouse and human are tissue-specific enhancers

Collaboration with Len Pennacchio, Hardison lab, ENCODE

Page 42: Genomics of Gene Regulation

Summary: Genomics of Gene Regulation

• Genetic determinants of variation in expression levels may contribute to complex traits - phenotype is not just determined by coding regions

• Biochemical features associated with cis-regulatory modules are being determined genome-wide for a range of cell types.

• These can be used to predict CRMs, but occupancy alone does not necessarily mean that the DNA is actively involved in regulation.

• Genome-wide data on biochemical signatures of functional sequences (DHS, chromatin modifications, transcription factor occupancy, transcripts, etc.) provide candidates for explaining how variants in noncoding regions contribute to phenotypes

Page 43: Genomics of Gene Regulation

Weisheng Wu, Yong Cheng, Demesew Abebe, Cheryl Keller Capone,Ying Zhang, Ross, Swathi Ashok Kumar, Christine Dorman, David King ….Tejaswini Mishra, Nergiz Dogan, Chris Morrissey, Deepti Jain

Collaborating labs: Mitch Weiss and Gerd Blobel (Childrens’ Hospital of Philadelphia), James Taylor (Emory)Webb Miller, Francesca Chiaromonte, Yu Zhang, Stephan Schuster, Frank Pugh, Bob Paulson (PSU), Greg Crawford (Duke), Jason Ernst, Manolis Kellis (MIT)

Thanks

Funding: NIH NIDDK, NHGRI (ARRA), Huck Institutes of Life Sciences and Institute for Cyberscience, PSU

Francesca Chiaromonte