sci cafe humangenome&health

62
Insights from The Human Genome Project about Risk of Disease Toby G. Rossman, Ph.D. The Nelson Institute of Environmental Medicine NYU-Langone School of Medicine

Upload: toby-rossman

Post on 21-Nov-2014

323 views

Category:

Education


0 download

DESCRIPTION

Presented at Hudson Valley Science Cafe

TRANSCRIPT

Page 1: Sci cafe humangenome&health

Insights from The Human Genome Project about Risk of Disease

Toby G. Rossman, Ph.D.

The Nelson Institute of Environmental Medicine

NYU-Langone School of Medicine

Page 2: Sci cafe humangenome&health

In 2003, scientists in the Human Genome Project obtained the DNA sequence of the 3 billion base pairs making up the human genome.

The significance of this work to our health is an ongoing project.

Page 3: Sci cafe humangenome&health

THE HOPE:

Page 4: Sci cafe humangenome&health

Genes and The Genome

Genes are made of DNA and provide the directions for building all of the proteins that make our bodies function. Genes are passed down by parents to their offspring.

The Genome is the sum total of all genetic material (DNA) in a cell.

Page 5: Sci cafe humangenome&health

Introduction to DNA

A double helix (2 strands) with a sugar-phosphate backbone.

Attached to each sugar are one of 4 “bases”: A, G, T, C

Base pair:C always pairs with GA always pairs with T

The sequence of the bases (e.g.: AATGCCCTGAACGTT) contain the information (genetic code).

Page 6: Sci cafe humangenome&health

DNA is the genetic material within the nucleus.

What DNA does

The process of transcription

creates an RNA using

DNA information.

The RNA leaves the nucleus.

Cytoplasm

Nucleus

DNA

RNA

Protein

Replication

Transcription

Translation

The process of translation takes place in the cytoplasm.

I creates a protein using

RNA information.

DNA is the genetic material within the nucleus. It is many genes long.

DNA is replicated before a cell divides. Each daughter cell receives an identical copy.

The DNA in a gene is transcribed to create an RNA copy, using information from one DNA strand.

The RNA leaves the nucleus.

In the cytoplasm, the RNA is translated to create a protein

using RNA information.

Page 7: Sci cafe humangenome&health

The “Central Dogma” of biologyThe “Central Dogma” of biology

DNADNA

mRNARNAcarries the messagecarries the message

Protein

transcriptiontranscription

translationtranslation

A protein is a string of amino acids

stores all informationstores all information

Codon = 3 bases

Page 8: Sci cafe humangenome&health

All known organisms use the same genetic code. (Unity of life on earth; we are all related).

The genetic code is universal.

The genetic code is degenerate.Some codons encode the same amino acid. e.g. GGU, GGC, GGA, and GGG all encode glycine.Degeneracy is mostly at the third base of the codon.

Some codons have additional functions.AUG encodes methionine, but is also a start codon.

UAA, UAG, and UGA do not encode an amino acid.These codons signal termination of the protein (stop codon).

The Genetic Code

Page 9: Sci cafe humangenome&health

DNA is packed into chromosomes (highly condensed)

Page 10: Sci cafe humangenome&health

Genes are present both in the nucleus and in mitochondria

1 2 3 4 5 6 7 8 9 10 11 12

Sex-chromosomes

mtDNA

16,569 bp

Autosomes

Mitochondrial DNA

Nuclear DNA

3.2 billion bp

Chromosomes located in cell nucleus

1 chromosome in mitochondria (hundreds

of copies in cell cytoplasm)

Inherited only from mother

13 14 15 16 17 18 19 20 21 22 X Y

Page 11: Sci cafe humangenome&health

An important technique: Hybridization

Heating the cellular DNA will separate the 2 strands.

You can then identify a piece of DNA by hybridization, using a “probe” of the complementary strand (at least 12 bases). This also works with RNA.

HOT

COLD

Page 12: Sci cafe humangenome&health

Painting the human chromosomesPainting the human chromosomes

Chromosome painting

Page 13: Sci cafe humangenome&health

A pair of homologous chromosomes (one from father and one from mother)

Gene – unit of DNA information about a trait

Alleles – slightly different versions of a gene

p

Q

Homozygous: both alleles are the same

Heterozygous: the alleles are different

Most genes are in the 23 pairs of chromosomes in the cell nucleus.

Chromosomes contain thousands of genes.

SOME DEFINITIONS

Page 14: Sci cafe humangenome&health

• The human genome is nearly the same in all people. We are >99.9% alike.

• Only about 1.5% of the human genome contains genes that are translated into proteins.

• Most of our DNA is transcribed into RNA, but not translated.

Some important lessons from the Human Genome Project

Page 15: Sci cafe humangenome&health

Only a small % gets translated into protein

NEXT SLIDE

Page 16: Sci cafe humangenome&health

The DNA in a gene is interrupted. It is not all expressed.

Exons are expressed; Introns are removed

Alternative splicing of exons allows more than 1 protein from each transcript

EXON = expressed

INTRON = not expressed

intronexon exon exon exonintron intron

Page 17: Sci cafe humangenome&health

It turns out that 90% of the genome is actively transcribed into RNA.

Although initially dismissed as “junk DNA”, recent evidence suggests

that many of the RNAs transcribed from this DNA play major biological

roles in control of gene expression, cellular development and

metabolism.

For example:

The Human Genome encodes only ~20,000 protein-coding genes. But

we have about 100,000 different proteins! Alternative splicing yields

many more than 20,000 proteins. RNA molecules may help control

which splicing event occurs.

What is all that DNA for?

We are at the tip of a very large iceberg!

Page 18: Sci cafe humangenome&health

Genetic variation

Single nucleotide polymorphisms (SNPs)

• Our genes are >99.9% alike (unless we are identical twins).

• Gene sequences can differ at a single base (called single nucleotide polymorphism or SNP).

• The human genome has at least 10 million SNPs.

Page 19: Sci cafe humangenome&health

Understanding SNPs

• Most variations are meaningless and do not affect

our ability to survive or adapt.

• For example: silent mutations in DNA, which

change the DNA, but do not change the amino acid

the DNA codes for.

• Other mutations may change the amino acid

sequence of a protein, but not the overall function

of that protein.

• Some variation leads to disease. Single-gene

disorders include sickel cell anemia, cystic fibrosis

and Huntington disease.

Page 20: Sci cafe humangenome&health

There are other types of genetic variation besides SNPs

Eleanor Rigby picks up the rice in the church. (Original sequence)

Eleanor Rigby picks up the lice in the church. (SNP)

Eleanor Rigby picks up the church. (deletion)

Eleanor Rigby picks up the rice and beans in the church. (insertion)

Eleanor Rigby picku pt her icei nt hec hurch. (-1 frameshift)

Eleanor Rigby picks hcruhc eht ni ecir eht pu. (inversion)

Eleanor Rigby picks up the rice in the church. Eleanor Rigby picks up the rice in the church. (gene duplification)

Rearrangement: Gene gets transposed to another chromosome.

Page 21: Sci cafe humangenome&health

Gene duplication: A major event in evolutionGene duplications produce copy number variations (CNV) in humans

Page 22: Sci cafe humangenome&health

Copy number for amylase-1 gene is higher in populations with a starchy diet.

Amylase digests starch.

Perry et al., Nature Genetics 2007

50%

Gene copy number/cell

Copy number variation (CNV)

Page 23: Sci cafe humangenome&health

How different are human beings from one another?

Differences in SNPs: about 10 million (~3/1000 bases)

Large deletions and insertions: about 15 million base differences

Copy number variations: can lead to very large numbers of base changes

Page 24: Sci cafe humangenome&health

Risk Factors for Disease

A risk factor increases your risk of developing a disease or health problem.

Behaviors and lifestyle (including diet)

“Environment”

Genes

+ = Environment

How can you tell which is more important? Sometimes a single gene mutation leads to disease. THE EASY CASES

Page 25: Sci cafe humangenome&health

SICKLE CELL ANEMIAAn example of a single gene disease

• Caused by a mutant allele h of a hemoglobin gene H.

• 1/500 black Americans have the disease. (homozygous hh)

• 1/10 is heterozygous (Hh) for the sickle cell gene

• The mutant gene is even more common among West African Blacks.

• In some parts of Africa, the fraction of individuals with this disease is 1/25

• The mutant allele confers some resistance to malaria.

Page 26: Sci cafe humangenome&health

Fig. 11.2

Sickel cell anemia is caused by a single base change (SNP)

Page 27: Sci cafe humangenome&health

Some other genetic disorders caused by single genes

Cystic fibrosis 1/2,000 Caucasians

1/17,000 African Americans

1/9,000 Hispanics

Obstructive lung disorder, infections, heart failure

Tay-Sachs disease 1/300,000 in US

1/3500 in Ashkenazi Jewish

Destroys nervous system at about 8 months. Rarely live past age 2.

Huntington’s disease 1/20,000 (Western countries), less freq. in Africa & Asia

Gradual deteriation of nervous system

Maple syrup urine disease 1/9,000 to 1/300,000

1/176 in PA Mennonites

Vomiting, seizures, mental retard., coma. Death in 2 years

Alpha-1-antitrypsin 1/2000 Early lung disorder (by 40 years)

BRCA1 or BRCA2 breast cancer

Will discuss later along with other cancer risk genes

Increased risk for breast cancer

Phenylketonurea (PKU) Mental retardation, excretion of phenylalanine

Hemophilia B X-linked recessive Lack of clotting factor IX

Disorder Incidence (US) Symptoms

Page 28: Sci cafe humangenome&health

Complexities in single gene diseases: Cystic Fibrosis

Cystic fibrosis (CF) is caused by an autosomal recessive mutation in the gene CFTR (cystic fibrosis trans-membrane conductance regulator).

However, it is not possible to predict the exact phenotype of this disease from analysis of CFTR mutations. Other genes modify the disease.

Page 29: Sci cafe humangenome&health

Polygenic inheritanceThe Harder cases

• These underlie some of the more clinically important

human diseases including

• Heart disease

• Stroke

• Diabetes

• Schiozphrenia

Page 30: Sci cafe humangenome&health

Body Mass Index (BMI)

• At least 17 genes interact to control body weight: Genes that affect how much we eat, metabolic rate and fat distribution.

• One of these genes encodes a protein hormone called leptin. Eating stimulates fat cells to secrete leptin. Leptin travels to the brain and signals it to supress appetite and increase metabolism to digest the food.

• Low levels of leptin indicate starvation, which triggers hunger and decreases the metabolic rate.

• However, body weight is not entirely genetically determined.

Page 31: Sci cafe humangenome&health
Page 32: Sci cafe humangenome&health

Breast cancer risk differs in different countries: Heredity or Environment?

US

Page 33: Sci cafe humangenome&health

• Adult use associated with risk (about 10% increase for each drink per day)

• All studies have reported impact of early age alcohol use on breast cancer risk

• About twice the risk of breast cancer for women below 35 years

• Alcohol use increases estrogen levels

• Adequate folic acid (B vitamin) may decrease risk in women who have more than 1 drink per day

e.g. Alcohol Use

Breast cancer risk cannot be entirely genetic.

For example, diet, obesity, radiation exposure, and alcohol use influence risk.

Page 34: Sci cafe humangenome&health

High incidence in Japanese men in Japan, but lower in High incidence in Japanese men in Japan, but lower in HawaiiHawaii

Another example: stomach cancer

Page 35: Sci cafe humangenome&health
Page 36: Sci cafe humangenome&health

Example of multifactor causation: breast cancer

or

Page 37: Sci cafe humangenome&health

Other inherited cancer syndromes

Syndrome Primary tumors Other tumors/traits

Dominant

Familial retinoblastoma retinoblastoma osteosarcoma

Hered. non-polyposis colon can. colorectal many

Familial adenomatous polyposis colorectal other GI, jaw, brain

Nevoid basal cell carcinoma skin jaw cysts, ovary, fibomas

Familial melanoma skin pancreas

Multiple endocrine neoplasia (1/2) pancreas(1), thyroid(2) other endocrine

Li-Fraumeni syndrome sarcoma, breast brain, leukemias

Recessive

Ataxia telangiectasia lymphoma cerebellar

ataxia

Bloom’s syndrome solid tumors

immunodeficiency,

small stature

Xeroderma pigmentosum skin abnormal

pigmentation

Fanconi’s anemia AML skeletal abnormalities

Page 38: Sci cafe humangenome&health

Exposure to mutagenic/carcinogenic

compounds

DNA adduct

A

T

C

G*

N

N

N

N

NH2

O

O

O

P O-O

O

O

OH

OHHO

NH

O

O

O O-

O

P

O

O

O O-

O

P

N

N

N

NH

O

P O-O

O

O

O

NH

N

O

N

N

NH2

O

Metabolic activation

DNA repair

Mutation

CancerCancer

Chemical carcinogens often cause DNA damage which leads to mutations if the damage is not

repaired

No Cancer

Page 39: Sci cafe humangenome&health

DNA repair defect diseases

Page 40: Sci cafe humangenome&health

Proteins known to bind to BRCA1’s BRCT domains—such as Abraxas, BACH1, and CtIP—are all involved in the DNA repair process.

This meshes with the prevailing view that faulty DNA repair by a mutated BRCA1 results in genomic instability, which ultimately leads to tumorigenesis.

Involvement of BRCA1 in DNA repair

Page 41: Sci cafe humangenome&health

Pennisi E, Science 2007; 318:1842-43.

2007: The Year of GWA Studies

Page 42: Sci cafe humangenome&health

THE NEWEST METHOD FOR GENETIC ANALYSIS:Genome-Wide Association Study (GWAS)

GWAS is an approach that involves rapidly scanning genetic markers across the complete sets of DNA (genomes), of many people to find genetic variations associated with a particular disease.

Such studies are being carried out to find genetic variations that contribute to common, complex diseases, such as asthma, cancer, diabetes, heart disease and mental illnesses.

Page 43: Sci cafe humangenome&health

The Gene Chip (microarray)

Page 44: Sci cafe humangenome&health

Microarray chips

Oligonucleotide probes representing many genes are spotted in an array.

Page 45: Sci cafe humangenome&health
Page 46: Sci cafe humangenome&health

Types of microarrays

Comparative Genomic Hybridization (CGH): for genomic gains and losses or for a change in the number of copies of a particular gene.

Microarray expression analysis: to determine the level of expression of a gene. Reflects subject’s mRNA levels.

SNP or mutation analysis: In this case, gene sequences placed on any given spot within the array will differ from that of other spots, by only one (SNP) or a few specific nucleotides.

Page 47: Sci cafe humangenome&health

http://www.broad.mit.edu/diabetes/scandinavs/type2.html

Genome-Wide Scan for Type 2 Diabetes in a Scandinavian Cohort:

SNP results

Page 48: Sci cafe humangenome&health

Type 2 Diabetes

GWAS have accelerated the identification of type 2 diabetes susceptibility genes. There are now at least 19 loci containing genes that increase risk of type 2 diabetes.

Individually, most of these variants confer a modest risk (odds ratio [OR] = 1.1–1.25) of developing type 2 diabetes.

To date, these approaches have only identified two genes: PPARG (peroxisome proliferator-activated receptor-r ) and KCNJ11(potassium inwardly-rectifying channel J11) robustly implicated in type 2 diabetes susceptibility.

Page 49: Sci cafe humangenome&health

GWAS for SNP Associations with Myocardial Infarction show hot

region on Ch 9

Samani N et al., N Engl J Med 2007; 357:443-53.

Page 50: Sci cafe humangenome&health

Klein et al, Science 2005; 308:385-389.

GWAS for Age-Related Macular Degeneration

Page 51: Sci cafe humangenome&health

WTCCC, Nature 2007; 447:661-678.

Wellcome Trust GWAS of Seven Diseases

Page 52: Sci cafe humangenome&health

BIG SURPRISE: Most SNPs associated with disease susceptibility are in introns and intergenic positions!!

Page 53: Sci cafe humangenome&health

Unique Aspects and problems of GWAS

• GWAS permits examination of inherited genetic variability at unprecedented level of resolution.

• GWAS permits "agnostic" genome-wide evaluation. • Once a genome is measured, it can be related to any trait.• Most robust associations in GWA studies have not been

with genes previously suspected of association with the disease.

• Many associations are in regions that do not harbor genes.

But with more than 500,000 comparisons per study, the potential for false positive results is unprecedented (and expensive!).

Most associations are NOT robust.

Page 54: Sci cafe humangenome&health

Further Reading (and some criticisms)

Taft et al., Non-coding RNAs: regulators of disease. J. Pathology 220:126-139, 2010

Roberts et al., The predictive capacity of personal genome sequencing. Science Translational Medicine 2010 10.1126/scitranslmed.3003380 stm.sciencemag.org

Ioannidis et al., A compendium of genome-wide associations for cancer: Critical synopsis and reappraisal. J. Natl. Cancer Inst. 102:846-858, 2010

Bell, Our changing view of the genomic landscape of cancer. J. Pathology 220:231-243, 2010

Vineis and Pearce, Genome-wide association studies may be misinterpreted: genes versus heritability. Carcinogenesis 32:1295-1298, 2011

Page 55: Sci cafe humangenome&health

CDC, Office of Genomics and Disease Prevention www.cdc.gov/genomics/public/famhist.htm

"DNA Interactive" Site from Cold Spring Harbor Labs: http://www.dnai.org/index.htm

Howard Hughes Medical Institute's "Biointeractive" Site http://www.hhmi.org/biointeractive/genomics/microarray.html

Learn Genetics — Genetic Science Learning Centerhttp://learn.genetics.utah.edu/

National Center for Biotechnologyhttp://www.ncbi.nlm.nih.gov/About/primer/snps.html

http://hapmap.ncbi.nlm.nih.gov/

Animations On How DNA Microarrays Workhttp://www.imagecyte.com/animations/array2.html http://www.bio.davidson.edu/courses/genomics/chip/chip.html

Websites

Page 56: Sci cafe humangenome&health

Some other things besides genes are important

Page 57: Sci cafe humangenome&health
Page 58: Sci cafe humangenome&health
Page 59: Sci cafe humangenome&health

Founder Effects

Occur when a population is established by a small number of people. A mutation in one of the founders becomes prevelent in the resulting population.

Afrikaners (S Africa) Familial hypocholesterolemia, APC, BRCA1/2, Blooms

French Canadians HED (skin disorder), congenital adrenal hyperplasia

Finns hMLH1, diastrophic dysplasia

Icelanders BRCA2

Dutch BRCA1/2, melanoma

Norwegians BRCA1

North Africans Allgrove syndrome

Swedes BRCA1/2

African Americans BRCA1

Germans (Black Forest) Von Hippel-Lindau disease

France (Rhone Alps) Hemophilia B

Sicilians Glycogen storage disease type II

South Italians CDA-II (anemia)

Page 60: Sci cafe humangenome&health

Steps in Chemical Carcinogenesis (also radiation)

Page 61: Sci cafe humangenome&health

HAPMAP

Testing all of the 10 million common SNPs in a person's chromosomes would be extremely expensive. The development of the HapMap will enable geneticists to take advantage of how SNPs and other genetic variants are organized on chromosomes. Genetic variants that are near each other tend to be inherited together. For example, all of the people who have an A rather than a G at a particular location in a chromosome can have identical genetic variants at other SNPs in the chromosomal region surrounding the A. These regions of linked variants are known as haplotypes.

Page 62: Sci cafe humangenome&health

Diseases and Traits with Published GWA Studies (n = 76, 11/17/08)

• Macular Degeneration• Exfoliation Glaucoma

• Lung Cancer• Prostate Cancer• Breast Cancer• Colorectal Cancer• Bladder Cancer• Neuroblastoma• Melanoma• TP53 Cancer

Predispos’n• Chr. Lymph. Leukemia

• Inflamm. Bowel Disease• Celiac Disease• Gallstones• Irritable Bowel

Syndrome

• QT Prolongation • Coronary Disease• Coronary Spasm • Atrial

Fibrillation/Flutter• Stroke• Subarachnoid

Hemorrhage• Intracranial Aneurysm • Hypertension• Hypt. Diuretic Response• Peripheral Artery

Disease

• Syst. Lupus Erythematosus

• Sarcoidosis• Pulmonary Fibrosis• Psoriasis • HIV Viral Setpoint• Childhood Asthma

• Type 1 Diabetes • Type 2 Diabetes• Diabetic Nephropathy • End-St. Renal Disease• Obesity, BMI, Waist,

IR• Height• Osteoporosis• Osteoarthritis• Male Pattern Baldness

• F-Cell Distribution• Fetal Hgb Levels• C-Reactive Protein• ICAM-1• Total IgE Levels• Uric Acid Levels, Gout• Protein Levels• Vitamin B12 Levels• Recombination Rate• Pigmentation

• Lipids and Lipoproteins• Warfarin Dosing• Ximelegatran Adv.

Resp.

• Parkinson Disease• Amyotrophic Lat.

Sclerosis• Multiple Sclerosis• MS Interferon-β

Response • Prog. Supranuclear

Palsy• Alzheimer’s Disease in

ε4+• Cognitive Ability• Memory• Hearing• Restless Legs Syndrome • Nicotine Dependence• Methamphetamine

Depend.• Neuroticism• Schizophrenia• Sz. Iloperidone

Response• Bipolar Disorder• Family Chaos• Narcolepsy• Attention Deficit

Hyperactivity• Personality Traits

• Rheumatoid Arthritis• RA Anti-TNF Response