mcb 104 - quiz 2 combined answer key

Upload: azgaray7

Post on 23-Feb-2018

213 views

Category:

Documents


0 download

TRANSCRIPT

  • 7/24/2019 MCB 104 - Quiz 2 Combined Answer Key

    1/6

    MCB 104 Quiz 2 combined answer key (Different sections had different questions.)

    Lecture 13:

    1) During a river boat trip on the Amazon river, you encounter a small tribe of people that have

    lived there for hundreds of years. Youre curious how isolated theyve been from other tribes in

    the area, and decided to take a whole genome sequencing approach to identify unique

    polymorphisms to their tribe. (Note that all sequences in the following problem are oriented 5 ->

    3 and are from the same strand).

    (A) Assemble the following reads from the Amazonian tribe genome project into a

    contig.

    There must be at least 5 bases that match in order for two reads to be considered

    overlapping. Order the given reads into a contig, and list the read numbers in order (for

    example, #5 -> #6 -> #7). (2pts)[3 -> 1 -> 2]

    Read #1: GATGCTCGCCGTAGGAGACG

    Read #2: GGAGACGTTCAGCGATCCTG

    Read #3: CACGACAATCGGGCGATGCT

    (B) In addition to the contig you found in A (Contig A), you also obtain two more

    contigs:

    Contig B: TTGCGAGTCAAGCGTTCACGACAATCGCCCGATC

    Contig C: TCGAATTCGATTAGGGCGATTCCGTTAGATTCCCA

    Now you would like to assemble all three into scaffolds, so you use additional paired-end

    reads which span the contigs, shown below:

    Paired-end read #1: TCCTGGAATCG.....Gap.....CGCATTCGAATT

    Paired-end read #2:GCCCGATCCTG.....Gap.....ACCCACGACAA

    Using the information above from the paired-end sequencing, assemble the three contigsinto scaffolds. List the contigs in order (for example, G -> E -> F), and indicate which

    paired-end read connects which contigs. (4pts) [ B -> PE#2 -> A -> PE#1 -> C]

    2) You have three DNA fragments that are 100bp long. You did paired-end sequencing and

    below is the result

    Fragment1: ACGCTCGTTATGCTA------------------------------------GCGTTATGTCGATTC

    Fragment2: GAGGCGTTACGTCAA------------------------------------AGTCTATTGGCTAAA

    Fragment3: TATTGGCTAAATCGA------------------------------------ATAGGACGCTCGTTA

    You overlapped these results and discovered that the fragments partially overlap:___________________________ Fragment 1

    ___________________________ Fragment (i)

    ___________________________ Fragment (ii)

    a) What is fragment (i) and fragment (ii)? [2p]

    Fragment (i) = Fragment 3

    Fragment (ii) = Fragment 2

  • 7/24/2019 MCB 104 - Quiz 2 Combined Answer Key

    2/6

    MCB 104 Quiz 2 combined answer key (Different sections had different questions.)

    b) How many contigs and gaps does this scaffold have? For this question, consider

    single reads with no overlap as contigs. (2pts)

    Contigs: 4, Gaps: 3

    c) What is the total length of the scaffold? Show your work. (2pts)

    100+100+100-11-10 = 279bp

    3) Restriction enzymes cut DNA into fragments according to sequence. The restriction enzyme

    HpyCH4V cleaves at sequences TG^CA (^ indicates where the cleaving occurs). How many

    times does HpyCH4V cleave the following sequence? Please circle or underline the restriction

    enzyme sites in the following sequence [6 points]

    agtaagagaattatgcagtgctgccataaccatgagtgataacactgcggccaacttacttctgacaacgatcggaggaccgaa

    ggagctaaccgcttttttgcacaacatgggggatcatgtaactcgccttgatcgttgggaaccggagctgaatgaagccatacca

    aacgacgagcgtgacaccacgatgcctgtagcaatggcaacaacgttgcgcaaactattaactggcgaactacttactctagctt

    cccggcaacaattaatagactggatggaggcggataaagttgcaggaccacttctgcgctcggcccttccggctggctggtttat

    tgctgataaatctggagccggtgagcgtgggtctcgcggtatcat

    tgca

    gcactgg

    4) You do your paired-end sequencing of all the fragments for a whole-genome sequencing

    project, but your computer file becomes corrupted. The reads are intact, but you no longer know

    which reads correspond to which fragment - all the pairing (paired end) data is lost.

    a. 2 pt Will you be able to assemble contigs with your remaining data? Why or why

    not?

    Yes, you still have reads and can overlap them to form a contig

    b. 2 pt Will you be able to make scaffolds? Why or why not?

    Nothing larger than a contig - you dont have the paired end data that linkscontigs together anymore

    c. 2 pt. The computer technicians are able to recover additional sequence reads that

    you didnt know were missing from that file, but they still lack pairing

    information. Some of these new reads partially overlap with two

    non-overlapping reads that you had before. How does this affect your answers to a

    and b?

    Youll be able to make longer contigs, but still no scaffolds

    Lecture 14, Monday lecture:5) After completing the sequencing of the Ornithorhynchus anatinus (duck-billed platypus)

    genome, you next want to know what portions of the platypus genome are transcribed. To find

    transcribed genes within the genome, you purify mRNAs from three different tissue types (brain,

    blood, and liver). You then synthesize cDNA from this mRNA as was covered in lecture, make a

    cDNA library from each tissue, then sequence the end of every clone. Your initial sequencing

    reveals three highly expressed platypus genes, which you name S, B, and M. You find cDNAs

  • 7/24/2019 MCB 104 - Quiz 2 Combined Answer Key

    3/6

    MCB 104 Quiz 2 combined answer key (Different sections had different questions.)

    from gene S in every tissue, but only find cDNAs from gene B in brain tissue and cDNAs from

    gene M in blood.

    (A) Why do you not find cDNAs from genes B and M in every tissue you sampled from?

    (3pts)because gene B is only expressed in the brain/has a brain-specific

    promoter/enhancers/cis-regulatory elements, and gene M is only expressed in the

    blood/has a blood-specific promoter/enhancers/cis-regulatory elements

    (B) It turns out that cDNAs of gene S derived from brain tissue have exons that are not

    present in the cDNAs of gene S from the blood or liver libraries. How could you explain

    this tissue-specific difference in what exons are present in the gene S cDNAs? (3pts)

    splicing differences between tissues

    6) Youve decided to undertake a transcriptome analysis (studying of all the mRNAs expressed

    in a cell) of brain and heart tissue, focusing on mRNAs encoding components of voltage-gated

    calcium channels.

    A. (1 pt) Which enzyme would you use to convert your RNAs to the corresponding cDNAs?

    reverse transcriptase

    B. (2 pts) If the same voltage-gated calcium channel genes are expressed in both brain and

    heart, before you collect sequence data can you predict how much cDNA from each gene

    will be found in brain versus heart tissue? Why or why not?

    no, because even if the same genes are transcribed, the relative number of RNAs could

    vary between tissue types

    C. (3 pts) You discover that in both the brain and heart, there are moreunique types of

    cDNAs (unique type meaning different sets of genomic sequences present in the cDNA)corresponding to voltage-gated calcium channels than there are genes. What is a possible

    explanation for this discrepancy? How does this explanation account for the discrepancy?

    alternative splicing (different exons being included/excluded in the final mRNA) leads to

    multiple RNA types derived from the same gene

    7) Youve discovered a new type of butterfly that looks very similar to the monarch butterfly.

    Since they look so similar, you reason they might share some genetic features. Here is an

    example sequence from genomic DNA and cDNA from both butterflies:

    New butterfly:genomic DNA ATGCAGGAGCATAAATTCCGCAATTGGATTAGGATGCCCAGTATT-

    cDNA ATGCAGGAGCAT TGGATTAGG

    Monarch butterfly:

    genomic DNA ATGCAGGAGCATCCATGAGACAATTGGATTAGGTACTTAAGTATT-

    cDNA ATGCAGGAGCAT TGGATTAGC

    A. (2 pts) Circle the introns and draw boxes around the exons in the genomic DNA

  • 7/24/2019 MCB 104 - Quiz 2 Combined Answer Key

    4/6

    MCB 104 Quiz 2 combined answer key (Different sections had different questions.)

    regions that are present also in the cDNA are exons, regions that are not also

    present in the cDNA are introns

    B. (2 pts) If this genomic region from the new butterfly was the most similar in

    sequence to this genomic region from monarch butterflies (and vice versa), what

    word would you use to describe the relationship between these two genes?

    orthologs

    C. (2 pts) There is a polymorphism in one of the exons. What kind of polymorphism

    is it? A SNP (single nucleotide polymorphism) or single base substitution

    8) mRNAs can not be sequenced directly by dideoxy (Sanger) sequencing.

    a. To make cDNA from mRNA, what one kind of nucleic acid do you need (in addition

    to the mRNA) and what one enzyme would you need to synthesize the cDNA? (2pts)

    A primer (or oligonucleotide or poly-T primer) , and reverse transcriptase

    b. (2 pt) If you put a cDNA into a plasmid and sequenced both ends, what type of reads

    would you generate?

    5 and 3 EST reads

    c. (2 pts) Aside from learning regions of the genome where transcription occurs, what

    else can you learn from comparing individual EST sequences to the genomic sequence in

    cases where one EST read aligns perfectly to two blocks of genomic sequence?

    location of introns/exons

    Lecture 15: Wednesday lecture:

    9) Some paternity tests, which test whether someone is the father of a particular child, work by

    genotyping a large collection of SNPs from both the potential father and the child.a. On average, what percent of genome-wide SNPs on autosomes (non-sex chromosomes)

    would be identical between a parent and their offspring, and why? [3p]

    50%. Because a child inherits half of their chromosomes from each parent, you would

    expect on average 50% of genome-wide genotypes to be identical.

    b) Why would you want to genotype multiple SNPs in a paternity test, and not just a

    single SNP? [3p]

    It would be way too easy for a single SNP to be the same between two individuals just

    by chance or from distant relatedness

    10) You are given the DNA sequence at the same locus for a few individuals:

    Person A GGCCGCGTGCCCAGGGTATATTTGGTCAGTATGGAATGGACT

    Person B GGCCGCGTGCCCACGGTATATTTGGTCAGTATCGAATGGACT

    Person C GGCCGCGTCCCCACGGTATATTAGGTCAGTATGGAATGGACT

  • 7/24/2019 MCB 104 - Quiz 2 Combined Answer Key

    5/6

    MCB 104 Quiz 2 combined answer key (Different sections had different questions.)

    Person 1 GGCCGCGTGCCCACGGTATATTTGGTCAGTATCGAATGGACT

    Person 2 GGCCGCGTGCCCAGGGTATATTTGGTCAGTATGGAATGGACT

    a. (2 points) Rank individuals A, B, and C in order of how closely they are related to Person

    1, based on the above sequences.Person B is most closely related, then Person A, then Person C

    b. (2 points) Rank individuals A, B, and C in order of how closely they are related to Person

    2, based on the above sequences.

    Person A is most closely related, then Person B, then Person C

    c. (2 points) What do we call the bolded regions? How did you use them to answer parts a

    and b?

    SNPs, more closely related individuals share more SNPs

    Lecture 16: Friday lecture:

    11) Transposable elements (indicated by the arrows over yellow segments below) can cause

    deletions. Using the following diagram, draw the products (2 points each) and the crossover

    intermediate (2 points) from the aberrant recombination. Either single-stranded crossover ->

    deletion, or two-stranded crossover -> deletion + duplication

    12) Nonallelic homologous recombination occurs at two regions of homologous sequence that

    arent at allelic positions.

    a. (2 pt) If the homologous sequences are on the same direction on the same strand, what

    outcome do you expect? What is/are the product(s)?

    deletion plasmid with deleted sequence, chromosome missing deleted sequence

    b. (2 pt) If the homologous sequences are in the opposite direction on the same strand, what

    outcome do you expect? What is/are the product(s)?

    inversion the sequence in between will be backwards (inverted)

    c. (2 pt) What result do you expect from nonallelic homologous recombination between two

    strands?

    deletion and a duplication

    13)

  • 7/24/2019 MCB 104 - Quiz 2 Combined Answer Key

    6/6

    MCB 104 Quiz 2 combined answer key (Different sections had different questions.)

    The above diagram shows some parts of two homologous chromosomes. The sequences i) and ii)

    are identical. What are the two non-allelic crossing over events that could occur in these

    chromosomes? Draw the events specifically. [3pts per event]

    Deletion (crossing over within one chromosome or between two chromosomes)

    Duplication (crossing over between two chromosomes)

    See slides for diagrams of crossing-over events