alternative splicing of intrinsically disordered regions ... during evolution, development and...

8
Alternative splicing of intrinsically disordered regions and rewiring of protein interactions Marija Buljan 1 , Guilhem Chalancon 1 , A Keith Dunker 2 , Alex Bateman 3 , S Balaji 1 , Monika Fuxreiter 4 and M Madan Babu 1 Alternatively spliced protein segments tend to be intrinsically disordered and contain linear interaction motifs and/or post- translational modification sites. An emerging concept is that differential inclusion of such disordered segments can mediate new protein interactions, and hence change the context in which the biochemical or molecular functions are carried out by the protein. Since genes with disordered regions are enriched in regulatory and signaling functions, the resulting protein isoforms could alter their function in different tissues and organisms by rewiring interaction networks through the recruitment of distinct interaction partners via the alternatively spliced disordered segments. In this manner, the alternative splicing of mRNA coding for disordered segments may contribute to the emergence of new traits during evolution, development and disease. Addresses 1 MRC Laboratory of Molecular Biology, Francis Crick Avenue, Cambridge CB2 0QH, United Kingdom 2 Center for Computational Biology and Bioinformatics, Department of Biochemistry and Molecular Biology, Indiana University School of Medicine, Indianapolis, IN 46202, USA 3 European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Saffron Walden CB10 1SD, United Kingdom 4 DE OEC-Momentum Laboratory of Protein Dynamics, University of Debrecen, H-4032 Debrecen, Nagyerdei krt 98, Hungary Corresponding authors: Buljan, Marija ([email protected]) and Babu, M Madan ([email protected]) Current Opinion in Structural Biology 2013, 23:443450 This review comes from a themed issue on Sequences and topology Edited by Julian Gough and A Keith Dunker For a complete overview see the Issue and the Editorial Available online 22nd May 2013 0959-440X/$ see front matter, # 2013 Elsevier Ltd. All rights reserved. http://dx.doi.org/10.1016/j.sbi.2013.03.006 Introduction One of the major challenges in biology is to understand how novel phenotypic traits emerge and to describe their underlying molecular mechanisms. Numerous studies have highlighted the importance of gene duplication and recombination on one side [1,2] and alterations in gene expression regulation on the other side [3] as fundamental for the evolution of new phenotypic traits. Recently, alternative splicing (AS) has been shown to contribute significantly to changes in proteome composition between organisms and to the increase in cellular complexity and diversity of phenotypes among eukaryotes [4,5 ,6 ,7]. While AS plays a key role in the expansion of proteomic and regulatory complexity, the molecular and biochemical functions of differentially spliced exons remain largely uncharacterized [8]. AS tends to avoid protein domains but more often affect intrinsically disordered (ID) regions [912]. However, the general principles behind how these different isoforms contribute to the functional versatility and how they mediate the emergence of novel phenotypes is just beginning to be grasped [5 ,6 ,13 ,14 ]. ID regions are polypeptide segments that do not acquire a defined tertiary structure autonomously but adopt diverse interconverting conformational states [1518]. Frequently, disordered regions fold completely upon binding [19] while in some cases such regions remain disordered even in the bound state [20,21]. Here, we synthesize emerging concepts to shed light on how AS of disordered regions might rewire molecular interactions, and how this may facilitate the emergence of novel phenotypes. In particular, we discuss how such events could be crucial for defining tissue-specific (TS) and organism-specific traits. While AS of structured domains may lead to similar effects, these are not discussed here. How do disordered regions increase the functional versatility of proteins? ID segments increase the functional versatility of proteins by: (a) providing conformational heterogeneity between structured domains, thus increasing the number of confor- mational states (Figure 1a); (b) presenting linear peptide motifs [22] that bind to other proteins, nucleic acids and small molecule ligands, thereby enabling larger protein interaction landscapes (Figure 1b) and (c) presenting amino acids that become post-translationally modified (PTM), thereby altering the chemical nature of the poly- peptide and further increasing its diversity of interactions (Figure 1c). In addition, the overall net charge of the disordered segment, the modification state, and the amino acid composition can all influence the conformational ensemble and the chemical states of the entire protein, thereby altering the functions of the full-length proteins [18,2332]. Splicing of disordered segments rewires interaction networks Spliced ID regions often harbor functionally important sites Earlier computational studies of alternative protein iso- forms from the UniProt database (http://www.uniprot.org/) as well as structures of different isoforms in the PDB Available online at www.sciencedirect.com www.sciencedirect.com Current Opinion in Structural Biology 2013, 23:443450

Upload: others

Post on 22-May-2020

2 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Alternative splicing of intrinsically disordered regions ... during evolution, development and disease. Addresses 1MRC Laboratory of Molecular Biology, Francis Crick Avenue, Cambridge

Available online at www.sciencedirect.com

Alternative splicing of intrinsical

ly disordered regions andrewiring of protein interactionsMarija Buljan1, Guilhem Chalancon1, A Keith Dunker2, Alex Bateman3,S Balaji1, Monika Fuxreiter4 and M Madan Babu1

Alternatively spliced protein segments tend to be intrinsically

disordered and contain linear interaction motifs and/or post-

translational modification sites. An emerging concept is that

differential inclusion of such disordered segments can mediate

new protein interactions, and hence change the context in which

the biochemical or molecular functions are carried out by the

protein. Since genes with disordered regions are enriched in

regulatory and signaling functions, the resulting protein isoforms

could alter their function in different tissues and organisms by

rewiring interaction networks through the recruitment of distinct

interaction partners via the alternatively spliced disordered

segments. In this manner, the alternative splicing of mRNA coding

for disordered segments may contribute to the emergence of new

traits during evolution, development and disease.

Addresses1 MRC Laboratory of Molecular Biology, Francis Crick Avenue,

Cambridge CB2 0QH, United Kingdom2 Center for Computational Biology and Bioinformatics, Department of

Biochemistry and Molecular Biology, Indiana University School of

Medicine, Indianapolis, IN 46202, USA3 European Bioinformatics Institute, Wellcome Trust Genome Campus,

Hinxton, Saffron Walden CB10 1SD, United Kingdom4 DE OEC-Momentum Laboratory of Protein Dynamics, University of

Debrecen, H-4032 Debrecen, Nagyerdei krt 98, Hungary

Corresponding authors: Buljan, Marija ([email protected])

and Babu, M Madan ([email protected])

Current Opinion in Structural Biology 2013, 23:443–450

This review comes from a themed issue on Sequences and topology

Edited by Julian Gough and A Keith Dunker

For a complete overview see the Issue and the Editorial

Available online 22nd May 2013

0959-440X/$ – see front matter, # 2013 Elsevier Ltd. All rights

reserved.

http://dx.doi.org/10.1016/j.sbi.2013.03.006

IntroductionOne of the major challenges in biology is to understand

how novel phenotypic traits emerge and to describe their

underlying molecular mechanisms. Numerous studies

have highlighted the importance of gene duplication

and recombination on one side [1,2] and alterations in gene

expression regulation on the other side [3] as fundamental

for the evolution of new phenotypic traits. Recently,

alternative splicing (AS) has been shown to contribute

significantly to changes in proteome composition between

organisms and to the increase in cellular complexity and

diversity of phenotypes among eukaryotes [4,5��,6��,7].

www.sciencedirect.com

While AS plays a key role in the expansion of proteomic

and regulatory complexity, the molecular and biochemical

functions of differentially spliced exons remain largely

uncharacterized [8]. AS tends to avoid protein domains

but more often affect intrinsically disordered (ID) regions

[9–12]. However, the general principles behind how these

different isoforms contribute to the functional versatility

and how they mediate the emergence of novel phenotypes

is just beginning to be grasped [5��,6��,13��,14��]. ID

regions are polypeptide segments that do not acquire a

defined tertiary structure autonomously but adopt diverse

interconverting conformational states [15–18]. Frequently,

disordered regions fold completely upon binding [19]

while in some cases such regions remain disordered even

in the bound state [20,21]. Here, we synthesize emerging

concepts to shed light on how AS of disordered regions

might rewire molecular interactions, and how this may

facilitate the emergence of novel phenotypes. In particular,

we discuss how such events could be crucial for defining

tissue-specific (TS) and organism-specific traits. While AS

of structured domains may lead to similar effects, these are

not discussed here.

How do disordered regions increase the functional

versatility of proteins?

ID segments increase the functional versatility of proteins

by: (a) providing conformational heterogeneity between

structured domains, thus increasing the number of confor-

mational states (Figure 1a); (b) presenting linear peptide

motifs [22] that bind to other proteins, nucleic acids and

small molecule ligands, thereby enabling larger protein

interaction landscapes (Figure 1b) and (c) presenting

amino acids that become post-translationally modified

(PTM), thereby altering the chemical nature of the poly-

peptide and further increasing its diversity of interactions

(Figure 1c). In addition, the overall net charge of the

disordered segment, the modification state, and the amino

acid composition can all influence the conformational

ensemble and the chemical states of the entire protein,

thereby altering the functions of the full-length proteins

[18,23–32].

Splicing of disordered segments rewiresinteraction networksSpliced ID regions often harbor functionally important

sites

Earlier computational studies of alternative protein iso-

forms from the UniProt database (http://www.uniprot.org/)

as well as structures of different isoforms in the PDB

Current Opinion in Structural Biology 2013, 23:443–450

Page 2: Alternative splicing of intrinsically disordered regions ... during evolution, development and disease. Addresses 1MRC Laboratory of Molecular Biology, Francis Crick Avenue, Cambridge

444 Sequences and topology

Figure 1

Embed post-translational modification sites

Increases conformational heterogeneity Tunes interaction specificity, affinity & avidity

Disordered segments increase the functional versatility of proteins

Connect structured domains together Contain linear peptide motifs

Alters chemical composition and regulates interaction properties

P

Current Opinion in Structural Biology

Disordered segments increase the functional versatility of proteins. Disordered segments are represented in white, and structured domains as gray

and blue blobs. A disordered segment can influence the functional versatility of a protein by increasing the conformational heterogeneity between

domains and the number of conformational states achieved by the protein (left panel). Linear peptide motifs (green rectangles) embedded in

disordered segments can tune the specificity, selectivity, kinetics, affinity and avidity of interactions with distinct partners (blue blobs) (middle panel).

Post-translational modification sites embedded in disordered segments can modify the function by altering the segment’s chemical composition and/

or by serving as regulators of diverse interaction properties (right panel).

(http://www.rcsb.org/) suggest that AS exons in mRNA

often code for ID regions [9,12]. Calculations on a much

larger set of 15 678 proteins with 36 320 alternatively

spliced regions confirms these observations and suggest

that a higher portion of alternatively spliced mRNA codes

for ID regions as compared to that which codes for struc-

tured regions (Dunker et al., unpublished results). AS protein

segments in general [33��] and tissue-specific AS segments

in particular [13��,14��] frequently contain disordered

regions with embedded linear interaction motifs. This

suggests that the alternative inclusion of such functional

elements can trigger changes in protein function by alter-

ing its stability, subcellular location or by influencing its

molecular interactions [13��,14��,33��]. For example, the

alternative inclusion of peptide KEN-motifs affect protein

half-life by targeting specific isoforms for degradation, the

alternative inclusion of NLS signal peptides affect the

subcellular location of proteins, or the alternative inclusion

of motifs that bind PDZ, PTB, SH2, SH3 or WW domains

modulate interactions with these signaling domains

[13��,33��,34,35].

AS disordered regions can also display the following charac-

teristics: the segment (a) lacks linear interaction motifs or

PTM sites [12,13��], thus inclusion of such segments may

increase the conformational entropy or may alter the bind-

ing affinity of the full-length protein due to multiple weak

transient interactions [20,21]; (b) contains multiple occur-

rences of the same motif [33��], thus altering the avidity of

an interaction [36] or increasing the local concentration of

Current Opinion in Structural Biology 2013, 23:443–450

an interacting protein within specific regions in a cell [37];

(c) contains distinct, possibly overlapping, motifs or Mol-

ecular Recognition Features (MoRFs) [38�], thus confer-

ring the property of an interaction hub, and thereby

contributing to multiple functional outcomes, including

moonlighting [39]; and (d) contains multiple occurrences of

PTM sites [13��], thus changing the overall charge distri-

bution and influencing the conformational states and fine-

tuning the interactions with other molecules [25,27] in-

cluding the alteration of partner selection preferences

[38�].

Tissue-specific rewiring of interactions through spliced

ID segments

Buljan et al. [13��] recently analyzed published transcrip-

tome data from several different human tissues and

cancer cell lines [40] and found that tissue-specific AS

protein segments frequently contain ID regions that

embed functional elements such as evolutionarily con-

served interaction motifs and PTM sites. Furthermore,

genes with TS exons tend to occupy central positions in

interaction networks and tend to display distinct inter-

action partners in the respective tissues. Many of these

genes have roles in signaling and development and are

associated with cancer and other human diseases. In an

independent study, Ellis et al. [14��] reported similar

findings, and, using an automated protein–protein inter-

action assay, they demonstrated a direct connection be-

tween the AS disordered segments and protein binding

properties. Specifically, approximately one-third of the

www.sciencedirect.com

Page 3: Alternative splicing of intrinsically disordered regions ... during evolution, development and disease. Addresses 1MRC Laboratory of Molecular Biology, Francis Crick Avenue, Cambridge

Rewiring protein interactions by splicing ID regions Buljan et al. 445

analyzed neural-regulated exons were observed to affect

protein–protein interactions. These two studies establish

the concept that AS of ID segments with embedded

functional elements can rewire protein interactions, regu-

latory networks and signaling pathways in a tissue-specific

manner, thus conferring functional versatility to proteins

and increasing interaction network diversity across tissues

[13��] (Figures 2 and 3).

Not only disordered regions but also structured domains

can undergo AS. AS affecting structured domains have

been observed to rewire biomolecular interactions

[41�,42��,43,44]. For instance, an evolutionarily conserved

Figure 2

D1

D2

NT

AB

Acid Box linker(inhibition module)

FGF ReceptorReceptorttFGFFGF Receptor

PEX3

Binary intera

AAAA

DNA

mRNA

Protein

Molecular mechanisms by which spliced disordered segmen

(a) (b)Auto-inhibitory interaction

Molecular mechanisms by which alternatively spliced ID segments influence

can regulate protein interactions via distinct mechanisms. (a) ASDS can infl

masking or competing for the interaction interface via auto-inhibition. For ex

Factor (FGF) receptor prevents the protein from interacting with its ligand F

segment connects the immunoglobulin domains D1 and D2 (cyan, surface a

can directly affect binary interactions, as shown for the binding of the human

(red helix within a disordered segment) (PDB:3MK4). (c) ASDS can influence

dependent protein kinase II (CAMKII) forms a dodecameric complex. Each h

and a C-terminal hub domain. These two domains are connected by a diso

Variation in the linker length directly impacts the sensitivity of CAMKII for ca

www.sciencedirect.com

embryonic stem cell specific AS of mutually exclusive

exons changes the DNA-binding preference of the fork-

head transcription factor FOXP1. The altered residues in

FOXP1’s DNA-binding domain affect specificity of DNA

binding and hence influence pluripotency and differen-

tiation of Embryonic Stem cells [42��].

Molecular mechanism of interaction rewiringvia splicing of disordered segmentsSplicing of disordered segments can affect a number of

properties of protein interactions. Below, we discuss how

such AS events influence intra-molecular and inter-mol-

ecular interactions [13��,45]. Note that through similar

PEX19 motif

Protein complex

NT kinasedomain

CT hubdomain

Ca2+/CaMbinding region

regulatorysegment

linker

(β)CAMK II(β)CAMK II

ction

AAAA

ts influence protein interactions

(c)

Alternatively spliced exon

Disordered segment

Structured domain

Constitutively expressed exon

Current Opinion in Structural Biology

protein interactions. Alternative splicing of disordered segments (ASDS)

uence the interaction of full-length proteins with other biomolecules by

ample, a segment with the Acid Box (AB; red oval) in Fibroblast Growth

GF1 and is subjected to alternative splicing. In the long isoform, this

nd cartoon representation; D1:PDB:2CKN and D2:PDB:1RY7). (b) ASDS

peroxin PEX3 (blue, surface representation) with the PEX19 binding motif

dynamics and kinetics of protein complexes. The Ca2+/calmodulin-

oloenzyme subunit (in cyan) comprises of an N-terminal kinase domain

rdered linker (red), the length of which varies among splice isoforms.

lcium gradients (PDB:3BHH).

Current Opinion in Structural Biology 2013, 23:443–450

Page 4: Alternative splicing of intrinsically disordered regions ... during evolution, development and disease. Addresses 1MRC Laboratory of Molecular Biology, Francis Crick Avenue, Cambridge

446 Sequences and topology

Figure 3

Rewiring of protein-protein interaction networks

11 2 3 4

1 2 3 4

1 2 3 4

1 2 3 4

Mutations indisordered regions

Evolution ofsplicing patterns

Impact of mutations and splicing of disordered segments on disease and the evolution of new phenotypes

(across organisms or tissues)

(across organisms or somatic mutations)

Evolution of organism-specific and tissue-specific traitsand manifestation of disease

Increase in interaction network diversityacross tissues and organisms

Across tissues

Across organisms

differential usageof segments

that influence interactions

modificationof segments

that influence interactions

(a) (b)

Current Opinion in Structural Biology

Impact of mutations and splicing of ID segments on disease and the evolution of new phenotypes. (a) Alternative splicing of disordered segments can

cause the rewiring of protein interaction networks across different tissues and organisms. (b) Mutations in disordered regions, either across organisms

or during the lifetime of an organism (e.g. somatic mutations), can modify segments that influence protein interactions leading to new phenotypes or

diseases (e.g. cancer). Likewise, the evolution of new splicing patterns, and their tissue-specific regulation, permits the differential usage of segments

that influence interactions. The combination of both mechanisms not only permits evolution of organism-specific or tissue-specific traits, but can also

be involved in disease development.

principles, AS of disordered segments could also influ-

ence protein–DNA, protein–RNA and protein–small

molecule interaction networks.

Auto-inhibitory interactions

Auto-inhibition modules (IMs) influence interaction of

full-length proteins with other biomolecules by masking

or competing for the interaction interface (Figure 2a).

Trudeau et al. [46��] analyzed auto-inhibited proteins and

demonstrated that such IMs are enriched in intrinsic

disorder. By comparing the properties of proteins with

disordered IMs to those with structured IMs, they show

that ID IMs permit fine-tuning of the equilibrium be-

tween the active and inactive states. The disordered IMs

were observed to be more highly phosphorylated, more

frequently AS, and to contain evolutionarily conserved

linear peptide motifs that often changed their secondary

structure during activation [46��]. Such IMs may not only

be regulated by splicing but may also emerge indepen-

dently in organisms during the course of evolution. For

instance, an interaction between Homeobox HoxA11 and

Forkhead box 01A (Foxo1a) in mammals has evolved only

after a previously present, but masked, binding site in

Current Opinion in Structural Biology 2013, 23:443–450

HoxA11 became exposed through mutations in a disor-

dered region that mediated self-interaction [47]. Thus,

inclusion or skipping of such segments may contribute to

selectivity and altered kinetics in partner recognition.

Such properties may be further modulated through TS

splicing and hence contribute to fine-tuning of molecular

interactions in a spatially and temporally regulated man-

ner [13��].

Binary interactions

Linear motifs, MoRFs and PTMs within ID regions

enable molecular interactions. Thus the AS of ID seg-

ments containing these elements can form new molecu-

lar interactions and thereby lead to the recruitment of the

same biochemical or molecular functions (often

mediated by structured domains in the same polypep-

tide) into a different context (Figure 2b) [13��]. For

instance, differential inclusion of disordered segments

with interaction motifs in a kinase protein could cause the

resulting kinase to recruit completely different protein

substrates. Similarly, the differential inclusion of disor-

dered PTM sites (e.g. phosphorylation sites) could alter

whether a given protein becomes the substrate for a

www.sciencedirect.com

Page 5: Alternative splicing of intrinsically disordered regions ... during evolution, development and disease. Addresses 1MRC Laboratory of Molecular Biology, Francis Crick Avenue, Cambridge

Rewiring protein interactions by splicing ID regions Buljan et al. 447

modifying enzyme (e.g. kinase). Thus AS of such dis-

ordered regions has the potential to rewire molecular

interaction networks and alter the specificity, selectivity,

affinity and kinetics of such molecular interactions [13��].Indeed, Merkin et al. [6��] recently investigated AS in

equivalent tissues across diverse organisms and found

that AS often alters whether a protein can be phosphory-

lated, thereby delimiting the scope of kinase signaling.

Similarly, changing the length of a disordered region can

interfere allosterically with interface formation else-

where due to increased conformational entropy

[48,49]. For example, alternative isoforms of the homo-

meric transcription factor Ulthrabithorax (UBX) gene,

which are expressed in a developmental and in a TS

manner, differ only in the length of the disordered

segment adjacent to the DNA-binding homeodomain

[48,50–52]. This AS region in UBX is not directly

involved in binding, but rather modulates the protein’s

dynamics and determines its DNA binding affinity. Thus

AS of the ID region enables UBX isoforms to recognize

different DNA sequences in distinct contexts and to bind

to the same DNA sequence differently. Hence, alterna-

tive inclusion of ID regions can have a significant influ-

ence on the affinity and kinetics of protein interactions

[13��].

Protein complexes

Many proteins form stable homo-oligomeric or hetero-

oligomeric protein complexes in the cell [53,54]. ID

regions in the interaction interface and their PTM often

influence complex formation [55,56]. Expression of the

subunit isoforms with differentially included ID seg-

ments can influence complex formation and cause tun-

able protein properties [57] (Figure 2c). Moreover, the

simultaneous expression of multiple isoforms that form

homo-complexes or hetero-complexes may lead to the

sequestration of functional isoforms into ‘heterogeneous’

oligomeric complexes, thereby leading to transient loss or

gain of function [58]. Depending on the expression level

of the isoform that encodes TS segments, the equilibrium

between the different oligomeric ‘protein-complex

states’ will be influenced, thereby leading to competition

for a common interaction partner [59] or altered kinetics

in response to a signal. This may result in dominant

negative effects or ultra-sensitivity due to molecular

titration effects [60,61]. For example, the increased diver-

sity of cell signaling options caused by receptor dimeriza-

tion is further enhanced by the generation of splice

variants in the case of GPCRs, one of the largest family

of membrane signaling proteins [62].

Implications for evolution, development anddiseaseEvolution and development

During evolution, ID regions contribute more extensively

than structured regions to the rewiring of protein inter-

action networks [63–65] by means of short insertions/

www.sciencedirect.com

deletions or point mutations that modulate interactions

(Figure 3). Fine-tuning of gained or rewired interactions

can also result from mutations of flanking residues around

an interaction motif [66,67]. Furthermore, species-

specific evolution of AS [68] of such altered disordered

regions may further expand the functional repertoire of

orthologous proteins by rewiring protein interaction net-

works in space (e.g. across tissues) or time (e.g. during

development or evolution) [68]. Indeed, the work of

Merkin et al. [6��] and Barbosa-Morais et al. [5��] revealed

the following trends: (a) AS is well conserved only for a

subset of exons and is frequently lineage-specific; (b) TS

spliced segments are enriched in PTM sites and disor-

dered regions; (c) tissue-specific AS often alters protein

phosphorylation sites, delimiting the scope of kinase

signaling; (d) certain splicing events likely remodel

protein interactions involving orthologous genes in equiv-

alent tissues across different organisms; and (e) segments

spliced in such species-specific manner are enriched in ID

regions and frequently occur in regulatory proteins.

Taken together, these studies suggest that diversification

of splicing and sequence mutations in ID regions during

the course of evolution may underlie the emergence of

novel phenotypic traits. These could include both tissue

differentiation and differences among the extant organ-

isms [5��,6��,7,13��] (Figure 3). We speculate that similar

principles might underlie the growing evidence for popu-

lation-specific [69–71] or sex-specific traits within a

species [72].

Disease

While molecular interactions can evolve through a small

number of mutations within ID segments due to the

emergence or loss of binding motifs, this property also

raises the risk of rewiring interactions that lead to

cancer or genetic diseases [73,74] (Figure 3). Indeed,

TS exons are significantly enriched in genes associated

with cancer and embryonic lethality [13��], thus under-

scoring the importance of TS exons in disease devel-

opment and the associated regulatory pathways. Since

TS isoforms likely have distinct molecular interactions,

this may explain how the same mutation may lead to

different phenotypes across tissue or organ types even

when gene expression levels across the tissues are

similar. Such tissue-specific AS-dependent gain or lo-

ss of specific interactions could lead to selective mani-

festation of disease in particular organs or tissue

types [74–76]. The flood of mutation data from the

cancer genome sequencing projects can be better inter-

preted by bearing in mind, not only which genes are

expressed but also which isoforms are expressed in the

tissue of interest. Exciting insights are already being

provided into how mutations that affect splicing events

or cause altered expression of certain isoforms — thus

leading to mis-regulation of protein function — can

cause diseases, such as cancer, in certain tissue types

[77,78].

Current Opinion in Structural Biology 2013, 23:443–450

Page 6: Alternative splicing of intrinsically disordered regions ... during evolution, development and disease. Addresses 1MRC Laboratory of Molecular Biology, Francis Crick Avenue, Cambridge

448 Sequences and topology

Conclusion and future directionsRapidly advancing technology makes it possible to obtain

sequences and expression patterns of different isoforms

in diverse tissue types, disease states and different devel-

opmental stages. Thus, some of the major challenges are

to understand how sequence variation and differential

inclusion of exons leads to the rewiring of molecular

interactions at the genomic and molecular level, and then

to predict when these alterations affect phenotype or

cause disease. While interpreting the impact of differen-

tial inclusion of structured domains is more feasible,

understanding the impact of addition or removal of ID

segments remains a challenge. Hence, an important

direction for future research is to develop a comprehen-

sive characterization and categorization of PTM and

molecular interaction sites in ID regions and how the

former alters the latter. These improvements will involve

development and expansion of databases such as ELM

[79], PhosphoELM [80], iELM [81], and improved pre-

dictions of sites for PTMs and molecular interactions.

This knowledge will provide the molecular basis for

interpreting how and when the differential inclusion of

disordered segments can contribute to the generation of

novel phenotypic traits in molecular terms. Such

improved understanding could lead to therapeutic strat-

egies tailored to altering the functions of specific exons,

for example by developing small molecules that target AS

disordered segments (e.g. via antisense oligonucleotides

or targeted delivery of small RNA or small molecules in

specific tissues) while avoiding constitutive exons present

in all isoforms [58,82–86]. Finally, as sequencing individ-

ual human genomes at affordable cost is becoming a

reality, a better understanding of how natural variation

influences disordered regions, AS patterns and the rewir-

ing of molecular interaction networks will have a signifi-

cant impact for personalized medicine and for bettering

human health.

AcknowledgementsThis work was supported by the Medical Research Council (U105185859;M.B., G.C., M.F., S.B., and M.M.B.), the Wellcome Trust (A.B.), HFSP(RGY0073/2010; M.B. and M.M.B.), the EMBO Young Investigator Program(M.M.B.), ERASysBio+ (GRAPPLE; M.M.B.), the Gates CambridgeScholarship and the Knox Trinity Studentship (G.C.) and the Momentumprogram (LP2012-41) of the Hungarian Academy of Sciences (M.F.).

Note added in proofThe reader is referred to two relevant articles that were published after ourreview was accepted for publication: Van Roey et al., Science Signaling2013, http://dx.doi.org/10.1126/scisignal.2003345 and Talavera et al., NatureBiotechnology 2013, http://dx.doi.org/10.1038/nbt.2540.

References and recommended readingPapers of particular interest, published within the period of review,have been highlighted as:

� of special interest

�� of outstanding interest

1. Chothia C, Gough J: Genomic and structural aspects of proteinevolution. Biochem J 2009, 419:15-28.

2. Ohno S: Evolution by Gene Duplication. Springer-Verlag; 1970.

Current Opinion in Structural Biology 2013, 23:443–450

3. King MC, Wilson AC: Evolution at two levels in humans andchimpanzees. Science 1975, 188:107-116.

4. Graveley BR: Alternative splicing: increasing diversity in theproteomic world. Trends Genet 2001, 17:100-107.

5.��

Barbosa-Morais NL, Irimia M, Pan Q, Xiong HY, Gueroussov S,Lee LJ, Slobodeniuc V, Kutter C, Watt S, Colak R et al.: Theevolutionary landscape of alternative splicing in vertebratespecies. Science 2012, 338:1587-1593.

The authors analyzed transcriptomes from equivalent tissues of differentvertebrate species. The work reveals that splice patterns strongly divergealong this evolutionary range. Importantly, investigation of species-spe-cific splicing events showed that (a) these are directed through alterationsin the splicing code within the spliced genes, (b) they frequently affect thedisordered regions and (c) many such events occur in regulatory genesthat bind to nucleic acids.

6.��

Merkin J, Russell C, Chen P, Burge CB: Evolutionary dynamics ofgene and isoform regulation in Mammalian tissues. Science2012, 338:1593-1599.

The authors analyzed tissue transcriptomes from mammals and a birdand found that AS events usually diverge along this evolutionary distance.Exons with the same pattern of tissue-specific splicing within mammalsare, however, conserved, bind to common splice regulators and fre-quently contain phosphosites. Moreover, the authors report interestingexamples where isoform production strongly differs in tissues with highlyexpressed kinases, which enables another level of regulation and rewiringof kinase signaling pathways.

7. Papasaikas P, Valcarcel J: Evolution. Splicing in 4D. Science2012, 338:1547-1548.

8. Nilsen TW, Graveley BR: Expansion of the eukaryotic proteomeby alternative splicing. Nature 2010, 463:457-463.

9. Hegyi H, Kalmar L, Horvath T, Tompa P: Verification ofalternative splicing variants based on domain integrity,truncation length and intrinsic protein disorder. Nucleic AcidsRes 2011, 39:1208-1219.

10. Kriventseva EV, Koch I, Apweiler R, Vingron M, Bork P,Gelfand MS, Sunyaev S: Increase of functional diversity byalternative splicing. Trends Genet 2003, 19:124-128.

11. Pentony MM, Jones DT: Modularity of intrinsic disorder in thehuman proteome. Proteins 2010, 78:212-221.

12. Romero PR, Zaidi S, Fang YY, Uversky VN, Radivojac P,Oldfield CJ, Cortese MS, Sickmeier M, LeGall T, Obradovic Z et al.:Alternative splicing in concert with protein intrinsic disorderenables increased functional diversity in multicellularorganisms. Proc Natl Acad Sci U S A 2006, 103:8390-8395.

13.��

Buljan M, Chalancon G, Eustermann S, Wagner GP, Fuxreiter M,Bateman A, Babu MM: Tissue-specific splicing of disorderedsegments that embed binding motifs rewires proteininteraction networks. Mol Cell 2012, 46:871-883.

The authors demonstrate that human tissue-specific alternativelyspliced segments are strongly enriched in protein disorder and fre-quently contain PTM sites and conserved binding motifs. Further, geneswith such exons were shown to be central in the interaction networks andenriched in those involved in signaling and development. The authorspropose that tissue-specific inclusion of disordered segments thatcontain binding motifs rewires interaction networks and signaling path-ways and that in this way, tissue-specific splicing may contribute tofunctional versatility of proteins and increase the diversity of interactionnetworks across tissues.

14.��

Ellis JD, Barrios-Rodiles M, Colak R, Irimia M, Kim T, Calarco JA,Wang X, Pan Q, O’Hanlon D, Kim PM et al.: Tissue-specificalternative splicing remodels protein–protein interactionnetworks. Mol Cell 2012, 46:884-892.

The authors demonstrate that tissue-specifically included exons arefrequently enriched in intrinsically disordered regions and are likely toinfluence protein interactions. By using protein–protein interaction assayto further analyze the effect of the inclusion of exons present only in brainisoforms, they demonstrate that tissue-specific splicing affected a num-ber of known interactions. Importantly, it was shown that exon inclusioncould both introduce and disrupt protein interactions.

15. Dunker AK, Lawson JD, Brown CJ, Williams RM, Romero P, Oh JS,Oldfield CJ, Campen AM, Ratliff CM, Hipps KW et al.: Intrinsicallydisordered protein. J Mol Graph Model 2001, 19:26-59.

www.sciencedirect.com

Page 7: Alternative splicing of intrinsically disordered regions ... during evolution, development and disease. Addresses 1MRC Laboratory of Molecular Biology, Francis Crick Avenue, Cambridge

Rewiring protein interactions by splicing ID regions Buljan et al. 449

16. Tompa P: Intrinsically disordered proteins: a 10-year recap.Trends Biochem Sci 2012, 37:509-516.

17. Dyson HJ, Wright PE: Intrinsically unstructured proteins andtheir functions. Nat Rev Mol Cell Biol 2005, 6:197-208.

18. Babu MM, Kriwacki RW, Pappu RV: Structural biology.Versatility from protein disorder. Science 2012, 337:1460-1461.

19. Wright PE, Dyson HJ: Linking folding and binding. Curr OpinStruct Biol 2009, 19:31-38.

20. Fuxreiter M: Fuzziness: linking regulation to protein dynamics.Mol Biosyst 2012, 8:168-177.

21. Fuxreiter M, Tompa P: Fuzzy complexes: a more stochasticview of protein function. Adv Exp Med Biol 2012, 725:1-14.

22. Davey NE, Van Roey K, Weatheritt RJ, Toedt G, Uyar B,Altenberg B, Budd A, Diella F, Dinkel H, Gibson TJ: Attributes ofshort linear motifs. Mol Biosyst 2012, 8:268-281.

23. Mao AH, Lyle N, Pappu RV: Describing sequence-ensemblerelationships for intrinsically disordered proteins. Biochem J2013, 449:307-318.

24. Meng W, Lyle N, Luan B, Raleigh DP, Pappu RV: Experiments andsimulations show how long-range contacts can form inexpanded unfolded proteins with negligible secondarystructure. Proc Natl Acad Sci U S A 2013, 110:2123-2128.

25. Vuzman D, Levy Y: DNA search efficiency is modulated bycharge composition and distribution in the intrinsicallydisordered tail. Proc Natl Acad Sci U S A 2010, 107:21004-21009.

26. Galea CA, Wang Y, Sivakolundu SG, Kriwacki RW: Regulation ofcell division by intrinsically unstructured proteins: intrinsicflexibility, modularity, and signaling conduits. Biochemistry2008, 47:7598-7609.

27. Mittag T, Orlicky S, Choy WY, Tang X, Lin H, Sicheri F, Kay LE,Tyers M, Forman-Kay JD: Dynamic equilibrium engagement of apolyvalent ligand with a single-site receptor. Proc Natl Acad SciU S A 2008, 105:17772-17777.

28. Soranno A, Buchli B, Nettels D, Cheng RR, Muller-Spath S,Pfeil SH, Hoffmann A, Lipman EA, Makarov DE, Schuler B:Quantifying internal friction in unfolded and intrinsicallydisordered proteins with single-molecule spectroscopy. ProcNatl Acad Sci U S A 2012, 109:17800-17806.

29. Boehr DD, Nussinov R, Wright PE: The role of dynamicconformational ensembles in biomolecular recognition. NatChem Biol 2009, 5:789-796.

30. Gsponer J, Babu MM: The rules of disorder or why disorderrules. Prog Biophys Mol Biol 2009, 99:94-103.

31. Santner AA, Croy CH, Vasanwala FH, Uversky VN, Van YY,Dunker AK: Sweeping away protein aggregation with entropicbristles: intrinsically disordered protein fusions enhancesoluble expression. Biochemistry 2012, 51:7250-7262.

32. Uversky VN, Dunker AK: The case for intrinsically disorderedproteins playing contributory roles in molecular recognitionwithout a stable 3D structure. F1000 Biol Rep 2013, 5:1.

33.��

Weatheritt RJ, Davey NE, Gibson TJ: Linear motifs conferfunctional diversity onto splice variants. Nucleic Acids Res2012, 40:7123-7131.

The authors perform a comprehensive study of the influence of AS andalternative promoter usage on inclusion or skipping of well-annotatedshort linear-motif interaction modules (SLiMs). They show that differentisoforms of the same gene frequently differ in whether or not they containa particular linear motif. Based on the SLiM’s role in the cell, they describethe effect of differential inclusion of these elements on the protein’sfunction for a number of cases.

34. Stamm S, Ben-Ari S, Rafalska I, Tang Y, Zhang Z, Toiber D,Thanaraj TA, Soreq H: Function of alternative splicing. Gene2005, 344:1-20.

35. Weatheritt RJ, Gibson TJ: Linear motifs: lost in (pre)translation.Trends Biochem Sci 2012, 37:333-341.

36. Van Roey K, Gibson TJ, Davey NE: Motif switches: decision-making in cell regulation. Curr Opin Struct Biol 2012, 22:378-385.

www.sciencedirect.com

37. Li P, Banjade S, Cheng HC, Kim S, Chen B, Guo L, Llaguno M,Hollingsworth JV, King DS, Banani SF et al.: Phase transitions inthe assembly of multivalent signalling proteins. Nature 2012,483:336-340.

38.�

Hsu WL, Oldfield CJ, Xue B, Meng J, Huang F, Romero P,Uversky VN, Dunker AK: Exploring the binding diversity ofintrinsically disordered proteins involved in one-to-manybinding. Protein Sci 2013, 22:258-273.

The authors of this study focus on Molecular Recognition Features(MoRFs), which are able to bind different interaction partners. MoRFsare intrinsically disordered regions that undergo disorder-to-order transi-tion after binding. They show that some MoRFs are able to attain differentfolds upon binding different partners and that AS and PTMs within MoRFsare able to contribute to distinct partner binding.

39. Tompa P, Szasz C, Buday L: Structural disorder throws newlight on moonlighting. Trends Biochem Sci 2005, 30:484-489.

40. Wang ET, Sandberg R, Luo S, Khrebtukova I, Zhang L, Mayr C,Kingsmore SF, Schroth GP, Burge CB: Alternative isoformregulation in human tissue transcriptomes. Nature 2008,456:470-476.

41.�

Davis MJ, Shin CJ, Jing N, Ragan MA: Rewiring the dynamicinteractome. Mol Biosyst 2012, 8:2054-2066.

The authors of this study analyze the human transcriptome data andpredict the likely influence of AS on protein’s connectivity. They focus onthe AS effects on protein domains and show that in nearly 20% of genes,AS of well-established interacting domains could modify protein’s inter-action potential.

42.��

Gabut M, Samavarchi-Tehrani P, Wang X, Slobodeniuc V,O’Hanlon D, Sung HK, Alvarez M, Talukder S, Pan Q, Mazzoni EOet al.: An alternative splicing switch regulates embryonic stemcell pluripotency and reprogramming. Cell 2011, 147:132-146.

In this elegant study, the authors use a large-scale approach for identify-ing ESC-specific splice events. The authors focus on an ESC-specificexon in the FOXP1 gene. The FOXP1 protein isoform, which is expressedin ESCs differs from the canonical isoform in crucial residues in the DNAbinding domain. The authors further demonstrate how this changes theprotein’s binding affinity for DNA and results in the distinct pattern of generegulation, which plays a major role in maintaining stem cell pluripotencyand differentiation.

43. Tress ML, Martelli PL, Frankish A, Reeves GA, Wesselink JJ,Yeats C, Olason PI, Albrecht M, Hegyi H, Giorgetti A et al.: Theimplications of alternative splicing in the ENCODE proteincomplement. Proc Natl Acad Sci U S A 2007, 104:5495-5500.

44. Yura K, Shionyu M, Hagino K, Hijikata A, Hirashima Y, Nakahara T,Eguchi T, Shinoda K, Yamaguchi A, Takahashi K et al.: Alternativesplicing in human transcriptome: functional and structuralinfluence on proteins. Gene 2006, 380:63-71.

45. Uversky VN: Multitude of binding modes attainable byintrinsically disordered proteins: a portrait gallery of disorder-based complexes. Chem Soc Rev 2011, 40:1623-1634.

46.��

Trudeau T, Nassar R, Cumberworth A, Wong ET, Woollard G,Gsponer J: Structure and Intrinsic Disorder in ProteinAutoinhibition. Structure 2013, 21:332-341.

In this work, the authors analyze inhibition modules in auto-inhibitedproteins and illustrate how the observed enrichment of these regions inintrinsic disorder confers several advantages. Such regions are moreoften phosphorylated, alternatively spliced and undergo structurechanges. They suggest that these properties allow fine-tuning the inhi-bitory properties of the segment.

47. Brayer KJ, Lynch VJ, Wagner GP: Evolution of a derived protein–protein interaction between HoxA11 and Foxo1a in mammalscaused by changes in intramolecular regulation. Proc NatlAcad Sci U S A 2011, 108:E414-E420.

48. Fuxreiter M, Simon I, Bondos S: Dynamic protein–DNArecognition: beyond what can be seen. Trends Biochem Sci2011, 36:415-423.

49. Nussinov R: How do dynamic cellular signals travel longdistances? Mol Biosyst 2012, 8:22-26.

50. Reed HC, Hoare T, Thomsen S, Weaver TA, White RA, Akam M,Alonso CR: Alternative splicing modulates Ubx proteinfunction in Drosophila melanogaster. Genetics 2010,184:745-758.

Current Opinion in Structural Biology 2013, 23:443–450

Page 8: Alternative splicing of intrinsically disordered regions ... during evolution, development and disease. Addresses 1MRC Laboratory of Molecular Biology, Francis Crick Avenue, Cambridge

450 Sequences and topology

51. Liu J, Perumal NB, Oldfield CJ, Su EW, Uversky VN, Dunker AK:Intrinsic disorder in transcription factors. Biochemistry 2006,45:6873-6888.

52. Dyson HJ: Roles of intrinsic disorder in protein–nucleic acidinteractions. Mol Biosyst 2012, 8:97-104.

53. Levy ED, Pereira-Leal JB, Chothia C, Teichmann SA: 3D complex:a structural classification of protein complexes. PLoS ComputBiol 2006, 2:e155.

54. Havugimana PC, Hart GT, Nepusz T, Yang H, Turinsky AL, Li Z,Wang PI, Boutz DR, Fong V, Phanse S et al.: A census of humansoluble protein complexes. Cell 2012, 150:1068-1081.

55. Marsh JA, Teichmann SA, Forman-Kay JD: Probing the diverselandscape of protein flexibility and binding. Curr Opin StructBiol 2012, 22:643-650.

56. Nishi H, Fong JH, Chang C, Teichmann SA, Panchenko AR:Regulation of protein–protein binding by coupling betweenphosphorylation and intrinsic disorder: analysis of humanprotein complexes. Mol Biosyst 2013 http://dx.doi.org/10.1039/C3MB25514J.

57. Chao LH, Stratton MM, Lee IH, Rosenberg OS, Levitz J,Mandell DJ, Kortemme T, Groves JT, Schulman H, Kuriyan J: Amechanism for tunable autoinhibition in the structure of ahuman Ca2+/calmodulin-dependent kinase II holoenzyme. Cell2011, 146:732-745.

58. Babu MM, van der Lee R, de Groot NS, Gsponer J: Intrinsicallydisordered proteins: regulation and disease. Curr Opin StructBiol 2011, 21:432-440.

59. Ungewitter E, Scrable H: Delta40p53 controls the switch frompluripotency to differentiation by regulating IGF signaling inESCs. Genes Dev 2010, 24:2408-2419.

60. Buchler NE, Cross FR: Protein sequestration generates aflexible ultrasensitive response in a genetic network. Mol SystBiol 2009, 5:272.

61. Buchler NE, Louis M: Molecular titration and ultrasensitivity inregulatory networks. J Mol Biol 2008, 384:1106-1119.

62. Wise H: The roles played by highly truncated splice variants ofG protein-coupled receptors. J Mol Signal 2012, 7:13.

63. Mosca R, Pache RA, Aloy P: The role of structural disorder in therewiring of protein interactions through evolution. Mol CellProteomics 2012, 11 M111 014969.

64. Beltrao P, Serrano L: Specificity and evolvability ineukaryotic protein interaction networks. PLoS Comput Biol2007, 3:e25.

65. Landry CR, Levy ED, Michnick SW: Weak functionalconstraints on phosphoproteomes. Trends Genet 2009,25:193-197.

66. Stein A, Aloy P: Contextual specificity in peptide-mediatedprotein interactions. PLoS ONE 2008, 3:e2524.

67. Stein A, Aloy P: Novel peptide-mediated interactions derivedfrom high-resolution 3-dimensional structures. PLoS ComputBiol 2010, 6:e1000789.

68. Gelfman S, Burstein D, Penn O, Savchenko A, Amit M, Schwartz S,Pupko T, Ast G: Changes in exon-intron structure duringvertebrate evolution affect the splicing pattern of exons.Genome Res 2012, 22:35-50.

69. Lu ZX, Jiang P, Xing Y: Genetic variation of pre-mRNAalternative splicing in human populations. Wiley Interdiscip RevRNA 2012, 3:581-592.

Current Opinion in Structural Biology 2013, 23:443–450

70. Montgomery SB, Sammeth M, Gutierrez-Arcelus M, Lach RP,Ingle C, Nisbett J, Guigo R, Dermitzakis ET: Transcriptomegenetics using second generation sequencing in a Caucasianpopulation. Nature 2010, 464:773-777.

71. Pickrell JK, Marioni JC, Pai AA, Degner JF, Engelhardt BE,Nkadori E, Veyrieras JB, Stephens M, Gilad Y, Pritchard JK:Understanding mechanisms underlying human geneexpression variation with RNA sequencing. Nature 2010,464:768-772.

72. Blekhman R, Marioni JC, Zumbo P, Stephens M, Gilad Y: Sex-specific and lineage-specific alternative splicing in primates.Genome Res 2010, 20:180-189.

73. Shan Y, Eastwood MP, Zhang X, Kim ET, Arkhipov A, Dror RO,Jumper J, Kuriyan J, Shaw DE: Oncogenic mutations counteractintrinsic disorder in the EGFR kinase and promote receptordimerization. Cell 2012, 149:860-870.

74. Zhang ZC, Chook YM: Structural and energetic basis of ALS-causing mutations in the atypical proline-tyrosine nuclearlocalization signal of the Fused in Sarcoma protein (FUS). ProcNatl Acad Sci U S A 2012, 109:12017-12021.

75. Garrick D, Samara V, McDowell TL, Smith AJ, Dobbie L, Higgs DR,Gibbons RJ: A conserved truncated isoform of the ATR-Xsyndrome protein lacking the SWI/SNF-homology domain.Gene 2004, 326:23-34.

76. Zhong Q, Simonis N, Li QR, Charloteaux B, Heuze F, Klitgord N,Tam S, Yu H, Venkatesan K, Mou D et al.: Edgetic perturbationmodels of human inherited disorders. Mol Syst Biol 2009, 5:321.

77. Liu J, Lee W, Jiang Z, Chen Z, Jhunjhunwala S, Haverty PM, Gnad F,Guan Y, Gilbert HN, Stinson J et al.: Genome and transcriptomesequencing of lung cancers reveal diverse mutational andsplicing events. Genome Res 2012, 22:2315-2327.

78. Christofk HR, Vander Heiden MG, Harris MH, Ramanathan A,Gerszten RE, Wei R, Fleming MD, Schreiber SL, Cantley LC: TheM2 splice isoform of pyruvate kinase is important for cancermetabolism and tumour growth. Nature 2008, 452:230-233.

79. Dinkel H, Michael S, Weatheritt RJ, Davey NE, Van Roey K,Altenberg B, Toedt G, Uyar B, Seiler M, Budd A et al.: ELM — thedatabase of eukaryotic linear motifs. Nucleic Acids Res 2012,40:D242-D251.

80. Dinkel H, Chica C, Via A, Gould CM, Jensen LJ, Gibson TJ, Diella F:Phospho.ELM: a database of phosphorylation sites — update2011. Nucleic Acids Res 2011, 39:D261-D267.

81. Weatheritt RJ, Jehl P, Dinkel H, Gibson TJ: iELM — a web serverto explore short linear motif-mediated interactions. NucleicAcids Res 2012, 40:W364-W369.

82. Alvarez-Erviti L, Seow Y, Yin H, Betts C, Lakhal S, Wood MJ:Delivery of siRNA to the mouse brain by systemic injection oftargeted exosomes. Nat Biotechnol 2011, 29:341-345.

83. Kumar P, Wu H, McBride JL, Jung KE, Kim MH, Davidson BL,Lee SK, Shankar P, Manjunath N: Transvascular delivery of smallinterfering RNA to the central nervous system. Nature 2007,448:39-43.

84. Spitali P, Aartsma-Rus A: Splice modulating therapies forhuman disease. Cell 2012, 148:1085-1088.

85. Metallo SJ: Intrinsically disordered proteins are potential drugtargets. Curr Opin Chem Biol 2010, 14:481-488.

86. Dunker AK, Uversky VN: Drugs for ‘protein clouds’: targetingintrinsically disordered transcription factors. Curr OpinPharmacol 2010, 10:782-788.

www.sciencedirect.com