chip-seq data processing - genes &...
TRANSCRIPT
Esnault et al
SUPPLEMENTARY MATERIALS
INVENTORY OF SUPPLEMENTARY MATERIALS
1. Supplementary Figures
Supplementary Figure S1 defines the effects of U0126, Latrunculin B and
Cytochalasin D on Ras-MAPK, Rho-actin and the Hippo signalling pathway. It
shows that U0126 specifically inhibits Ras-MAPK signalling, while LatB
inhibits Rho-actin signalling-dependent MRTF nuclear accumulation, and CD
specifically activates the MRTFs; and that the Hippo signalling pathway is
constitutively active under our assay conditions. Related to Figure 1.
Supplementary Figure S2 shows ChIP-seq control experiments including
inhibitor studies, validation by ChIP-qPCR, dimerisation of MRTFs, specificity
of MRTF and TCF binding, and motifs associated with SRF sites apparently
lacking cofactors. Related to Figures 1 and 2.
Supplementary Figure S3 displays examples of MRTF binding sites where
MRTF-B binds apparently in a SRF-independent manner. Related to Figure 2.
Supplementary Figure S4 shows that inducible SRF binding sites exhibit
MRTF-dependent nucleosome displacement in response to Rho-actin
signalling pathway. Related to Figure 3.
Supplementary Figure S5 shows the validation of the RNA-seq data by Q-
PCR and displays examples of potential ncRNA cis-regulatory effect on gene
activity. Related to Figure 4.
Esnault et al, Supplementary Materials
1
Esnault et al
Supplementary Figure S6 describes the role of SRF/MRTF in controlling
expression of genes regulating cytoskeleton remodelling, focal adhesion and
the extracellular matrix and shows that SRF deleted cells are impaired in both
actin and microtubule assembly. Related to Figure 6.
Supplementary Figure S7. Shows the validation of SRF, MRTF and TCF
association with selected genes involved in the circadian rhythm control and
confirms that MRTF can synchronise the circadian clock in response to actin
dynamics. Related to Figure 7.
2. Supplementary tables
Excel file of the supplementary tables summarising the genomic data. Related
to Figure 1, 2, 4, and 6.
Supplementary Table S1 summarises the ChIP-seq data. Related to Figures
1 and 2.
Supplementary Table S2 summarises the RNA-seq data of coding genes.
Related to Figure 4.
Supplementary Table S3 summarises the RNA-seq data of ncRNA. Related
to Figure 4.
Supplementary Table S4 summarises the relationship between SRF and
cofactor binding and the activity of associated genes. Related to Figure 4.
Supplementary Table S5 presents the SRF, MRTF and TCF signatures.
Related to figure 6.
Esnault et al, Supplementary Materials
2
Esnault et al
Supplementary Table S6 presents the gene ontology analysis. Related to
Figure 6
Supplementary Table S7 presents genes regulated by SRF/MRTF involved
in cytoskeleton control. Related to Figure 6.
Supplementary Table S8 shows a detailed gene signature enrichment
analysis. Related to Figure 6.
3. Extended Methods
4. Supplementary References
Esnault et al, Supplementary Materials
3
Esnault et al
SUPPLEMENTARY FIGURE LEGENDS
Supplementary Figure S1. Effects of inhibitors
Cells were maintained in 0.3% FCS and stimulated with 15% FCS with of
LatB (0.3µM), CD (2µM) or U0126 as indicated, or treated with drugs alone.
(A) U0126 (10µM), but not LatB (0.3µM), inhibits ERK activation following
serum stimulation, as assessed by immunoblot for phospho-ERK. As a
loading control the blot was stripped and reprobed with pan-ERK antibody.
(B) ERK inhibition by U0126 (10µM) does not prevent serum-induced nuclear
accumulation of MRTF-A, as assessed by immunofluorescence.
(C) Effects of actin-binding drugs on MRTF-A and YAP subcellular
localisation. Confocal immunofluorescence microscopy of MRTF-A (green; left
panels) or YAP1 (green; right panels) with nuclei and F-actin visualised by
DAPI (blue) and Phalloidin (red). MRTF-A is predominantly cytoplasmic in
unstimulated cells, and LatB inhibits its serum-induced nuclear accumulation;
CD alone is sufficient to induce MRTF-A nuclear localisation. YAP
localisation, which is predominantly nuclear, is unaffected by these
treatments.
(D) Quantitation of MRTF and YAP1 nuclear localisation and response to
signals in (C), with the corresponding signals for similarly-treated confluent
cell monolayers, where YAP is more cytoplasmic (Yu et al. 2012) are shown
for comparison. ArrayScan VTI and Compartmental analysis
(BioApplications). >3000 cells / condition. Mean signal is indicated by Red
bars.
Esnault et al, Supplementary Materials
4
Esnault et al
(E) Effect of serum stimulation on YAP phosphorylation, assessed by
immunoblot with Phospho-YAP (Ser127) antibody. Under our experimental
conditions - subconfluent cells grown on plastic - YAP phosphorylation is low
and unaffected by serum stimulation; in contrast, in confluent cells YAP
phosphorylation is high and decreases upon serum stimulation (see (Yu et al.
2012).
(F) Differential sensitivity of SRF target genes to Rho-actin and Ras-ERK
signalling. Endogenous MRTF targets (Acta2 and Srf) and TCF targets (Fos
and Egr1) were analysed by qRT-PCR. Expression following specific
activation of MRTF by Cytochalasin D (CD; 2µM) is also shown.
Supplementary Figure S2. ChIP-seq control and validation experiments.
(A) SRF ChIP-seq shows that LatB inhibits inducible (red) but not constitutive
(black) SRF binding, while U0126 has no effect.
(B) ChIP-qPCR validation of SRF ChIP-seq. 6 constitutive (black) and 49
inducible (red) ChIP-seq peaks. Data are means of 3 independent
experiments.
(C) MRTF and SRF signals correlate at inducible SRF sites. SRF peak
heights at the 2133 inducible SRF sites are compared to those of MRTF-A
(left panels) and MRTF-B (right panels), both expressed as percent of
maximum signal compared by scatter plot. Upper panels, high-confidence
MRTF sites (MACS p<10-5); lower panels, all MRTF sites.
(D) Serum stimulation and serum + U0126 treatment are pseudoreplicates for
MRTF ChIP-seq. Scatter-plot comparing high-confidence MRTF-A and MRTF-
Esnault et al, Supplementary Materials
5
Esnault et al
B ChIPseq signals (274 MRTF-A, 1176 MRTF-B; MACS p<10-5) from cells
stimulated with serum in the presence or absence of U0126.
(E) MRTF-A and MRTF-B dimerise. Left, NIH3T3 cell protein extracts were
immunoprecipitated with anti-MRTF-A, anti-MRTF-B, or no antibody, and
proteins detected by immunoblotting. Right, antibody specificity was
confirmed by immunoblotting of cells treated with SRF, MRTF-A or MRTF-B
specific siRNAs (Medjkane et al. 2009).
(F) MRTF-A and MRTF-B predominantly bind the same sites. Left, peak
heights for MRTF-A and MRTF-B at sites bound by both proteins, expressed
as percent of maximum, were compared by scatter plot (n=1320). Centre,
comparison of MRTF-A and MRTF-B raw readcounts at peaks called for
MRTF-B only (n=1021). Right, comparison of MRTF-A and MRTF-B raw
readcounts at peaks called for MRTF-A only (n=75).
(G) Comparison of SRF ChIP-qPCR data with MRTF-A and MRTF-B qPCR
data at 50 SRF ChIP-seq peaks and 3 negative control sites. Sites are ranked
in order of increasing SRF ChIP-seq signal. Vertical dashes indicate those
peaks called positive by ChIP-seq for SRF, MRTF-A and MRTF-B, and the
TCFs (see Figure S2H). Antibodies used for ChIP-qPCR assays are indicated
at the top. Note that at 4 peaks called as negative for MRTF-A by ChIP-seq
nevertheless validate as MRTF-A positive by ChIP-qPCR, as indicated by the
red arrows.
(H) Comparison of TCF ChIP-qPCR data with TCF peaks called by ChIP-seq.
14 SRF-positive ChIP-seq peaks which scored positive for TCF (9 SAP-1, 7
Elk-1, 6 Net; indicated in Figure S2G), and 3 negative control loci were
analysed by ChIP-qPCR. Assays are ranked in order of increasing SRF ChIP-
Esnault et al, Supplementary Materials
6
Esnault et al
seq signal. Antibodies used for ChIP-qPCR assays are indicated at the top.
Vertical dashes indicate peaks called as positive for SRF, and the different
TCFs by ChIP-seq. Red line marks the maximum negative control value.
(I) Sequence motifs associated "no-cofactor" SRF sites, classified according
to whether SRF binding was inducible and LatB-sensitive. Sequences within
100bp of each SRF summit were scanned. The spectrum of motifs associated
with each class suggests these sites may represent undetected MRTF- and
TCF-specific SRF sites. See Figure 2F.
Supplementary Figure S3. SRF-independent MRTF binding events
Apparently SRF-independent MRTF-B binding events at inducible SRF target
genes, at a constitutive gene, and in an intergenic region are shown in
orange. MRTF-A binding did not score at MACS p<10-5 at these sites.
Supplementary Figure S4. Inducible SRF binding is associated with
signal-regulated nucleosome displacement.
(A) SRF binding correlates with histone H3 displacement on the Fos and
Acta2 genes. Tracks show total H3 ChIP-seq signal in resting and serum-
stimulated cells, with inhibitors as indicated. Red bars show SRF binding
sites, black arrows highlight histone-depleted regions.
(B) SRF binding sites correlate with DNA'ase I sensitivity maxima.
Metaprofiles showing DNA'ase I cleavage per base across a 4 kb window
centred on the SRF peaks are shown. Inducible sites, red; constitutive sites,
black. DNA'ase I data at GEO accession number: GSM1003831.
Esnault et al, Supplementary Materials
7
Esnault et al
(C) Scatter plots displaying SRF-ChIP-seq readcounts against H3 ChIP-seq
signal (displayed by rank order of decreasing H3 signal; left panels,) or
DNA'ase I sensitivity (displayed by rank order of increasing cleavage signal;
right panels) in cycling NIH3T3 cells, as determined by others (right panels;
wgEncodeEM002916 (2011). Signals at inducible and constitutive SRF sites
are shown in red and black respectively. SRF ChIP-seq peak height in resting
cells correlates with DNA'ase I sensitivity maxima (left), but the correlation is
lost upon serum stimulation (right). Black line, Loess regression fitting curve
(20 value moving window). DNA'ase I data at GEO accession number:
GSM1003831.
Supplementary Figure S5. RNA-seq: functional validation and ncRNA
targets.
(A) RNA changes were assayed by qRT-PCR on 20 endogenous SRF target
genes defined by RNA-seq in the indicated culture conditions (Figure 1B).
Blue bars qRT-PCR ± SEM, analysis of 3 separate RNA preparations; red
bars, RNA-seq readcounts ± halfrange from analysis of 2 independent RNA
preparations.
(B) Comparison of RNA-seq and qRT-PCR data. Scatter plots comparing the
RNA changes quantified by qRT-PCR and RNA-seq are displayed in function
of the cell culture conditions. They show a good correlation between the two
methods of quantification (Spearman r, 0.79; p<0.0001).
(C) ncRNA targets in the vicinity of SRF target genes. Top, schematic view of
the genomic organisation around Gm15270, Gm13270, Neat1, GM10501,
Esnault et al, Supplementary Materials
8
Esnault et al
4930500J02rik and Gm1720 ncRNAs. Scale bars, 20kb. Red bars, ncRNA
transcription units; green bars, candidate SRF target genes; vertical lines,
SRF ChIP-seq peaks. Bottom, comparison of transcriptional responses of the
ncRNAs and adjacent SRF targets. ncRNA candidates for cis-regulation are
highlighted with gene expression changes indicated. For functional studies of
SRF ncRNA targets, see the following: miR-143 and miR145 (Xin et al. 2009);
miR-199a2 and miR214, (Park et al. 2011; Alexander et al. 2013); miR-21,
(Kumarswamy et al. 2011); miR-22 (Gurha et al. 2012); Tug1 and
Malat1/Neat2, (Gutschner et al. 2013).
Supplementary Figure S6. Cytoskeletal MRTF-SRF targets.
(A) MRTF-SRF signalling in cytoskeletal dynamics. MRTF-SRF target genes
(specific proteins or protein functional classes) are indicated by ovals. Left,
regulators of the actin dynamics and contractility; right, focal adhesion
components.
(B) SRF is required for assembly of F-actin and microtubules, and to maintain
nuclear morphology. F-actin and microtubules were visualised using Texas
Red-X Phalloidin and -Tubulin-Alexa488 in wildtype or SRF-deleted MEFs
under the indicated conditions. Data were quantified using ArrayScan VTI and
Compartmental analysis (BioApplications), >3000 cells / condition.
Supplementary Figure S7. SRF network and circadian clock regulation.
(A) ChIP-qPCR analysis of selected circadian SRF targets detected by ChIP-
seq.
Esnault et al, Supplementary Materials
9
Esnault et al
(B) CD treatment rapidly activates clock component gene transcription.
Analysis by qRT-PCR.
(C) Clock resetting by SRF activation. NIH3T3 cells were treated with 50%
FCS and transcripts were quantified by qRT-PCR over 36h.
(D) MRTF-induced clock resetting is SRF-dependent in MEFs. Wildtype and
SRF knockout MEFs (pooled cultures from each of 3 embryos) were treated
with 2 µM CD and transcripts quantified by qRT-PCR.
Esnault et al, Supplementary Materials
10
Esnault et al
SUPPLEMENTARY TABLES
Supplementary Table S1. SRF network ChIP-seq analysis
Summary of the ChIP-seq data from SRF, MRTF-A, MRTF-B, Sap-1, Elk-1
and Net experiments. Chromosomal location of each peak is given, together
with the identity of the nearest gene (or genes where the peak is located
within a gene feature). Each peak ID can be used on UCSC browser to
visualise the genomic area within the GEO database (instructions at
http://genome.ucsc.edu). The peak information, the summary of SRF peak
detection, the peak signal quantification for all factors in 0.3%FCS, 15%FCS,
LatB+15%FCS and U0126+15%FCS, the summary of the detection by MACS
of the factors and the relative MRTF and TCF scores are shown. The basis for
calling each peak by MACS score and/or coincidence with ChIP-seq signals
for other factors is indicated. Shading indicates SRF peaks called at MACS
p<10-5 (red) or MACS p<0.05 plus coincidence with a high-confidence MRTF
peak (green). Signals are quantified as the number of reads per 15 million.
The cofactor score normalises the cofactor signal across each condition to the
mean MRTF or TCF signals (MRTFs, mean of 15% FCS and 15%
FCS+U0126 signals; TCFs, mean across all conditions) to derive a measure
of the relative strength of MRTF and TCF signals at each SRF peak; the ratio
of the MRTF and TCF scores is also shown.
Supplementary Table S2. RNA-seq analysis: protein-coding genes
The table presents the RNA-seq analysis of NIH-3T3 cells maintained in 0.3%
FCS, stimulated with 15% FCS with or without LatB and/or U0126 treatment,
Esnault et al, Supplementary Materials
11
Esnault et al
or stimulated with CD, for all Refseq genes (release 47). For each gene,
expression was evaluated using all RNAseq reads within the gene feature, or
intronic reads only. The effect of serum or CD stimulation is summarised, and
serum-induced genes responsive to SRF-linked signals are indicated (serum-
induced, sensitive to LatB and/or U0126, or induced by CD; FDR<0.08.
Background shading indicates gene activity status: red, inducible; green,
repressed; yellow, constitutive; white, inactive. Active genes are defined as all
those which show detectable RNA-seq signal in any experimental condition.
ChIP-seq data are summarised according to whether the closest SRF site to
the gene is within the gene feature, within 70kb, or more distant. The genomic
coordinates of the SRF site closest to each gene, and its distance from the
TSS, are given (sites within a gene feature are indicated as zero). Cofactor
association is defined as those cofactors associated with the SRF peaks for
which the gene concerned is the closest; note that for genes associated with
multiple peaks, not all will be associated with that cofactor.
Supplementary Table S3. RNA-seq analysis: ncRNAs
The table presents the RNA-seq analysis of NIH-3T3 cells maintained in 0.3%
FCS, stimulated with 15% FCS with or without LatB and/or U0126 treatment,
or stimulated with CD, for all ncRNA (Ensembl release 69). The ncRNA
information section displays ncRNA coordinates, ncRNA biotype, database
source, and the identity and proximity of the nearest protein-coding gene. Red
shading indicates largest ncRNA in the database, green shading indicates
overlapping ncRNAs where multiple ncRNAs are transcribed from the same
locus. Expression and ChIP-seq peak data are displayed as in Table S2.
Esnault et al, Supplementary Materials
12
Esnault et al
Supplementary Table S4. Relationships between SRF and cofactor
binding and the activity of associated genes.
Genes are categorised according to their proximity to SRF binding events.
Direct genes exhibit SRF binding within a gene feature or within 2kb of 5'
flanking sequence. The Near (<70 kb) gene class has SRF sites within that
distance of the TSS; since SRF sites are significantly enriched within that
distance of active genes, these represent potential SRF targets. Genes in the
Far (>70 kb) class are more than 70kb from an SRF site; since there is no
significant enrichment of SRF sites at active genes in this set, we cannot infer
anything about the role of SRF in regulation of this gene class. (A) Relation
between gene expression status, SRF cofactor association, and regulatory
signal pathway. Genes are classified according to their response to serum
stimulation, sensitivity to inhibition by U0126 or LatB, induction by the MRTF
activator CD, and the cofactors associated with their nearest SRF binding
sites. See also Figure 4D. (B) Relationship between SRF peak type and gene
expression status. (i) Each SRF peak is classified according to cofactor
association, SRF binding inducibility, and the expression status of its nearest
gene; an accessory table below summarises properties of peaks that are
associated with more than one "Direct" gene. (ii) Peaks are classified
according to whether they are associated with gene regulatory events at
genes within the 70kb cutoff point. For example, 104 of the 720 peaks
associated with serum-inducible SRF-linked Direct genes are also the closest
peaks to a second serum-inducible "Near" gene (ie located within 70kb of the
site concerned).
Esnault et al, Supplementary Materials
13
Esnault et al
Supplementary Table S5. SRF, MRTF and TCF gene signatures.
The table presents SRF, MRTF, and TCF gene signatures comprising all
genes within 70kb of an SRF binding site which satisfy the criteria listed at the
top of each column. A list of all "Direct" genes satisfying the criteria is also
given for each signature. The stringent MRTF signature is defined as those
genes which both bind MRTF and are sensitive to direct inhibition or activation
of MRTF by LatB or CD.
Supplementary Table S6. Ontology analysis
Gene ontology analysis performed with DAVID on the various gene sets. P-
values below 10-3 are shown in bold red, and associated gene numbers and
enrichment factors in bold. The Near and Far serum inducible genes sets do
not exhibit significantly different ontology (Wilcoxon test p = 0.1).
Supplementary Table S7. SRF and MRTF in control of cytoskeletal gene
expression
The table displays MRTF-SRF cytoskeletal target genes, categorised
according to their functional roles. For functional studies of SRF targets
ncRNAs that may impact on the cytoskeleton, see the following: miR-143 and
miR145 (Xin et al. 2009); miR-199a2 and miR214, (Park et al. 2011;
Alexander et al. 2013); miR-21, (Kumarswamy et al. 2011); miR-22 (Gurha et
al. 2012) Tug1 and Malat1/Neat2 (Gutschner et al. 2013).
Esnault et al, Supplementary Materials
14
Esnault et al
Supplementary Table S8. Gene signature enrichment analysis
The SRF, MRTF and TCF gene signatures were compared with gene
expression databases. P-values from two-tailed Fisher test, number of genes
in common and article references are displayed. In several cases, the
inducible MRTF-SRF target gene set exhibited significant overlap with the
gene sets identified as up- and down-regulated between two experimental
conditions, possibly owing to normalisation procedures used for comparative
analysis of microarray datasets (for discussion see Loven et al. 2012). In
these cases only the upregulated set is included in the Table. For references
see (Iyer et al. 1999; Chang et al. 2004; Philippar et al. 2004; Wang et al.
2004; Kornmann et al. 2007a; Kornmann et al. 2007b; Wang et al. 2007;
Antipova et al. 2008; Hamilton and Kay 2008; Padua et al. 2008; Provenzano
et al. 2008; Boros et al. 2009a; Boros et al. 2009b; Descot et al. 2009;
Provenzano et al. 2009; Zhang et al. 2009; Costello et al. 2010; Cordenonsi et
al. 2011; Dupont et al. 2011; Bruna et al. 2012; Calvo et al. 2013; Jagannath
et al. 2013; Kwon et al. 2013; Park and Guan 2013).
Esnault et al, Supplementary Materials
15
Esnault et al
EXTENDED METHODS
Cell culture and immunofluorescence
NIH-3T3 or MEF cells at 50% confluence were serum-starved overnight (0.3%
FCS) and stimulated with 15% fetal calf serum (FCS), cytochalasin D
(Calbiochem, CD, 2M), or TPA (50ng/ml) for 30 min or as in figure legends.
For circadian clock entrainment analysis, cells were grown in 5% FCS and
stimulated with CD for 2h followed by washout or with 50% FCS. Signalling
pathways were inhibited by pretreating the cells for 30 minutes before
stimulation with latrunculin B (Calbiochem, LatB, 0.3 M) and/or U0126
(Promega, 10 μM). Immunofluorescence microscopy was performed as
described earlier (Vartiainen et al. 2007; Guettler et al. 2008). Cytoskeletal
changes were analysed using ArrayScan VTI and Compartmental analysis
(BioApplications).
Antibodies
Antibodies were: SRF (sc-335), SAP-1a (sc-13030), MRTF-A (sc-21558),
MRTF-B (sc-47282), PolII, CTD S2unP, 8WG16 (sc-56767) all from
SantaCruz; Elk-1 (Epitomics); Net, 1996 (Buchwalter et al. 2005), total H3
(ab1791, Abcam); PolII, CTD S2P (H5, covance); PolII, CTD S5P (H14,
covance); -Tubulin, Cell signalling #8058; YAP, Santa-Cruz H-125 #15407;
Phospho-YAP, Cell signalling #4911. F-actin was visualised with Texas
Red®-X Phalloidin (Invitrogen), and nuclei with DAPI.
Esnault et al, Supplementary Materials
16
Esnault et al
EMSA
Gel mobility shift probes comprised 120 bp centred on CArG motifs present in
specific SRF ChIP-seq peaks, generated by PCR and cloned using the
pENTR™⁄D-TOPO® cloning kit (Invitrogen). Binding assays were performed
as described previously (Marais et al. 1992; Murai and Treisman 2002). DNA
binding assays were in buffer D (1mM EDTA, 10mM Tris.HCl, pH 7.9, 50 mM
NaCl, 3mM DTT, 50µg/l BSA, 25µg/l poly(dIdC), 10% Ficoll 400, 5mM
Spermidine, 0.5% Bromophenol Blue with protease inhibitors (Roche)).
Reactions contained 60µg of whole cell extract from transfected NIH3T3 cells
transfected with SRF expression plasmid or pcDNA4 as a control, together
with 100-800 ng of recombinant MRTF-A123-1A (Vartiainen et al. 2007) prepared
using a recombinant baculovirus (from A.Mylona and S. Kjaer, unpublished).
Quantitation of binding assays was by phosphorimage analysis using
ImageQuant software. MRTF-A123-1A binding was estimated by quantification
material migrating more slowly than the SRF-DNA complex, which can be
supershifted using MRTF-A antibody, and compared to the yield of SRF-DNA
complex obtained in the absence of added MRTF-A123-1A.
Chromatin immunoprecipitation
ChIP was performed as described (Miralles et al. 2003), with the following
modifications: fixation was stopped by the addition of 250 mM glycine,
sonication was performed with a Bioruptor UCD 200 and recovery was
realised by using magnetic G-protein beads (Invitrogen). SYBR Green based
real-time PCR (Invitrogen) was performed using dilutions of genomic DNA
solution for calibration and to derive arbitrary abundance units. All data are
Esnault et al, Supplementary Materials
17
Esnault et al
from at least 3 independent experiments. For discussion of PolII antibody
specificity, see (Palancade and Bensaude 2003; Buratowski 2009).
Chip-Seq
DNA samples were end repaired, poly-A tailed and Illumina single end
adapters were ligated following the standard Illumina protocol with minor
adjustments. Agencourt AMPure XP beads at 0.8x ratio were used to size
select out adapter dimers after adapter ligation. The Illumina kit Phusion
enzyme was replaced by Kapa HiFi HotStart ready mix. Post PCR, AMPure
XP beads were used at a 1:1 ratio to maintain size integrity and to allow use
of the Invitrogen SizeSelect E-gel system. Samples were finally purified with
QIAquick gel extraction kit and quality controlled on the DNA 1000
BioAnalyser 2100 chip before clustering and subsequent 36bp single end
sequencing on the GAIIx analyser or on a Hi-seq 2000/1000.
ChIP-Seq Data Processing
All ChIP-Seq data sets were aligned using Eland (version pipeline 1.4) to
version NCBI37/mm9 of the mouse genome with the default settings. Raw
data and WIG files can be found online associated with the GEO Series ID
GSE45888.
Identifying ChIP-Seq Enriched Regions
We used MACS version 1.3.7.1 (Zhang et al. 2008) to identify regions
enriched over background (beads alone) in the aligned data. Default settings
were used apart from; model fold=8 and an effective genome size of
Esnault et al, Supplementary Materials
18
Esnault et al
2.43e+09. MACS threshold values of p<10-5 or p<0.05 were used as stated in
the text.
Calculating Read Density
All ChIP-Seq sample mapped read counts were normalised to 15 million total
reads and the normalised read density per base pair calculated for each
enriched region. Total H3 samples were normalised 300 million mapped
reads.
MRTF score
For all positive MRTF ChIP-seq peaks, the MRTF-A and MRTF-B signals from
the 15% FCS and 15% FCS+U0126 conditions, were each normalised using
the median MRTF-A and MRTF-B signal respectively. The MRTF score was
then defined as the mean of the normalised MRTF-A and MRTF-B signals at
each peak.
TCF score
For all positive TCF-SRF ChIP-seq peaks, the SAP-1, Elk-1, and Net signals
from each condition were normalised using the median SAP-1, Elk-1, and Net
signals respectively. The TCF score was then defined as the mean of the
normalised TCF signals at each peak.
Density plot
We calculated per nucleotide average read density profiles for the Pol II
datasets across gene loci ±5 kb. Gene lengths were standardised to 20 kb.
Samples were scaled to a total of 15 million reads. In the case of the H3 and
SRF datasets, we calculated average read density profiles centered on the
SRF ChIP binding loci (±2 kb around the SRF ChIP-seq signal summit). Here
Esnault et al, Supplementary Materials
19
Esnault et al
samples were scaled to a total of 300 million reads for total H3 and 15 million
for SRF. The same analysis was run on the imported DNase I Digital Genomic
Footprinting from GEO accession number GSM1003831 (2011). For MRTF-A
and MRTF-B datasets, we calculated average read density profiles centered
on the ChIP binding loci (±2 kb around the ChIP-seq peak summit). Here
samples were scaled to a total of 15 million reads.
Conservation plot
To visualise sequence conservation on ChIP-Seq loci, we used the Placental
phastCon30way conservation data from UCSC. We created per nucleotide
mean conservation profiles for binding loci sets. Profiles represent ±2 kb
centered on SRF ChIP-seq profile summit.
Identification of TF Binding Motifs
De novo motif discovery and known motif identification were performed using
Homer 3.10 (Heinz et al. 2010) on sequences ±100 bp from the peak summit
of enriched regions. This was performed for each of the TF datasets and also
for enriched regions that were represented in overlapping datasets. In addition
these datasets were interrogated with MEME 4.6.1 (Bailey and Elkan 1994),
which produced similar results. The SRF CArG binding consensus
(CC(AT)6GG), with mismatches as required, was used to search the selected
sequences by Fuzznuc, part of the EMBOSS suite (Rice et al. 2000).
Determination of RNA Pol II Enrichment
PolII ChIP-seq datasets were aligned to the mouse genome version
NCBI37/mm9 using Eland (version 1.4) and the number of mapped reads in
each sample was normalised to 15 million. For all transcripts annotated by
UCSC for mm9, we quantified the signal by counting the number of reads in
Esnault et al, Supplementary Materials
20
Esnault et al
500bp windows from -2Kb from the TSS to +70Kb or to the end of the gene, if
it was shorter.
RNA-Seq
Libraries were prepared using the Directional mRNA-Seq Library Prep. v1.0.
Pre-Release Protocol from Illumina with minor adjustments. To minimise the
ribosomal rRNA representation in the libraries, samples were processed
either using DSN (Evrogen JSC; serum- and CD-stimulation experiment) or
with the Ribo-zero rRNA removal kit (Epicentre; separate CD-stimulation
experiment). The Illumina kit Phusion enzyme was replaced by Kapa HiFi
HotStart ready mix which reduced the overall volume of the PCR and the ratio
for the Agencourt AMPure XP beads was adjusted accordingly. The standard
PCR cycling was also changed to match the concentration of the total RNA
from the initial QC. After passing the final QC, the libraries were subjected to
cluster formation and then 72bp single end sequencing on the GAIIx analyser.
RNA-Seq Data Processing
All RNA-Seq data were aligned to NCBI37/mm9 with BWA(version 0.5.6)
using the default settings. Raw data can be found online associated with the
GEO Series ID GSE45888.
Calculation of read density and gene expression analysis
After alignment read counts for the canonical gene features from RefSeq
gene and intron annotations, were calculated using coverageBed (bedtools
version 2.14.3), using the bam file generated by bwa as the input. The data
were normalised by using the average expression level of 6664 genes which
Esnault et al, Supplementary Materials
21
Esnault et al
do not significantly change in expression upon serum stimulation. The
differential gene and intron expression analysis was performed with DESeq
(Anders and Huber 2010) at p<0.2. Since gene changes in response to signal
were called in combination of several treatments, we simulated the false
discovery rate by using 5000 permutations and found an estimated fold
discovery rate of FDR<0.08. Figures present relative gene expression levels
across the conditions of cell growth from either all normalised gene feature
counts or normalised intronic counts.
Esnault et al, Supplementary Materials
22
Esnault et al
SUPPLEMENTARY REFERENCES
Alexander MS, Kawahara G, Motohashi N, Casar JC, Eisenberg I, Myers JA,
Gasperini MJ, Estrella EA, Kho AT, Mitsuhashi S et al. 2013.
MicroRNA-199a is induced in dystrophic muscle and affects WNT
signaling, cell proliferation, and myogenic differentiation. Cell death and
differentiation.
Anders S, Huber W. 2010. Differential expression analysis for sequence count
data. Genome biology 11(10): R106.
Antipova AA, Stockwell BR, Golub TR. 2008. Gene expression-based
screening for inhibitors of PDGFR signaling. Genome biology 9(3):
R47.
Bailey TL, Elkan C. 1994. Fitting a mixture model by expectation maximization
to discover motifs in biopolymers. Proceedings / International
Conference on Intelligent Systems for Molecular Biology ; ISMB
International Conference on Intelligent Systems for Molecular Biology
2: 28-36.
Boros J, Donaldson IJ, O'Donnell A, Odrowaz ZA, Zeef L, Lupien M, Meyer
CA, Liu XS, Brown M, Sharrocks AD. 2009a. Elucidation of the ELK1
target gene network reveals a role in the coordinate regulation of core
components of the gene regulation machinery. Genome Res 19(11):
1963-1973.
Boros J, O'Donnell A, Donaldson IJ, Kasza A, Zeef L, Sharrocks AD. 2009b.
Overlapping promoter targeting by Elk-1 and other divergent ETS-
Esnault et al, Supplementary Materials
23
Esnault et al
domain transcription factor family members. Nucleic acids research
37(22): 7368-7380.
Bruna A, Greenwood W, Le Quesne J, Teschendorff A, Miranda-Saavedra D,
Rueda OM, Sandoval JL, Vidakovic AT, Saadi A, Pharoah P et al.
2012. TGFbeta induces the formation of tumour-initiating cells in
claudinlow breast cancer. Nature communications 3: 1055.
Buchwalter G, Gross C, Wasylyk B. 2005. The ternary complex factor Net
regulates cell migration through inhibition of PAI-1 expression. Mol Cell
Biol 25(24): 10853-10862.
Buratowski S. 2009. Progression through the RNA polymerase II CTD cycle.
Molecular cell 36(4): 541-546.
Calvo F, Ege N, Grande-Garcia A, Hooper S, Jenkins RP, Chaudhry SI,
Harrington K, Williamson P, Moeendarbary E, Charras G et al. 2013.
Mechanotransduction and YAP-dependent matrix remodelling is
required for the generation and maintenance of cancer-associated
fibroblasts. Nature cell biology 15(6): 637-646.
Chang HY, Sneddon JB, Alizadeh AA, Sood R, West RB, Montgomery K, Chi
JT, van de Rijn M, Botstein D, Brown PO. 2004. Gene expression
signature of fibroblast serum response predicts human cancer
progression: similarities between tumors and wounds. PLoS biology
2(2): E7.
Cordenonsi M, Zanconato F, Azzolin L, Forcato M, Rosato A, Frasson C, Inui
M, Montagner M, Parenti AR, Poletti A et al. 2011. The Hippo
transducer TAZ confers cancer stem cell-related traits on breast cancer
cells. Cell 147(4): 759-772.
Esnault et al, Supplementary Materials
24
Esnault et al
Costello P, Nicolas R, Willoughby J, Wasylyk B, Nordheim A, Treisman R.
2010. Ternary complex factors SAP-1 and Elk-1, but not net, are
functionally equivalent in thymocyte development. J Immunol 185(2):
1082-1092.
Descot A, Hoffmann R, Shaposhnikov D, Reschke M, Ullrich A, Posern G.
2009. Negative regulation of the EGFR-MAPK cascade by actin-MAL-
mediated Mig6/Errfi-1 induction. Molecular cell 35(3): 291-304.
Dupont S, Morsut L, Aragona M, Enzo E, Giulitti S, Cordenonsi M, Zanconato
F, Le Digabel J, Forcato M, Bicciato S et al. 2011. Role of YAP/TAZ in
mechanotransduction. Nature 474(7350): 179-183.
Guettler S, Vartiainen MK, Miralles F, Larijani B, Treisman R. 2008. RPEL
motifs link the serum response factor cofactor MAL but not myocardin
to Rho signaling via actin binding. Mol Cell Biol 28(2): 732-742.
Gurha P, Abreu-Goodger C, Wang T, Ramirez MO, Drumond AL, van Dongen
S, Chen Y, Bartonicek N, Enright AJ, Lee B et al. 2012. Targeted
deletion of microRNA-22 promotes stress-induced cardiac dilation and
contractile dysfunction. Circulation 125(22): 2751-2761.
Gutschner T, Hammerle M, Diederichs S. 2013. MALAT1 - a paradigm for
long noncoding RNA function in cancer. J Mol Med (Berl).
Hamilton EE, Kay SA. 2008. SnapShot: circadian clock proteins. Cell 135(2):
368-368 e361.
Heinz S, Benner C, Spann N, Bertolino E, Lin YC, Laslo P, Cheng JX, Murre
C, Singh H, Glass CK. 2010. Simple combinations of lineage-
determining transcription factors prime cis-regulatory elements required
for macrophage and B cell identities. Molecular cell 38(4): 576-589.
Esnault et al, Supplementary Materials
25
Esnault et al
Iyer VR, Eisen MB, Ross DT, Schuler G, Moore T, Lee JC, Trent JM, Staudt
LM, Hudson J, Jr., Boguski MS et al. 1999. The transcriptional program
in the response of human fibroblasts to serum. Science 283(5398): 83-
87.
Jagannath A, Butler R, Godinho SI, Couch Y, Brown LA, Vasudevan SR,
Flanagan KC, Anthony D, Churchill GC, Wood MJ et al. 2013. The
CRTC1-SIK1 Pathway Regulates Entrainment of the Circadian Clock.
Cell 154(5): 1100-1111.
Kornmann B, Schaad O, Bujard H, Takahashi JS, Schibler U. 2007a. System-
driven and oscillator-dependent circadian transcription in mice with a
conditionally active liver clock. PLoS biology 5(2): e34.
Kornmann B, Schaad O, Reinke H, Saini C, Schibler U. 2007b. Regulation of
circadian gene expression in liver by systemic signals and hepatocyte
oscillators. Cold Spring Harbor symposia on quantitative biology 72:
319-330.
Kumarswamy R, Volkmann I, Thum T. 2011. Regulation and function of
miRNA-21 in health and disease. RNA biology 8(5): 706-713.
Kwon Y, Vinayagam A, Sun X, Dephoure N, Gygi SP, Hong P, Perrimon N.
2013. The Hippo signaling pathway interactome. Science 342(6159):
737-740.
Loven J, Orlando DA, Sigova AA, Lin CY, Rahl PB, Burge CB, Levens DL,
Lee TI, Young RA. 2012. Revisiting global gene expression analysis.
Cell 151(3): 476-482.
Esnault et al, Supplementary Materials
26
Esnault et al
Marais RM, Hsuan JJ, McGuigan C, Wynne J, Treisman R. 1992. Casein
kinase II phosphorylation increases the rate of serum response factor-
binding site exchange. EMBO J 11(1): 97-105.
Medjkane S, Perez-Sanchez C, Gaggioli C, Sahai E, Treisman R. 2009.
Myocardin-related transcription factors and SRF are required for
cytoskeletal dynamics and experimental metastasis. Nature cell biology
11(3): 257-268.
Miralles F, Posern G, Zaromytidou A-I, Treisman R. 2003. Actin dynamics
control SRF activity by regulation of its coactivator MAL. Cell 113: 329-
342.
Murai K, Treisman R. 2002. Interaction of serum response factor (SRF) with
the Elk-1 B box inhibits RhoA-actin signaling to SRF and potentiates
transcriptional activation by Elk-1. Mol Cell Biol 22(20): 7083-7092.
Padua D, Zhang XH, Wang Q, Nadal C, Gerald WL, Gomis RR, Massague J.
2008. TGFbeta primes breast tumors for lung metastasis seeding
through angiopoietin-like 4. Cell 133(1): 66-77.
Palancade B, Bensaude O. 2003. Investigating RNA polymerase II carboxyl-
terminal domain (CTD) phosphorylation. European journal of
biochemistry / FEBS 270(19): 3859-3870.
Park C, Hennig GW, Sanders KM, Cho JH, Hatton WJ, Redelman D, Park JK,
Ward SM, Miano JM, Yan W et al. 2011. Serum response factor-
dependent MicroRNAs regulate gastrointestinal smooth muscle cell
phenotypes. Gastroenterology 141(1): 164-175.
Esnault et al, Supplementary Materials
27
Esnault et al
Park HW, Guan KL. 2013. Regulation of the Hippo pathway and implications
for anticancer drug development. Trends in pharmacological sciences
34(10): 581-589.
Philippar U, Schratt G, Dieterich C, Muller JM, Galgoczy P, Engel FB, Keating
MT, Gertler F, Schule R, Vingron M et al. 2004. The SRF target gene
Fhl2 antagonizes RhoA/MAL-dependent activation of SRF. Molecular
cell 16(6): 867-880.
Provenzano PP, Inman DR, Eliceiri KW, Beggs HE, Keely PJ. 2008.
Mammary epithelial-specific disruption of focal adhesion kinase retards
tumor formation and metastasis in a transgenic mouse model of human
breast cancer. The American journal of pathology 173(5): 1551-1565.
Provenzano PP, Inman DR, Eliceiri KW, Keely PJ. 2009. Matrix density-
induced mechanoregulation of breast cell phenotype, signaling and
gene expression through a FAK-ERK linkage. Oncogene 28(49): 4326-
4343.
Rice P, Longden I, Bleasby A. 2000. EMBOSS: the European Molecular
Biology Open Software Suite. Trends in genetics : TIG 16(6): 276-277.
The-ENCODE-Consortium. 2011. A user's guide to the encyclopedia of DNA
elements (ENCODE). PLoS biology 9(4): e1001046.
Vartiainen MK, Guettler S, Larijani B, Treisman R. 2007. Nuclear actin
regulates dynamic subcellular localization and activity of the SRF
cofactor MAL. Science 316(5832): 1749-1752.
Wang W, Goswami S, Lapidus K, Wells AL, Wyckoff JB, Sahai E, Singer RH,
Segall JE, Condeelis JS. 2004. Identification and testing of a gene
Esnault et al, Supplementary Materials
28
Esnault et al
expression signature of invasive carcinoma cells within primary
mammary tumors. Cancer research 64(23): 8585-8594.
Wang W, Wyckoff JB, Goswami S, Wang Y, Sidani M, Segall JE, Condeelis
JS. 2007. Coordinated regulation of pathways for enhanced cell motility
and chemotaxis is conserved in rat and mouse mammary tumors.
Cancer research 67(8): 3505-3511.
Xin M, Small EM, Sutherland LB, Qi X, McAnally J, Plato CF, Richardson JA,
Bassel-Duby R, Olson EN. 2009. MicroRNAs miR-143 and miR-145
modulate cytoskeletal dynamics and responsiveness of smooth muscle
cells to injury. Genes Dev 23(18): 2166-2178.
Yu FX, Zhao B, Panupinthu N, Jewell JL, Lian I, Wang LH, Zhao J, Yuan H,
Tumaneng K, Li H et al. 2012. Regulation of the Hippo-YAP pathway
by G-protein-coupled receptor signaling. Cell 150(4): 780-791.
Zhang H, Liu CY, Zha ZY, Zhao B, Yao J, Zhao S, Xiong Y, Lei QY, Guan KL.
2009. TEAD transcription factors mediate the function of TAZ in cell
growth and epithelial-mesenchymal transition. The Journal of biological
chemistry 284(20): 13355-13362.
Zhang Y, Liu T, Meyer CA, Eeckhoute J, Johnson DS, Bernstein BE,
Nusbaum C, Myers RM, Brown M, Li W et al. 2008. Model-based
analysis of ChIP-Seq (MACS). Genome biology 9(9): R137.
Esnault et al, Supplementary Materials
29