using arrayexpress. arrayexpress is an international public repository for well-annotated microarray...

31
Using ArrayExpress

Post on 22-Dec-2015

226 views

Category:

Documents


1 download

TRANSCRIPT

Page 1: Using ArrayExpress. ArrayExpress is an international public repository for well-annotated microarray data, including gene expression, comparative genomic

Using ArrayExpress

Page 2: Using ArrayExpress. ArrayExpress is an international public repository for well-annotated microarray data, including gene expression, comparative genomic

ArrayExpress is an international public repository

for well-annotated microarray data, including gene

expression, comparative genomic hybridization (CGH) and chromatin-immunoprecipitation (ChI

P) experiments.

ArrayExpress http://www.ebi.ac.uk/microarray-as/aer/index.html#ae-main[0]

Page 3: Using ArrayExpress. ArrayExpress is an international public repository for well-annotated microarray data, including gene expression, comparative genomic

ArrayExpress has three major goalsArrayExpress has three major goals ::

1.Serve the scientific community as a repository for data supporting publications

2.Provide easy access to high-quality data in a standard format.

3.Facilitate the sharing of microarray designs and experimental protocols.

Page 4: Using ArrayExpress. ArrayExpress is an international public repository for well-annotated microarray data, including gene expression, comparative genomic

1. ArrayExpress experiment repository – the main database containing complete data supporting publications.

2. ArrayExpress gene expression profile data warehouse – contains gene-indexed expression profiles from a curated subset of experiments from the repository.

ArrayExpress has two major componentsArrayExpress has two major components ::

Page 5: Using ArrayExpress. ArrayExpress is an international public repository for well-annotated microarray data, including gene expression, comparative genomic
Page 6: Using ArrayExpress. ArrayExpress is an international public repository for well-annotated microarray data, including gene expression, comparative genomic
Page 7: Using ArrayExpress. ArrayExpress is an international public repository for well-annotated microarray data, including gene expression, comparative genomic

Search for experiments by entering ArrayExpress experiment accession numbers or keywords (e.g. RNAi, breast cancer) in the query box on the left-hand panel.

Options for sorting and filtering your results.

Page 8: Using ArrayExpress. ArrayExpress is an international public repository for well-annotated microarray data, including gene expression, comparative genomic
Page 9: Using ArrayExpress. ArrayExpress is an international public repository for well-annotated microarray data, including gene expression, comparative genomic

ID - the unique ArrayExpress accession number of the experiment.

Experiment accession numbers are in the format of E-XXXX-n, where XXXX is a code for the source of the data.

Experiments and array designs in ArrayExpress are given unique accession numbers in the format ofE-XXXX-n for experiments A-XXXX-n for array designs

XXXX represents a four letter code and n is a number e.g. E-MEXP-568, A-UHNC-18.

Page 10: Using ArrayExpress. ArrayExpress is an international public repository for well-annotated microarray data, including gene expression, comparative genomic

Title - the curated title for the experiment

Page 11: Using ArrayExpress. ArrayExpress is an international public repository for well-annotated microarray data, including gene expression, comparative genomic

Hybs - the total number of hybridizations in the experiment

Page 12: Using ArrayExpress. ArrayExpress is an international public repository for well-annotated microarray data, including gene expression, comparative genomic

Species - the species of the samples used (can be multiple)

Page 13: Using ArrayExpress. ArrayExpress is an international public repository for well-annotated microarray data, including gene expression, comparative genomic

Date - the date that the data were loaded into ArrayExpress

Page 14: Using ArrayExpress. ArrayExpress is an international public repository for well-annotated microarray data, including gene expression, comparative genomic

Processed – direct link to the processed data as a zip file (brown icon indicates that this exists)

Page 15: Using ArrayExpress. ArrayExpress is an international public repository for well-annotated microarray data, including gene expression, comparative genomic

Raw – a direct link to the raw data (brown/grey icon indicates that this exists/not exists). A wedge shaped icon indicates Affymetrix .CEL files

Page 16: Using ArrayExpress. ArrayExpress is an international public repository for well-annotated microarray data, including gene expression, comparative genomic

More – a link to the ArrayExpress advanced interface where you can get subsets of each data file by gene, hybridization and QuantitationTypes (columns in the data file).

Page 17: Using ArrayExpress. ArrayExpress is an international public repository for well-annotated microarray data, including gene expression, comparative genomic

Click anywhere on an experiment row and it will expand to allow you see more details about this experiment and see where the term you searched for appears.

Page 18: Using ArrayExpress. ArrayExpress is an international public repository for well-annotated microarray data, including gene expression, comparative genomic

Title - curated title of the experiment

Page 19: Using ArrayExpress. ArrayExpress is an international public repository for well-annotated microarray data, including gene expression, comparative genomic

MIAME score - this is a score to indicate how close to full MIAME-compliance an experiment is, with a score of 5 being the highest. One point each is given for •sufficient annotation of the associated array design •essential sample annotation including at least one experimental factor and the species of all samples •raw data files for each hybridization •final processed (normalized) data for the hybridizations in the experiment •essential laboratory and data processing protocols

Page 20: Using ArrayExpress. ArrayExpress is an international public repository for well-annotated microarray data, including gene expression, comparative genomic

Sample annotation – a link to .2columns.xls which is a file containing a list of the samples, the experimental factor values associated with these samples and the corresponding data files

Page 21: Using ArrayExpress. ArrayExpress is an international public repository for well-annotated microarray data, including gene expression, comparative genomic

Array – the ArrayExpress accession number(s) for the array design(s) used in the experiment. Clicking on the accession number opens a new browser window showing more information about the array design in the advanced query interface.

Page 22: Using ArrayExpress. ArrayExpress is an international public repository for well-annotated microarray data, including gene expression, comparative genomic

Downloads – links to the FTP server directory containing data files and sample and hybridization information for the experiment, and to the data retrieval page for the experiment in the advanced user interface

Page 23: Using ArrayExpress. ArrayExpress is an international public repository for well-annotated microarray data, including gene expression, comparative genomic

Experiment design – links to a diagram of the sample relationships in .png and .svg format.

Page 24: Using ArrayExpress. ArrayExpress is an international public repository for well-annotated microarray data, including gene expression, comparative genomic

Protocols – there is a link taking you to a page listing all the protocols used in the experiment.

Page 25: Using ArrayExpress. ArrayExpress is an international public repository for well-annotated microarray data, including gene expression, comparative genomic

Citation - details about any publications that relate to the data, including links to the online article and to the PubMed entry where available

Page 26: Using ArrayExpress. ArrayExpress is an international public repository for well-annotated microarray data, including gene expression, comparative genomic

Detailed sample annotation - a link to .sdrf.xls which contains information about the samples, the relationships between the samples, extracts, labeled extracts, hybridizations and data files.

Page 27: Using ArrayExpress. ArrayExpress is an international public repository for well-annotated microarray data, including gene expression, comparative genomic

Contact - the name of the experiment submitter

Page 28: Using ArrayExpress. ArrayExpress is an international public repository for well-annotated microarray data, including gene expression, comparative genomic

Design types - terms describing design types of the experiment. These can include biological, methodological and technology types e.g. disease state, strain or line, compound treatment, in-vivo, dye swap, co-expression, binding site identification.

Page 29: Using ArrayExpress. ArrayExpress is an international public repository for well-annotated microarray data, including gene expression, comparative genomic

Description - the description of the experiment as supplied by the submitter

Page 30: Using ArrayExpress. ArrayExpress is an international public repository for well-annotated microarray data, including gene expression, comparative genomic

Factor values - a list of the experimental factor values in the experiment

Page 31: Using ArrayExpress. ArrayExpress is an international public repository for well-annotated microarray data, including gene expression, comparative genomic

The four letter code in the accession number generally indicates the source of the MAGE-ML file that was used to load the data into the ArrayExpress database. Sources include our own submission tools (MEXP for MIAMExpress and TABM for Tab2MAGE) as well as MAGE-ML submitted from other organizations or microarray data management tools. The 4 letter code does not necessarily tell you which organization performed the experiment or manufactured the array design. Some experiments have also been extracted from the Gene Expression Omnibus (GEO) at the NCBI.

MIAME describes the Minimum Information About a Microarray Experiment that is needed to enable the interpretation of the results of the experiment unambiguously and potentially to reproduce the experiment.