the environmental genomics thematic programme data centre

25
The Environmental The Environmental Genomics Thematic Genomics Thematic Programme Data Centre Programme Data Centre Dawn Field, Director

Upload: ova

Post on 04-Feb-2016

23 views

Category:

Documents


0 download

DESCRIPTION

The Environmental Genomics Thematic Programme Data Centre. Dawn Field, Director. The Environmental Genomics Thematic Programme. Funded by the NERC at £16.5m Aimed at understanding the molecular basis of evolutionary change, organismal phenotype, and ecosystem function - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: The Environmental Genomics Thematic Programme Data Centre

The Environmental Genomics The Environmental Genomics Thematic Programme Data Thematic Programme Data CentreCentre

Dawn Field, Director

Page 2: The Environmental Genomics Thematic Programme Data Centre

Dawn [email protected]

The Environmental Genomics The Environmental Genomics Thematic ProgrammeThematic Programme Funded by the NERC at £16.5m Aimed at understanding the molecular basis of

evolutionary change, organismal phenotype, and ecosystem function

Evolutionary and Ecological theory plus Genomic technologies

Round 1 (17) funded in Sept, 2001, Round 2 to be funded in April 2003

Data Centre to be Launched October 2002 for a period of 5 years

Page 3: The Environmental Genomics Thematic Programme Data Centre

Dawn [email protected]

The Environmental Genomics The Environmental Genomics Thematic ProgrammeThematic Programme

Programmehttp://www.nerc.ac.uk/funding/thematics/envgen/

Data Centrehttp://envgen.nox.ac.uk/

FOR MORE INFO...

Page 4: The Environmental Genomics Thematic Programme Data Centre

Dawn [email protected]

Data ManagementData Management

Page 5: The Environmental Genomics Thematic Programme Data Centre

Dawn [email protected]

The Environmental Genomics The Environmental Genomics Thematic ProgrammeThematic ProgrammeData to be generated

• 25-30 Awardees producing genomic data• non-model organisms• microbes to vertebrates• key area of overlap in data is microarray

and EST data

Page 6: The Environmental Genomics Thematic Programme Data Centre

Dawn [email protected]

The Data InitiativeThe Data Initiative

Must comply with NERC data policy,

balanced by science driven Awardee

requirements: Heterogeneity and complexity of genomic data Emerging standards, especially for microarray data (MIAME,

MAGE-ML) and beyond that proteomics and metabolomics) Importance of meta-data collection Emphasis on need for Local solutions Emphasis on need for bioinformatics training and skill

development

Page 7: The Environmental Genomics Thematic Programme Data Centre

Dawn [email protected]

The Goals of the Data CentreThe Goals of the Data Centre

The CEH Oxford EG Thematic programme Data Centre

will provide the data warehouse for the Programme

according to the requirements of the NERC Policy for

data management. Working with Bioinformatics

Partners to provide and develop specific file formats,

analysis tools, and data archiving methods will allow the

use of common software solutions that will maximise

the value of the final data holding.

Page 8: The Environmental Genomics Thematic Programme Data Centre

Dawn [email protected]

Data Centre - MissionData Centre - Mission

The mission of the EG Data Centre at CEH Oxford is to assurethat:

• All Awardees have the means to collect and submit their data to the Centre

• We create the capacity and expertise within the Centre to collect, manage, distribute, protect, and exploit the collective data holdings

• All EG generated genomic data is eventually accessible long-term by the wider-community in an organised and add-value format

• We create the opportunity for affordable 'buy-in' options for future Research council-funded science initiatives

• Scale to meet future demand

Page 9: The Environmental Genomics Thematic Programme Data Centre

Dawn [email protected]

Data Centre - MissionData Centre - Mission

We will implement this mission through the creation of:• A Data Centre Team with expertise in bioinformatics,

database management, and computing• A computing infrastructure (hardware and software) that

will include both centralised resources and a network of specialised computers in Awardee Labs

Page 10: The Environmental Genomics Thematic Programme Data Centre

Dawn [email protected]

IntegrationIntegration& &

ImplementationImplementation

Page 11: The Environmental Genomics Thematic Programme Data Centre

Dawn [email protected]

Bioinformatics PartnershipsBioinformatics Partnerships

Silicon Genetics• http://www.silicongenetics.com/cgi/SiG.cgi/index.smf

maxD• http://bioinf.man.ac.uk/microarray/resources.html

Nembase• http://nema.cap.ed.ac.uk/nematodeESTs/nembase.html

The Centre• http://envgen.nox.ac.uk/

FOR MORE INFO...

Page 12: The Environmental Genomics Thematic Programme Data Centre

Dawn [email protected]

Centre OverviewCentre Overview

Bioinformatics Software Solutions• GeneSpring/GeNet/ScriptEditor• maxD (meta-data policies)• Partial Genome Sequence Analysis Pipeline

and Database System• Bio-RedHat 7.3 (Custom Designed Linux

Distribution for Bioinformatics Research that will include GeneSpring, Edinburgh's Partial Genome Sequence Tools, and maxD),

• Bioinformatics for the PC Toolkit (Unix Emulation for tools in Bio-RedHat 7.3)

Page 13: The Environmental Genomics Thematic Programme Data Centre

Dawn [email protected]

Centre OverviewCentre Overview

Location• CEH Oxford (MAN, EDIN)

Centre Team• Bioinformatician, Data Manager, Linux

Developer, Technical Administrator

• 4 Developers (MAN, EDIN)

Computational Infrastructure • Commodity Hardware running Linux,

combination of open source and commercial software

Page 14: The Environmental Genomics Thematic Programme Data Centre

Dawn [email protected]

Centre OverviewCentre Overview

Community Development• Web site, Mailing Lists, Help Desk(s), Discussion

Boards, Presentations at EG Workshops

Teaching and Training• EGDC Documentation Project, Bio-RedHat

Workshops, Bioinformatics Training in the context of supported software and access to total data holdings, 8 EPSRC/BBSRC MRes /DPhil placements

Page 15: The Environmental Genomics Thematic Programme Data Centre

Dawn [email protected]

Centre OverviewCentre Overview

Data Holdings• microarray data repository (GeNet)• EST sequence warehouse

Bio-IT and Knowledge-based Tangibles

• Bio-Linux• Extensive Documentation Archive (FAQ, Links,

Installation and usage documents)• Data Centre “Toolkit”

Page 16: The Environmental Genomics Thematic Programme Data Centre

Dawn [email protected]

Centre Overview: FutureCentre Overview: Future

Centre

Funded labs

GeNet microarray repositoryEST sequence warehouse

GeneSpring microarrayanalysis softwareEST pipeline

Bio-Linux

Bio-PCrecipes

Software HELP files

HelpDesk

Page 17: The Environmental Genomics Thematic Programme Data Centre

Dawn [email protected]

Centre Overview: Sept 2002Centre Overview: Sept 2002

Centre

Funded labs

Bio-Linux

Bio-PCrecipes

www

GeNet microarray repository

Software HELP files

HelpDesk

Page 18: The Environmental Genomics Thematic Programme Data Centre

Dawn [email protected]

Centre Overview: Oct 2002Centre Overview: Oct 2002

Centre

Funded labs

GeNet microarray repository

GeneSpring microarrayanalysis software

Bio-Linux

Bio-PCrecipes

Software HELP files

HelpDesk

Page 19: The Environmental Genomics Thematic Programme Data Centre

Dawn [email protected]

Centre Overview: winterCentre Overview: winter

Centre

Funded labs

GeNet microarray repositoryEST sequence warehouse

GeneSpring microarrayanalysis softwareEST pipeline

Bio-Linux

Bio-PCrecipes

Software HELP files

HelpDesk

Page 20: The Environmental Genomics Thematic Programme Data Centre

Dawn [email protected]

Centre Overview: April 2003Centre Overview: April 2003

Centre

Funded labs

GeNet microarray repositoryEST sequence warehouse

GeneSpring microarrayanalysis softwareEST pipeline

Bio-Linux

Bio-PCrecipes

Software HELP files

HelpDesk

Page 21: The Environmental Genomics Thematic Programme Data Centre

Dawn [email protected]

Centre Overview: Aug 2003Centre Overview: Aug 2003

Centre

Funded labs

GeNet microarray repositoryEST sequence warehouse

GeneSpring microarrayanalysis softwareEST pipeline

Bio-Linux

Bio-PCrecipes

Software HELP files

HelpDesk

Page 22: The Environmental Genomics Thematic Programme Data Centre

Dawn [email protected]

Centre Overview: FutureCentre Overview: Future

Centre

Funded labs

GeNet microarray repositoryEST sequence warehouse

GeneSpring microarrayanalysis softwareEST pipeline

Bio-Linux

Bio-PCrecipes

Software HELP files

HelpDesk

Page 23: The Environmental Genomics Thematic Programme Data Centre

Dawn [email protected]

Centre Overview: FutureCentre Overview: Future

Centre

Funded labs

GeNet microarray repositoryEST sequence warehouse

GeneSpring microarrayanalysis softwareEST pipeline

Bio-Linux

Bio-PCrecipes

Software HELP files

HelpDesk

Page 24: The Environmental Genomics Thematic Programme Data Centre

Dawn [email protected]

Centre Overview: FutureCentre Overview: Future

Centre

Funded labs

GeNet microarray repositoryEST sequence warehouse

GeneSpring microarrayanalysis softwareEST pipeline

Bio-Linux

Bio-PCrecipes

Software HELP files

HelpDesk

Page 25: The Environmental Genomics Thematic Programme Data Centre

Dawn [email protected]

SummarySummary

New Thematic, New Centre Efforts are focused on solutions that take advantage

of existing UK Bioinformatics projects, provide scalability, open access, and aim to produce added value data sets

expertise, tools, data holdings for environmental genomics researchers

Dawn Field, Director of Centre: [email protected]

Jason Snape, Science Co-ordinator: [email protected]

Fiona C. Knight, Programme Co-ordinator: [email protected]

http://envgen.nox.ac.uk/