tam sneddon: revolutionizing data dissemination, organization and use

30
www.gigadb.org – revolutionizing data dissemination, organization and use. Tam Sneddon BGI-Hong Kong Now taking submissions

Upload: gigascience-bgi-hong-kong

Post on 28-Jan-2015

118 views

Category:

Technology


1 download

DESCRIPTION

Tam Sneddon's talk at Genome Informatics 2012 in Cambridge: Revolutionizing data dissemination, organization and use. September 7th 2012

TRANSCRIPT

Page 1: Tam Sneddon: Revolutionizing data dissemination, organization and use

www.gigadb.org

– revolutionizing data dissemination, organization and use.

Tam SneddonBGI-Hong Kong

Now taking submissions…

Page 2: Tam Sneddon: Revolutionizing data dissemination, organization and use

OverviewIntroduction

/

What is ,why we want your data and why you should submit to us?

Data Publishing

DOIs

Published datasets

New database features

Future tools: Galaxy/Cloud

Page 3: Tam Sneddon: Revolutionizing data dissemination, organization and use

www.gigasciencejournal.com www.gigadb.org

Reproducibility/Reuse

Utility/Usability

Standards/Searchability/Sharing

Data publishing/DOI

Page 4: Tam Sneddon: Revolutionizing data dissemination, organization and use

DataCite goal: “increase acceptance of research as legitimate, citable contributions to the scholarly record”

Page 5: Tam Sneddon: Revolutionizing data dissemination, organization and use
Page 6: Tam Sneddon: Revolutionizing data dissemination, organization and use

PlantsChinese cabbageCucumber, domesticFoxtail milletPigeonpeaPotatoSorghum

MicrobesE. Coli O104:H4 TY-2482T2D gut metagenome

Cell-linesChinese Hamster Ovary

- Crab-eatingMinipigMouse methylomesNaked mole rat Penguin - Adelie penguin - Emperor penguinPigeon, domesticPolar bearSheep, domesticTibetan antelope

InvertebratesAnt - Florida carpenter ant- Jerdon’s jumping ant- Leaf-cutter antRoundwormSchistosoma haematobiumSilkworm, domestic and wild

Humans Ancient DNA- Aboriginal Australian- Saqqaq Eskimo Asian individual (YH) - DNA methylome - Genome assembly- TranscriptomeCancer- Hepatocellular carcinoma- Single-cell bladderHuman exome – chronic hepatitis B infection predisposing variants

VertebratesDarwin finchGiant panda Macaque - Chinese rhesus

Currently: 36 public datasets

Page 7: Tam Sneddon: Revolutionizing data dissemination, organization and use

PlantsChinese cabbageCucumber, domesticFoxtail milletPigeonpeaPotatoSorghum

MicrobesE. Coli O104:H4 TY-2482T2D gut metagenome

Cell-linesChinese Hamster Ovary

- Crab-eatingMinipigMouse methylomesNaked mole rat Penguin - Adelie penguin - Emperor penguinPigeon, domesticPolar bearSheep, domesticTibetan antelope

InvertebratesAnt - Florida carpenter ant- Jerdon’s jumping ant- Leaf-cutter antRoundwormSchistosoma haematobiumSilkworm, domestic and wild

Humans Ancient DNA- Aboriginal Australian- Saqqaq Eskimo Asian individual (YH) - DNA methylome - Genome assembly- TranscriptomeCancer- Hepatocellular carcinoma- Single-cell bladderHuman exome – chronic hepatitis B infection predisposing variants

VertebratesDarwin finchGiant panda Macaque - Chinese rhesus

Currently: 36 public datasets

*14TB*

Page 8: Tam Sneddon: Revolutionizing data dissemination, organization and use

PlantsChinese cabbageCucumber, domesticFoxtail milletPigeonpeaPotatoSorghum

MicrobesE. Coli O104:H4 TY-2482T2D gut metagenome

Cell-linesChinese Hamster Ovary

- Crab-eatingMinipigMouse methylomesNaked mole rat Penguin - Adelie penguin - Emperor penguinPigeon, domesticPolar bearSheep, domesticTibetan antelope

InvertebratesAnt - Florida carpenter ant- Jerdon’s jumping ant- Leaf-cutter antRoundwormSchistosoma haematobiumSilkworm, domestic and wild

Humans Ancient DNA- Aboriginal Australian- Saqqaq Eskimo Asian individual (YH) - DNA methylome - Genome assembly- TranscriptomeCancer- Hepatocellular carcinoma- Single-cell bladder cancerHuman exome – chronic hepatitis B infection predisposing variants

VertebratesDarwin finchGiant panda Macaque - Chinese rhesus

Currently: 36 public datasets***15 pre-publication***

Page 9: Tam Sneddon: Revolutionizing data dissemination, organization and use

PlantsChinese cabbageCucumber, domesticFoxtail milletPigeonpeaPotato*Sorghum*

MicrobesE. Coli O104:H4 TY-2482T2D gut metagenome

Cell-linesChinese Hamster Ovary

- Crab-eatingMinipig*Mouse methylomes*Naked mole rat Penguin - Adelie penguin - Emperor penguinPigeon, domestic*Polar bear*Sheep, domesticTibetan antelope

InvertebratesAnt - Florida carpenter ant- Jerdon’s jumping ant- Leaf-cutter antRoundwormSchistosoma haematobiumSilkworm, domestic and wild

Humans Ancient DNA- Aboriginal Australian- Saqqaq Eskimo Asian individual (YH) - DNA methylome - Genome assembly- *Transcriptome*Cancer- Hepatocellular carcinoma- *Single-cell bladder cancer*Human exome – chronic hepatitis B infection predisposing variants

VertebratesDarwin finchGiant panda Macaque - Chinese rhesus

Currently: 36 public datasets*5 citations in the references*

Page 10: Tam Sneddon: Revolutionizing data dissemination, organization and use

PlantsChinese cabbageCucumber, domesticFoxtail milletPigeonpeaPotato*Sorghum*

MicrobesE. Coli O104:H4 TY-2482

Cell-linesChinese Hamster Ovary (CHO)

Currently: 36 public datasets*5 citations in the references*

Humans Ancient DNA- Aboriginal Australian- Saqqaq Eskimo Asian individual (YH) - DNA methylome - Genome assembly- TranscriptomeCancer- Hepatocellular carcinoma- Single-cell bladderHuman exome – chronic hepatitis B infection predisposing variants

VertebratesDarwin finchGiant panda Macaque - Chinese rhesus

- Crab-eatingMinipigMouse methylomesNaked mole rat Penguin - Adelie penguin - Emperor penguinPigeon, domesticPolar bearSheep, domesticTibetan antelope

InvertebratesAnt - Florida carpenter ant- Jerdon’s jumping ant- Leaf-cutter antRoundwormSchistosoma haematobiumSilkworm, domestic and wild

Complemented by data submitted to INSDC databases:- Raw data SRA:SRA046843 - Assemblies of 3 strains Genbank:AHAO00000000-AHAQ00000000 - SNPs dbSNP batch ids:1056306-10563068 - CNVs- InDels dbVAR:nstd63 - SV }

Page 11: Tam Sneddon: Revolutionizing data dissemination, organization and use

PlantsChinese cabbageCucumber, domesticFoxtail milletPigeonpeaPotatoSorghum

MicrobesE. Coli O104:H4 TY-2482

Cell-linesChinese Hamster Ovary (CHO)

Currently: 36 public datasets*5 citations in the references*

Humans Ancient DNA- Aboriginal Australian- Saqqaq Eskimo Asian individual (YH) - DNA methylome - Genome assembly- *Transcriptome*Cancer- Hepatocellular carcinoma- Single-cell bladderHuman exome – chronic hepatitis B infection predisposing variants

VertebratesDarwin finchGiant panda Macaque - Chinese rhesus

- Crab-eatingMinipigMouse methylomesNaked mole rat Penguin - Adelie penguin - Emperor penguinPigeon, domesticPolar bearSheep, domesticTibetan antelope

InvertebratesAnt - Florida carpenter ant- Jerdon’s jumping ant- Leaf-cutter antRoundwormSchistosoma haematobiumSilkworm, domestic and wild

Page 12: Tam Sneddon: Revolutionizing data dissemination, organization and use

PlantsChinese cabbageCucumber, domesticFoxtail milletPigeonpeaPotatoSorghum

MicrobesE. Coli O104:H4 TY-2482

Cell-linesChinese Hamster Ovary (CHO)

Currently: 36 public datasets*5 citations in the references*

Humans Ancient DNA- Aboriginal Australian- Saqqaq Eskimo Asian individual (YH) - DNA methylome - Genome assembly- *Transcriptome*Cancer- Hepatocellular carcinoma- Single-cell bladderHuman exome – chronic hepatitis B infection predisposing variants

VertebratesDarwin finchGiant panda Macaque - Chinese rhesus

- Crab-eatingMinipigMouse methylomesNaked mole rat Penguin - Adelie penguin - Emperor penguinPigeon, domestic*Polar bear*Sheep, domesticTibetan antelope

InvertebratesAnt - Florida carpenter ant- Jerdon’s jumping ant- Leaf-cutter antRoundwormSchistosoma haematobiumSilkworm, domestic and wild

Page 13: Tam Sneddon: Revolutionizing data dissemination, organization and use

- Crab-eatingMinipig*Mouse methylomes*Naked mole rat Penguin - Adelie penguin - Emperor penguinPigeon, domesticPolar bearSheep, domesticTibetan antelope

InvertebratesAnt - Florida carpenter ant- Jerdon’s jumping ant- Leaf-cutter antRoundwormSchistosoma haematobiumSilkworm, domestic and wild

PlantsChinese cabbageCucumber, domesticFoxtail milletPigeonpeaPotatoSorghum

MicrobesE. Coli O104:H4 TY-2482

Cell-linesChinese Hamster Ovary (CHO)

Currently: 36 public datasets*5 citations in the references*

Humans Ancient DNA- Aboriginal Australian- Saqqaq Eskimo Asian individual (YH) - DNA methylome - Genome assembly- TranscriptomeCancer- Hepatocellular carcinoma- *Single-cell bladder cancer*Human exome – chronic hepatitis B infection predisposing variants

VertebratesDarwin finchGiant panda Macaque - Chinese rhesus

Page 14: Tam Sneddon: Revolutionizing data dissemination, organization and use

GigaDB is a new database integrated with the GigaScience journal to meet the needs of a new generation of biological and biomedical research as it enters the era of “big-data”… (see more)

Page 15: Tam Sneddon: Revolutionizing data dissemination, organization and use

GigaDB is a new database integrated with the GigaScience journal to meet the needs of a new generation of biological and biomedical research as it enters the era of “big-data”… (see more)

Page 16: Tam Sneddon: Revolutionizing data dissemination, organization and use
Page 17: Tam Sneddon: Revolutionizing data dissemination, organization and use

GigaDB is a new database integrated with the GigaScience journal to meet the needs of a new generation of biological and biomedical research as it enters the era of “big-data”… (see more)

Page 18: Tam Sneddon: Revolutionizing data dissemination, organization and use
Page 19: Tam Sneddon: Revolutionizing data dissemination, organization and use
Page 20: Tam Sneddon: Revolutionizing data dissemination, organization and use
Page 21: Tam Sneddon: Revolutionizing data dissemination, organization and use
Page 22: Tam Sneddon: Revolutionizing data dissemination, organization and use
Page 23: Tam Sneddon: Revolutionizing data dissemination, organization and use
Page 24: Tam Sneddon: Revolutionizing data dissemination, organization and use
Page 25: Tam Sneddon: Revolutionizing data dissemination, organization and use
Page 26: Tam Sneddon: Revolutionizing data dissemination, organization and use
Page 27: Tam Sneddon: Revolutionizing data dissemination, organization and use

Related DOIs:10.5524/100013 (is supplemented by)10.5524/100014 (is supplemented by)

http://dx.doi.org/10.5524/100015http://gigadb.org/100015

Page 28: Tam Sneddon: Revolutionizing data dissemination, organization and use
Page 29: Tam Sneddon: Revolutionizing data dissemination, organization and use

Galaxy for GigaScience

BioinformaticsDevelopment PublishingBiomedical and bioinformatics research

Page 30: Tam Sneddon: Revolutionizing data dissemination, organization and use

Thanks to:

[email protected]@gigasciencejournal.com

@gigascience

facebook.com/GigaScience

blogs.openaccesscentral.com/blogs/gigablog/

Contact us:

Laurie GoodmanScott EdmondsAlexandra BasfordPeter LiJesse Si Zhe

Follow us:

Shaoguang Liang (BGI-SZ)Tin-Lap Lee (CUHK)Qiong Luo (HKUST)Senghong Wang (HKUST)Yan Zhou (HKUST)Cogini

www.gigadb.org