13 questions you might have about galaxy

Post on 15-Apr-2017

2.091 Views

Category:

Science

1 Downloads

Preview:

Click to see full reader

TRANSCRIPT

Keith Bradnam

questions you may have about Galaxy13

My spoken commentary

Things that I would normally say when giving this talk to a real audience are shown in purple text…like this!

1

How does Galaxy describe Galaxy?

“Galaxy is an open, web-based platform for data intensive biomedical research”

https://galaxyproject.org

How does Galaxy describe Galaxy?

“Galaxy is an open, web-based platform for data intensive biomedical research”

https://galaxyproject.org

How does Galaxy describe Galaxy?

Existing tools that you may be familiar with may not the most appropriate for working with the extremely large datasets that genomics generates.

Microsoft Excel is not always the right tool for the job

Microsoft: “By importing FASTA data into Excel, researchers are better able to visualize and analyze information.”

https://www.flickr.com/photos/msr_redmond/

Microsoft Excel is not always the right tool for the job

Microsoft: “By importing FASTA data into Excel, researchers are better able to visualize and analyze information.”

https://www.flickr.com/photos/msr_redmond/

X

2

What is Galaxy?

My definition:

A web-based platform that provides a simplified interface to many popular bioinformatics tools.

What is Galaxy?

● Can run bioinformatics tools

● Can perform basic text manipulations

● Can generate reproducible workflows

● Can also be used as a data hub

What is Galaxy?

My definition:

A web-based platform that provides a simplified interface to many popular bioinformatics tools.

What is Galaxy?

Tools in Galaxy may only provide some of the options compared to if you ran the same tool on the command-line.

What is Galaxy?

My definition:

A web-based platform that provides a simplified interface to many popular bioinformatics tools.

What is Galaxy?

There will only ever be a subset of the currently available command-line bioinformatics tools available in Galaxy.

3

Who makes Galaxy?

Center for Comparative Genomics and Bioinformatics

Department of Biology

Who makes Galaxy?

Plus countless contributions from the Galaxy community…

Who makes Galaxy?

GitHub hosts all the code behind Galaxy and you can see who is making changes, and you can also raise bugs/problems yourself!

Who makes Galaxy?

4

How long has Galaxy been around?

Over a decade!

How long has Galaxy been around?

The first publication describing Galaxy: Sep 16th, 2005

How long has Galaxy been around?

How long has Galaxy been around?Total papers over time (cumulative, as of 2015/07)

How long has Galaxy been around?

A lot of people have used, and continue to use Galaxy. It is built up a strong community of developers and users.

5

Will Galaxy be around in 5 years?

● A good question to ask!

● Should be safe to use for analysis of your next paper or two…

● …but can always be useful to stay alert for alternatives that might emerge

Will Galaxy be around in 5 years?

Tools like iPlant offer a complementary service to Galaxy. Other tools may also appear in future.

Will Galaxy be around in 5 years?

6

How is Galaxy different from Unix?

● Unix is vastly more powerful…

● …but comes with a steeper learning curve

● For many bioinformatics tools, Galaxy is simply wrapping those tools in a friendly, graphical user interface

7

How much does Galaxy cost?

$0.00Assuming you have access to a web browser and Internet connectivity, or can install on your own computer

8

Should I learn to use Unix instead?

● If you can, yes.

● Unix has been around for almost 50 years, not going to disappear any time soon.

● You will be able to do many more powerful things if you learn to use Unix, but…

Should I learn to use Unix instead?

…consider Galaxy as yourgateway drug to learning Unix

Should I learn to use Unix instead?

Learning Galaxy will teach you some concepts which will be useful if you go on to learn Unix (or Linux).

Should I learn to use Unix instead?

● Galaxy will teach you about command-line options of bioinformatics tools

● Galaxy will show you some simple ways of processing text files

● Galaxy will teach you the idea of building reproducible workflows

9

Can Galaxy be customized?

● Yes. Many customizations can be made

● Some Unix expertise needed

● Common customizations add specific bioinformatics tools and/or prepopulate Galaxy with specific datasets

Can Galaxy be customized?

The existence of so many different public Galaxy servers is a reflection of the demand for custom Galaxy installations

Can Galaxy be customized?

Most Galaxy installations looks very similar, though the colors can also be customized somewhat.

Can Galaxy be customized?

Can Galaxy be customized?

Can Galaxy be customized?

10

Why should I learn Galaxy?

● To perform powerful bioinformatics analyses

● To perform reproducible bioinformatics analyses

● To dip your toes into the world of computing

● Because Microsoft Excel is not the answer

11

How do I find out more?

● Try the main Galaxy server:○ https://usegalaxy.org

● Galaxy wiki:○ https://wiki.galaxyproject.org

● Biostar forum○ http://biostar.usegalaxy.org

● Mailing lists

How do I find out more?

How do I find out more?

How do I find out more?

12

When shouldn’t I use Galaxy?

● People like cakes

● Some people like to bake cakes

● Others buy their cakes from the store

When shouldn’t I use Galaxy?

Your time may be precious to you. Learning some bioinformatics can be useful to appreciate what others could do for you, i.e. consider outsourcing your bioinformatics.

When shouldn’t I use Galaxy?

Paying a bioinformatics core facility to do your analysis may be a better use of your money (and time)

13

Anything else I should know?

● Galaxy will not do your work for you

● Galaxy will not write papers for you!

● Remember the bioinformatics maxim:○ Garbage in ⇒ Garbage out

Anything else I should know?

Galaxy should be treated like any other tool you use in science. It will not magically produce meaningful results for you. Also, you should document what you do to ensure that your work is reproducible by others (and by yourself).

The end

top related