big data
DESCRIPTION
Introduction to Big Data at the IFMSA preGA 2014 HammametTRANSCRIPT
��������
Data pro capita
double
every 40 months
2.5 exabyte of data was produced daily in 2012
1 exabyte = 23000000000000000000 bits (1019)
Gartner: "Big data is high
volume, high velocity, and/or
high variety information that
require new forms of
approaches to make sense of
them"
LHC200 petabyte x year
10 year to decode the first human genome, now is done in 1 week
Nasa Center for Climate Simulations stores 32 PB of data
Facebook manage data for more than 1 billion of users
New methodologies for storing,
processing, analyze and visualize
the data.
Machine Learning:
• Supervised
• Unsupervised
Cluster Analysis
Cluster Analysis
Infections diffusion patterns
DNA polymorphisms analysis
Imaging Analysis
Useful Material
Jones & al, The New Bioinformatics: Integrating Ecological Data from the Gene to the Biosphere
Dits & al, Using Cluster Analysis for Medical Resource Decision Making
Snijders & al, Big Gap of Knowledge in the Field of Internet Science
www.ibm.com/big-data