the future of big data analytics

41
The future of big data and analytics Ahmed Banafa

Upload: ahmed-banafa

Post on 14-Jan-2017

1.009 views

Category:

Internet


2 download

TRANSCRIPT

PowerPoint Presentation

The future of big data and analyticsAhmed Banafa

Extensive experience in operations and management, with research background in a variety of techniques and analysis.Taught at several universities and colleges, including the University of California, Berkeley,California State University-East Bay,San Jose State Universityand University of Massachusetts. Recipient of several awards, including Distinguished Tenured Staff Award of 2013, Business Program Instructor of the year for 2013 and2014 and the Parthenon award for best instructor in 2012,2010 and2003,and Certificate of Honor for instructor of the year from the City and County of San Francisco.Included in the 2000 to2001 "Whos Who in Finance and Industry."

Ahmed Banafa

Some of my Publications

4

Now .lets talk Big Data !

But before that check this !

Big Data?The simplest definition of big data is large and complex structured and unstructured data (images posted on Facebook, email, text messages, GPS signals from mobile phones, tweets, and other social media updates, etc.) that cannot be processed by traditional database tools.

Roots of Big Data

Starting from the basics

statistics is using numbers to quantify the data. Data mining is using statistics and programming languages to find patterns hidden in the data. Machine learning uses data mining to build models to predict future outcomes. Artificial intelligence uses models built by machine learning to make machines act in an intelligent way like playing a game or driving a car (e.g., IBMs Watson supercomputer and the driverless car by Google).

Big data analytics is the process of studying big data to uncover hidden patterns and correlations to make better decisions using technologies like NoSQL databases, Hadoop, and MapReduce. The main goal of big data analytics is to help organizations make better business decisions.

Three Vs of Big Data

Volume. Unstructured data streaming in from social media. Increasing amounts of sensor and machine-to-machine data being collected.Velocity. Data is streaming in at unprecedented speed and must be dealt with in a timely manner.Variety. Data today comes in all types of formatsstructured, numeric data in traditional databases. Information created from line-of-business applications.

Big Data Analytics 3.0 Analytics 1.0 : BI Analytics 2.0: Used by online companies only (Google, Yahoo, Facebook, etc.). Analytics 3.0: A new resolve to apply powerful data-gathering and analysis methods not just to a companys operations but also to its offeringsto embed data smartness into the products and services customers buy.

Attributes of Analytics 3.0:

The most important trait is that not only online firms, but virtually any type of firm in any industry, can participate in the data-driven economy.Multiple data types: Organizations are combining large and small volumes of data, internal and external sources, and structured and unstructured formats to yield new insights in predictive and prescriptive models.

Technologies and methods are much faster: Big data technologies include a variety of hardware/software architectures, including clustered parallel servers using Hadoop/MapReduce, in-memory analytics, and so forth. All of these technologies are considerably faster than previous generations.

Integrated and embedded: built into consumer-oriented products and features.Data science/analytics/IT teams will work togetherChief analytics officers (CAO) are new leadership positions.

More about it

Prescriptive analytics: There have always been three types of analytics: descriptive, that report on the past; predictive, that use models based on past data to predict the future; and prescriptive, that use models to specify optimal behaviors and actions. Analytics 3.0 includes all types, but there is an increased emphasis on prescriptive analytics.

Old and New!Google announced acquisition of Nest (smart home devices), a source of massive data from homes all over the United States, confirming the direction of Analytics 3.0 by an online company at the leading edge of Analytics 2.0.

Big Data has a dark side !

Dark Data

Gartner defines dark data: as the information assets organizations collect, process and store during regular business activities, but generally fail to use for other purposes (for example, analytics, business relationships and direct monetizing). IDC, stated that up to 90 percent of big data is dark data.

Similar to dark matter in physics, dark data often comprises most organizations universe of information assets. Thus, organizations often retain dark data for compliance purposes only. Storing and securing data typically incurs more expense (and sometimes greater risk) than value.

Dark data is a type of unstructured, untagged and untapped data that is found in data repositories and has not been analyzed or processed. It is similar to big data but differs in how it is mostly neglected by business and IT administrators in terms of its value.Dark data is also known as dusty data.

Dark data, unlike dark matter, can be brought to light and so can its potential ROI. And whats more, a simple way of thinking about what to do with the data - through a cost-benefit analysis - can remove the complexity surrounding the previously mysterious dark data.

So what is the future of Big Data?

Big Data as a Service: the next big thing ?

Big data as a service (BDaaS) is a term typically used to refer to services that offer analysis of large or complex data sets, using the cloud hosted services. Similar types of services include software as a service (SaaS) or infrastructure as a service (IaaS), where specific big data as a service options are used to help businesses handle what the IT world calls big data, or sophisticated aggregated data sets that provide a lot of value for todays companies.

Examples of Big Data Analytics

Network Security Needs Big Data

ZTM: "Zero trust model" is an aggressive model of network security that monitors every piece of data possible, assuming that every file is a potential threatThe convergence of Big Data and Network Security is a direct product of Applied Big Data and its a prime example of using analytics technologies to tackle a current business problem such as cyberattacks

GoogleThey process 3.5 billion requests per day, and each request queries a database of 20 billion web pages

AmazonAmazon has recently obtained a patent on a system designed to ship goods to us before we have even decided to buy it predictive despatch.