paper 6: management information system

16
Principal Investigator Co-Principal Investigator Paper Coordinator Content Writer Prof. S P Bansal Vice Chancellor Maharaja Agrasen University, Baddi Prof YoginderVerma ProVice Chancellor Central University of Himachal Pradesh. Kangra. H.P. Prof. Manu Sood Chairman, Department of Computer Science, H.P University, Summer Hill, Shimla. Paper 6: Management Information System Module 36: Big Data Ms. Vinodini Kapoor Asst. Prof, Northern India Institute of Fashion Technology Mohali, India

Upload: others

Post on 30-Oct-2021

0 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Paper 6: Management Information System

Principal Investigator

Co-Principal Investigator

Paper Coordinator

Content Writer

Prof. S P Bansal

Vice Chancellor

Maharaja Agrasen University, Baddi

Prof YoginderVerma

Pro–Vice Chancellor

Central University of Himachal Pradesh. Kangra. H.P.

Prof. Manu Sood

Chairman, Department of Computer Science,

H.P University, Summer Hill, Shimla.

.

Paper 6: Management Information System

Module 36: Big Data

Ms. Vinodini Kapoor

Asst. Prof, Northern India Institute of Fashion Technology

Mohali, India

Page 2: Paper 6: Management Information System

Items Description of Module

Subject Name Management

Paper Name Management Information System

Module Title Big Data

Module Id Module No.-36

Pre- Requisites Basic Infrastructure for building IS and capturing data.

Objectives To understand the utility of big data and its impact on information systems.

Keywords Velocity, volume, variety, decision trees, structured data, unstructured data, data

analytics

QUADRANT-I

Module-36 Big Data

1. Learning Outcome

2. Introduction

3. The Concept of Big Data

4. Sources of Big Data

4.1 Who uses Big Data

5. Big Data Changing the face of Information Systems

6. Benefits of capturing Big Data

7. Summary

1. Learning Outcome:

After completing this module the students will be able to:

Develop an understanding of the concept of Big Data

List various Sources of Big Data.

Discuss how various industries are utilizing Big Data.

Understand how Big Data is changing Information Systems.

Understand Benefits of Big Data.

2. Introduction

The amount of data in today’s world has been exploding, resulting in what is popularly known as Big

Data. Big data refers to our ability to collect and analyze the vast amounts of data. The ability to harness

the large realms of data is completely transforming our ability to understand the world and everything

within it.

Exhibit 1: Big Data

Image Source: https://www.linkedin.com/pulse/big-data-power-hadoop-part-1-kartik-dave

Page 3: Paper 6: Management Information System

Big Data pertains to data sets where the size is beyond the capability of any common software tools to

capture, manage, and process. It is not the amount of data that’s important but what organizations do with

the data that matters.

Exhibit 2: Bid Data Enormous Size

Image Source: http://blog.blazeclan.com/wp-content/uploads/2013/03/Untitled-300x212.png

The size of the data, along with the underlying purpose to derive benefit from it, has led to a new class of

technologies that have emerged. Organizations are on the run to accumulate, store and analyze data that

has high volume, velocity, and variety and comes from a variety of new sources. These may be in form of

social media access, log files, video, text, image and global positioning system access. These sources

exhaust the capabilities of traditional relational database management systems and galvanize a host of

new technologies, approaches, and platforms.

Interestingly, the idea behind the phrase 'Big Data' is that everything we do in our lives leaves a digital

trace (or data), which we can use and analyze. The advances in capturing and analyzing big data allow us

to decode human DNA in minutes, find cures for cancer, accurately predict human behavior, foil terrorist

attacks, pinpoint marketing efforts, and prevent diseases and so much more. This can be utilized

constructively or otherwise if not secured.

Exhibit 3: Big Data Landscape

Image

Source:https://media.licdn.com/mpr/mpr/AAEAAQAAAAAAAAd8AAAAJGJjNWRlNTc2LWQ1ZjUtNGU1Yy1iYzVjLTljNjlkNTJjOGZkY

Q.png

Page 4: Paper 6: Management Information System

The dominant Big Data technologies in use today commercially are Apache’s Hadoop and No-SQL

databases. Hadoop is a software framework for data intensive distributed applications and was inspired by

Google's MapReduce, a software framework in which an application is broken down into numerous small

parts.

3. The Concept of Big Data

Primarily, three major forces that drive the interest and growth in Big Data can be stated as follows:

Exhibit 4: Big Data Analytics

Image Source: http://d26hz1696nal5d.cloudfront.net/wp-content/uploads/2016/09/Hotels-Hospitality-Travel-BigData-Analytics-

1200x800-1.jpg

1. Enormous growth in the amount of data being generated on the internet.

2. The evolving strategy of firms to collect data from internal and external sources throughout the product

and process lifecycle

3. The phenomenal outreach of social media, mobile applications, and sensor based technologies as well

as the Internet.

All of these forces are generating a flood of data which is increasing in volume, variety and velocity.

Big Data is referred to both, the type of data being managed as well as the technology used to store and

process it. Mostly, the technologies originated from companies such as Google, Amazon, Facebook and

Linked-In, where they were developed for each company’s own use in order to analyze the massive

amounts of social media data they were dealing with.

Exhibit5: Accumulating Big Data

Image Source: http://onlinembapage.com/files/2015/06/bigdata.jpg

Page 5: Paper 6: Management Information System

Big Data is increasingly being defined by the “Three Vs.” stated in exhibit 6, which become a reasonable

test as to whether a Big Data approach is the right one to adopt for a new area of analysis. The Vs are:

Exhibit6: The Three V’s of Big Data

Image Source: http://infinitylimited.co.uk/wp-content/uploads/2015/07/Big-Data-Analytics-UK.jpg

Volume – It refers to the size of the data. With technology it’s limiting to talk about data volume

in any absolute sense, numbers get quickly outdated, volume refers to a relative sense instead. If

the data volume is at an order of magnitude or larger than anything previously encountered in

your industry, then you’re probably dealing with Big Data. In case of certain companies this

might range to the order of 10’s of terabytes or 10’s of petabytes.

Exhibit7: Volume

Image Source: https://www.mytechlogy.com/upload/by_users/Robert/311503022816Volume.jpg

Walmart is estimated to accumulate more than 2.5 petabytes of data every hour from its customer

transactions. A terabyte is one quadrillion bytes, or the equivalent of about 20 million filing

cabinets’ worth of text. An exabyte is 1,000 times that amount, or one billion gigabytes.

Page 6: Paper 6: Management Information System

Exhibit 8: Velocity

Image Source: https://www.mytechlogy.com/upload/by_users/Robert/311503022842Velocity.jpg

Velocity – It refers to the rate at which data is being received and acted upon. The moment a

broad cast message is received and user initiates a reaction. For example, a discount offered to a

customer based on location updates or traffic forecast may render useless if the user has already

crossed the geographical distance.

Exhibit9: Variety

Image Source: https://www.mytechlogy.com/upload/by_users/Robert/311503022912Variety.jpg

Variety - There are two aspects of variety one pertaining to syntax and other, semantics. This

implies the ability for the data to be categorized into a relational database easily and content

exposed for analysis. Modern tools are capable of dealing with data arriving in virtually any

format or syntax. However, they are less able to deal with semantically rich data such as free text.

Page 7: Paper 6: Management Information System

4. Sources of Big Data

It is important to understand from where does this large volume of data come from?

Exhibit10: Sources of Big Data

Image Source: http://www.ibmbigdatahub.com/infographic/where-does-big-data-come

Big data is an aggregation of quantum of data generated by machines, people, and organizations. With

automated or machine generated data we refer to real time sensors in industrial machinery or logs that

monitor and capture user behavior online, environmental sensors or personal health trackers, and many

other sense data resources.

We also refer to vast amount of social media data, profile updates, tweets, check-ins and photos. With

organizational generated data we refer to more traditional types of data, including transaction information

in databases and structured data open stored in data warehouses.

As manual activities move to the digitized platform, information and minimal cost equipment combine to

step into a space where large amounts of digital information exist on virtually any topic of interest to a

business.

From phones to shopping on online portals, social networks and communication all produce torrents of

data as a by-product of their ordinary operations.

Over a period of time data driven companies have brainstormed as to what they should do with the data

they collect. It necessarily may not entirely fit into any relational databases such as text and web logs. Big

Data offers the promise of unlocking the potential of this data and opens up new avenues for value

creation with regard to correlation of social network and purchase behavior to form a complete profile of

every customer.

Page 8: Paper 6: Management Information System

Exhibit 11: Data Sources

Image Source: https://40uu5c99f3a2ja7s7miveqgqu-wpengine.netdna-ssl.com/wp-content/uploads/2014/10/Big-Data-ecosystem-

from-data-to-decisions-IDC-click-for-full-image.jpg

Big Data, consists of huge volumes of structured, unstructured and se-structured data from both internal

and external sources from which insight and actionable intelligence is sought.

Unstructured data- refers to data that does not conform to a predefined data model. This implies

no relation model and no SQL. It is mostly anything that we don't store in a traditional Relational

database management system. Nearly, 80 to 90% of all data in the world is unstructured and this

number is rapidly growing. Unstructured data generated by people includes images, videos,

audio, internet searches, and emails.

The costs and time of the process acquisition, storage, retrieval and processing of unstructured

data may add up to quite and investment before we can start reaping value from this process. It

can be pretty hard to find the tools and people to implement such a process and reap value in the

end.

Structured Data - Each organization has distinct operation practices and business models, which

result in a variety of data generation platforms. Organizational big data come from online e

commerce transactions, government institution websites, banking or stock records, medical

records, sensors and so on.

Page 9: Paper 6: Management Information System

Almost every event can be potentially digitally stored. Organizations build and apply processes to

record and monitor business events of interest, such as registering a customer, manufacturing a

product or taking an order. These processes streamline data in a structured format which includes

transactions, reference tables, and relationships, as well as the metadata that sets its context.

Various Sources from which Big Data can be captured are listed below and showcased in exhibit 12.

Media exists in-and-out of the organization; it may connect with APIs (Application

Programmable Interface) and is moderately structured. Media files such as images, videos, audio,

flash, live streams, podcasts etc.

Business apps are structured, and using APIs you can pull data from both inside and outside the

organization. For example a CRM or SCM tool integrated with ecommerce system. Externally

using Accu-Weather update for local personalization.

Public web also deals with some very useful applications. For example, business affected by the

daily fluctuation of currency or share prices or gold rates that can be pulled from Google Trends.

Government, traffic, health care services, and other web services are all examples of Public Web

based data.

Sensor data It follows the quadruple characteristic of high velocity, volume, variety and value.

When used correctly to understand user context and predict behavior. Examples of sensors

include temperature, noise, pollution levels, traffic updates and biometrics. Further, car sensors,

traffic recording devices, office buildings, cell towers, jet engines provide information to be

analyzed for various technical aspects.

Machine log data – This refers to mobile or third party services that identify, target and convert

visitors. Event logs, Server data, application logs, call logs, mobile location, mobile app usage

refer to machine log data that can build a repository of information.

Page 10: Paper 6: Management Information System

Exhibit12: Sources of Big Data

Image Source: http://www.getelastic.com/big-data-infographic/

Social media is high velocity, high volume data that can be used to analyze reviews, brand

popularity, visitor count, customer rating target campaigns to social accounts that match the email

addresses in your customer file. Social Media refers to Twitter, Face book, LinkedIn, Blog,

YouTube, and Google+ are some among the various examples.

Docs can exist inside or outside your organization, and like archived data, doesn’t use APIs.

There refer to various xls, pdf, csv, email, word, ppt, plain text files etc.

Page 11: Paper 6: Management Information System

4.1 Who uses Big Data?

Exhibit 13: Big Data Use Cases

Image Source: https://qph.ec.quoracdn.net/main-qimg-acb88951b0b622c598599c894301f2f2

Big data affects organizations across every industry. Various applications are highlighted in exhibit 13.

Each industry can benefit from this onslaught of information.

Banking - With large amounts of information pooled in from countless sources, banks are faced

with finding new and innovative ways to manage big data. While it is imperative to understand

customers and ensure their satisfaction, it’s equally important to minimize risk and fraud while

maintaining regulatory compliance. Big data requires financial institutions to stay abreast with

advanced analytics.

Education - Academicians armed with data-driven analytics can make a significant impact on

school systems, students and curriculums. By analyzing big data, they can identify student

performance at various levels and parameters and segregate students needing more attention,

make sure students are making adequate progress, and can implement a better system for

evaluation and support of teachers and principals.

Government - When government use data analytics to gain significant ground when it comes to

managing utilities, running agencies, dealing with traffic congestion or preventing crime. But

while there are many advantages to big data, governments must also address issues of

transparency, data security and privacy.

Health Care - In healthcare everything needs to be done quickly, accurately – and, in some cases,

with enough transparency to satisfy stringent industry regulations. High degree of accuracy and

Page 12: Paper 6: Management Information System

timely information available to the medical practitioner is imperative. When big data is managed

effectively, health care providers can create patient profiles, maintain medical history.

Manufacturing - Manufacturers focus on enhancing quality and substantially reduce wastage.

Such processes are key in today’s highly competitive market. Manufacturers today apply

business analytics for implementing lean six sigma, just in time, quality control and other such

benchmarks to ensure minimum defects and maximum return.

Retail – Maintaining relationships is of paramount importance to the retail industry – and the best

way to manage is by filtering and segmenting information from big data. Retailers can know

customers better by identifying them through the database, maintain purchase history and cross

sell, manage transactions.

The benefit of big data comes from how it is used and how it is analyzed. With management focus and

creative analytical ability big data can be helpful for multiple use cases.

Introducing a New Coffee Product at Starbucks

Starbucks was introducing a new coffee product but was concerned that customers would find

its taste too strong. The morning that the coffee was rolled out, Starbucks monitored blogs,

Twitter, and niche coffee forum discussion groups to assess customers’ reactions. By mid-

morning, Starbucks discovered that although people liked the taste of the coffee, they thought

that it was too expensive. Starbucks lowered the price, and by the end of the day all of the

negative comments had disappeared.

Compare this fast response with a more traditional approach of waiting for the sales reports to

come in and noticing that sales are disappointing. A next step might be to run a focus group to

discover why. Perhaps in several weeks Starbucks would have discovered the reason and

responded by lowering the price.

Page 13: Paper 6: Management Information System

5. Big Data changing the face of Information Systems

Exhibit 14: Big Data Problem Solving

Image Source: http://istc-bigdata.org/wp-content/uploads/2012/08/ISTC-banner1.png

Reasons as to why Big Data is changing Information Systems and corporate information technology.

1. Move away from traditional RDBMS – From the start of electronic storage and processing of data the

concept of the Relational Database Management System has emerged. It is centric to most of the

computerized corporate information systems. Information systems such as ERP or CRM are well

integrated RDBMS. NoSQL helps to run data base queries to fetch information on a real time basis.

2. Unstructured data handling capability - Capability of handling both data in various formats is a

competent capability of any information system. Variety implies that Big Data is not necessarily text or

numbers (alphanumeric fields), but also unstructured data.

Exhibit15: Where does Big Data Come From?

Image Source: https://www.sas.com/en_us/insights/articles/big-data/big-data-privacy/_jcr_content/socialShareImage.img.png

3. Real Time Data Processing – Harnessing the power of big data also requires the ability to immediately

take action in case of various events. This refers to responding to a query or customer complaint, reacting

to a review or a tweet or handling negative or damaging publicity over the social media. Infact, batch

processing, nightly or weekly updates and even near real time data processing are not good enough when

dealing with data velocity as is the case with Big Data.

Page 14: Paper 6: Management Information System

4. Predictive analytics and in memory analytics - If data is being generated in a variety of formats

(structured and unstructured), in high volume and at a high velocity, only way it can be used effectively

for decision making is through the use of Predictive Analytics and in memory data analytics. Information

systems in future will have to be designed keeping this aspect in mind.

5. Most data are either user or machine generated - Most of big data is captured from multiple touch

points. These may be users/customers (such as social media data) or by machines/sensors outside the

confines or firewall of a company. This is unlike when most of the data were generated within the firewall

of a corporation (such as transaction data, inventory data or factory production data) with very little

coming from outside.

How Organizations are utilizing Big Data

When it comes to using big data for human resources, it can turn around the entire industry upside.

Big data poses critical questions that need to be asked during the hiring process. This can relate to

how people are selected and how much talent is being overlooked, big data is the solution for virtually

any HR department. One of the biggest concerns for most companies is their budget.

Employers want to ensure they are making the most of their resources and finding ways to eliminate

any extra costs. Employers have used big data for hiring talent who would be the best option for their

company. By using big data, employers can analyze applicants to find out which people are likely to

stay with the company once hired and who will enjoy their job. This is huge when it comes to

reducing employee turnover because employers can almost guarantee finding the right employees for

their company.

Xerox has been able to manage turnover using big data to review job applications. Before big data,

Xerox relied solely on candidate experience. However, this company soon discovered personality and

soft skills are more important during the hiring process. Xerox also found that more creative hires

were likely to stay longer than the analytical type of employee.

Google has an entire department dedicated to “People Operations where they use big data to recruit

and hire the best talent for their company. The team is dedicated to “people analytics,” which

measures whether a candidate is a good or bad option based on their soft skills, attitude, work ethic,

and flexibility. This data-driven approach has proven as a great way for Google to maximize their

hiring efforts and reinvent the human resources industry.

Another common way to implement big data into the hiring process is through gamification which is a

great way to improve talent acquisition and management. Through creating a game or mobile app,

you can reward candidates as they complete each step of your hiring process. Job applicants are

expected to play these games for 20 minutes, which helps the employer determine problem solving

skills, creativity, and persistence of each candidate. Gamification allows employers to paint a picture

of their candidates and determine whether the person has the potential to be a leader or innovator for

their organization.

Page 15: Paper 6: Management Information System

6. Benefits of capturing Big Data

Exhibit16: Big Data Benefits

Image Source: http://technosoftwares.com/wp-content/uploads/2016/09/big-data-350x300.png

1. Cost Reductions - Organizations that rely on cost effectiveness largely adopt big data tools primarily on

largely technical and economic criteria. Cost reduction can also be a secondary objective after others have

been achieved.

2. Time Reduction from Big Data – With processing speed of computers and high speed processors have

helped in reducing the cycle time for complex and large-scale analytical calculations from hours or even

days to minutes or seconds.

3. Building Customer Decision Trees - Big data makes it possible for companies to better understand

customers’ shopping behavior at each stage of the “consumer purchase cycle.” By analyzing online

browsing and searching histories, for example, companies can learn the alternatives customers look at

when considering buying a product, the important factors in their final purchasing decision, and how they

put together their shopping baskets—information that can help companies identify valuable up-selling and

cross-selling opportunities.

Companies can also monitor how customers talk about a product on social media, including why they

purchased it, which features they like and dislike, and what would prompt them to purchase it again.

4. Building Customer Satisfaction - A common use-case for Big Data the intelligent use of CRM. Apart from

this it is essential to manage customer experience. Companies understand customer sentiment and adapt

service delivery across our channels accordingly to offer the best possible customer experience.

Page 16: Paper 6: Management Information System

7. Summary

Big Data has been a game changer in the field of knowledge management and data mining. This

revolution has fundamentally changed how information is collected, stored, managed and consumed

thereby transforming the way we work, live and play. The use of Big Data is emerging as a crucial way

for leading companies to gain a competitive edge and outperform their counterparts. Established firms

and new entrants accumulate data or buy databases to leverage on data-driven strategies. Their aim is to

compete and capture value. Big Data helps to create multiple use cases, avenues for business and growth

opportunities and entirely new distinction of companies such as data aggregators who accumulate and

analyze industry data. Big data technologies help to provide accurate analysis, which may lead to more

concrete decision making resulting in greater operational efficiencies, cost reductions, and reduced risks

for the business. Large organizations across industries are joining the data economy. They are not keeping

traditional analytics and big data separate, but are combining them to form a new synthesis.