data warehousing and data mining

28
01/01/22 01:24 PM 1 DATA WAREHOUSING AND DATA MINING PRESENTED BY :- ANIL SHARMA B-TECH(IT)MBA-A REG NO : 3470070100 PANKAJ JARIAL BTECH(IT)MBA-A REG NO : 3470070086

Upload: lovely-professional-university

Post on 20-May-2015

55.654 views

Category:

Business


0 download

DESCRIPTION

ALL ABOUT DATA WAREHOUSING AND DATA MINING

TRANSCRIPT

Page 1: DATA WAREHOUSING AND DATA MINING

04/12/23 05:59 AM 1

DATA WAREHOUSING AND DATA MINING

PRESENTED BY:-

ANIL SHARMA B-TECH(IT)MBA-A

REG NO : 3470070100

PANKAJ JARIALBTECH(IT)MBA-A

REG NO : 3470070086

Page 2: DATA WAREHOUSING AND DATA MINING

04/12/23 05:59 AM 2

DATA WAREHOUSING

Data warehousing is combining data from multiple sources into one comprehensive and easily manipulated database.

The primary aim for data warehousing is to provide businesses with analytics results from data mining, OLAP, Scorecarding and reporting.

Page 3: DATA WAREHOUSING AND DATA MINING

04/12/23 05:59 AM 3

NEED FOR DATA WAREHOUSING

Information is now considered as a key for all the works.

Those who gather, analyze, understand, and act upon information are winners.

Information have no limits, it is very hard to collect information from various sources, so we need an data warehouse from where we can get all the information.

Page 4: DATA WAREHOUSING AND DATA MINING

04/12/23 05:59 AM 4

TODAYS BUISNESS INFORMATION

Page 5: DATA WAREHOUSING AND DATA MINING

04/12/23 05:59 AM 5

DATA WAREHOUSING INCLUDES:-

Retrieving data

Analyzing data

Extracting data

Loading data

Transforming data

Managing data

Page 6: DATA WAREHOUSING AND DATA MINING

04/12/23 05:59 AM 6

DATA WAREHOUSE ARCHITECTURE

Data warehousing is designed to provide an architecture that will make cooperate data accessible and useful to users.

There is no right or wrong architecture. The worthiness of the architecture can be

judge by its use, and concept behind it . Data Warehouses can be architected in

many different ways, depending on the specific needs of a business. 

Page 7: DATA WAREHOUSING AND DATA MINING

04/12/23 05:59 AM 7

Typical Data Warehousing Environment

Page 8: DATA WAREHOUSING AND DATA MINING

04/12/23 05:59 AM 8

An operational data store (ODS) is basically a database that is used for being an temporary storage area for a datawarehouse.

Its primary purpose is for handling data which are progressively in use.

Operational data store contains data which are constantly updated through the course of the business operations.

Page 9: DATA WAREHOUSING AND DATA MINING

04/12/23 05:59 AM 9

ETL (Extract, Transform, Load) is used to copy data from:-

ODS to data warehouse staging area. Data warehouse staging area to data warehouse

. Data warehouse to data mart . ETL extracts data, transforms values of

inconsistent data, cleanses "bad" data, filters data and loads data into a target database. 

Page 10: DATA WAREHOUSING AND DATA MINING

04/12/23 05:59 AM 10

The Data Warehouse Staging Area is temporary location where data from source systems is copied. 

It increases the speed of data warehouse architecture.

It is very essential since data is increasing day by day.

Page 11: DATA WAREHOUSING AND DATA MINING

04/12/23 05:59 AM 11

The purpose of the Data Warehouse is to integrate corporate data.

The amount of data in the Data Warehouse is massive.  Data is stored at a very deep level of detail.

This allows data to be grouped in unimaginable ways.

Data Warehouses does not contain all the data in the organization ,It's purpose is to provide base that are needed by the organization for strategic and tactical decision making.  

Page 12: DATA WAREHOUSING AND DATA MINING

04/12/23 05:59 AM 12

ETL extract data from the Data Warehouse and send to one or more Data Marts for use of users.

Data marts are represented as shortcut to a data warehouse ,to save time.

It is just an partition of data present in data warehouse.

Each Data Mart can contain different combinations of tables, columns and rows from the Enterprise Data Warehouse. 

Page 13: DATA WAREHOUSING AND DATA MINING

04/12/23 05:59 AM 13

REASONS FOR CREATING AN DATA MART

Easy access to frequently needed data. Creates collective view by a group of users. Improves user response time. Ease of creation. Lower cost than implementing a full Data

warehouse

Page 14: DATA WAREHOUSING AND DATA MINING

04/12/23 05:59 AM 14

DATA MINING

The non-trivial extraction of implicit, previously unknown, and potentially useful information from large databases.

– Extremely large datasets – Useful knowledge that can improve processes – Cannot be done manually

Page 15: DATA WAREHOUSING AND DATA MINING

04/12/23 05:59 AM 15

Where Has it Come From ?

Page 16: DATA WAREHOUSING AND DATA MINING

04/12/23 05:59 AM 16

Motivation

Databases today are huge:

– More than 1,000,000 entities/records/rows

– From 10 to 10,000 fields/attributes/variables

– Giga-bytes and tera-bytes Databases a growing at an unprecendented rate The corporate world is a cut-throat world

– Decisions must be made rapidly

– Decisions must be made with maximum knowledge

Page 17: DATA WAREHOUSING AND DATA MINING

04/12/23 05:59 AM 17

How does data mining work?

Extract, transform, and load transaction data onto the data warehouse system.

Store and manage the data in a multidimensional database system.

Provide data access to business analysts and information technology professionals.

Analyze the data by application software. Present the data in a useful format, such as a graph

or table

Page 18: DATA WAREHOUSING AND DATA MINING

04/12/23 05:59 AM 18

DATA MINING MEASURES

Accuracy Clarity Dirty Data Scalability Speed Validation

Page 19: DATA WAREHOUSING AND DATA MINING

04/12/23 05:59 AM 19

Typical Applications of Data Mining

Page 20: DATA WAREHOUSING AND DATA MINING

04/12/23 05:59 AM 20

ADVANTAGES OF DATA MINING

Engineering and Technology Medical Science Business Combating Terrorism Games Research and Development

Page 21: DATA WAREHOUSING AND DATA MINING

04/12/23 05:59 AM 21

Engineering and Technology

In Electrical Power Engineering

- used for condition monitoring of high

voltage electrical equipment

- vibration monitoring and analysis of

transformer on-load tap-changers Education

- to concentrate their knowledge

Page 22: DATA WAREHOUSING AND DATA MINING

04/12/23 05:59 AM 22

Medical Science

Data mining has been widely used in area of bioinformatics , genetics

DNA sequences and variability in disease susceptibility which is very important to help improve the diagnosis, prevention and treatment of the diseases

Page 23: DATA WAREHOUSING AND DATA MINING

04/12/23 05:59 AM 23

BUSINESS

In Customer Relationship Management applications

It Translate data from customer to merchant Accurately

Distribute Business Processes Powerful Tool For Marketing

Page 24: DATA WAREHOUSING AND DATA MINING

04/12/23 05:59 AM 24

Combating terrorism

Concept used by Interpol against terrorists for searching their records by Multistate Anti-Terrorism Information Exchange

In the Secure Flight program , Computer Assisted Passenger Pre screening System , Semantic Enhancement

Page 25: DATA WAREHOUSING AND DATA MINING

04/12/23 05:59 AM 25

Games

for certain combinatorial games, also called table bases (e.g. for 3x3-chess)

It includes extraction of human-usable strategies

Berlekamp in dots-and-boxes and Joh Nunn in chess endgames are notable examples

Page 26: DATA WAREHOUSING AND DATA MINING

04/12/23 05:59 AM 26

Research And Development

Helps to Develop the search algorithms It offers huge libraries of graphing and

visualisation softwares The users can easily create the models

optimally

Page 27: DATA WAREHOUSING AND DATA MINING

04/12/23 05:59 AM 27

List of the top eight data-mining software vendors in 2008

Angoss Software Infor CRM Epiphany Portrait Software SAS G-Stat SPSS ThinkAnalytics Unica Viscovery

Page 28: DATA WAREHOUSING AND DATA MINING

04/12/23 05:59 AM 28

THANK YOU