topiciii_data_resourse_systems.pdf
TRANSCRIPT
-
7/29/2019 TopicIII_Data_Resourse_Systems.pdf
1/33
Topic 3 DATA RESOURCE MANAGEMENT
MIS 601 : MANAGEMENTINFORMATION SYSTEMS
-
7/29/2019 TopicIII_Data_Resourse_Systems.pdf
2/33
CONTENTS :
Data Resource ManagementTypes of Databases
Concept of a DBMS
Database Objects
DDLC
-
7/29/2019 TopicIII_Data_Resourse_Systems.pdf
3/33
Types of Databases
1. Operational Databases2. Distributed Databases
3. Multimedia databases4. Data warehousing
z Data mining
z OLAP
-
7/29/2019 TopicIII_Data_Resourse_Systems.pdf
4/33
Data Resource Management
It is a Managerial activity that appliesInformation Systems technologies like:-
Database management
Data Warehousing
Other Data Management tools
to the task of managing the organizationsdata resources to meet the information
needs of their business stakeholders
-
7/29/2019 TopicIII_Data_Resourse_Systems.pdf
5/33
Database
A database is an integrated collection oflogically related data elements.
The data stored in a database areindependent of the application programs
using them & of the type of storagedevices on which they are stored.
-
7/29/2019 TopicIII_Data_Resourse_Systems.pdf
6/33
Operational Databases
These databases store detailed dataneeded to support business processes &operations of a company.
Other names for this are :- Subject Area Databases (SADB)
Transaction Databases
Production Databases
Examples:- HR Database
Customer Database etc
-
7/29/2019 TopicIII_Data_Resourse_Systems.pdf
7/33
Operational Databases
Employee Record 1 Employee Record 2
EmployeeCode
EmployeeName
Salary EmployeeCode
EmployeeName
Salary
1001 James $3500 1002 Martha $2750
HR Database
PayrollBenefits
-
7/29/2019 TopicIII_Data_Resourse_Systems.pdf
8/33
Multimedia databasesMultimedia databases. Multimedia databases
can be defined as the database systems that canstore, manipulate and query informationpresented in more than one format such as text,
audio, video, graphics, and images. Themultimedia databases are of prominence in theworld of computers today and more so for the
flexibility and convenience they offer inrepresenting various forms of objects that wecome across in our everyday lives
-
7/29/2019 TopicIII_Data_Resourse_Systems.pdf
9/33
Multimedia databasesThishasnecessitatedutilizationofdifferentformsforsoring
multimediaand
they
include:
ImageData:Imagesareverycommonlyfoundinmultimediadatabasesandtheirapplicationscoversimplefigures,icons,medical
imageslikeXraysetc.
VideoFiles:Thesehavebecomeveryimportantwiththeadventoftechnologieslikedistributionofvideoetc.Itisnowmoreconvenient
thanevertostoreahomevideoonapersonalcomputer.
Audiofiles:Thesefilesarebeingusedextensivelytostoreaswellasdistribute
music
and
are
even
eing shared
over
the
internet!
DocumentData:Thesearethetraditionaltextfileswhereinformationisstoredintheformoftext.Thesefilesarestill inuseand
havechangedintermsofthecapabilityofstoragesize.
-
7/29/2019 TopicIII_Data_Resourse_Systems.pdf
10/33
A sample application that represents a Police application is
presented below:
-
7/29/2019 TopicIII_Data_Resourse_Systems.pdf
11/33
Distributed database system
A distributed database system consists ofloosely coupled sites that share nophysical component
Database systems that run on each siteare independent of each other
Transactions may access data at one ormore sites
-
7/29/2019 TopicIII_Data_Resourse_Systems.pdf
12/33
Distributed DatabasesMany Organizations replicate and distribute copies of parts
of databases to network servers at a variety of sites.
These Distributed databases can reside on network serverson the WWW on corporate intranets or extranets etc.
Distributed
Databases
Partitioned
Databases
Duplicate
Databases
-
7/29/2019 TopicIII_Data_Resourse_Systems.pdf
13/33
Distributed Databases
-
7/29/2019 TopicIII_Data_Resourse_Systems.pdf
14/33
Distributed Databases
-
7/29/2019 TopicIII_Data_Resourse_Systems.pdf
15/33
Fragmentation
Horizontal fragmentation: each tuple of
risassigned to one or more fragments
Vertical fragmentation: the schema for relation
ris split into several smaller schemas Example : relation account with following
schema
Account-schema = (branch-name, account-number, balance)
Horizontal Fragmentation of account Relation
-
7/29/2019 TopicIII_Data_Resourse_Systems.pdf
16/33
Horizontal Fragmentation ofaccount Relation
branch-name account-number balance
HillsideHillside
Hillside
A-305A-226
A-155
500336
62
account1=branch-name=Hillside(account)
branch-name account-number balance
Valleyview
ValleyviewValleyviewValleyview
A-177
A-402A-408A-639
205
100001123750
account2=branch-name=Valleyview(account)
-
-
7/29/2019 TopicIII_Data_Resourse_Systems.pdf
17/33
branch-name customer-name tuple-id
HillsideHillsideValleyviewValleyviewHillsideValleyviewValleyview
LowmanCampCampKahnKahnKahnGreen
deposit1=branch-name, customer-name, tuple-id(employee-info)
1234567
account number balance tuple-id
50033620510000621123750
12
34567
A-305A-226
A-177A-402A-155A-408A-639
deposit2=account-number, balance, tuple-id(employee-info)
-
7/29/2019 TopicIII_Data_Resourse_Systems.pdf
18/33
Advantages of Fragmentation Horizontal:
allows parallel processing on fragments of a relation
allows a relation to be split so that tuples are located wherethey are most frequently accessed
Vertical:
allows tuples to be split so that each part of the tuple is storedwhere it is most frequently accessed
tuple-id attribute allows efficient joining of vertical fragments
allows parallel processing on a relation Vertical and horizontal fragmentation can be mixed.
Fragments may be successively fragmented to an arbitrarydepth.
-
7/29/2019 TopicIII_Data_Resourse_Systems.pdf
19/33
Data warehousing
A data warehouse stores data that havebeen extracted from the variousoperational, external & other databases of
an organization.It is a central source of data that has been cleaned,transformed & catalogued so that they can be used bymanagers & other business professionals for datamining, online analytical processing & other forms of
business analysis, market research & decision support.
-
7/29/2019 TopicIII_Data_Resourse_Systems.pdf
20/33
Data warehousing
-
7/29/2019 TopicIII_Data_Resourse_Systems.pdf
21/33
Case Study :Shell Exploration & Production
Various units worldwide each having their own IT resources
(collecting & processing local data)
The fuel company wanted to combine data from its ERPFinancial applications with data from its various systems to
process information on how much GAS& OILthecompany finds & collects?
Steve Much (Data warehouse team leader SHELL Scotland)faced major problems as each system had their set ofcodes.
The option of going back cleansing & integrating data inhost system wasn't an option which was feasible ???
-
7/29/2019 TopicIII_Data_Resourse_Systems.pdf
22/33
Case Study :Shell Exploration & Production
Steve Mutch found a tool from KALIDOLTD (London) that analysed & mappeddata from various systems & then
combined it into one data warehouse. After 7 months of Data analysis and
mapping work data from 27 data sourcesnow came together in one 450 GBWarehouse.
-
7/29/2019 TopicIII_Data_Resourse_Systems.pdf
23/33
Case Study :
Shell Exploration & Production
Benefits No single Business Unit lost control of its data. Hence all Business Heads contribute to a greater understanding
of information for the company as a whole.
After this success Mutch faced pressure fromTOP Executives tointegrate data from other applications also.
Hence theTRUE POTENTIAL of a Data warehouse is realized
at SHELL
-
7/29/2019 TopicIII_Data_Resourse_Systems.pdf
24/33
Database ManagementSystems A Database Management System(DBMS) is a collection
of interrelated data (database) and a set of programs toaccess those data.
A Database Management System(DBMS) is a software
that:-
Defines a database
Stores the data
Supports a Query Language
Produces Reports
Creates data entry screens
-
7/29/2019 TopicIII_Data_Resourse_Systems.pdf
25/33
Components of
Database Management Systems
Databases/ Files
Database EngineData Dictionary
Query Processor
ReportWriter
FormsGenerrator
-
7/29/2019 TopicIII_Data_Resourse_Systems.pdf
26/33
Components of
Database Management Systems
Database Engine
Data Dictionary
Query Processor Report Writer
-
7/29/2019 TopicIII_Data_Resourse_Systems.pdf
27/33
Components of
Database Management Systems
Form Generator
Application Generator
Security
-
7/29/2019 TopicIII_Data_Resourse_Systems.pdf
28/33
CASE : Experian Automotive Experian Inc a unit of a LONDON based company. It
runs one of the largest credit reporting agencies in the
US. Experian wanted to go beyond credit checks for
automotive loans.
Experian wanted to collect vehicle data from variousMotor vehicle departments in the US, & blend it withother data such as Change of Addressrecords &then it could sell the enhanced data.
To offer these services, Experian first needed a way toextract, transfer and load data from 50 differentUSState Department of Motor Vehicles (DMV) systems
into a single database.
-
7/29/2019 TopicIII_Data_Resourse_Systems.pdf
29/33
CASE : Experian Automotive This was difficult as per Ken Kauppila, VP(IT) at Experian
Automotive (California) As Each DMV had their own format for
entering data Kaupilla decided to use ETL( Extracting, Transforming & Loading)
tools to combine very large data repositories.
Using ETL EXTRACTby Evolutionary Technologies, Experian
created database that can incorporate vehicle information within48hrs of its entry into any States DMV computer.
Experian automotive database is the 10th largestautomobiledatabase in the world (16 billion rows of data)
The entire relational database is managed by just 3
IT professionals
-
7/29/2019 TopicIII_Data_Resourse_Systems.pdf
30/33
Database Concepts
Th Th T f A i ti ClTh Th T f A i ti Cl
-
7/29/2019 TopicIII_Data_Resourse_Systems.pdf
31/33
The Three Types of Associations among ClassesThe Three Types of Associations among Classes
Employee 1
Employee 2
Employee 3
Customer A
Customer B
Customer C
Customer D
Customer E
Employee Salary Account
Employee
Sales 1
Sales 2
Sales 3
1:1
Many:Many
1:many
Designing a Relational Database
-
7/29/2019 TopicIII_Data_Resourse_Systems.pdf
32/33
Designing a Relational Database
Relational Database Model -- need toinclude a common, unique field betweentables in order to link or "relate" thedifferent tables.
Basic Definitions
Primary key field (column) that is unique for table (a data item
cannot be repeated anywhere in the field -- productnumber, employee id., etc.)
Foreign key A primary key in another table that is used to join
(connect) two tables.
-
7/29/2019 TopicIII_Data_Resourse_Systems.pdf
33/33
Phone Name Address City
312-555-1234 J ones 123 Main Chicago502-555-8876 Smith 456 Oak Glasgow602-555-9987 J uarez 887 Ribera Phoenix612-555-4325 Olsen 465 Thor Minneapolis
Customer Table
Customer Date Salesperson Total_sale502-555-8876 3/3/04 2223 157.92602-555-9987 4/4/04 8876 295.53612-555-4325 4/9/04 8876 132.94502-555-8876 5/7/04 3345 183.67
Orders Table
Relational DatabasesTables
RowsColumns
Primary keys Data types
Text
Dates & times
Numbers
Objects