topiciii_data_resourse_systems.pdf

Upload: rohan-sivaramakrishnan-iyer

Post on 03-Apr-2018

214 views

Category:

Documents


0 download

TRANSCRIPT

  • 7/29/2019 TopicIII_Data_Resourse_Systems.pdf

    1/33

    Topic 3 DATA RESOURCE MANAGEMENT

    MIS 601 : MANAGEMENTINFORMATION SYSTEMS

  • 7/29/2019 TopicIII_Data_Resourse_Systems.pdf

    2/33

    CONTENTS :

    Data Resource ManagementTypes of Databases

    Concept of a DBMS

    Database Objects

    DDLC

  • 7/29/2019 TopicIII_Data_Resourse_Systems.pdf

    3/33

    Types of Databases

    1. Operational Databases2. Distributed Databases

    3. Multimedia databases4. Data warehousing

    z Data mining

    z OLAP

  • 7/29/2019 TopicIII_Data_Resourse_Systems.pdf

    4/33

    Data Resource Management

    It is a Managerial activity that appliesInformation Systems technologies like:-

    Database management

    Data Warehousing

    Other Data Management tools

    to the task of managing the organizationsdata resources to meet the information

    needs of their business stakeholders

  • 7/29/2019 TopicIII_Data_Resourse_Systems.pdf

    5/33

    Database

    A database is an integrated collection oflogically related data elements.

    The data stored in a database areindependent of the application programs

    using them & of the type of storagedevices on which they are stored.

  • 7/29/2019 TopicIII_Data_Resourse_Systems.pdf

    6/33

    Operational Databases

    These databases store detailed dataneeded to support business processes &operations of a company.

    Other names for this are :- Subject Area Databases (SADB)

    Transaction Databases

    Production Databases

    Examples:- HR Database

    Customer Database etc

  • 7/29/2019 TopicIII_Data_Resourse_Systems.pdf

    7/33

    Operational Databases

    Employee Record 1 Employee Record 2

    EmployeeCode

    EmployeeName

    Salary EmployeeCode

    EmployeeName

    Salary

    1001 James $3500 1002 Martha $2750

    HR Database

    PayrollBenefits

  • 7/29/2019 TopicIII_Data_Resourse_Systems.pdf

    8/33

    Multimedia databasesMultimedia databases. Multimedia databases

    can be defined as the database systems that canstore, manipulate and query informationpresented in more than one format such as text,

    audio, video, graphics, and images. Themultimedia databases are of prominence in theworld of computers today and more so for the

    flexibility and convenience they offer inrepresenting various forms of objects that wecome across in our everyday lives

  • 7/29/2019 TopicIII_Data_Resourse_Systems.pdf

    9/33

    Multimedia databasesThishasnecessitatedutilizationofdifferentformsforsoring

    multimediaand

    they

    include:

    ImageData:Imagesareverycommonlyfoundinmultimediadatabasesandtheirapplicationscoversimplefigures,icons,medical

    imageslikeXraysetc.

    VideoFiles:Thesehavebecomeveryimportantwiththeadventoftechnologieslikedistributionofvideoetc.Itisnowmoreconvenient

    thanevertostoreahomevideoonapersonalcomputer.

    Audiofiles:Thesefilesarebeingusedextensivelytostoreaswellasdistribute

    music

    and

    are

    even

    eing shared

    over

    the

    internet!

    DocumentData:Thesearethetraditionaltextfileswhereinformationisstoredintheformoftext.Thesefilesarestill inuseand

    havechangedintermsofthecapabilityofstoragesize.

  • 7/29/2019 TopicIII_Data_Resourse_Systems.pdf

    10/33

    A sample application that represents a Police application is

    presented below:

  • 7/29/2019 TopicIII_Data_Resourse_Systems.pdf

    11/33

    Distributed database system

    A distributed database system consists ofloosely coupled sites that share nophysical component

    Database systems that run on each siteare independent of each other

    Transactions may access data at one ormore sites

  • 7/29/2019 TopicIII_Data_Resourse_Systems.pdf

    12/33

    Distributed DatabasesMany Organizations replicate and distribute copies of parts

    of databases to network servers at a variety of sites.

    These Distributed databases can reside on network serverson the WWW on corporate intranets or extranets etc.

    Distributed

    Databases

    Partitioned

    Databases

    Duplicate

    Databases

  • 7/29/2019 TopicIII_Data_Resourse_Systems.pdf

    13/33

    Distributed Databases

  • 7/29/2019 TopicIII_Data_Resourse_Systems.pdf

    14/33

    Distributed Databases

  • 7/29/2019 TopicIII_Data_Resourse_Systems.pdf

    15/33

    Fragmentation

    Horizontal fragmentation: each tuple of

    risassigned to one or more fragments

    Vertical fragmentation: the schema for relation

    ris split into several smaller schemas Example : relation account with following

    schema

    Account-schema = (branch-name, account-number, balance)

    Horizontal Fragmentation of account Relation

  • 7/29/2019 TopicIII_Data_Resourse_Systems.pdf

    16/33

    Horizontal Fragmentation ofaccount Relation

    branch-name account-number balance

    HillsideHillside

    Hillside

    A-305A-226

    A-155

    500336

    62

    account1=branch-name=Hillside(account)

    branch-name account-number balance

    Valleyview

    ValleyviewValleyviewValleyview

    A-177

    A-402A-408A-639

    205

    100001123750

    account2=branch-name=Valleyview(account)

    -

  • 7/29/2019 TopicIII_Data_Resourse_Systems.pdf

    17/33

    branch-name customer-name tuple-id

    HillsideHillsideValleyviewValleyviewHillsideValleyviewValleyview

    LowmanCampCampKahnKahnKahnGreen

    deposit1=branch-name, customer-name, tuple-id(employee-info)

    1234567

    account number balance tuple-id

    50033620510000621123750

    12

    34567

    A-305A-226

    A-177A-402A-155A-408A-639

    deposit2=account-number, balance, tuple-id(employee-info)

  • 7/29/2019 TopicIII_Data_Resourse_Systems.pdf

    18/33

    Advantages of Fragmentation Horizontal:

    allows parallel processing on fragments of a relation

    allows a relation to be split so that tuples are located wherethey are most frequently accessed

    Vertical:

    allows tuples to be split so that each part of the tuple is storedwhere it is most frequently accessed

    tuple-id attribute allows efficient joining of vertical fragments

    allows parallel processing on a relation Vertical and horizontal fragmentation can be mixed.

    Fragments may be successively fragmented to an arbitrarydepth.

  • 7/29/2019 TopicIII_Data_Resourse_Systems.pdf

    19/33

    Data warehousing

    A data warehouse stores data that havebeen extracted from the variousoperational, external & other databases of

    an organization.It is a central source of data that has been cleaned,transformed & catalogued so that they can be used bymanagers & other business professionals for datamining, online analytical processing & other forms of

    business analysis, market research & decision support.

  • 7/29/2019 TopicIII_Data_Resourse_Systems.pdf

    20/33

    Data warehousing

  • 7/29/2019 TopicIII_Data_Resourse_Systems.pdf

    21/33

    Case Study :Shell Exploration & Production

    Various units worldwide each having their own IT resources

    (collecting & processing local data)

    The fuel company wanted to combine data from its ERPFinancial applications with data from its various systems to

    process information on how much GAS& OILthecompany finds & collects?

    Steve Much (Data warehouse team leader SHELL Scotland)faced major problems as each system had their set ofcodes.

    The option of going back cleansing & integrating data inhost system wasn't an option which was feasible ???

  • 7/29/2019 TopicIII_Data_Resourse_Systems.pdf

    22/33

    Case Study :Shell Exploration & Production

    Steve Mutch found a tool from KALIDOLTD (London) that analysed & mappeddata from various systems & then

    combined it into one data warehouse. After 7 months of Data analysis and

    mapping work data from 27 data sourcesnow came together in one 450 GBWarehouse.

  • 7/29/2019 TopicIII_Data_Resourse_Systems.pdf

    23/33

    Case Study :

    Shell Exploration & Production

    Benefits No single Business Unit lost control of its data. Hence all Business Heads contribute to a greater understanding

    of information for the company as a whole.

    After this success Mutch faced pressure fromTOP Executives tointegrate data from other applications also.

    Hence theTRUE POTENTIAL of a Data warehouse is realized

    at SHELL

  • 7/29/2019 TopicIII_Data_Resourse_Systems.pdf

    24/33

    Database ManagementSystems A Database Management System(DBMS) is a collection

    of interrelated data (database) and a set of programs toaccess those data.

    A Database Management System(DBMS) is a software

    that:-

    Defines a database

    Stores the data

    Supports a Query Language

    Produces Reports

    Creates data entry screens

  • 7/29/2019 TopicIII_Data_Resourse_Systems.pdf

    25/33

    Components of

    Database Management Systems

    Databases/ Files

    Database EngineData Dictionary

    Query Processor

    ReportWriter

    FormsGenerrator

  • 7/29/2019 TopicIII_Data_Resourse_Systems.pdf

    26/33

    Components of

    Database Management Systems

    Database Engine

    Data Dictionary

    Query Processor Report Writer

  • 7/29/2019 TopicIII_Data_Resourse_Systems.pdf

    27/33

    Components of

    Database Management Systems

    Form Generator

    Application Generator

    Security

  • 7/29/2019 TopicIII_Data_Resourse_Systems.pdf

    28/33

    CASE : Experian Automotive Experian Inc a unit of a LONDON based company. It

    runs one of the largest credit reporting agencies in the

    US. Experian wanted to go beyond credit checks for

    automotive loans.

    Experian wanted to collect vehicle data from variousMotor vehicle departments in the US, & blend it withother data such as Change of Addressrecords &then it could sell the enhanced data.

    To offer these services, Experian first needed a way toextract, transfer and load data from 50 differentUSState Department of Motor Vehicles (DMV) systems

    into a single database.

  • 7/29/2019 TopicIII_Data_Resourse_Systems.pdf

    29/33

    CASE : Experian Automotive This was difficult as per Ken Kauppila, VP(IT) at Experian

    Automotive (California) As Each DMV had their own format for

    entering data Kaupilla decided to use ETL( Extracting, Transforming & Loading)

    tools to combine very large data repositories.

    Using ETL EXTRACTby Evolutionary Technologies, Experian

    created database that can incorporate vehicle information within48hrs of its entry into any States DMV computer.

    Experian automotive database is the 10th largestautomobiledatabase in the world (16 billion rows of data)

    The entire relational database is managed by just 3

    IT professionals

  • 7/29/2019 TopicIII_Data_Resourse_Systems.pdf

    30/33

    Database Concepts

    Th Th T f A i ti ClTh Th T f A i ti Cl

  • 7/29/2019 TopicIII_Data_Resourse_Systems.pdf

    31/33

    The Three Types of Associations among ClassesThe Three Types of Associations among Classes

    Employee 1

    Employee 2

    Employee 3

    Customer A

    Customer B

    Customer C

    Customer D

    Customer E

    Employee Salary Account

    Employee

    Sales 1

    Sales 2

    Sales 3

    1:1

    Many:Many

    1:many

    Designing a Relational Database

  • 7/29/2019 TopicIII_Data_Resourse_Systems.pdf

    32/33

    Designing a Relational Database

    Relational Database Model -- need toinclude a common, unique field betweentables in order to link or "relate" thedifferent tables.

    Basic Definitions

    Primary key field (column) that is unique for table (a data item

    cannot be repeated anywhere in the field -- productnumber, employee id., etc.)

    Foreign key A primary key in another table that is used to join

    (connect) two tables.

  • 7/29/2019 TopicIII_Data_Resourse_Systems.pdf

    33/33

    Phone Name Address City

    312-555-1234 J ones 123 Main Chicago502-555-8876 Smith 456 Oak Glasgow602-555-9987 J uarez 887 Ribera Phoenix612-555-4325 Olsen 465 Thor Minneapolis

    Customer Table

    Customer Date Salesperson Total_sale502-555-8876 3/3/04 2223 157.92602-555-9987 4/4/04 8876 295.53612-555-4325 4/9/04 8876 132.94502-555-8876 5/7/04 3345 183.67

    Orders Table

    Relational DatabasesTables

    RowsColumns

    Primary keys Data types

    Text

    Dates & times

    Numbers

    Objects