presentation - oracle exadata as a research platform

Upload: kinankazuki104

Post on 03-Jun-2018

228 views

Category:

Documents


0 download

TRANSCRIPT

  • 8/12/2019 Presentation - Oracle Exadata as a Research Platform

    1/36

    OracleExadataasaResearchPlatform

    . ,

  • 8/12/2019 Presentation - Oracle Exadata as a Research Platform

    2/36

    ScienceAproductofdataanalysis

    missionor

    the

    collection

    of

    data.

    Rather,

    understandingofthatdata.

    PhilosophyoftheNASAScienceMissionDirectorate(SMD)

  • 8/12/2019 Presentation - Oracle Exadata as a Research Platform

    3/36

    OraclesR&DPresence Nationa IgnitionFaci ity Fusionan LaserResearc

    Database,SecureFiles,OrchestrationandMiddleware,Virtualization,Dataguard,GridControl,StorageManagement,Partitioning

    CERN/LargeHadronColliderDatabase,Streams,Dataguard,GridControl,StorageManagement,

    Partitioning

    MaxPlanckInstituteDatabase,SecureFiles,Dataguard,GridControl,StorageManagement,

    Partitioning

    NBII.gov NationalBiologicalInformationInfrastructureMiddleware,Portal,Spatial

    htt : www.nbii. ov ortal server.

    JetPropulsionLabDatabase,GridControl,Partitioning,StorageManagement

  • 8/12/2019 Presentation - Oracle Exadata as a Research Platform

    4/36

    FutureofScientificComputingandAnalysis

    DataIntensive

    +

    Collaborative

    DataIntensiveCollaborativeScience

  • 8/12/2019 Presentation - Oracle Exadata as a Research Platform

    5/36

    DataIntensiveCollaborativeScienceCost Complexity

    KnowledgeBase DriversDrivers

    Interdependence

    Collaboration

    EnablersEnablers

    Web 2.0Web 2.0Network

    Capacity

    Network

    Capacity

    Virtualization/

    Grid

    Technologies

    Virtualization/

    Grid

    Technologies

    Moores

    Law

    Moores

    Law

    Standards

    JSR/JCR

    Standards

    JSR/JCR

    rac e

  • 8/12/2019 Presentation - Oracle Exadata as a Research Platform

    6/36

    DataChallengesforScience

    Stewardshipthelongtermpreservationof

    anticipatedand

    unanticipated

    uses

    Integrity/Provenance dataiscomplete,accurate,verifiable,ifpossiblereproducible

    Accessibility

    availabilityof

    research

    data

    to

    researc erso er an osew ogenera e edatawhenthedataisneeded

    appropriatemannerinaverifiablemannerbytheappropriatepeopleorresources

  • 8/12/2019 Presentation - Oracle Exadata as a Research Platform

    7/36

    UseCasesforDataSharing

    Reanalysis

    SecondaryAnalysis

    Replication

    Verification

    3rd art reanal sis usin existin initial data.

  • 8/12/2019 Presentation - Oracle Exadata as a Research Platform

    8/36

    SubsequentAnalysts

    Scienti icCommunity

    FundingAgencies

    and

    Foundations

  • 8/12/2019 Presentation - Oracle Exadata as a Research Platform

    9/36

    ObstaclestoDataSharing

    Human Systematic

    ac o ores g t

    FearofConflicting OriginationRules

    onc us ons

    Breech

    of

    Confidentiality

    LackofStandardsClassif in

    reater n uence

    CompromisingofArchiving

    Documenting

    o en a

    ro s Metadata

  • 8/12/2019 Presentation - Oracle Exadata as a Research Platform

    10/36

    Lackof

    Institutional

    IT

    Support

    In orma DataS aringMec anisms

    Lackof

    Expertise

  • 8/12/2019 Presentation - Oracle Exadata as a Research Platform

    11/36

  • 8/12/2019 Presentation - Oracle Exadata as a Research Platform

    12/36

    ResearchOrganizationsneedtoefficientlystore

    anal ze

    and

    mana e

    all

    data

    Structured SemiStructured Unstructured

    XML PDF

    Database Filesystem

    Simplicityandperformanceoffilesystemsmakesit

    attractivetostorefiledatainfilesystems,while

    keepingrelationaldatainDB

  • 8/12/2019 Presentation - Oracle Exadata as a Research Platform

    13/36

    ProblemwithFileSystems(bfiles) Manyapplicationsmanipulatebothfilesandrelationaldata

    Richuserexperience,compliance,businessintegration

    Thissplitcompromisesthevalueofthedata.Difficultymergingdata

    LegacyofStovePipedData

    Disjointsecurity

    and

    auditing

    models

    Changescannotbemadeatomically

    Backupandrecoveryarefragmented

    earc

    acrossre a ona

    a a

    an

    es

    s

    cu Spacemanagementiscomplicated

    Se arateinterfacesand rotocols

    Applicationarchitecturemorecomplex

  • 8/12/2019 Presentation - Oracle Exadata as a Research Platform

    14/36

    IntegratingUnstructuredData

    Database 11g

    RFID

    3D Binary XML

    Images

    SecureFilesDBFS

  • 8/12/2019 Presentation - Oracle Exadata as a Research Platform

    15/36

    DisparateDataTypesDatasetCategory Examples DataTypeOpticsMetrology OpticsMeasurements XML,Other

    Production checklists LRU manufacturin checklist XLS

    Calibration EngNodeSensitivity,CalATP XML,Other

    OIInspection DMS,IMS,CIM,VIDARlabs Images(jpeg,GIF)

    OIInspection Online FODI,PODI,LOIS Images(jpeg,GIF)

    AutoAlignment AASamples Images

    TargetDiagnosticRaw SXI,Dante,FABS HDF5,Other

    LaserDiagnostics

    Raw Energy

    Node,

    ISP

    Cal HDF5,

    Other

    o na ys s esu s na yze a a , er

    Operations Environmental Scalar

  • 8/12/2019 Presentation - Oracle Exadata as a Research Platform

    16/36

    DatabaseFilesystems BridgetheGapbetweenFilesystemsand

    e a ona a a ase ys ems

    MaintainFilesystemPerformance

    Leveragemultipleaccessmethods

    SingleSecurityMechanism

    UnifiedAdministrative

    Tools

    Filesystems

    DataPedigree

    UnifiedArchitectureandSkillsets

    LeverageInstitutionalResourcesforIT

    EnablingCollaboration

    around

    Data

    OptimizedforDataAccessDatabases

  • 8/12/2019 Presentation - Oracle Exadata as a Research Platform

    17/36

    DatabaseFilesystems DBFSisafilesysteminthedatabase,usesdatabaseforstorageandbringsall

    ofdatabasetechnologytofilesystems

    FuseClient

    DBFSimplements

    the

    file

    system

    interfaces:

    2methods(getpath,list)forareadonlyfilesystem

    5methodsforafilesystemwithreadandwritesupport

    15methodsforfullyfunctionalPOSIXfilesystem

    DBFSinterface

    is

    extensible

    for

    easily

    defining

    special

    purpose

    DBFScansurfaceoneormoreDBtablesasafilesystemorasingletablethroughmultiplefilesystems

    Example,aCheckImagestablecanhave2filesystemsonit:

    /CheckImages_by_customer/CustomerName/check.jpg /CheckImages_by_date/2008/September/check.jpg

  • 8/12/2019 Presentation - Oracle Exadata as a Research Platform

    18/36

    DatabaseFilesystemsbuilton Anewdatabasefeaturedesignedtobreaktheperformancebarrierkeeping

    file data out of databases Similar

    to

    LOBs

    but

    much

    faster,

    and

    with

    more

    capabilities

    Transparentencryption(withAdvancedSecurityOption) ompress on, e up ca on w vance ompress on p on Preservesthesecurity,reliability,andscalabilityofdatabase SupersetofLOBinterfacesallowseasymigrationfromLOBs Enablesconsolidationoffiledatawithassociatedrelationaldata

    Singlesecuritymodel Sin leviewofdata Singlemanagementofdata

  • 8/12/2019 Presentation - Oracle Exadata as a Research Platform

    19/36

    SecureFilesDetailBase Table Oracle table holding metadata

    plus locator columns similar to a b-file

    pointer.

    Delta Update

    Management

    Encryption

    Compression

    De-duplication

    Inode Management

    IO Management Space Management

  • 8/12/2019 Presentation - Oracle Exadata as a Research Platform

    20/36

    Pedigree with a database filesystem

    3/19/2010 20

  • 8/12/2019 Presentation - Oracle Exadata as a Research Platform

    21/36

  • 8/12/2019 Presentation - Oracle Exadata as a Research Platform

    22/36

    OracleExadata

    OracleExadataprovidesamidrangecapacitycomputing

    platformthatcanmeettheneedsofmanydataintensive

    scientificprogramsatacostmuchlowerthantraditional

    scientificplatforms. Whencombinedwithadditional

    ,

    intensiveandIOintensivescientificprogramrequirements.

  • 8/12/2019 Presentation - Oracle Exadata as a Research Platform

    23/36

    expensive

    clusters

    of

    systems

    to

    run

    parallel

    problemsrequiringmodestcomputationalpower

    CapabilityComputing:Usingthemostpowerfulsupercomputerstosolve

    thelargestandmostdemandingproblemswiththe

    intentto

    minimize

    time

    to

    solution

  • 8/12/2019 Presentation - Oracle Exadata as a Research Platform

    24/36

    Moderndatabaseshavemuchtoofferinth r lm f t n l i

    data

    Spatial

    Data

    Analysis TextMiningofUnstructuredContent

  • 8/12/2019 Presentation - Oracle Exadata as a Research Platform

    25/36

    Someofthenativedataminingtechniquesandalgorithmsavailable

    Algorithms

    LogisticRegression

    Technique

    Classification

    NaiveBayes

    SupportVector

    Machine

    DecisionTree

    MultipleRegression

    MinimumDescriptionLength

    Regression

    AttributeImportance

    EnhancedKMeans

    OrthogonalPartitioningClustering

    Clustering

    AprioriNonnegativeMatrixFactorization

    AssociationFeatureExtraction

  • 8/12/2019 Presentation - Oracle Exadata as a Research Platform

    26/36

    SunOracleDatabaseMachineHardware

    Complete,Preconfigured,Testedfor

    er ormanceDatabaseServers

    InfiniBandSwitches

    EthernetSwitch

    Precabled

    Keyboard,Video,Mouse(KVM)

    PowerDistributionUnits(PDUs)

    ReadytoDeploy

    Plugin

    power

    ConnecttoNetwork

    y

  • 8/12/2019 Presentation - Oracle Exadata as a Research Platform

    27/36

  • 8/12/2019 Presentation - Oracle Exadata as a Research Platform

    28/36

    SunFire X4170 DatabaseReferenceServer

    Processors 2QuadCoreIntel XeonE5540Processors 2.53

    GHz)

    Memory 72GB

    LocalDisks 4x146GB10KRPMSASDisks

    Disk DiskControllerHBAwith512MBBatteryBacked

    Network 2InfiniBand4XQDR(40Gb/s)Ports(Dualport

    HCA)

    4EmbeddedGigabitEthernetPorts

    Remote 1Ethernetport(ILOM)

    ManagementPower Redundant

  • 8/12/2019 Presentation - Oracle Exadata as a Research Platform

    29/36

    SunOracleExadataStorageServersProcessors 2QuadCoreIntel XeonE5540Processors(2.53GHz)

    Memory 24GB

    Disks 12x600GB15KRPMSAS

    OR

    12

    x

    2

    TB

    7.2K

    RPM

    SATAFlash 4x96GBSunFlashAcceleratorF20PCIeCards

    DiskController DiskControllerHBAwith512MBBatteryBackedCache

    Network 2InfiniBand4XQDR(40Gb/s)Ports(DualportHCA)

    4Embedded

    Gi abit

    Ethernet

    Ports

    Remote

    Management

    1Ethernetport(ILOM)

  • 8/12/2019 Presentation - Oracle Exadata as a Research Platform

    30/36

    InfiniBandNetwork

    UnifiedInfiniBandNetworkStorageNetwork

    ExternalConnectivity(optional)

    HighPerformance,LowLatencyNetwork s an w per n seac rec on

    SANlikeEfficiency(Zerocopy,bufferreservation)

    Simple

    manageability

    like

    IP

    network Protoco s

    ZerocopyZerolossDatagramProtocol(ZDPRDSv3)

    LinuxOpenSource,LowCPUoverhead(Transfer3GB/swith2%CPUusage)

    InternetProtocol

    over

    InfiniBand

    (IPoIB)

    LookslikenormalEthernettohostsoftware(tcp/ip,udp,http,ssh,)

    f d k

  • 8/12/2019 Presentation - Oracle Exadata as a Research Platform

    31/36

    InfiniBandNetwork

    UsesSun

    Datacenter

    36

    port

    Managed

    QDR

    (40Gb/s)

    InfiniBandswitches

    Runssubnetmanagerandautomaticallydiscoversnetworktopology

    Onlyonesubnetmanageractiveatatime

    2leaf

    switches

    to

    connect

    individual

    server

    IB

    ports

    1spineswitchinFullRackforscalingouttoadditionalRacks

    DatabaseServerandExadataServersEachserverhasDualportQDR(40Gb/s)IBHCA

    PerformanceislimitedbyPCIebus,soactiveactivenotneeded

    ConnectoneportfromtheHCAtooneleafswitchandtheotherporttothesecondleafswitchforredundancy

    Connectionspre

    wired

    in

    the

    Factory

  • 8/12/2019 Presentation - Oracle Exadata as a Research Platform

    32/36

    ScalingOuttoMultipleFullRacks

    SingleInfiniBand

    Network

    SwitchtoaFatTreeTopologyValidupto8Racks

    Everyleaf

    node

    inter

    connected

    with

    every

    spine

    switch

    Spineswitchesnotconnectedwithotherspineswitches

    Databaseand

    Exadata

    Server

    cabling

    unchanged.

    Interrackcablingdoneatinstallationtime

    Upto3Racks

    x ra

    ca es

    a rea y

    nc u e

    w

    eac

    ac ne Greaterthan3Racks

    I fi iB d N k E l C i i

  • 8/12/2019 Presentation - Oracle Exadata as a Research Platform

    33/36

    InfiniBandNetwork ExternalConnectivity

    Externalconnectivity

    ports

    for

    ConnecttomoreExadataserversforondiskbackup

    ConnecttomediaserversforTapebackup

    DataLoading

    Client/ApplicationAccess

    a a e n n an ca e eng s

    Upto5mPassiveCopper4XQDRQSFPcables

    Upto

    50m

    Fiber

    Optic

    4X

    QDR

    QSFP

    cables

    (more

    expensive)

    UseavailableportsonthetwoLeafswitches

    12intheFullRack(6perleafswitch)

    48intheQuarterRack(24perleafswitch)

    32intheSingleServerConfiguration

  • 8/12/2019 Presentation - Oracle Exadata as a Research Platform

    34/36

    ExternalConnectivity Ethernet

    PerDatabaseMachine

    AdminAccess 1portfromAdminEthernetswitch

    1port

    from

    KVM

    Switch

    ,KVMorEthernetswitchprovidedandtheILOMand

    management

    ports

    are

    connected

    to

    data

    center

    network

    Database/Client/ApplicationAccess Minimum 1 ort er X4170

    2more

    Ethernet

    ports

    per

    X4170

    available

    Canusethemforbondedclient/applicationaccessorfor

    C l i

  • 8/12/2019 Presentation - Oracle Exadata as a Research Platform

    35/36

    Conclusion

    knowledgeandnewdiscoveries.

    Oraclehasanumberoffeatureswhichcanbenefitthescientificcommunityandeasetheburdenof

    pedigree,data

    management,

    and

    analysis

    Usin a database files stem will enable data intensivecollaborativescience.

    Asnewdiscoveriesaremadeanddatavolumes

    ,

    systemthatisnotonlycapableofmanagingthepedigreeofthatdata,butalsoserveasaknowledge

    .

    Exadataprovides

    and

    ideal

    platform

    for

    program

    consolidationandscientificcollaboration

  • 8/12/2019 Presentation - Oracle Exadata as a Research Platform

    36/36

    http://search.oracle.com

    or

    http://www.oracle.com/