cs 4400 database systems

15

Upload: others

Post on 18-Dec-2021

4 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: CS 4400 Database Systems
Page 2: CS 4400 Database Systems

CS4400DatabaseSystems

Meeting2:DatamanagementsystemsBrandonMyers

UniversityofIowa

Page 3: CS 4400 Database Systems

DatabasesandDatabasemanagementsystems(DBMS)• Examplesofdatabases

• ExamplesofDBMSs

Page 4: CS 4400 Database Systems

Anexample:onlinemusicstreamingservice• Whatdatamustitcontain?

• Whatcapabilitiesareneeded?

Page 5: CS 4400 Database Systems

Summaryofdatamanagementrequirements1.Abletodescribereal-worldentitiesintermsofstoreddata2.Persistentlystorelargedatasets3.Efficientlyquery&update4.Changestructure(e.g.,addattributes)5.Concurrencycontrol:enablesimultaneousupdates6.Crashrecovery7.Securityandintegrity,provenance

DBMSprovidesthesesothatuserscanfocusonapplicationlogic

Page 6: CS 4400 Database Systems

Peopleanddatabases

1. Appdeveloper:writesprogramsthatupdateandquerythedataintheDB

2. DBdesigner:modelsthedatabychoosingtablesandtheirattributes

3. DBadmin(“DBA”):operatesthedatabase,diagnosesperformanceproblems

4. Dataanalyst:datamining(inferringusefulinformation),dataintegration(combiningdisparatedata)

5. DBMSimplementer:buildstheDBMS

In4400we’lltrytogiveyousomeexperienceinalloftheseroles,although4and5arehugetopicsthatdemandtheirowncourses

Page 7: CS 4400 Database Systems

Datastructuresanddatabases

• InCS2230(orequivalent)wasallaboutdatastructures

• Whatisthedifferencebetweenadatabaseandadatastructure?

Page 8: CS 4400 Database Systems

Howdowerepresentourdataforthepurposesofmakingqueriesandupdates?

Page 9: CS 4400 Database Systems

Howdowerepresentourdataforthepurposesofmakingqueriesandupdates?

DATAMODEL,theinterfacetoyourdata

Wehaveabunchofstudents,eachwithanameandamajor.WhatmightbeagoodDATAMODELforthisdataset?

Page 10: CS 4400 Database Systems

Howdowerepresentourdataforthepurposesofmakingqueriesandupdates?

DATAMODEL,theinterfacetoyourdata

• e.g.,Relational• semi-structured,specificallyXML,JSON• graph• key-value

Page 11: CS 4400 Database Systems

Now,therelationaldatamodel

(seetheboard)

Page 12: CS 4400 Database Systems

SQLandSQLite

(seethenotesinthe.sql file)

Page 13: CS 4400 Database Systems

Datawarehousestodatalakes• Conventionally,businesseswouldhave:

1. Businessoperationssupportedby:aDBMSfortransactions(e.g.,sales,supplychainorders)

2. Businessintelligencesupportedby:aDBMSforstoringastructuredandindexedarchiveofrecentandhistoricaldata(thinklibrary)calledadatawarehouse.Employeesanalyzedthedatatoinformdecisions.

• Today,companieslikeMicrosoftrefertodatalakes,replacingthecarefullymaintaineddatabasesofadatawarehousewithenormousquantitiesofrawdata

• Whenthedataneedstobeanalyzed,itistransformedwithparallelprocessingsystems

• in4400we’llexploresemi-structureddata,parallelprocessing,andnon-relationalsystems(“NoSQL”)

From Chaos to Order | by Wiertz Sébastien

Page 14: CS 4400 Database Systems

Whattodonow

• HW1,whichisdue1/25,11:59pm• startearlybecauseitinvolvesanewtool:sqlite

• LookforthecoursepoliciessurveyintheAnnouncementsofICON,ifyouhaven’ttakenit

Page 15: CS 4400 Database Systems

Attribution

• SomeslidesinspiredorquotedfromUWCSE344• Peopleanddatabases• Datawarehousestodatalakes• Summaryofdatamanagementrequirements• https://courses.cs.washington.edu/courses/cse344/