data warehousing at the university at albany wendell g. lorang director of institutional research...

42
Data Warehousing Data Warehousing at The University at The University at Albany at Albany Wendell G. Lorang Wendell G. Lorang Director of Institutional Research Director of Institutional Research University at Albany, SUNY University at Albany, SUNY AIRPO Winter 2006 Conference AIRPO Winter 2006 Conference Albany, New York Albany, New York

Post on 15-Jan-2016

244 views

Category:

Documents


1 download

TRANSCRIPT

Page 1: Data Warehousing at The University at Albany Wendell G. Lorang Director of Institutional Research University at Albany, SUNY AIRPO Winter 2006 Conference

Data Warehousing Data Warehousing at The University at at The University at

AlbanyAlbany

Wendell G. LorangWendell G. Lorang

Director of Institutional ResearchDirector of Institutional Research

University at Albany, SUNYUniversity at Albany, SUNY

AIRPO Winter 2006 ConferenceAIRPO Winter 2006 Conference

Albany, New YorkAlbany, New York

Page 2: Data Warehousing at The University at Albany Wendell G. Lorang Director of Institutional Research University at Albany, SUNY AIRPO Winter 2006 Conference

UAlbany implemented a UAlbany implemented a Student Records data Student Records data

warehouse that denormalizes warehouse that denormalizes the data from over 60 the data from over 60

PeopleSoft tables into seven PeopleSoft tables into seven tables. More than 100 queries tables. More than 100 queries

are available to departments to are available to departments to use to obtain the information use to obtain the information

they need. they need.

OverviewOverview

Page 3: Data Warehousing at The University at Albany Wendell G. Lorang Director of Institutional Research University at Albany, SUNY AIRPO Winter 2006 Conference

• Our PeopleSoft Our PeopleSoft Implementation Implementation

• Problem DescriptionProblem Description

• The SolutionThe Solution

• The IssuesThe Issues

PresentationPresentation

Page 4: Data Warehousing at The University at Albany Wendell G. Lorang Director of Institutional Research University at Albany, SUNY AIRPO Winter 2006 Conference

4

PeopleSoftPeopleSoft at UAlbany

19961996

March 1999March 1999

Project BeganProject Began

Undergrad ProspectsUndergrad Prospects

June 2000June 2000 Human ResourcesHuman Resources

November 2000November 2000 Undergraduate andUndergraduate and

Graduate Graduate AdmissionsAdmissions

March 2001March 2001 Orientation andOrientation and

Summer PlanningSummer Planning

ConferencesConferences

Page 5: Data Warehousing at The University at Albany Wendell G. Lorang Director of Institutional Research University at Albany, SUNY AIRPO Winter 2006 Conference

5

PeopleSoftPeopleSoft at UAlbany

June 2003June 2003 Student Records Student Records

January 2004January 2004 Financial AidFinancial Aid

May 2005May 2005 Full Student Full Student FinancialsFinancials

Page 6: Data Warehousing at The University at Albany Wendell G. Lorang Director of Institutional Research University at Albany, SUNY AIRPO Winter 2006 Conference

““Can I get a list of all of the grad Can I get a list of all of the grad

students in my department who students in my department who

are currently registered?”are currently registered?”

The Problem – A Common Question

Page 7: Data Warehousing at The University at Albany Wendell G. Lorang Director of Institutional Research University at Albany, SUNY AIRPO Winter 2006 Conference

The Problem – A Common QuestionWhat data fields are needed?What data fields are needed?

Term, Empl ID, Name, Career, Term, Empl ID, Name, Career, Department, Program, Registered Department, Program, Registered Credits, Full/Part Time, Admit Term, Credits, Full/Part Time, Admit Term, Statute of Limitations Term, Statute of Limitations Term, Advisor, Last Term Registered, Total Advisor, Last Term Registered, Total CreditsCredits

Page 8: Data Warehousing at The University at Albany Wendell G. Lorang Director of Institutional Research University at Albany, SUNY AIRPO Winter 2006 Conference

The Solution - Using Query Tool

• To get the 13 fields requires To get the 13 fields requires linking to 9 tables.linking to 9 tables.

•Logic to find the most recent Logic to find the most recent program/plan that has not been program/plan that has not been discontinued is too complicated for discontinued is too complicated for query toolquery tool

Page 9: Data Warehousing at The University at Albany Wendell G. Lorang Director of Institutional Research University at Albany, SUNY AIRPO Winter 2006 Conference

Criteria

Page 10: Data Warehousing at The University at Albany Wendell G. Lorang Director of Institutional Research University at Albany, SUNY AIRPO Winter 2006 Conference

Generated SQL

Page 11: Data Warehousing at The University at Albany Wendell G. Lorang Director of Institutional Research University at Albany, SUNY AIRPO Winter 2006 Conference

• Too many PeopleSoft tables andToo many PeopleSoft tables and

users don’t know table structureusers don’t know table structure

• Need to join many tables to get Need to join many tables to get

datadata

• Queries run slow against tables Queries run slow against tables

because of joins and security treebecause of joins and security tree

The Problem

Page 12: Data Warehousing at The University at Albany Wendell G. Lorang Director of Institutional Research University at Albany, SUNY AIRPO Winter 2006 Conference

What Are We Going to DO???

Write Write SQRsSQRs

??

Write Write QueriesQueries

??

Buy a Buy a reportinreportin

g g productproduct

??

Page 13: Data Warehousing at The University at Albany Wendell G. Lorang Director of Institutional Research University at Albany, SUNY AIRPO Winter 2006 Conference

13

THE SOLUTIONTHE SOLUTION

A CUSTOMA CUSTOM

UNIVERSITY AT UNIVERSITY AT ALBANY ALBANY

DATA WAREHOUSEDATA WAREHOUSE

Page 14: Data Warehousing at The University at Albany Wendell G. Lorang Director of Institutional Research University at Albany, SUNY AIRPO Winter 2006 Conference

• Primary goals Primary goals

User Access User Access

Answer many varied Answer many varied questions/needsquestions/needs

Real-time management/monitoring Real-time management/monitoring tooltool

The Solution –

Student Data Warehouse

Page 15: Data Warehousing at The University at Albany Wendell G. Lorang Director of Institutional Research University at Albany, SUNY AIRPO Winter 2006 Conference

Design and Implementation

• Design led by Institutional Design led by Institutional ResearchResearch

• Met with departments and Met with departments and worked with programmer to define worked with programmer to define the table organizationthe table organization

• Defined the fields to be includedDefined the fields to be included

• Defined the population of Defined the population of students to be includedstudents to be included

• IR testedIR tested

Page 16: Data Warehousing at The University at Albany Wendell G. Lorang Director of Institutional Research University at Albany, SUNY AIRPO Winter 2006 Conference

•Data Data NOTNOT included: included:• Scores (ACT, SAT, high school & Scores (ACT, SAT, high school &

transfer GPAs, GRE, etc)transfer GPAs, GRE, etc)• Student Billing dataStudent Billing data• Financial AidFinancial Aid• Racial/ethnic dataRacial/ethnic data• Disability informationDisability information• Student’s SSNStudent’s SSN

The Solution –

Student Data Warehouse

Page 17: Data Warehousing at The University at Albany Wendell G. Lorang Director of Institutional Research University at Albany, SUNY AIRPO Winter 2006 Conference

● ● What students to include?What students to include?

● ● How many semesters to How many semesters to include?include?

• How to define registered?How to define registered?

• How to handle addresses?How to handle addresses?

The Issues

Page 18: Data Warehousing at The University at Albany Wendell G. Lorang Director of Institutional Research University at Albany, SUNY AIRPO Winter 2006 Conference

What students to include?What students to include? Any student who was registered Any student who was registered or got a grade (even a W or Z) is or got a grade (even a W or Z) is included. included.

In addition, all active (eligible to In addition, all active (eligible to register) students are included.register) students are included.

The Issues

Page 19: Data Warehousing at The University at Albany Wendell G. Lorang Director of Institutional Research University at Albany, SUNY AIRPO Winter 2006 Conference

What semesters to include?What semesters to include?

● ● CurrentCurrent

● ● Future (for Advance Future (for Advance Registration)Registration)

●● Past (comparative purposes)Past (comparative purposes)

The Issues

Page 20: Data Warehousing at The University at Albany Wendell G. Lorang Director of Institutional Research University at Albany, SUNY AIRPO Winter 2006 Conference

How to Define Registered?How to Define Registered?

a)a)Registered Flag showing that a Registered Flag showing that a student was registered at some student was registered at some point in the semester but point in the semester but subsequently withdrew (with a subsequently withdrew (with a grade) from all courses grade) from all courses

b)b)Presently Registered Flag Presently Registered Flag indicates that the student is indicates that the student is currently (as of the day before) currently (as of the day before) registeredregistered

The Issues

Page 21: Data Warehousing at The University at Albany Wendell G. Lorang Director of Institutional Research University at Albany, SUNY AIRPO Winter 2006 Conference

How to Handle Addresses?

Developed concept of Priority Address -- the student’s ‘closest’ address

If a student lives on campus, that is their Priority Address; if not on campus, but locally, that is the Priority Address; otherwise, their permanent address is their Priority Address.

The Issues

Page 22: Data Warehousing at The University at Albany Wendell G. Lorang Director of Institutional Research University at Albany, SUNY AIRPO Winter 2006 Conference

Last Enrollment Term

Deceased Students

Managing Size of the Data Warehouse

FERPA

The Issues

Page 23: Data Warehousing at The University at Albany Wendell G. Lorang Director of Institutional Research University at Albany, SUNY AIRPO Winter 2006 Conference

• Active students for past and Active students for past and

future semestersfuture semesters•Academic program informationAcademic program information• Student and Course EnrollmentsStudent and Course Enrollments•Course informationCourse information

The Solution –

Student Data Warehouse

Page 24: Data Warehousing at The University at Albany Wendell G. Lorang Director of Institutional Research University at Albany, SUNY AIRPO Winter 2006 Conference

• data on majors (demographics, data on majors (demographics,

academic performance, registered academic performance, registered

vs not-registered, etc.)vs not-registered, etc.)•Demographic and academic Demographic and academic

characteristics of students in characteristics of students in

classesclasses• Output to prepare labelsOutput to prepare labels

The Solution –

A Student Data Warehouse

Page 25: Data Warehousing at The University at Albany Wendell G. Lorang Director of Institutional Research University at Albany, SUNY AIRPO Winter 2006 Conference

25

•Data from more than 60 PeopleSoft Data from more than 60 PeopleSoft

tables are placed in seven tablestables are placed in seven tables• Tables reside in the same instance Tables reside in the same instance

as our Production databaseas our Production database• Queries are quick – no Queries are quick – no

performance or response time performance or response time

issuesissues

The Solution –

Student Data Warehouse

Page 26: Data Warehousing at The University at Albany Wendell G. Lorang Director of Institutional Research University at Albany, SUNY AIRPO Winter 2006 Conference

26

• Refreshed daily based on Refreshed daily based on

parametersparameters• Parameters (terms to be included) Parameters (terms to be included)

can be reset each daycan be reset each day• DW currently includes Fall 2003 DW currently includes Fall 2003

through Fall 2006through Fall 2006

The Solution –

Student Data Warehouse

Page 27: Data Warehousing at The University at Albany Wendell G. Lorang Director of Institutional Research University at Albany, SUNY AIRPO Winter 2006 Conference

Queries

• Over 200 public queries written by Over 200 public queries written by IRIR

• 90 queries primarily for academic 90 queries primarily for academic departmentsdepartments

• Return data elements depending on Return data elements depending on requirementsrequirements

• Prompts for fields such as term, Prompts for fields such as term, department, plan, advisor, coursedepartment, plan, advisor, course

• Can link to other PeopleSoft tablesCan link to other PeopleSoft tables

Page 28: Data Warehousing at The University at Albany Wendell G. Lorang Director of Institutional Research University at Albany, SUNY AIRPO Winter 2006 Conference

Queries

• Most academic departments have Most academic departments have “run only” query security“run only” query security

• Most changes to existing queries or Most changes to existing queries or new queries are handled by IRnew queries are handled by IR

Page 29: Data Warehousing at The University at Albany Wendell G. Lorang Director of Institutional Research University at Albany, SUNY AIRPO Winter 2006 Conference

Documentation

• What is the Data Warehouse? What is the Data Warehouse?

• Terms in the Data Warehouse Terms in the Data Warehouse

• Data Warehouse Translation Values Data Warehouse Translation Values

• Data Warehouse Public Queries LogData Warehouse Public Queries Log

• Data Warehouse Data DictionaryData Warehouse Data Dictionary

• Running a Query Running a Query

• Downloading Query Results to a CSV Text Downloading Query Results to a CSV Text FileFile

• Printing labels using PeopleSoft QueryPrinting labels using PeopleSoft Query

• Creating Pivot Tables in Microsoft ExcelCreating Pivot Tables in Microsoft Excel

Page 30: Data Warehousing at The University at Albany Wendell G. Lorang Director of Institutional Research University at Albany, SUNY AIRPO Winter 2006 Conference

Once it’s done, it’s not done

• TrainingTraining

• DocumentationDocumentation

• User EvaluationUser Evaluation

• New queriesNew queries

• SupportSupport

• TweakingTweaking

Page 31: Data Warehousing at The University at Albany Wendell G. Lorang Director of Institutional Research University at Albany, SUNY AIRPO Winter 2006 Conference

What Else?

• Added a table for degrees awarded to the Added a table for degrees awarded to the StudentStudent

Data Warehouse in Summer 2004Data Warehouse in Summer 2004

● ● Graduate Admissions Data Warehouse Graduate Admissions Data Warehouse implemented in Summer 2004 implemented in Summer 2004

• Undergraduate Admissions Data Warehouse Undergraduate Admissions Data Warehouse

being implemented in January 2006being implemented in January 2006

Page 32: Data Warehousing at The University at Albany Wendell G. Lorang Director of Institutional Research University at Albany, SUNY AIRPO Winter 2006 Conference

Student Demographics

STDNT_CAR_TERM PERSONAL_DATA

PeopleSoft Tables Data Warehouse Table

UADW_STDNT_DEMO

NAMES CITIZENSHIP TERM_TBL 

STDNT_ENRL ACAD_PROG

RESIDENCY_OFF CITIZEN_PSSPRT

STDNT_GRPS_HIST VISA_PMT_DATA

VISA_PERMIT_TBL COUNTRY_TBL

STATE_NAMES_TBL UASA_CSC_TBL

XLAT

16 16 TablesTables

Page 33: Data Warehousing at The University at Albany Wendell G. Lorang Director of Institutional Research University at Albany, SUNY AIRPO Winter 2006 Conference

Student Addresses

PeopleSoft Tables Data Warehouse Table

STDNT_CAR_TERM ACAD_PROG

UADW_STDNT_ADDR PERSONAL_PHONE PERSONAL_DATA

TERM_TBL  ADDRESSES

EMAIL_ADDRESSES

7 Tables7 Tables

Page 34: Data Warehousing at The University at Albany Wendell G. Lorang Director of Institutional Research University at Albany, SUNY AIRPO Winter 2006 Conference

Student Academic Info

PeopleSoft Tables Data Warehouse Table

UADW_STDNT_ACAD

STDNT_CAR_TERM TERM_TBL

ADM_APPL_PROG ACAD_STDNT_ACTN

ACAD_PROG  ACAD_PLAN

ACAD_SUBPLAN ADM_APPL_DATA

STDNT_ENRL ENRL_REQ_DETAIL ACAD_PROG_TBL

STDNT_ADVR_HIST UASR_AVN_TBL ACAD_PLAN_TBL

XLAT UASR_STDNT_STAT

EXT_ORG_TBL EXT_DEGREE

ACAD_PROG_OWNER ACAD_PLAN_OWNER

NAMES

21 21 TablesTables

Page 35: Data Warehousing at The University at Albany Wendell G. Lorang Director of Institutional Research University at Albany, SUNY AIRPO Winter 2006 Conference

Student Attributes

PeopleSoft Tables Data Warehouse Table

UADW_STDNT_ATTR STDNT_CAR_TERM

ACAD_PROG STDNT_ATTR_DTL

TERM_TBL

4 Tables4 Tables

Page 36: Data Warehousing at The University at Albany Wendell G. Lorang Director of Institutional Research University at Albany, SUNY AIRPO Winter 2006 Conference

Student Groups

PeopleSoft Tables Data Warehouse Table

UADW_STDNT_GRPSSTDNT_CAR_TERM

ACAD_PROG STDNT_GRPS_HIST

TERM_TBL

4 Tables4 Tables

Page 37: Data Warehousing at The University at Albany Wendell G. Lorang Director of Institutional Research University at Albany, SUNY AIRPO Winter 2006 Conference

Student Enrollments

PeopleSoft Tables Data Warehouse Table

STDNT_CAR_TERM ACAD_PROG

UADW_STDNT_ENRL TERM_TBL STDNT_ENRL

ENRL_REQ_DETAIL  CLASS_TBL

6 Tables6 Tables

Page 38: Data Warehousing at The University at Albany Wendell G. Lorang Director of Institutional Research University at Albany, SUNY AIRPO Winter 2006 Conference

Course Characteristics

PeopleSoft Tables Data Warehouse Table

CLASS_TBL CRSE_OFFER

UADW_COURSE CLASS_INSTR

CRSE_ATTRIBUTES NAMES 

PERS_NIDTERM_TBL

CRSE_CATALOG

8 Tables8 Tables

Page 39: Data Warehousing at The University at Albany Wendell G. Lorang Director of Institutional Research University at Albany, SUNY AIRPO Winter 2006 Conference

Statistics

UADW_STDNT_DEMOUADW_STDNT_DEMO     50,802 UADW_STDNT_ADDR50,802 UADW_STDNT_ADDR 50,802 UADW_STDNT_ACAD 50,802 UADW_STDNT_ACAD 

227,843227,843UADW_STDNT_ENRLUADW_STDNT_ENRL 366,789 366,789 UADW_COURSE  UADW_COURSE  39,29239,292

* As of September 2005

Page 40: Data Warehousing at The University at Albany Wendell G. Lorang Director of Institutional Research University at Albany, SUNY AIRPO Winter 2006 Conference

40

But what about ??But what about ??

• Official ‘third week’ dataOfficial ‘third week’ data• Financial Aid dataFinancial Aid data• Faculty/Staff dataFaculty/Staff data

Page 41: Data Warehousing at The University at Albany Wendell G. Lorang Director of Institutional Research University at Albany, SUNY AIRPO Winter 2006 Conference

QUESTIONS?QUESTIONS?

Page 42: Data Warehousing at The University at Albany Wendell G. Lorang Director of Institutional Research University at Albany, SUNY AIRPO Winter 2006 Conference

Marybeth SalmonMarybeth SalmonDirector, University Applications DevelopmentDirector, University Applications DevelopmentUniversity at Albany, State University of New YorkUniversity at Albany, State University of New YorkE-mail: E-mail: [email protected]

Shahnaz SadeghiShahnaz SadeghiTechnical Lead – Data WarehouseTechnical Lead – Data WarehousePeopleSoft AdministratorPeopleSoft AdministratorUniversity at Albany, State University of New YorkUniversity at Albany, State University of New YorkE-mail: E-mail: [email protected]

Wendell LorangWendell LorangDirector of Institutional ResearchDirector of Institutional ResearchUniversity at Albany, State University of New YorkUniversity at Albany, State University of New YorkE-mail: E-mail: [email protected]

ContactsContacts