education leadership data analytics keynote presentation

113
Education Leadership Data Analytics: Integrating Education Leadership, Data Science, and Evidence-based Improvement Cycles in Schools Alex J. Bowers, Ph.D. Associate Professor of Education Leadership Teachers College, Columbia University New York, New York, USA http://www.tc.columbia.edu/faculty/ab3764/ http://www.tc.columbia.edu/elda/ Twitter: @Alex_J_Bowers ORCiD ID: 0000-0002-5140-6428 Any opinions, findings, and conclusions or recommendations are those of the author and do not necessarily reflect the views of funding agencies NSF EHR DGE #1560720 NSF CISE IIS #1546653 NSF EHR DRL #1444621

Upload: others

Post on 06-Feb-2022

0 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Education Leadership Data Analytics Keynote Presentation

Education Leadership Data Analytics: Integrating Education Leadership, Data Science, and Evidence-based

Improvement Cycles in Schools

Alex J. Bowers, Ph.D.

Associate Professor of Education Leadership

Teachers College, Columbia University

New York, New York, USA

http://www.tc.columbia.edu/faculty/ab3764/

http://www.tc.columbia.edu/elda/

Twitter: @Alex_J_Bowers

ORCiD ID: 0000-0002-5140-6428

Any opinions, findings, and conclusions

or recommendations are those of the

author and do not necessarily reflect

the views of funding agencies

NSF EHR DGE #1560720

NSF CISE IIS #1546653

NSF EHR DRL #1444621

Page 2: Education Leadership Data Analytics Keynote Presentation

Overview: Education Leadership Data Analytics (ELDA)

• AI in education, data analytics, and education data science. Hype versus reality.• What is Education Leadership Data Analytics (ELDA) and why should anyone care?

• Data science, and education leadership training• Bowers, A.J., Bang, A., Pan, Y., Graves, K.E. (2019) Education Leadership Data Analytics (ELDA): A White Paper Report on the 2018 ELDA

Summit. Teachers College, Columbia University: New York, NY. https://doi.org/10.7916/d8-31a0-pt97

• Bowers, A.J. (2017) Quantitative Research Methods Training in Education Leadership and Administration Preparation Programs as Disciplined

Inquiry for Building School Improvement Capacity. Journal of Research on Leadership Education, 12(1), p. 72-96. http://doi.org/10.7916/D8K93H16

• Agasisti, T., Bowers, A.J. (2017) Data Analytics and Decision-Making in Education: Towards the Educational Data Scientist as a Key Actor in

Schools and Higher Education Institutions. In Johnes, G., Johnes, J., Agasisti, T., López-Torres, L. (Eds.) Handbook of Contemporary Education

Economics (184-210). Cheltenham, UK: Edward Elgar Publishing. https://doi.org/10.7916/D8PR95T2

• What components of a statewide longitudinal data system are needed for that system to best meet the

respective needs of superintendent, principal and teacher leaders?• Interaction and small group discussion

• Pattern visualization of student course interactions:• Lee, J., Recker, M., Bowers, A.J., Yuan, M. (2016). Hierarchical Cluster Analysis Heatmaps and Pattern Analysis: An Approach for Visualizing

Learning Management System Interaction Data. A poster presented at the annual International Conference on Educational Data Mining (EDM),

Raleigh, NC: June 2016. http://www.educationaldatamining.org/EDM2016/proceedings/paper_34.pdf

• Calculating predictor accuracy for early warning systems• Bowers, A.J., Sprott, R., Taff, S.A. (2013) Do we Know Who Will Drop Out? A Review of the Predictors of Dropping out of High School: Precision,

Sensitivity and Specificity. The High School Journal.96(2), 77-100. http://hdl.handle.net/10022/AC:P:21258

• Bowers, A.J., Zhou, X. (2019) Receiver Operating Characteristic (ROC) Area Under the Curve (AUC): A Diagnostic Measure for Evaluating the

Accuracy of Predictors of Education Outcomes. Journal of Education for Students Placed at Risk. Open Access R Markdown:

https://doi.org/10.7916/D8K94RDD

Understanding How Educators Use Instructional Data Warehouses (IDWs) (or not) NSF collaboration:

• Building community and capacity in data use: Nassau BOCES and Teachers College collaborative• Building Community and Capacity for Data-Intensive Evidence-Based Decision Making in Schools and Districts. National Science Foundation, NSF

DGE-1560720: http://www.nsf.gov/awardsearch/showAward?AWD_ID=1560720

• Do teachers and administrators have different perceptions of data use in their schools and IDW use?• https://www.tc.columbia.edu/articles/2020/january/how-education-data-can-help-teachers-raise-their-game/

Page 3: Education Leadership Data Analytics Keynote Presentation

Growing interest in AI in Education

Page 4: Education Leadership Data Analytics Keynote Presentation

$ B

illio

ns

Year

Record Yearly US Venture Capital Investment in Educational Technology Companies

2019 was the

highest on record

$1.7 Billion

Source:

EdSurge Jan 15, 2020

EdSurge Dec 19, 2017

Page 5: Education Leadership Data Analytics Keynote Presentation

But what is AI in Education?

Page 6: Education Leadership Data Analytics Keynote Presentation

Robots looking past chalk boards?

But what is AI in Education?

Page 7: Education Leadership Data Analytics Keynote Presentation

StatisticsData

ModelOutcome

The Difference between Statistics and Machine Learning

(Known beforehand)

(Identified at the end)

Page 8: Education Leadership Data Analytics Keynote Presentation

StatisticsData

ModelOutcome

Machine

Learning

Data

OutcomeModel

The Difference between Statistics and Machine Learning

(Known beforehand)

(Identified at the end)

(Known beforehand)

(Identified at the end)

Page 9: Education Leadership Data Analytics Keynote Presentation

StatisticsData

ModelOutcome

Machine

Learning

Data

OutcomeModel

The Difference between Statistics and Machine Learning

Lots more Data

Predict

Outcome

(Known beforehand)

(Identified at the end)

(Known beforehand)

(Identified at the end) This is the Power of AI.

Page 10: Education Leadership Data Analytics Keynote Presentation

Chen, Wenlin, "Learning with Scalability and Compactness" (2016). Engineering and Applied Science Theses & Dissertations. 155. https://openscholarship.wustl.edu/eng_etds/155

Rapid improvements over the last few years in machine learning and deep learning neural networks driven initially through access to ImageNet datasets

Page 11: Education Leadership Data Analytics Keynote Presentation

76% probability cat (2017)

The Change in Easy to Implement Neural Networks for Image Recognition Over Just 1.5 years

Data stored in local driveAnalysis done on local machine (bare metal)

Page 12: Education Leadership Data Analytics Keynote Presentation

76% probability cat (2017)Analysis on my computer

98% probability frog (2019) Data stored in the cloudAnalysis done with Docker, Keras, Tensorflow

The Change in Easy to Implement Neural Networks for Image Recognition Over Just 1.5 years

Page 13: Education Leadership Data Analytics Keynote Presentation

The problem

• Image recognition is something humans do well already.

• What about AI for things we’re not very good at? Such as outcome prediction in education?

• This talk:

• Bringing together learning analytics, machine learning, and education leadership for evidence-based improvement

• K-12 education dropout risk prediction

• Different types of dropouts? Typology analytics

• Data dashboards, educator data use, and school improvement

Page 14: Education Leadership Data Analytics Keynote Presentation

Early Warning Prediction/Dashboards in K-12 Education

Panorama Illuminate

Infinite Campus

Page 15: Education Leadership Data Analytics Keynote Presentation

https://doi.org/10.1177/2332858416685534

One school district

65,000 students

670 teachers

73 schools

Relationship of Instructional Data Warehouse (IDW) “Instructional Clicks” to Student Achievement

Weak relationship with elementary reading

No relationship with elementary or junior high math, or junior high reading over 3 years

Page 16: Education Leadership Data Analytics Keynote Presentation

https://files.eric.ed.gov/fulltext/ED573814.pdf https://doi.org/10.1080/19345747.2019.1615156

Early Warning System Dashboard Randomized Controlled ExperimentsMain Finding = No effect

2017

2019

Page 17: Education Leadership Data Analytics Keynote Presentation

https://ies.ed.gov/ncee/edlabs/regions/midwest/pdf/REL_2016207.pdf

Machine Learning for High School Predictors of GraduationModels not generalizable = Must re-analyze in three different districts

“In short, the best indicators of failure to graduate on time for one district and grade level may not be the best for other districts and grade levels”

Page 18: Education Leadership Data Analytics Keynote Presentation

Issues in Education Predictive Analytics & Early Warning Systems• Early Warning Systems (EWS) and Indicators (EWI) randomized controlled trials have to date

shown little effect on student persistence.– Admittedly, the evidence is sparse.

• Early evidence suggests that machine learning prediction algorithms may not generalize across contexts well.

• To date, interventions that target student K-12 persistence don’t show much impact (Freeman & Simonsen, 2015)

• So what does a data analyst in education do?

Central issues for Education Leadership Data Analytics (ELDA) (and today’s talk):

• Build capacity of people who can talk to both the machines and to people: Education Data Science

• Build stronger collaborations between dashboard and analytics providers and educators, figuring out what data will actually support instructional improvement.

• Increase the accuracy of prediction and dashboard systems.

• Compare what educators say they want with data versus what they actually click on.

• Build collaborations between education systems, researchers, and analytics vendors.

Page 19: Education Leadership Data Analytics Keynote Presentation

EducationLeadership

Data Science &Data Analytics

Evidence-BasedImprovement

Cycles

Education Leadership Data Analytics (ELDA)

Page 20: Education Leadership Data Analytics Keynote Presentation
Page 21: Education Leadership Data Analytics Keynote Presentation

We have reason to fear that the multitude of books which grows every day in a

prodigious fashion will make the following centuries fall into a state as barbarous

as that of the centuries that followed the fall of the Roman Empire.

- Adrien Baillet, 1685

Adrien Baillet, Jugemens des sçavans sur les principaux ouvrages des auteurs (Paris,1685), I, avertissement au lecteur, sig. avij

verso; see Françoise Waquet, “Pour une éthique de la réception: les Jugemens des livres en général d’Adrien Baillet (1685),”

XVIIe siècle, 159 (1988), 157-74.

Cited in: Blair, Ann. 2003. Reading strategies for coping with information overload, ca.1550-1700. Journal of the History of Ideas

64, no. 1: 11-28. http://doi.org/10.2307/3654293

Worries about too much information are not new!

Page 22: Education Leadership Data Analytics Keynote Presentation

Schutt, R., & O'Neil, C. (2013). Doing Data Science:

Straight Talk from the Frontline. Cambridge, MA:

O'Reilly.

You’re dealing with Big Data when you’re

working with data that doesn’t fit into your

computer unit. Note that makes it an evolving

definition: Big Data has been around for a long

time. . . . Today, Big Data means working with

data that doesn’t fit in one computer. p.24

Big Data:• Volume – Scale of Data

• Variety – Different forms of Data

• Velocity – Analysis of Data

• Veracity – Uncertainty of Data

Page 23: Education Leadership Data Analytics Keynote Presentation

“Big” Analysis in Education Leadership

Page 24: Education Leadership Data Analytics Keynote Presentation

Schools have used algorithms

for a long time

Page 25: Education Leadership Data Analytics Keynote Presentation

What is Data Science?

Page 26: Education Leadership Data Analytics Keynote Presentation

Drew Conway ODSC East 2018 – “Data scientists are people who can write on glass backwards”

What is Data Science?

Page 27: Education Leadership Data Analytics Keynote Presentation

Schutt, R., & O'Neil, C. (2013). Doing Data Science:

Straight Talk from the Frontline. Cambridge, MA:

O'Reilly.

A data scientist is someone who knows how to

extract meaning from and interpret data, which

requires both tools and methods from statistics

and machine learning, as well as being

human… Once she gets the data into shape, a

crucial part is exploratory data analysis, which

combines visualization and data sense… She’ll

communicate with team members, engineers,

and leadership in clear language and with data

visualizations so that even if her colleagues are

not immersed in the data themselves, they will

understand the implications. p.16

Page 28: Education Leadership Data Analytics Keynote Presentation

https://www.quora.com/What-do-you-do-as-a-data-scientist?no_redirect=1

The 26 Steps of Doing Data Science

Page 29: Education Leadership Data Analytics Keynote Presentation

Statistician

Data Scientist

http://flowingdata.com/2013/12/18/data-scientist-surpasses-statistician-on-google-trends/

December 18, 2013

Data scientist surpasses statistician on Google Trends

Page 30: Education Leadership Data Analytics Keynote Presentation
Page 31: Education Leadership Data Analytics Keynote Presentation
Page 32: Education Leadership Data Analytics Keynote Presentation

http://tc.columbia.edu/cps/evidence

Page 33: Education Leadership Data Analytics Keynote Presentation

Data Science & Analytics in Leadership Preparation

Bowers, A.J. (2017) Quantitative Research Methods Training in Education Leadership and Administration

Preparation Programs as Disciplined Inquiry for Building School Improvement Capacity. Journal of Research

on Leadership Education, 12(1), p. 72-96.

Journal Version: http://doi.org/10.1177/1942775116659462

Open Access Version: http://doi.org/10.7916/D8K93H16

Page 34: Education Leadership Data Analytics Keynote Presentation

Training School System Leaders for Four Different Data Analytic Roles

Bowers, A.J. (2017) Quantitative Research Methods Training in Education Leadership and Administration Preparation Programs as

Disciplined Inquiry for Building School Improvement Capacity. Journal of Research on Leadership Education, 12(1), p. 72-96.

http://doi.org/10.1177/1942775116659462

Page 35: Education Leadership Data Analytics Keynote Presentation

EducationLeadership

Data Science &Data Analytics

Evidence-BasedImprovement

Cycles

Education Leadership Data Analytics (ELDA)

Page 36: Education Leadership Data Analytics Keynote Presentation

EducationLeadership

Data Science &Data Analytics

Evidence-BasedImprovement

Cycles

Education Leadership Data Analytics (ELDA)

Make visible data that have heretofore gone unseen, unnoticed,

and therefore unactionable. (p.ix) Bienkowski, Feng, Means, (2012)

Page 37: Education Leadership Data Analytics Keynote Presentation

Bowers, A.J., Bang, A., Pan, Y., Graves, K.E. (2019) Education Leadership Data Analytics (ELDA): A

White Paper Report on the 2018 ELDA Summit. Teachers College, Columbia University: New York, NY.

https://doi.org/10.7916/d8-31a0-pt97

Education Leadership Data Analytics Summit 2018 White Paper Report

April Bang

Yilin Pan

Kenny Graves

Page 38: Education Leadership Data Analytics Keynote Presentation

Education Leadership Data Analytics Summit 2018 White Paper Report

ELDA Definition:

Education Leadership Data Analytics (ELDA) practitioners work collaboratively with

schooling system leaders and teachers to analyze, pattern, and visualize previously

unknown patterns and information from the vast sets of data collected by schooling

organizations, and then integrate findings in easy to understand language and

digital tools into collaborative and community-building evidence-based improvement

cycles with stakeholders.

Identified Needs/Challenges:

• Collaborative partnerships with schooling organizations

• Capacity-building and training infrastructure

• Focus on equity and algorithmic fairness

• Data privacy and security

• Open and accessible data and tools using F.A.I.R. data standards

Page 39: Education Leadership Data Analytics Keynote Presentation
Page 40: Education Leadership Data Analytics Keynote Presentation

Agasisti, T., Bowers, A.J. (2017) Data Analytics and Decision-Making in Education: Towards the Educational

Data Scientist as a Key Actor in Schools and Higher Education Institutions. In Johnes, G., Johnes, J., Agasisti,

T., López-Torres, L. (Eds.) Handbook of Contemporary Education Economics (184-210). Cheltenham, UK:

Edward Elgar Publishing. https://doi.org/10.7916/D8PR95T2

Data Analytics & Education Data Science

Tommaso Agasisti

Politecnico di Milano, Italy

Page 41: Education Leadership Data Analytics Keynote Presentation

Agasisti, T., Bowers, A.J. (2017) Data Analytics and Decision-Making in Education: Towards the Educational Data Scientist as a Key Actor in

Schools and Higher Education Institutions. In Johnes, G., Johnes, J., Agasisti, T., López-Torres, L. (Eds.) Handbook of Contemporary Education

Economics (184-210). Cheltenham, UK: Edward Elgar Publishing. https://doi.org/10.7916/D8PR95T2

Page 42: Education Leadership Data Analytics Keynote Presentation

Potential recommendations or

decisions with algorithms:

The code must be open

access! Taxpayers paid for it.

Ca

utio

nC

au

tio

nC

au

tio

nAnalysis

Ethics

Agasisti, T., Bowers, A.J. (2017) Data Analytics and Decision-Making in Education: Towards the Educational Data Scientist as a Key Actor in

Schools and Higher Education Institutions. In Johnes, G., Johnes, J., Agasisti, T., López-Torres, L. (Eds.) Handbook of Contemporary Education

Economics (184-210). Cheltenham, UK: Edward Elgar Publishing. https://doi.org/10.7916/D8PR95T2

Page 43: Education Leadership Data Analytics Keynote Presentation

Overview: Education Leadership Data Analytics (ELDA)

• AI in education, data analytics, and education data science. Hype versus reality.• What is Education Leadership Data Analytics (ELDA) and why should anyone care?

• Data science, and education leadership training• Bowers, A.J., Bang, A., Pan, Y., Graves, K.E. (2019) Education Leadership Data Analytics (ELDA): A White Paper Report on the 2018 ELDA

Summit. Teachers College, Columbia University: New York, NY. https://doi.org/10.7916/d8-31a0-pt97

• Bowers, A.J. (2017) Quantitative Research Methods Training in Education Leadership and Administration Preparation Programs as Disciplined

Inquiry for Building School Improvement Capacity. Journal of Research on Leadership Education, 12(1), p. 72-96. http://doi.org/10.7916/D8K93H16

• Agasisti, T., Bowers, A.J. (2017) Data Analytics and Decision-Making in Education: Towards the Educational Data Scientist as a Key Actor in

Schools and Higher Education Institutions. In Johnes, G., Johnes, J., Agasisti, T., López-Torres, L. (Eds.) Handbook of Contemporary Education

Economics (184-210). Cheltenham, UK: Edward Elgar Publishing. https://doi.org/10.7916/D8PR95T2

• What components of a statewide longitudinal data system are needed for that system to best meet the

respective needs of superintendent, principal and teacher leaders?• Interaction and small group discussion

• Pattern visualization of student course interactions:• Lee, J., Recker, M., Bowers, A.J., Yuan, M. (2016). Hierarchical Cluster Analysis Heatmaps and Pattern Analysis: An Approach for Visualizing

Learning Management System Interaction Data. A poster presented at the annual International Conference on Educational Data Mining (EDM),

Raleigh, NC: June 2016. http://www.educationaldatamining.org/EDM2016/proceedings/paper_34.pdf

• Calculating predictor accuracy for early warning systems• Bowers, A.J., Sprott, R., Taff, S.A. (2013) Do we Know Who Will Drop Out? A Review of the Predictors of Dropping out of High School: Precision,

Sensitivity and Specificity. The High School Journal.96(2), 77-100. http://hdl.handle.net/10022/AC:P:21258

• Bowers, A.J., Zhou, X. (2019) Receiver Operating Characteristic (ROC) Area Under the Curve (AUC): A Diagnostic Measure for Evaluating the

Accuracy of Predictors of Education Outcomes. Journal of Education for Students Placed at Risk. Open Access R Markdown:

https://doi.org/10.7916/D8K94RDD

Understanding How Educators Use Instructional Data Warehouses (IDWs) (or not) NSF collaboration:

• Building community and capacity in data use: Nassau BOCES and Teachers College collaborative• Building Community and Capacity for Data-Intensive Evidence-Based Decision Making in Schools and Districts. National Science Foundation, NSF

DGE-1560720: http://www.nsf.gov/awardsearch/showAward?AWD_ID=1560720

• Do teachers and administrators have different perceptions of data use in their schools and IDW use?• https://www.tc.columbia.edu/articles/2020/january/how-education-data-can-help-teachers-raise-their-game/

Page 44: Education Leadership Data Analytics Keynote Presentation

What do teachers, principals and superintendents say they want in a

longitudinal data system?

Brocato, K., Willis, C., Dechert, K. (2014) Longitudinal Data Use: Ideas for

District, Building and Classroom Leaders. In A.J. Bowers, A.R. Shoho, B. G.

Barnett (Eds.) Using Data in Schools to Inform Leadership and Decision

Making (p.97-120). Charlotte, NC: Information Age Publishing Inc.

Page 45: Education Leadership Data Analytics Keynote Presentation

Brocato, K., Willis, C., Dechert, K. (2014) Longitudinal

Data Use: Ideas for District, Building and Classroom

Leaders. In A.J. Bowers, A.R. Shoho, B. G. Barnett

(Eds.) Using Data in Schools to Inform Leadership and

Decision Making (p.97-120). Charlotte, NC:

Information Age Publishing Inc.

Page 46: Education Leadership Data Analytics Keynote Presentation

Brocato, K., Willis, C., Dechert, K. (2014) Longitudinal

Data Use: Ideas for District, Building and Classroom

Leaders. In A.J. Bowers, A.R. Shoho, B. G. Barnett

(Eds.) Using Data in Schools to Inform Leadership and

Decision Making (p.97-120). Charlotte, NC:

Information Age Publishing Inc.

What components of a statewide longitudinal

data system are needed for that system to best

meet the respective needs of superintendent,

principal and teacher leaders?

Page 47: Education Leadership Data Analytics Keynote Presentation

Superintendents

TeachersPrincipals

Longitudinal Data System

What components of a longitudinal data system are needed for that

system to best meet the respective needs of superintendent, principal,

and teacher leaders?

Page 48: Education Leadership Data Analytics Keynote Presentation

Discuss with One or Two People Around You…

Imagine the average teacher in your schools and channel their thoughts…

1. If you had a single webpage only you could visit to get quick information about students or schools, what information would the page contain?

2. In an ideal world, what information should the longitudinal data tool provide to teachers to help them make better decisions?

Repeat for the average principal and then repeat again for the Superintendent.

Page 49: Education Leadership Data Analytics Keynote Presentation

Discuss your responses with the people around you:

1. What were the similarities and differences between your anwers for teachers?

2. What about for principals?

3. What about for the superintendent?

4. If you were to think about a Venn Diagram of your answers.

- What are the overlaps?

- What is unique for the superintendent, principals or teachers?

Discuss with the People Around You…

Page 50: Education Leadership Data Analytics Keynote Presentation

Brocato, K., Willis, C., Dechert, K. (2014) Longitudinal

Data Use: Ideas for District, Building and Classroom

Leaders. In A.J. Bowers, A.R. Shoho, B. G. Barnett

(Eds.) Using Data in Schools to Inform Leadership and

Decision Making (p.97-120). Charlotte, NC:

Information Age Publishing Inc.

What components of a statewide longitudinal

data system are needed for that system to best

meet the respective needs of superintendent,

principal and teacher leaders?

Page 51: Education Leadership Data Analytics Keynote Presentation

Superintendents (n=55)

Individual Student/Teacher

• Teacher data: performance, attendance, past experience

• Who rides the bus

• Record of parent conferences

• Test scores for students as well as whether or not they have met growth

Comparative

• All district’s comparative test data, district demographics

• Access test data, budget information, comparing per pupil expenditures, demographic data for the local area as compared to the region and state

• Student growth from year to year, demographic group growth

Page 52: Education Leadership Data Analytics Keynote Presentation

Principals (n=45)

Teacher Data

• Teachers’ past performance and years of experience

• Background of teachers and records showing student test scores broken down by teacher

• Student and teacher achievement and growth data, teacher evaluation information, student attendance and dropout data, staff attendance, accountability information

Student Data

• Achievement information on teachers based on student performance

• Background of teachers being hired; teacher performance data

• Are we able to keep high performing teachers in our district?

Page 53: Education Leadership Data Analytics Keynote Presentation

Teachers (n=193)

Brocato, Willis & Dechert (2014)

Page 54: Education Leadership Data Analytics Keynote Presentation

Figure 1. Summary of eight themes and

corresponding frequency percentage in the

“more frequent” level of category

What components of a longitudinal data system are needed for that system to

best meet the respective needs of superintendent, principal, and teacher

leaders?

Brocato, K., Willis, C., Dechert, K. (2014) Longitudinal Data Use: Ideas for District,

Building and Classroom Leaders. In A.J. Bowers, A.R. Shoho, B. G. Barnett (Eds.)

Using Data in Schools to Inform Leadership and Decision Making (p.97-120). Charlotte,

NC: Information Age Publishing Inc.

Page 55: Education Leadership Data Analytics Keynote Presentation

Overview: Education Leadership Data Analytics (ELDA)

• AI in education, data analytics, and education data science. Hype versus reality.• What is Education Leadership Data Analytics (ELDA) and why should anyone care?

• Data science, and education leadership training• Bowers, A.J., Bang, A., Pan, Y., Graves, K.E. (2019) Education Leadership Data Analytics (ELDA): A White Paper Report on the 2018 ELDA

Summit. Teachers College, Columbia University: New York, NY. https://doi.org/10.7916/d8-31a0-pt97

• Bowers, A.J. (2017) Quantitative Research Methods Training in Education Leadership and Administration Preparation Programs as Disciplined

Inquiry for Building School Improvement Capacity. Journal of Research on Leadership Education, 12(1), p. 72-96. http://doi.org/10.7916/D8K93H16

• Agasisti, T., Bowers, A.J. (2017) Data Analytics and Decision-Making in Education: Towards the Educational Data Scientist as a Key Actor in

Schools and Higher Education Institutions. In Johnes, G., Johnes, J., Agasisti, T., López-Torres, L. (Eds.) Handbook of Contemporary Education

Economics (184-210). Cheltenham, UK: Edward Elgar Publishing. https://doi.org/10.7916/D8PR95T2

• What components of a statewide longitudinal data system are needed for that system to best meet the

respective needs of superintendent, principal and teacher leaders?• Interaction and small group discussion

• Pattern visualization of student course interactions:• Lee, J., Recker, M., Bowers, A.J., Yuan, M. (2016). Hierarchical Cluster Analysis Heatmaps and Pattern Analysis: An Approach for Visualizing

Learning Management System Interaction Data. A poster presented at the annual International Conference on Educational Data Mining (EDM),

Raleigh, NC: June 2016. http://www.educationaldatamining.org/EDM2016/proceedings/paper_34.pdf

• Calculating predictor accuracy for early warning systems• Bowers, A.J., Sprott, R., Taff, S.A. (2013) Do we Know Who Will Drop Out? A Review of the Predictors of Dropping out of High School: Precision,

Sensitivity and Specificity. The High School Journal.96(2), 77-100. http://hdl.handle.net/10022/AC:P:21258

• Bowers, A.J., Zhou, X. (2019) Receiver Operating Characteristic (ROC) Area Under the Curve (AUC): A Diagnostic Measure for Evaluating the

Accuracy of Predictors of Education Outcomes. Journal of Education for Students Placed at Risk. Open Access R Markdown:

https://doi.org/10.7916/D8K94RDD

Understanding How Educators Use Instructional Data Warehouses (IDWs) (or not) NSF collaboration:

• Building community and capacity in data use: Nassau BOCES and Teachers College collaborative• Building Community and Capacity for Data-Intensive Evidence-Based Decision Making in Schools and Districts. National Science Foundation, NSF

DGE-1560720: http://www.nsf.gov/awardsearch/showAward?AWD_ID=1560720

• Do teachers and administrators have different perceptions of data use in their schools and IDW use?• https://www.tc.columbia.edu/articles/2020/january/how-education-data-can-help-teachers-raise-their-game/

Page 56: Education Leadership Data Analytics Keynote Presentation

Variables

Pers

on

s

Cattell’s Data Box (1966)

Cattell, R. B. (1966). The data box: its ordering of total resources in terms of possible relational

systems. 67-128. Handbook of Multivariate Experimental Psychology.

Boker, Steven (2017). Dynamical systems analysis in the context of statistical methods and research

design. Keynote address, Modern Modeling Methods Conference, Storrs CT: May 22, 2017.

Page 57: Education Leadership Data Analytics Keynote Presentation

Variables

Pers

on

s

CliniciansOccasions x Variables

“diagnosis”

Page 58: Education Leadership Data Analytics Keynote Presentation

Variables

Pers

on

s

Variables

Pers

on

s

CliniciansOccasions x Variables

“diagnosis”

EducatorsPersons x Occasions

“growth”

Page 59: Education Leadership Data Analytics Keynote Presentation

Variables

Pers

on

s

Variables

Pers

on

s

Variables

Pers

on

s

CliniciansOccasions x Variables

“diagnosis”

EducatorsPersons x Occasions

“growth”

PolicymakersPersons x Variables

“interventions”

Page 60: Education Leadership Data Analytics Keynote Presentation

Lee, J., Recker, M., Bowers, A.J., Yuan, M. (2016). Hierarchical Cluster Analysis Heatmaps and Pattern

Analysis: An Approach for Visualizing Learning Management System Interaction Data. A poster presented at

the annual International Conference on Educational Data Mining (EDM), Raleigh, NC: June 2016.

http://www.educationaldatamining.org/EDM2016/proceedings/paper_34.pdf

Page 61: Education Leadership Data Analytics Keynote Presentation

• Canvas Learning Management System LMS Data

• Mid-sized University

• Undergrad freshman mathematics course, taught completely online as a required course

• n=139 students

• Clickstream LMS logfile data

• Features are # of pageviews

Method:Hierarchical Cluster Analysis (HCA) Heatmaps

Clusters together students with similar longitudinal data patterns and visualizes similarities and differences Bowers (2007, 2010)

Page 62: Education Leadership Data Analytics Keynote Presentation

Cluster Analysis Heatmaps of Canvas Interaction DataPattern by Content

Student are rows

Course features

are columns

Annotation column is final grade:

from red (high) to dark (low)

Page 63: Education Leadership Data Analytics Keynote Presentation

Cluster Analysis Heatmaps of Canvas Interaction DataPattern by Content Pattern by Number of Interactions per Week

Student are rows

Course features

are columns

Page 64: Education Leadership Data Analytics Keynote Presentation

Cluster Analysis Heatmaps of Canvas Interaction DataPattern by Content Pattern by Number of Interactions per Week

Page 65: Education Leadership Data Analytics Keynote Presentation

Accurately Predicting K-12 Schooling Outcomes

• Dropout risk– Accuracy, precision, sensitivity & specificity

• Early Warning Indicators (EWI) / Early Warning Systems (EWS)

• Data driven decision making (3DM)

• Targeted and data-informed resource allocation to improve schooling outcomes

Ryan Sprott

Sherry Taff

Xiaoliang Zhou

Bowers, A.J., Sprott, R., Taff, S.A. (2013) Do we Know Who Will Drop Out? A Review of the Predictors of

Dropping out of High School: Precision, Sensitivity and Specificity. The High School Journal, 96(2), 77-100.

Journal Version: 10.1353/hsj.2013.0000 Open Access Version: https://doi.org/10.7916/D86W9N4X

Bowers, A.J., Sprott, R. (2012) Examining the Multiple Trajectories Associated with Dropping Out of High

School: A Growth Mixture Model Analysis. The Journal of Educational Research, 105(3), 176-195.

Journal Version: 10.1080/00220671.2011.552075 Open Access Version: https://doi.org/10.7916/D8FR06R0

Bowers, A.J., Zhou, X. (2019) Receiver Operating Characteristic (ROC) Area Under the Curve (AUC): A

Diagnostic Measure for Evaluating the Accuracy of Predictors of Education Outcomes. Journal of Education for

Students Placed at Risk, 24(1) p. 20-46. Journal Version: https://doi.org/10.1080/10824669.2018.1523734

Open Access Version: https://doi.org/10.7916/d8-nc5k-3m53

Online Supplement R Code: https://doi.org/10.7916/D8K94RDD

Page 66: Education Leadership Data Analytics Keynote Presentation

Issues of Accuracy in Predicting High School Dropout

• Dropping out of high school in the U.S. is associated with a multitude of negative outcomes.

• However, other than a select number of demographic and background variables, we know little about the accuracy of current dropout predictors that are school-related.

• Current predictions accurately predict only about 50-60% of students who actually dropout.

• According to Gleason & Dynarski (2002), accurate prediction of who will dropout is a resource and efficiency issue:– A large percentage of students are mis-identified as at-risk.

– A large percentage of students who are at-risk are never identified.

• Current reporting of dropout “flags” across the literature is haphazard.– Almost none report accuracy

– Many report specificity or sensitivity, but rarely both

• A dropout flag may be highly precise, in that almost all of the students with the flag dropout, but may not be accurate since the flag may identify only a small proportion of the dropouts.

Page 67: Education Leadership Data Analytics Keynote Presentation

Re-analyzing Past Dropout Flags for Accuracy,

Precision, Sensitivity and Specificity

• Literature search.• Queried multiple databases:

– JSTOR, Google Scholar, EBSCO, Educational Full Text Wilson Web

• Studies were included that:– Were published since 1979– Examined High School dropout– Examined school-wide characteristics and included all students– A focus on the student level– Reported frequencies for re-analysis

• Initially yield 6,434 overlapping studies– 301 studies were read in full– 140 provided school-wide samples and quantifiable data– 36 articles provided enough data for accuracy re-calculations– Yield 110 separate dropout flags

• Relative Operating Characteristic (ROC) – Hits versus False-Alarms

Page 68: Education Leadership Data Analytics Keynote Presentation

Dropout Graduate

Dropout a

True-positive

(TP)

Correct

b

False-positive

(FP)

Type I Error

a+b

Graduate c

False-negative

(FN)

Type II Error

d

True-negative

(TN)

Correct

c+d

a+c b+d a+b+c+d=N

Event

Pre

dic

tor

Precision = a/(a + b) Positive Predictive Value

True-Positive Proportion = a/(a + c) Sensitivity

True-Negative Proportion = d/(b + d) Specificity

False-Positive Proportion = b/(b + d) 1-Specificity

Event table for calculating dropout contingency proportions

Fawcett (2004); Hanley & McNeil (1982); Swets (1988); Zwieg & Campbell (1993)

“Hits”

“False Alarms”

Page 69: Education Leadership Data Analytics Keynote Presentation

0

0.2

0.4

0.6

0.8

1.0

Tru

e-p

ositiv

e p

roport

ion (

Sensitiv

ity)

0 0.2 0.4 0.6 0.8 1.0

False-positive proportion (1-Specificity)

Perfect prediction

Better prediction

Worse prediction

Low attendance

An example of the true-positive proportion plotted against the false-positive

proportion for Balfanz et al. (2007) comparing the relative operating characteristics

(ROC) of each dropout flag.

Page 70: Education Leadership Data Analytics Keynote Presentation

0

0.2

0.4

0.6

0.8

1.0

Tru

e-p

ositiv

e p

roport

ion (

Sensitiv

ity)

0 0.2 0.4 0.6 0.8 1.0

False-positive proportion (1-Specificity)

Relative operating characteristics (ROC) of all dropout flags reviewed, plotted

as the true-positive proportion against the false-positive proportion. Numbers

refer to dropout indicator IDs.

Bowers, A.J., Sprott, R., Taff, S.A. (2013) Do we Know Who Will Drop Out? A Review of the Predictors of Dropping out of High School: Precision, Sensitivity and Specificity. The High School

Journal, 96(2), 77-100. Journal Version: 10.1353/hsj.2013.0000 Open Access Version: https://doi.org/10.7916/D86W9N4X

Page 71: Education Leadership Data Analytics Keynote Presentation
Page 72: Education Leadership Data Analytics Keynote Presentation

0

0.2

0.4

0.6

0.8

1.0

Tru

e-p

ositiv

e p

roport

ion (

Sensitiv

ity)

0 0.2 0.4 0.6 0.8 1.0

False-positive proportion (1-Specificity)

Relative operating characteristics (ROC) of all dropout flags reviewed, plotted

as the true-positive proportion against the false-positive proportion. Numbers

refer to dropout indicator IDs.

Bowers, A.J., Sprott, R., Taff, S.A. (2013) Do we Know Who Will Drop Out? A Review of the Predictors of Dropping out of High School: Precision, Sensitivity and Specificity. The High School Journal, 96(2), 77-100. doi:10.1353/hsj.2013.0000

Page 73: Education Leadership Data Analytics Keynote Presentation

Knowles, Jared (2015): Of Needles and Haystacks: Building an Accurate Statewide Dropout Early Warning System in

Wisconsin. Journal of Educational Data Mining, 7(3), p. 18-67.

http://www.educationaldatamining.org/JEDM/index.php/JEDM/article/view/JEDM082

Page 74: Education Leadership Data Analytics Keynote Presentation

Knowles, Jared (2015): Of Needles and Haystacks: Building an Accurate Statewide Dropout Early Warning System in

Wisconsin. Journal of Educational Data Mining, 7(3), p. 18-67.

http://www.educationaldatamining.org/JEDM/index.php/JEDM/article/view/JEDM082

Page 75: Education Leadership Data Analytics Keynote Presentation

Models Tested:

Lasso

Subset Selection

Least Squares

Generalized Linear Models

Generalized Additive Models

KNN

Trees

Bagging boosting

Support Vector Machines

Ensembles

Knowles, Jared (2015): Of Needles and Haystacks: Building an Accurate Statewide Dropout Early Warning System in

Wisconsin. Journal of Educational Data Mining, 7(3), p. 18-67.

http://www.educationaldatamining.org/JEDM/index.php/JEDM/article/view/JEDM082

Page 76: Education Leadership Data Analytics Keynote Presentation

Models Tested:

Lasso

Subset Selection

Least Squares

Generalized Linear Models

Generalized Additive Models

KNN

Trees

Bagging boosting

Support Vector Machines

Ensembles

Knowles, Jared (2015): Of Needles and Haystacks: Building an Accurate Statewide Dropout Early Warning System in

Wisconsin. Journal of Educational Data Mining, 7(3), p. 18-67.

http://www.educationaldatamining.org/JEDM/index.php/JEDM/article/view/JEDM082

Page 77: Education Leadership Data Analytics Keynote Presentation

Growth mixture model for the simultaneous estimation of latent trajectory classes

using non-cumulative GPA from the first three semesters of high school.

GPA 9S1 GPA 9S2 GPA 10S1

Intercepts Slopes

High School

Dropout

Behavior & Structure:

Student:

Extracurricular

Retained

Negative Behavior

School:

Student-Teacher Ratio

Academic Press

Small School

Large School

Extra-Large School

Demographics:

Student:

Female

African American

Asian

Hispanic

Non-Traditional Family

SES

School:

Urban

Rural

% Students Free Lunch

Latent

Trajectory

Classes

C

Bowers, A.J., Sprott, R. (2012) Examining the Multiple Trajectories Associated with Dropping Out of High School: A Growth

Mixture Model Analysis. The Journal of Educational Research, 105(3), 176-195.

Journal Version: 10.1080/00220671.2011.552075 Open Access Version: https://doi.org/10.7916/D8FR06R0

Page 78: Education Leadership Data Analytics Keynote Presentation

A:

B:

0

1

2

3

4

9S1 9S2 10S1Semester

Non-C

um

ula

tive G

PA

Mid-DecreasingLow-IncreasingMid-AchievingHigh-Achieving

0

1

2

3

4

9S1 9S2 10S1

Semester

0

1

2

3

4

9S1 9S2 10S1

0

1

2

3

4

0

1

2

3

4

9S1 9S2 10S1

Semester

0

Non

-cum

ula

tive

GP

A

1

2

3

4

9S1 9S2 10S1

Semester

0

1

2

3

4

9S1 9S2 10S1

Semester

0

1

2

3

4

9S1 9S2 10S1

0

1

2

3

4

9S1 9S2 10S1

Semester

0

1

2

3

4

9S1 9S2 10S1

Semester

0

1

2

3

4

9S1 9S2 10S1

0

1

2

3

4

9S1 9S2 10S1

Semester

Low-Increasing: 13.8%Mid-Decreasing: 10.8% Mid-Achieving: 56.5% High-Achieving: 18.9%

Dropout: 39.7% 52.1% 7.0% 1.2%

Longitudinal Non-Cumulative GPA Trajectories in the First Three

Semesters of High School

n=5,400

Bowers, A.J., Sprott, R. (2012) Examining the Multiple Trajectories Associated with Dropping Out of High School: A Growth

Mixture Model Analysis. The Journal of Educational Research, 105(3), 176-195.

Journal Version: 10.1080/00220671.2011.552075 Open Access Version: https://doi.org/10.7916/D8FR06R0

Mid-decreasing & Low-increasing accounted for:

24.6% of the sample

91.1% of the dropouts

Page 79: Education Leadership Data Analytics Keynote Presentation

Bowers, A.J. (2019) Towards Measures of Different and Useful Aspects of Schooling:

Why Schools Need Both Teacher Assigned Grades and Standardized Assessments.

In Brookhart, S., McMillian, T. (Eds.) Classroom Assessment as Educational

Measurement. National Council on Measurement in Education (NCME) Book Series.

Routledge.

Bowers, A.J. (2019) Report Card Grades and Educational Outcomes. In Guskey, T.,

Brookhart, S. (Eds.) What We Know About Grading: What Works, What Doesn't, and

What's Next, (p.32-56). Alexandria, VA: Association for Supervision and Curriculum

Development.

Brookhart, S., Guskey, T., Bowers, A.J., McMillan, J. Smith, L. Smith, J., Welsh, M.

(2016) One Hundred Years of Grading Research: Meaning and Value in the Most

Common Educational Measure. Review of Educational Research, 86(4), p. 803-848.

Journal: http://doi.org/10.3102/0034654316672069

Open Access: http://dx.doi.org/10.7916/D8NV9JQ0

Bowers, A.J. (2011) What’s in a Grade? The Multidimensional Nature of What

Teacher Assigned Grades Assess in High School. Educational Research & Evaluation,

17(3), 141-159. Journal: https://doi.org/10.1080/13803611.2011.597112

Open Access: https://doi.org/10.7916/D8WM1QF6

Bowers, A.J. (2009) Reconsidering Grades as Data for Decision Making: More than

Just Academic Knowledge. Journal of Educational Administration, 47(5), 609-629.

http://doi.org/10.1108/09578230910981080

Page 80: Education Leadership Data Analytics Keynote Presentation

0

0.2

0.4

0.6

0.8

1.0

Tru

e-p

ositiv

e p

roport

ion (

Sensitiv

ity)

0 0.2 0.4 0.6 0.8 1.0

False-positive proportion (1-Specificity)

Relative operating characteristics (ROC) of all dropout flags reviewed, plotted

as the true-positive proportion against the false-positive proportion. Numbers

refer to dropout indicator IDs.

Bowers, A.J., Sprott, R., Taff, S.A. (2013) Do we Know Who Will Drop Out? A Review of the Predictors of Dropping out of High School: Precision, Sensitivity and Specificity. The High School

Journal, 96(2), 77-100. Journal Version: 10.1353/hsj.2013.0000 Open Access Version: https://doi.org/10.7916/D86W9N4X

Page 81: Education Leadership Data Analytics Keynote Presentation

Bowers, A.J., Zhou, X. (2019) Receiver Operating Characteristic (ROC) Area Under the Curve (AUC): A Diagnostic Measure for

Evaluating the Accuracy of Predictors of Education Outcomes. Journal of Education for Students Placed at Risk, 24(1) p. 20-46.

Journal Version: https://doi.org/10.1080/10824669.2018.1523734

Open Access Version: https://doi.org/10.7916/d8-nc5k-3m53

Online Supplement R Code: https://doi.org/10.7916/D8K94RDD

Xiaoliang Zhou

Page 82: Education Leadership Data Analytics Keynote Presentation

Receiver Operating Characteristic (ROC) Area Under the Curve (AUC) to Evaluate the Accuracy of Predictors

for College Enrollment and Postsecondary STEM Degree

Page 83: Education Leadership Data Analytics Keynote Presentation

Contributions & Open QuestionsFor K-12 & Higher Ed Prediction Algorithms• Visualize the data and pattern analytics

– Can we see the individuals in the data and patterns?– Are there typology subgroups?

• Use accuracy comparison metrics and visualizations– ROC AUC, precision-recall, kappa

• Publish the code with example walk-throughs so that education data analysts can work with your algorithms.

• Do these accuracy metrics for predictors replicate across multiple district and state contexts?

• How to provide accurate predictors to teachers and leaders for their decision making?

Page 84: Education Leadership Data Analytics Keynote Presentation

Overview: Education Leadership Data Analytics (ELDA)

• AI in education, data analytics, and education data science. Hype versus reality.• What is Education Leadership Data Analytics (ELDA) and why should anyone care?

• Data science, and education leadership training• Bowers, A.J., Bang, A., Pan, Y., Graves, K.E. (2019) Education Leadership Data Analytics (ELDA): A White Paper Report on the 2018 ELDA

Summit. Teachers College, Columbia University: New York, NY. https://doi.org/10.7916/d8-31a0-pt97

• Bowers, A.J. (2017) Quantitative Research Methods Training in Education Leadership and Administration Preparation Programs as Disciplined

Inquiry for Building School Improvement Capacity. Journal of Research on Leadership Education, 12(1), p. 72-96. http://doi.org/10.7916/D8K93H16

• Agasisti, T., Bowers, A.J. (2017) Data Analytics and Decision-Making in Education: Towards the Educational Data Scientist as a Key Actor in

Schools and Higher Education Institutions. In Johnes, G., Johnes, J., Agasisti, T., López-Torres, L. (Eds.) Handbook of Contemporary Education

Economics (184-210). Cheltenham, UK: Edward Elgar Publishing. https://doi.org/10.7916/D8PR95T2

• What components of a statewide longitudinal data system are needed for that system to best meet the

respective needs of superintendent, principal and teacher leaders?• Interaction and small group discussion

• Pattern visualization of student course interactions:• Lee, J., Recker, M., Bowers, A.J., Yuan, M. (2016). Hierarchical Cluster Analysis Heatmaps and Pattern Analysis: An Approach for Visualizing

Learning Management System Interaction Data. A poster presented at the annual International Conference on Educational Data Mining (EDM),

Raleigh, NC: June 2016. http://www.educationaldatamining.org/EDM2016/proceedings/paper_34.pdf

• Calculating predictor accuracy for early warning systems• Bowers, A.J., Sprott, R., Taff, S.A. (2013) Do we Know Who Will Drop Out? A Review of the Predictors of Dropping out of High School: Precision,

Sensitivity and Specificity. The High School Journal.96(2), 77-100. http://hdl.handle.net/10022/AC:P:21258

• Bowers, A.J., Zhou, X. (2019) Receiver Operating Characteristic (ROC) Area Under the Curve (AUC): A Diagnostic Measure for Evaluating the

Accuracy of Predictors of Education Outcomes. Journal of Education for Students Placed at Risk. Open Access R Markdown:

https://doi.org/10.7916/D8K94RDD

Understanding How Educators Use Instructional Data Warehouses (IDWs) (or not) NSF collaboration:

• Building community and capacity in data use: Nassau BOCES and Teachers College collaborative• Building Community and Capacity for Data-Intensive Evidence-Based Decision Making in Schools and Districts. National Science Foundation, NSF

DGE-1560720: http://www.nsf.gov/awardsearch/showAward?AWD_ID=1560720

• Do teachers and administrators have different perceptions of data use in their schools and IDW use?• https://www.tc.columbia.edu/articles/2020/january/how-education-data-can-help-teachers-raise-their-game/

Page 85: Education Leadership Data Analytics Keynote Presentation

National Science

FoundationNSF DGE #1560720

Any opinions, findings, and conclusions or recommendations

are those of the author and do not necessarily reflect the views

of funding agencies

Building Community and Capacity for Data-Intensive Evidence-Based Decision

Making in Schools and Districts - Teacher Data Use Survey (TDUS)

Page 86: Education Leadership Data Analytics Keynote Presentation

A Collaborative Partnership with Nassau County BOCES to Build Community and Capacity around Data Use and Evidence-Based Improvement Cycles

Nassau County Long Island New York:

• 56 School Districts

• Over 200,000 students• 379 schools

Page 87: Education Leadership Data Analytics Keynote Presentation

Data use in schools:

• Evidence-based decision making in schools and districts is a growing area of research and practice.

• Teachers, school and district leaders have a wealth of knowledge about how they use data to inform their practice.

• Growing consensus in the research that training teachers and leaders how to use data to inform evidence-based improvement cycles can impact instructional improvement.

However:

• Much of this research does not take advantage of the complex and rich data-sources available to schools and districts.

• Teachers and administrators may disagree on what data is most accessible, useful and informative for instructional improvement.

• To date, the research has considered educator data use as a single dimension of high to low. Could there be subgroups of teacher and leader data use that may help inform professional development, evidence-based conversations and instructional improvement?

Project Background

Boudett, City, & Murnane (2013); Bowers (2017); Bowers, Shoho, & Barnett (2014), Cho & Wayman (2015); Cosner (2014); Datnow & Park (2014); Farley-Ripple & Buttram

(2015); Halverson (2010); Marsh (2012); Mandinach (2012); Schildkamp, Poortman, Luyten, & Ebbeler (2017); Wayman, Midgley, & Stringfield (2006); Wayman, Shaw,&Cho 2017

Page 88: Education Leadership Data Analytics Keynote Presentation

Purpose & Goals of the Overall Project

Purpose of the project:

To build a researcher-practitioner community around data-intensive evidence-based decision making through bringing together large and diverse school district datasets and then applying recent innovations from the data sciences to pilot and test data analytics with teachers and administrators.

Goals of this four year project:

1. Learn how data are used across districts and schools in Nassau County.

2. Understand how big data and data science techniques can help provide new opportunities for evidence-based improvement cycles.

3. Privilege the innovation in Nassau County, nationally, as a test-bed for innovation in evidence use to improve instruction.

4. Collaboratively build and test new analysis and data visualizations for schools using open-access code to address the issues educators see as most important, then release this code nationally to build a foundation for future innovations.

Page 89: Education Leadership Data Analytics Keynote Presentation

Survey:

Teachers and administrators

asking them what they say they do

with data, n=4,941.

• Four Statistically Significantly

Different Types of Responders

(LCA)

Qualitative Interviews:

40 educators from the survey.

About 10 in each of the four LCA

groups.

Clickstream Logfiles:

All clicks in the Instructional Data

Warehouse (IDW) > 100,000+ clicks

• Four Statistically Significantly

Different Types of Responders

(LCA)

Collaborative Workshop

Gather 50+ educators and 20+

education data scientists for a

2-day collaborative “datasprint”

to co-design data visualizations

that educators want using the

data that matters to them.

Four Phase Mixed Methods Collaborative Project

Today: Breakout Session 12:15-1:25:

Measuring the Mindset of Educators

Elizabeth Young, Nassau BOCES

Sarah Weeks, Teachers College

Today: Breakout Session 1:35-2:35:

Clicks in Context

Robert Feihel, Nassau BOCES

Aaron Hawn, Teachers College & Penn

Today: Breakout Session 12:25-1:25:

Working with Cognos 11 Visualizations

Jeff Davis, Nassau BOCES

Page 90: Education Leadership Data Analytics Keynote Presentation

20202018-201920172016

Grant awarded Preplanning

TDUS into online form

Meeting with pilot districts

TDUS: Teacher Data Use Survey

Pilot TDUS

Meeting with pilot

districts

All districts invited to participate

Full administration of TDUS

Data processing completed

Teacher & Administrator Interviews

2018-2019 n=40

Data analysis of how the Instructional

Data Warehouse (IDW) is used (or not)

2-Day workshop to pilot IDW tools and

build new tools in collaboration with

teachers and administrators

Publish e-book, journal articles,

R code datapipe, R code dashboards

Page 91: Education Leadership Data Analytics Keynote Presentation

Data Use Theory of Action

Data use theory of action - adapted from Ikemoto & Marsh (2007); Mandinach, Honey, Light, & Brunner

(2008); Marsh (2012); Schildkamp & Kuiper (2010); Schildkamp, Poortman, & Handelzalts (2016)

Page 92: Education Leadership Data Analytics Keynote Presentation

Data Use Theory of Action

Data use theory of action - adapted from Ikemoto & Marsh (2007); Mandinach, Honey, Light, & Brunner

(2008); Marsh (2012); Schildkamp & Kuiper (2010); Schildkamp, Poortman, & Handelzalts (2016)

This Project

Page 93: Education Leadership Data Analytics Keynote Presentation

PurposeThe purpose of this study is to investigate the extent to which there

are significantly different subgroups of responders to the Teacher Data

Use Survey (TDUS) from across 56 school districts to examine a

typology of school and district leaders, teachers and instructional

support staff perceptions of data use.

Wayman, Wilkerson, Cho, Mandinach, & Supovitz (2016)

Page 94: Education Leadership Data Analytics Keynote Presentation

Research Questions

• To what extent are there significantly different subgroups of responders to the Teacher Data Use Survey?

• To what extent are teachers, school and district administrators, and instructional support staff’s perceptions on how teacher use data associated with membership in these subgroups of responders?

• Do these different types vary by how they click in the IDW dashboard system?

• What data do educators say are most accessible and useful to their practice?

Page 95: Education Leadership Data Analytics Keynote Presentation

Teacher Data Use Survey (TDUS)

• The TDUS is created and validated by the U.S. Department of Education Institute of Education

Sciences, Wayman and colleagues (2016).

• The TDUS collects the perceptions of the data use by teachers in schools around issues of

accessibility of data, frequency of data use, usefulness of data, actions with data, competence,

attitudes, collaboration, organizational supports and leadership.

• Survey Versions: Teachers, Administrators (e.g. principals), Instructional support staff (e.g.

instructional coaches)

• Purpose: Provide a survey instrument that enables district and school leaders to learn more

about:

– How teachers use data

– Teachers attitude toward data

– Teacher perception of supports for using data

– How teacher collaborate to use data

• Survey Administration: The full survey administration was completed at the end of May 2017.

Page 96: Education Leadership Data Analytics Keynote Presentation

https://ies.ed.gov/ncee/edlabs/regions/appalachia/pdf/REL_2017166.pdf

Page 97: Education Leadership Data Analytics Keynote Presentation

Three different survey forms:

• School Leaders

• Teachers

• Support staff

https://ies.ed.gov/ncee/edlabs/regions/appalachia/pdf/REL_2017166.pdf

Page 98: Education Leadership Data Analytics Keynote Presentation

Teacher Data Use Survey (TDUS):

• Online (emailed) survey link to all teachers, administrators (school and district) and instructional support staff for the entire county.

– About 20,000 people

• Received 4,941 responses (25% response overall rate)– 3,783 full survey responses for dashboard reporting

– Used multiple imputation for analytics (FIML)

• Some districts had as high as 65% response rate, many had less than 15% response rate, and a few less than 10% response rate.

– Modal response rate by districts was 22%.

• By position:– 3,776 teachers

– 405 administrators (building-level and district)

– 760 instructional support staff

Results Reporting:

• TDUS dashboards generated overall, by district and by building– Dashboards generated in R. Open source code available by spring 2018.

• Meetings with superintendents, principals and teachers to discuss results

• Latent Class Analysis (LCA) (clustering) of the responses to ask to what extent is there one or more than one type of data user across these school districts?

Survey Administration, Sample, & Method

Bowers &Sprott (2012); Boyce & Bowers (2016); Jung & Wickrama (2008); Lanza & Collins (2012); Masyn (2003); Muthén (2002, 2004); Muthén &

Asparouhov (2002, 2008); Samuelsen & Raczynski (2013); Urick & Bowers (2014)

Page 99: Education Leadership Data Analytics Keynote Presentation

Teachers Administrators

Self-Reported Position Titles:

Page 100: Education Leadership Data Analytics Keynote Presentation
Page 101: Education Leadership Data Analytics Keynote Presentation
Page 102: Education Leadership Data Analytics Keynote Presentation

Model of the Latent Class Analysis (LCA) of Responders to Teacher

Data Use Survey (TDUS)

Page 103: Education Leadership Data Analytics Keynote Presentation

Findings

Page 104: Education Leadership Data Analytics Keynote Presentation

Four Significantly Different Subgroups of Responders to the TDUS:

Latent Class Analysis (LCA)

0.0

0.1

0.2

0.3

0.4

0.5

0.6

0.7

0.8

0.9

1.0

Fre

qu

ency

of

Sta

te D

ata

Use

Fre

qu

ency

of

Ben

chm

ark

Data

Use

Fre

qu

ency

of

Dis

tric

t D

ata

Use

Fre

qu

ency

of

Cla

ssro

om

Data

Use

Sta

te D

ata

Use

fuln

ess

Ben

chm

ark

Data

Use

fuln

ess

Dis

tric

t D

ata

Use

fuln

ess

Cla

ssro

om

Data

Use

fuln

ess

Act

ion

wit

h S

tate

Data

Act

ion

wit

h B

ench

mark

Da

ta

Act

ion

wit

h D

istr

ict

Data

Act

ion

wit

h C

lass

room

Data

Com

pet

ence

in

Usi

ng D

ata

Data

's E

ffec

tiven

ess

for

Ped

agogy

Att

itu

des

tow

ard

Data

Fre

qu

ency

of

Coll

ab

ora

tive

Mee

tin

gs

Coll

ab

ora

tive T

eam

Tru

st

Coll

ab

ora

tive T

eam

Act

ion

s

Su

pp

ort

for

Data

Use

Pri

nci

pal

Lea

der

ship

Com

pu

ter

Data

Syst

ems

Frequency of Data Use Data Usefulness Action with Data Attitudes

Toward Data

Collaboration Organizational Support

Aver

ag

e p

rop

ort

ion

by g

rou

p

Group 4 (24.2%)High Data Use with

Collaboration & Action

Group 2 (27.8%)Data with low collaboration

Group 1 (22.5%)Individual Classroom Focus

Group 3 (25.5%) Data Forward

Page 105: Education Leadership Data Analytics Keynote Presentation

Instructional Data Warehouse Assessment Click Actions

1

23

4

Administrators n=135

Page 106: Education Leadership Data Analytics Keynote Presentation

Instructional Data Warehouse Assessment Click Actions

1

23

4

Administrators n=135 Teachers n=147

Page 107: Education Leadership Data Analytics Keynote Presentation

Perception of Availability

Desired

(low) (high)

(lo

w)

(hig

h)

Academic Data Types Identified in Teacher Follow-up Interviews

Low/Low High/Low

Low/High High/High

Page 108: Education Leadership Data Analytics Keynote Presentation

Perception of Availability

Desire

d NY State 3-8 Math & ELA Test

Regents Exam

NYSITELL/NYSESLAT

International Baccalaureate Exams & Papers

AP Exams

Graduation

SAT or ACT

Current Grades

Prior Grades

Prior Teacher Anecdotal/Reflection - Acad

District or BOCES Benchmark

Curriculum Provided Assessment

School/Grade Level Benchmark

Teacher Observation

Teacher Created Assessment

Portfolio of Student Work

Homework

ELL Teacher Data/ Note Special Education Testing/IEP

Gross Motor/Fine Motor Development

Commercial Reading & Math Assessments:

i-Ready

NWEAMAP Math & ELA

Right Reasons Technology

Imagine Learning

QRI

STAR

Brigance

Running Records

Fountas & Pinnel

DRA

Dibbles

Castle Learning

AimsWeb

(low) (high)

(lo

w)

(hig

h)

Academic Data Types Identified in Teacher Follow-up Interviews

Page 109: Education Leadership Data Analytics Keynote Presentation

Perception of Availability

Desire

d NY State 3-8 Math & ELA Test

Regents Exam

NYSITELL/NYSESLAT

International Baccalaureate Exams & Papers

AP Exams

Graduation

SAT or ACT

Current Grades

Prior Grades

Prior Teacher Anecdotal/Reflection - Acad

District or BOCES Benchmark

Curriculum Provided Assessment

School/Grade Level Benchmark

Teacher Observation

Teacher Created Assessment

Portfolio of Student Work

Homework

ELL Teacher Data/ Note Special Education Testing/IEP

Gross Motor/Fine Motor Development

Commercial Reading & Math Assessments:

i-Ready

NWEAMAP Math & ELA

Right Reasons Technology

Imagine Learning

QRI

STAR

Brigance

Running Records

Fountas & Pinnel

DRA

Dibbles

Castle Learning

AimsWeb

(low) (high)

(lo

w)

(hig

h)

????

????

Academic Data Types Identified in Teacher Follow-up Interviews

Page 110: Education Leadership Data Analytics Keynote Presentation

Conclusion• There are at least four different types of responders to the TDUS

– The four subgroups value and collaborate around data in very different ways.

– Dashboards plus the LCA subgroup analysis provides opportunities for potential targeted

district capacity building and professional development around data use and evidence-based

improvement cycles

– For the highest responders on the Teacher Data Use Survey

• Principals click the most while teachers click the least in the Instructional Data Warehouse (IDW).

– The more available the data type, the more it is desired.

Find out more at the breakout sessions this afternoon!• Measuring the Mindset of Educators: Attitudes and Perceptions of Data Use in Schools and Districts: 12:25-

1:25 Albany room

– Elizabeth Young & Sarah Weeks

• Working with Cognos 11 Visualizations: 12:25-1:25 Salon B

– Jeff Davis

• Clicks in Context: Getting to Insight and Action from Data Warehouse Activity Logs: 1:35-2:35 Salon C

– Robert Feihel & Aaron Hawn

Page 111: Education Leadership Data Analytics Keynote Presentation

http://tc.columbia.edu/cps/evidence

Page 112: Education Leadership Data Analytics Keynote Presentation

Overview: Education Leadership Data Analytics (ELDA)

• AI in education, data analytics, and education data science. Hype versus reality.• What is Education Leadership Data Analytics (ELDA) and why should anyone care?

• Data science, and education leadership training• Bowers, A.J., Bang, A., Pan, Y., Graves, K.E. (2019) Education Leadership Data Analytics (ELDA): A White Paper Report on the 2018 ELDA

Summit. Teachers College, Columbia University: New York, NY. https://doi.org/10.7916/d8-31a0-pt97

• Bowers, A.J. (2017) Quantitative Research Methods Training in Education Leadership and Administration Preparation Programs as Disciplined

Inquiry for Building School Improvement Capacity. Journal of Research on Leadership Education, 12(1), p. 72-96. http://doi.org/10.7916/D8K93H16

• Agasisti, T., Bowers, A.J. (2017) Data Analytics and Decision-Making in Education: Towards the Educational Data Scientist as a Key Actor in

Schools and Higher Education Institutions. In Johnes, G., Johnes, J., Agasisti, T., López-Torres, L. (Eds.) Handbook of Contemporary Education

Economics (184-210). Cheltenham, UK: Edward Elgar Publishing. https://doi.org/10.7916/D8PR95T2

• What components of a statewide longitudinal data system are needed for that system to best meet the

respective needs of superintendent, principal and teacher leaders?• Interaction and small group discussion

• Pattern visualization of student course interactions:• Lee, J., Recker, M., Bowers, A.J., Yuan, M. (2016). Hierarchical Cluster Analysis Heatmaps and Pattern Analysis: An Approach for Visualizing

Learning Management System Interaction Data. A poster presented at the annual International Conference on Educational Data Mining (EDM),

Raleigh, NC: June 2016. http://www.educationaldatamining.org/EDM2016/proceedings/paper_34.pdf

• Calculating predictor accuracy for early warning systems• Bowers, A.J., Sprott, R., Taff, S.A. (2013) Do we Know Who Will Drop Out? A Review of the Predictors of Dropping out of High School: Precision,

Sensitivity and Specificity. The High School Journal.96(2), 77-100. http://hdl.handle.net/10022/AC:P:21258

• Bowers, A.J., Zhou, X. (2019) Receiver Operating Characteristic (ROC) Area Under the Curve (AUC): A Diagnostic Measure for Evaluating the

Accuracy of Predictors of Education Outcomes. Journal of Education for Students Placed at Risk. Open Access R Markdown:

https://doi.org/10.7916/D8K94RDD

Understanding How Educators Use Instructional Data Warehouses (IDWs) (or not) NSF collaboration:

• Building community and capacity in data use: Nassau BOCES and Teachers College collaborative• Building Community and Capacity for Data-Intensive Evidence-Based Decision Making in Schools and Districts. National Science Foundation, NSF

DGE-1560720: http://www.nsf.gov/awardsearch/showAward?AWD_ID=1560720

• Do teachers and administrators have different perceptions of data use in their schools and IDW use?• https://www.tc.columbia.edu/articles/2020/january/how-education-data-can-help-teachers-raise-their-game/

Page 113: Education Leadership Data Analytics Keynote Presentation

Contributors to the Education Leadership Data Analytics Research Group:

April Bang Aaron Hawn Yilin Pan Sarah Weeks Yi Zhang

Kenny Graves Elizabeth Monroe Burcu Peckan Yihan Zhao Sen Zhang

Eric Ho Xinyu Ni Luronne Vaval Seulgi Kang Xiaoliang Zhou

Thank you!Email: [email protected]

Faculty Webpage: http://www.tc.columbia.edu/faculty/ab3764/

Research Group Website: http://www.tc.columbia.edu/elda/

Leading with Evidence in Schools Online PD Course: http://tc.columbia.edu/cps/evidence