evaluation of extended learning opportunities summer title

53
February 2019 Evaluation of Extended Learning Opportunities Summer Title I Enrichment Program (ELO STEP) in 2017 2018 PREPARED BY: Juan Carlos Davila Valencia, M.S. Shahpar Modarresi, Ph.D. Program Evaluation Unit

Upload: others

Post on 24-Mar-2022

1 views

Category:

Documents


0 download

TRANSCRIPT

February 2019

Evaluation of Extended Learning

Opportunities—Summer Title I

Enrichment Program (ELO STEP) in

2017–2018

P R E P A R E D B Y :

Juan Carlos Davila Valencia, M.S.

Shahpar Modarresi, Ph.D.

Program Evaluation Unit

850 Hungerford Drive

Rockville, Maryland 20850

240-740-3000

Dr. Jack R. Smith Superintendent of Schools

Dr. Janet S. Wilson Associate Superintendent, Office of Shared Accountability

Published for the Office of Shared Accountability

Copyright © 2019 Montgomery County Public Schools, Rockville, Maryland

February 2019 Office of Shared Accountability Program Evaluation Unit

Evaluation of Extended Learning Opportunities Summer Title I Enrichment Program (ELO STEP) in

2017–2018 Juan Carlos Davila Valencia, M.S., and Shahpar Modarresi, Ph.D.

Purpose of the Study

During summer 2018, the Extended Learning Opportunities—Summer Title I Enrichment Program (ELO STEP) in Montgomery

County Public Schools (MCPS) provided high achievers from low-income families, entering Grades 3, 4, and 5, easy access to

advanced programming in Curriculum 2.0, and center programs especially focused on advanced mathematics and science.

Using a non-equivalent comparison group design, this evaluation examined the outcomes of the ELO STEP program by comparing

the fall 2018 MAP-R and MAP-M scores of participants (664 students who attended 19 or more days out of 25 days of instruction) to

nonparticipants with similar characteristics (1,004 students) after controlling for pre-existing differences.

Grade 3. Positive effect for ESOL recipients, all Other Student Groups, and Non-FARMS Black or African American.

Grade 4. Positive effect for ESOL recipients and all Non-FARMS groups, but a negative effect for FARMS All Other Student

Groups and FARMS Black or African American. Grade 5. Positive effect for all fifth graders, ESOL recipients, all Hispanic/Latino groups, and FARMS Black or African American.

Figure I. Effect size of fall 2018 MAP-M scores for ELO STEP participants vs. nonparticipants by grade level. Note. Only groups with practically significant differences are shown. *p < .05, **p < .01.

Grade 3. Positive effect for ESOL recipients and FARMS Black or African American, but a negative effect for the Monitoring Group.

Grade 4. Positive effect for Non-FARMS Hispanic/Latino, but a negative effect for FARMS All Other Student Groups.

Grade 5. Positive effect for ESOL recipients and FARMS Hispanic/Latino, but a negative effect for the Monitoring Group.

Figure II. Effect size of fall 2018 MAP-R scores for ELO STEP participants vs. nonparticipants by grade level. Note. Only groups with practically significant differences are shown. No results were statistically significant for reading.

Selected Recommendations

Continue with the use of the instructional mathematics practices but explore the reasons for the small negative effect size in Grade

4 participants who received FARMS services (excluding Hispanic/Latino).

Examine the ELO STEP curriculum and make it more interdisciplinary, as well as increase participation via outreach activities.

Include a formative evaluation component that incorporates program stakeholders’ feedback and experiences.

Administer a test, before and after the program, assessing all content areas and skills the program intends to enrich.

0.160.24 0.30

0.410.30

0.38* 0.44*0.34

-0.19 -0.17

0.17** 0.150.24

0.54**

0.25**

-0.35

0.00

0.35

0.70

All ESOL Non-FARMS All

Other Student

Groups

(Monitoring

Group)

Non-FARMS

Black or African

American

Non-FARMS

Hispanic/Latino

FARMS All

Other Student

Groups

FARMS Black

or African

American

FARMS

Hispanic/Latino

Eff

ect

size

Grade 3

Grade 4

Grade 5

0.22

-0.34

0.330.24

-0.54

0.40

-0.20

0.17

-0.80

-0.30

0.20

0.70

ESOL Non-FARMS All

Other Student Groups

(Monitoring Group)

Non-FARMS

Hispanic/Latino

FARMS All Other

Student Groups

FARMS Black or

African American

FARMS

Hispanic/Latino

Eff

ect

size

Grade 3

Grade 4

Grade 5

Were there differences in fall 2018 mathematics performance between ELO STEP participants and nonparticipants?

Were there differences in fall 2018 reading performance between ELO STEP participants and nonparticipants?

Montgomery County Public Schools Office of Shared Accountability

Program Evaluation Unit i ELO STEP Evaluation

Table of Contents

Executive Summary ................................................................................................................................... iv

Background ................................................................................................................................................. 1

Overview of ELO STEP in MCPS.............................................................................................................. 1

Previous Evaluation of ELO STEP ......................................................................................................... 3

Literature Review........................................................................................................................................ 4

Achievement gap among high achievers ................................................................................................ 4

Efforts across the country ....................................................................................................................... 5

Efforts in MCPS ...................................................................................................................................... 6

Evaluation Scope and Questions ................................................................................................................. 6

Methodology ............................................................................................................................................... 7

Evaluation Design ................................................................................................................................... 7

Sample..................................................................................................................................................... 8

Analytical Procedures ............................................................................................................................. 9

Strengths and Limitations ..................................................................................................................... 11

Results ....................................................................................................................................................... 13

Evaluation Question 1. What were the demographic characteristics of students who participated in

the summer 2018 ELO STEP by gender, race/ethnicity, services received, and focus group status? .. 13

Evaluation Question 2. After controlling for pre-existing differences among students, were there

differences in reading and mathematics performance in the fall of 2018 among students receiving the

ELO STEP program compared to their peers who were invited but did not participate? Did these

differences vary by students receiving ESOL services and student focus groups? .............................. 16

Discussion and Conclusion ....................................................................................................................... 28

Recommendations ................................................................................................................................. 30

Acknowledgements ................................................................................................................................... 31

References ................................................................................................................................................. 32

Appendix A ............................................................................................................................................... 36

Appendix B ............................................................................................................................................... 37

Appendix C ............................................................................................................................................... 38

Appendix D ............................................................................................................................................... 39

Appendix E ............................................................................................................................................... 40

Montgomery County Public Schools Office of Shared Accountability

Program Evaluation Unit ii ELO STEP Evaluation

List of Tables

Table 1. Characteristics of 2018 ELO STEP Participants and Nonparticipants ......................................... 9

Table 2. Characteristics of 2018 ELO STEP Participants by Grade (Gender, Race/Ethnicity, Services

Received, and Focus Group) ............................................................................................................. 14

Table 3. Adjusted Means of Grade 5 MAP-M RIT Scores for ELO STEP Participants and their Peers 18

Table 4. Adjusted Means of Grade 5 MAP-R RIT Scores for ELO STEP Participants and their Peers 19

Table 5. Adjusted Means of Grade 4 MAP-M RIT Scores for ELO STEP Participants and their Peers 20

Table 6. Adjusted Means of Grade 4 MAP-R RIT Scores for ELO STEP Participants and their Peers .. 20

Table 7. Adjusted Means of Grade 3 MAP-M RIT Scores for ELO STEP Participants and their Peers . 21

Table 8. Adjusted Means of Grade 3 MAP-R RIT Scores for ELO STEP Participants and their Peers .. 22

Table 9. Statistical Significance and Effect Sizes for ELO STEP Participants vs. Comparison Students in

Math .................................................................................................................................................. 27

Table 10. Statistical Significance and Effect Sizes for ELO STEP Participants vs. Comparison Students

in Reading ......................................................................................................................................... 28

Table B1. Schedules of 2018 ELO STEP and Enrichment Activities ...................................................... 37

Table C1. FY 2018 Budget for ELO STEP .............................................................................................. 38

Table D1. 2018 ELO STEP Program Sites and Bell times ....................................................................... 40

Table E1. Mean and Standard Deviation of Spring 2018 MAP-P/M (Math) in Grade 2 for ELO STEP

Participants and their Comparison Group ......................................................................................... 40

Table E2. Mean and Standard Deviation of Spring 2018 MAP-P/M (Math) in Grade 3 for ELO STEP

Participants and their Comparison Group ......................................................................................... 40

Table E3. Mean and Standard Deviation of Spring 2018 MAP-P/M (Math) in Grade 4 for ELO STEP

Participants and their Comparison Group ......................................................................................... 41

Table E4. Percentage of Students Meeting the Spring 2018 AP-PR Benchmark in Grade 2 for ELO

STEP Participants and their Comparison Group .............................................................................. 41

Table E5. Mean and Standard Deviation of Spring 2018 MAP-R (Reading) in Grade 3 for ELO STEP

Participants and their Comparison Group ......................................................................................... 42

Table E6. Mean and Standard Deviation of Spring 2018 MAP-R (Reading) in Grade 4 for ELO STEP

Participants and their Comparison Group ......................................................................................... 42

Montgomery County Public Schools Office of Shared Accountability

Program Evaluation Unit iii ELO STEP Evaluation

List of Figures

Figure 1. Evaluation Design of the ELO STEP Program ........................................................................... 7

Figure 2. Gender distribution of ELO STEP participants by grade. ........................................................ 14

Figure 3. Race/Ethnicity distribution of ELO STEP participants by grade. ............................................ 14

Figure 4. Distribution of Services Received by ELO STEP participants during the school year 2017–

2018 by grade. ................................................................................................................................... 15

Figure 5. Distribution of ELO STEP participants in focus and monitoring group by grade. ................... 15

Figure 6. Distribution of ELO STEP participants in each of the focus groups by grade. ........................ 16

Figure 7. Effect size of fall MAP-M scores for all ELO STEP participants. ........................................... 23

Figure 8. Effect size of fall MAP-M scores for all ELO STEP participants receiving ESOL services. .. 23

Figure 9. Effect size of fall MAP-M scores for ELO STEP participants by focus group. ....................... 24

Figure 10. Effect size of fall MAP-R scores for all ELO STEP participants. .......................................... 24

Figure 11. Effect size of fall MAP-R scores for all ELO STEP participants receiving ESOL services. . 25

Figure 12. Effect size of fall MAP-R scores for ELO STEP participants by focus group. ...................... 26

Montgomery County Public Schools Office of Shared Accountability

Program Evaluation Unit iv ELO STEP Evaluation

Executive Summary

In response to a nationwide opportunity gap for many students who are qualified but underrepresented in

intervention programs offering challenging academic content, Montgomery County Public Schools

(MCPS) offer the Extended Learning Opportunities—Summer Title I Enrichment Program (ELO STEP).

The program is designed to provide high achievers from low-income families easy access to advanced

programming in Curriculum 2.0, and center programs that are especially focused on mathematics and

science. At the request of the Chief Academic Officer, the Office of Shared Accountability (OSA)

conducted an evaluation of ELO STEP in MCPS during the summer of 2018. This report presents the

mathematics and reading outcomes associated with the program for those students who were served by

ELO STEP.

Overview

In the summer of 2018, ELO STEP was offered to 22 Title I elementary school students in 14 sites with

an average class size of 14 students. A total of 1,824 students entering Grades 3, 4, and 5 were invited to

the program, and 785 of them (43%) attended at least one day of instruction. The program benefited

participants by providing free breakfast, lunch, transportation, and rigorous learning opportunities five

days a week for 25 days, from July 9 to August 10, 2018.

Evaluation Design and Methodology

Nowadays, the school district accountability efforts have been challenged to develop procedures that

determine the outcomes of their intervention programs on student achievement within the typical

implementation setting (lack of randomization in selection and assignment of students to an intervention

program). The critical issue in outcome evaluations is to determine whether or not a program produces

levels of effects above what would have occurred without the program (Rossi & Freeman, 1993). Using

a non-equivalent comparison group design, a frequently used type of quasi-experimental design, this

outcome evaluation examined the program’s effectiveness by comparing the academic performance of

ELO STEP students in mathematics and reading to their peers who were invited to participate but did not.

The use of a comparison group for interpreting findings in this evaluation increases our confidence that

observed students’ outcomes are the result of participation in ELO STEP instead of a function of these

students’ initial abilities.

Outcome measures for addressing the program’s effectiveness included the Measures of Academic

Progress in Reading (MAP-R) and Mathematics (MAP-M). Analyses were done by grade, content area,

and disaggregated by students’ English for Speakers of Other Languages (ESOL) status as well as

students’ focus groups. Appropriate analytical procedures, including advanced statistical analyses, were

conducted based on the nature of the evaluation questions and the characteristics of the data. The General

Linear Model (GLM) parameterization procedure, the general analysis procedures, was used to test the

significant differences between adjusted means. Statistical analyses were supplemented with the

computation of effect sizes to infer conclusions, meaningful for educational settings. A summary of

findings for each of the evaluations questions is listed below.

Montgomery County Public Schools Office of Shared Accountability

Program Evaluation Unit v ELO STEP Evaluation

Findings

Evaluation Question 1. What were the demographic characteristics of students who participated in

the summer 2018 ELO STEP by gender, race/ethnicity, services received, and focus group status?

According to the program records, 730 students participated in the ELO STEP program, attending at least

19 days of instruction (out of 25 days). There were also slightly more male than female participants (52%

vs. 48%), around 50% of attendees were Hispanic/Latino, and 30% were Black or African American.

Regarding services received in 2017–2018, about 67% of the participants received Free and Reduced-price

Meals System (FARMS) services, and 23% of them received ESOL instruction.

The disaggregation of data by focus groups revealed that approximately 41% of participants were

Hispanic/Latino receiving FARMS services, followed by Black or African American receiving FARMS

services (19%), Non-FARMS All Other Student Groups (12%), and Non-FARMS Black or African

American (12%). The rest of the focus groups’ percentages were less than 10%. When the data were

analyzed across grades, similar distributions by gender, race/ethnicity, service receipt measures, and focus

group were found.

Evaluation Question 2. After controlling for pre-existing differences among students, were there

differences in reading and mathematics performance in the fall of 2018 among students receiving the

ELO STEP program compared to their peers who were invited but did not participate? Did these

differences vary by students receiving ESOL services and student focus groups?

The benefits of the ELO STEP program varied by content area and by student characteristics, with a more

positive effect on mathematics than reading. It should be noted that even though ELO STEP did not focus

on developing students’ reading skills, the program’s daily schedule included some literacy activities

related to informational texts.

Mathematics. The mathematics outcomes of the ELO STEP participants, as compared with the

nonparticipants, were both statistically (p < 0.01) and practically (ES = 0.17) significant for those entering

Grade 5, as measured by fall MAP-M.

Comparative analyses, for mathematics, found that participants who received ESOL services performed

better in mathematics across all grade levels than those who did not. The effect size analyses revealed

practically significant differences for all grades (ES = 0.16 for Grade 3, ES = 0.30 for Grade 4, and ES =

0.15 for Grade 5).

When the data were disaggregated by student focus groups, statistically or practically significant results

associated with participation in ELO STEP varied across grade levels (Figure A1). Focus groups with the

largest effect sizes (and also significant at 0.05 level) included fourth graders in the Monitoring Group (ES

= 0.38), Non-FARMS Black or African American in Grade 4 (ES = 0.44), FARMS Black or African

American in Grade 5 (ES = 0.54), and FARMS Hispanic/Latino in Grade 5 (ES = 0.25). Other focus groups

with positive and practically significant differences in the comparative analyses are shown in bold (Figure

A1 below).

On the other hand, parallel effect size analyses revealed that fourth graders in two comparison groups

outperformed their ELO STEP counterparts: (a) FARMS All Other Student Groups (ES = -0.19); and (b)

FARMS Black or African American (ES = -0.17).

Montgomery County Public Schools Office of Shared Accountability

Program Evaluation Unit vi ELO STEP Evaluation

Reading. Overall, the advanced analyses did not provide sufficient evidence to associate the program with

a positive reading performance of ELO STEP participants for all students across grades, except for ESOL

students.

Disaggregation of performance data by ESOL status revealed that the program positively affected the

reading outcomes for all participants in Grade 3 (ES = 0.22) and Grade 5 (ES = 0.40). These findings

indicate that the impact of ELO STEP on reading performance of ESOL students is practically significant

and meaningful in an educational setting.

The analyses by focus group status (Figure A2), indicate that the program has affected the reading

performance of participants differently. For instance, analyses find a practically significant difference

between participants versus nonparticipants in favor of nonparticipants for FARMS All Other Student

Groups in Grade 4 (ES = -0.54) and Non-FARMS All Other Student Groups in Grade 3 (-0.34). In

contrast, three focus groups across all grade levels benefited from the program, such as FARMS Black or

African American in Grade 3 (ES = 0.33) followed by Non-FARMS Hispanic/Latino in Grade 4 (ES =

0.24) and FARMS Hispanic/Latino in Grade 5 (ES = 0.17).

0.240.30

0.10

0.41

-0.06

0.04

0.38*0.44*

0.34

-0.19 -0.17

0.13

0.010.09

0.24

-0.09

0.54**

0.25**

-0.30

-0.15

0.00

0.15

0.30

0.45

0.60

Non-FARMS All

Other Student

Groups

(Monitoring Group)

Non-FARMS Black

or African

American

Non-FARMS

Hispanic/Latino

FARMS All Other

Student Groups

FARMS Black or

African American

FARMS

Hispanic/Latino

Eff

ect

size

Grade 3 Grade 4 Grade 5

-0.34

0.00 0.000.09

0.33

-0.04-0.04

0.130.24

-0.54

0.07 0.12

-0.20

-0.05 -0.07

0.09 0.060.17

-0.75

-0.55

-0.35

-0.15

0.05

0.25

0.45

Non-FARMS All

Other Student

Groups

(Monitoring Group)

Non-FARMS Black

or African

American

Non-FARMS

Hispanic/Latino

FARMS All Other

Student Groups

FARMS Black or

African American

FARMS

Hispanic/Latino

Eff

ect

size

Grade 3 Grade 4 Grade 5

Figure A1. Effect size of fall 2018 MAP-M scores for ELO STEP participants by focus group.

Note. Bold indicates a practically significant difference. *p < 0.05, and **p < 0.01

Figure A2. Effect size of fall 2018 MAP-R scores for ELO STEP participants by focus group.

Note. Bold indicates a practically significant difference. None of the effect sizes was statistically significant.

Montgomery County Public Schools Office of Shared Accountability

Program Evaluation Unit vii ELO STEP Evaluation

Conclusion

After the Every Student Succeeds Act (ESSA) passed in 2015, states are more encouraged to focus on all

students, not only on struggling or at-risk children but also on students with different skill sets (Wright,

2017). ELO STEP is one example of these initiatives that intended to close the achievement gap between

high achievers from low-income families and their affluent peers. This program, during summer 2018,

provided a structured curriculum in math, science, and information literacy continuously for five weeks

to 785 students from 21 Title I schools, with an average class size of 14 students per section.

The outcome analyses provided some empirical evidence indicating that ELO STEP improved the

academic achievement of above-average students who were impacted by poverty. Comparative analyses

found more evidence of effects in mathematics than in reading, and the curriculum structure and schedule

might be the reasons for this discrepancy. According to 2018 ELO STEP daily schedule, math was taught

for 75 minutes whereas literacy activities (information literacy) were provided for 30 minutes.

In the case of particular student subgroups, results from this evaluation were mixed. This evaluation found

a variety of results, where the direction of the program’s outcome differed based on participants’

characteristics, such as grade level, race/ethnicity, and FARMS status. An overall summary of the positive

and practically significant effects of the program by student focus groups are included below:

Black or African American. Comparative analyses (ELO STEP participants and nonparticipants)

revealed that Black or African American participants in Grades 3 and 4, who did not receive

FARMS services, performed better in mathematics, as well as Black or African American

participants in Grade 5 who received FARMS services. For reading, only Black or African

American in Grade 3 who received FARMS services performed better than their peers in the

comparison group.

Hispanic/Latino. Similar comparative analyses found that Hispanic/Latino participants in Grade

4 who did not receive FARMS services performed better than their peers (nonparticipants) in math.

Better performance was observed for Hispanic/Latino participants in Grade 5 regardless of their

FARMS status using parallel analyses. In reading, the comparison of the two groups of students

(participants and nonparticipants) found that Hispanic/Latino participants in Grade 4 who did not

receive FARMS services performed better than their peers. The same comparison found similar

results for Hispanic/Latino participants in Grade 5 who did receive FARMS services.

FARMS recipients. For mathematics, the comparative analyses indicated that participants who

were not FARMS recipients, regardless of their grade level, tended to perform better in

mathematics than participants who received FARMS services. For reading, no overall conclusion

can be inferred by students’ FARMS status.

ESOL recipients. Comparative analyses, for mathematics, found that participants who received

ESOL services performed better in mathematics across all grades than their nonparticipants peers.

For reading, the same conclusion was observed but only for participants in Grade 3 and 5.

In closing, establishing the criteria of success and failure is very difficult in evaluating the impact of any

intervention programs. Borus (1979) asserts that the list of outcomes is always incomplete since only

those criteria of success, which can be measured, are considered. The effectiveness of ELO STEP in this

Montgomery County Public Schools Office of Shared Accountability

Program Evaluation Unit viii ELO STEP Evaluation

evaluation was assessed only by examining the performance of students in the area of reading and

mathematics through only one measure of students’ achievement (MAP test scores). Other program goals,

such as critical thinking, knowledge of science concepts, or students’ attitudes to complex tasks, cannot

be addressed by the analysis.

Recommendations

Based on the findings of this evaluation, the following recommendations are suggested:

Continue with the use of the instructional mathematics practices for ELO STEP for future positive

effects in mathematics. However, it is important to explore the reasons why the program had a

small negative effect size in Grade 4 participants who received FARMS services but who were not

Hispanic/Latino, even though these results were not statistically significant.

Examine the ELO STEP curriculum and make it more interdisciplinary. Similar to the

recommendation proposed in the first evaluation of ELO STEP (2016), the shorter instructional

time in literacy might be the reason why there was no apparent effect of the program on students’

overall reading performance.

Provide opportunity for and encourage teachers to conduct Action Research (AR) in their

classroom, which will allow them to collect data from their students via multiple measures, identify

patterns, collaborate with their peers, and think about different ways to improve instruction

delivery, student learning, and engagement. Guidelines and example about how to put AR in

practice can be included in the ELO STEP training that takes place before the program starts.

Increase the number of students participating in ELO STEP via outreach activities. Among the

1,824 students invited to the program, only less than the half (43%) chose to participate.

Include in future studies a formative evaluation component that incorporates students, teachers,

and parents’ feedback regarding their experience in the program. Program stakeholders’

experiences can help the administration and program staff identifying what components or

practices in the program are effective or need to be improved. Also, extend this recommendation

to parents of students who were invited but decided not to participate in order to understand the

reasons for their decision.

Replicate the study over time using different student population to see if the observed findings are

stable.

Ask parents or guardians to sign a confirmation form at the time of registration for the ELO STEP

program to commit attendance of their child or children, similar to what is done for the ELO SAIL

program.

Administer a test, before and after the program, assessing all the content areas and skills that the

program intends to enrich (e.g., critical thinking, knowledge of science concepts, students’

attitudes to complex tasks, etc.). Results from these tests can provide indicators, relevant and

specific to the ELO STEP program, which can be used to measure the impact of the intervention.

Montgomery County Public Schools Office of Shared Accountability

Program Evaluation Unit 1 ELO STEP Evaluation

Evaluation of Extended Learning Opportunities—Summer Title I Enrichment

Program (ELO STEP) in 2017–2018

Juan Carlos Davila Valencia, M.S., and Shahpar Modarresi, Ph.D.

Background

In 2015, the divisions of Accelerated and Enriched Instruction (AEI), Consortia Choice and Application

Program Services (DCCAPS), and Title I and Early Childhood Programs and Services (DTECPS) of the

Montgomery County Public Schools (MCPS) designed and implemented the Extended Learning

Opportunities—Summer Title I Enrichment Program (ELO STEP). The goal of this program is to provide

the traditionally underrepresented students with access to advanced programming in Curriculum 2.0, and

center programs.

Nationwide, summer learning programs have emerged as a promising way to address the opportunity gap

between underprivileged and privileged students since low-income parents may not afford any

investments in their children’s academic experiences (Kaushal et. al., 2011). Currently, many school

districts offer mandatory summer programs to students at risk of grade retention, but fewer districts offer

summer learning programs to a broader population of students as a means in boosting academic

performance (Augustine et. al, 2013). In the past several years, the number of summer intervention

programs across MCPS has grown, but more needs to be learned about their effects on student

achievement. At the request of the Chief Academic Officer, the Office of Shared Accountability (OSA)

conducted an evaluation of ELO STEP (a summer program) in MCPS. The ultimate goal of any program

is to be effective and to produce quality outcomes. The critical issue in outcome evaluations is whether

or not a program produces levels of effects above what would have occurred without the program (Rossi

& Freeman, 1993). Using a non-equivalent control group design, this evaluation estimated the immediate

outcome of the program on student academic performance in mathematics and reading. The findings

present the effects associated with the summer 2018 implementation of the ELO STEP program at MCPS.

Overview of ELO STEP in MCPS

In response to nation-wide opportunity gaps and in order to decrease excellence gaps within MCPS, the

ELO STEP program in Title I schools was designed to offer accelerated and enriched instructional

opportunities to students impacted by poverty who may not have access to enriching summer programs

otherwise. The broad objectives of the ELO STEP program include: (a) offer advanced pathways for

students; (b) engage students in complex texts and tasks aligned to Common Core; (c) nurture critical

thinking skills; (d) stimulate students through a rigorous environment; (e) allow students to take risks in

academic situations; and (f) expose students to specialized instructional materials that are recommended

by the National Association for Gifted Children (Montgomery County Public Schools, 2018).

The program was first offered in July 2015 to incoming third-grade students who were enrolled in a Title

I school and met specific selection criteria. The targeted students for the program were above average

students attending Title I schools. By the summer of 2018, the ELO STEP was extended to rising Grades

3, 4, and 5 students who were enrolled in a Title I school and met specific selection criteria. The goal was

Montgomery County Public Schools Office of Shared Accountability

Program Evaluation Unit 2 ELO STEP Evaluation

to nurture critical and creative thinking skills through engaging, hands-on, rigorous instruction, using the

infrastructure from an existing MCPS summer program, the Extended Learning Opportunities Summer

Adventures in Learning (ELO SAIL) program. The 2018 ELO STEP program was offered during July 9

– August 10, 2018 (25 days of instruction) at 14 sites, and provided free breakfast, lunch, and

transportation to and from school (Lang, 2018).

Student eligibility and selection. Students in Grades 3, 4, and 5 from Title I schools who met the following

criteria during 2017–2018 were invited to participate in the 2018 ELO STEP.

1. Met one of the InView subtests at or above the 50th percentile: (a) Analogies; (b) Quantitative; (c)

Sequencing;

2. Were on or above the target Instructional Reading Level at the end of Marking Period 2;

3. Participated in Grade 2 mathematics enrichment as indicated on the Marking Period 2 report card;

4. Earned a score at or above the 50th percentile for the Grade 2 fall Measures of Academic Progress

– Primary Grades (MAP-P) for mathematics (applied to Grade 2 students only);

5. Earned a score at or above the 50th national percentile on fall or winter Measures of Academic

Progress –Reading (MAP-R) (applied to Grade 3 and 4 students only);

6. Earned a score at or above the 50th national percentile on fall or winter Measures of Academic

Progress –Mathematics (MAP-M) (applied to Grade 3 and 4 students only).

In addition, schools were asked to consider students who met the following criteria: (1) received Free and

Reduced-price Meals System (FARMS) services; (2) received schools advocacy (in cases where a student

did not meet the criteria stated above, principals and/or school staff members could have advocated for

students by completing a School Advocacy Form) (Appendix A); and (3) received instruction in Jacob’s

Ladder and/or William and Mary Curricula during 2017–2018.

Staff and training. AEI hired ELO STEP professional staff from among teachers in MCPS. Teachers

who were interested in ELO STEP teaching opportunities were asked to apply through the online MCPS

Careers System during April 3–7, 2018. AEI collaborated with the Elementary Integrated Curriculum

(EIC) team to avoid scheduling conflicts between the required summer training and the ELO STEP

summer work schedules. Also, one 2.5 hour training session was provided to Grade 3, 4, and 5 teachers

before the program started.

Curriculum and Class Activities. The overall academic outcome of ELO STEP was to engage students

in a deeper study of concepts and topics based on their assigned grade level. AEI developed the structure

of the ELO STEP curriculum, whereas DTECPS developed the extended part (offered to some participants

only). ELO STEP was provided in all 14 sites from 9:00 a.m. to 1:00 p.m. and offered instruction in math

and science. Even though ELO STEP did not focus on developing reading skills, this program included

some literacy activities related to informational texts. The extended part or enrichment activities during

the afternoon were provided only in nine sites to some participants from 1:00 p.m. to 3:30 p.m. and

instruction included digital literacy and coding based on existing curricula from Code.org and

Commonsense.org. Schedules of the ELO STEP program and the extended part can be found in

Appendix B.

Program Operations and Cost. As mentioned before, the 2018 ELO STEP was designed and

implemented by three offices (AEI, DCCAPS, and DTECPS), and participants received free breakfast,

lunch, and transportation to and from school. According to DTECPS, the cost of the program was

Montgomery County Public Schools Office of Shared Accountability

Program Evaluation Unit 3 ELO STEP Evaluation

$505,621 including teachers and paraeducators’ salaries, meetings, staff benefits, and instructional

materials, but excluding transportation and meal costs (Appendix C). Also, in case substitute teachers

were needed due to illness or another emergency, a list of experienced teachers was provided to each ELO

STEP site. However, if teachers from the list were not available, schools were asked to follow the same

substitute procedures used during the school year.

Participating Students and Schools. In 2018, 1,824 students from 22 Title I Elementary Schools were

invited to participate in ELO STEP across 14 sites. Among them, 785 students (43%) did participate (at

least attended one day or more), and 1,039 (57%) did not. Also, 78 students who were not initially invited

did participate. Program staff reported an average class size of 14 students. The full list of the ELO STEP

sites and their hours of instruction can be found in Appendix D. All school sites were asked to monitor

students’ attendance accurately and contact parents/guardians of students who missed the first three days

of the program to determine their plans.

Previous Evaluation of ELO STEP

The Office of Shared Accountability (OSA) evaluated the first implementation of ELO STEP in the

summer of 2015 (Zhao et al., 2016). The first section of this evaluation was formative and provided

information regarding the experiences of students, teachers, and parents about the components of the

program and how it might be improved. The second section of the evaluation comprised an outcome study

on how effective the program was in terms of improving fall academic performance of ELO STEP students

(in reading and mathematics) compared to their peers who were invited to participate but did not.

The implementation evaluation revealed that almost all of ELO STEP students had positive educational

experiences, indicating that their math and science teachers made those subjects interesting. ELO STEP

teachers also communicated that the program: (a) had a well-designed curriculum; (b) encompassed an

instructional content that supported students’ learning needs; and (c) was successful in helping students

engage in enriched and accelerated academic work. Similar to teachers, parents also provided positive

feedback regarding the program (Zhao et al., 2016).

The outcome evaluation showed empirical evidence that ELO STEP intervention reinforced the academic

achievement of students impacted by poverty. In mathematics, the benefits of the program were observed

in all of the four groups used in the analysis (all students, Black or African American, Hispanic/Latino,

and students receiving FARMS services). In reading, the benefits of the program were only observed in

two of the four groups (Hispanic/Latino and students receiving FARMS services). The largest ELO STEP

effect was observed among Hispanic/Latino students in both reading and mathematics, followed by Black

or African American students but only in mathematics (Zhao et al., 2016).

The 2015 evaluation report (Zhao et al., 2016) concluded that ELO STEP equalized opportunities for low-

income students to access challenging content and significantly improved the mathematics performance

of all students who participated. Moreover, according to the authors, the use of the following practices

likely contributed to the success of the program: (a) the use of multiple measures in selecting the students;

(b) the presence of enriched curriculum for nurturing critical and creative thinking skills through engaging,

hands-on, and rigorous instruction; (c) the presence of teachers’ professional development; (d) the

presence of expanded learning time outside of the normal school day; and (e) the equalized opportunities

Montgomery County Public Schools Office of Shared Accountability

Program Evaluation Unit 4 ELO STEP Evaluation

between more affluent and less affluent families by providing summer learning to students impacted by

poverty.

Literature Review

Achievement gap among high achievers

A large body of research has shown that the achievement gaps are evident between disadvantaged high-

achieving students and their affluent peers. For instance, in their study, Peters et al. (2012) argued that

high-achieving students from low-income families fell out of the high achievement group in elementary

and high school faster compared to their higher-income peers. Similarly, Smarick (2013) remarked that

students in high poverty schools were underrepresented among high achievers and their proportion

declined over time. In their analyses, Plucker et al. (2013) showed that even though there were state and

federal efforts to close these achievement gaps among high-achieving students, these gaps closed slowly

or in many cases grew over the past generation. Moreover, the same authors reported that the highest-

performing American students were disproportionally White and wealthy, and the gap between White and

non-White had widened over the years.

One of the possible reasons why low-income students experienced different summer achievement

trajectories relates to the limited opportunities they have to practice academic skills over the summer.

Some studies have found that low-income students spend more time watching television during the

summer than the students from wealthier backgrounds, and low-income students are also less likely to

have enriching nonacademic experiences, such as sports, compared to their affluent peers (Augustine et

al., 2016). Hence, summer learning initiatives have emerged as an alternative to equalize opportunities

and create access for low-income students to put in practice skills learned during the school year. For

instance, one randomized experimental study examined the short and long-run effects of voluntary

summer programs in five districts and found strong evidence of short term gains in mathematics

(Augustine et al., 2016). Additionally, the same study remarked that students with high attendance in

voluntary summer programs after two consecutive summers outperformed their peers in mathematics and

English Language Arts as well as exhibited stronger social-emotional skills (Augustine et al., 2016).

Similarly, Cooper et al. (2000) in their meta-analysis found that summer programs led to favorable

outcomes on mathematics assessments than on reading assessments.

As documented by the literature, most of the summer programs, even though they are usually mandatory,

have targeted students at risk of grade retention and not high achievers (Augustine et al., 2013). According

to Olszewski-Kubulius et al. (2012), during the No Child Left Behind (NCLB) era, there has been extra

attention on raising the achievement of low performers in detriment of high performers since teacher’s

time and attention have been concentrated on struggling students. Teachers perceived that low-achieving

students received significantly more attention and represented their schools’ top priority (Loveless et al.,

2008). The passing of Every Student Succeeds Act (ESSA) in 2015 addressed this particular issue by

giving the authority to the states over their accountability regimes, and therefore encouraging them to

focus on all pupils (Wright, 2017). States, for example, can give substantial weight to measures of

academic growth for all students from one year to another, or use average scale scores or performance

indexes to measure academic growth beyond proficiency (Wright, 2017).

Montgomery County Public Schools Office of Shared Accountability

Program Evaluation Unit 5 ELO STEP Evaluation

Many studies on achievement gaps have been conducted during the NCLB era, suggesting that the focus

on low achievers compared to high achievers was apparent. Even the results of state-level research

mirrored the trends observed in national studies, revealing that gaps increased over time more for advanced

students than those with low levels of achievement (Zhao et al., 2016). However, recent research is

demonstrating that the focus has switched to an emphasis on all students. For instance, according to the

study conducted by the Thomas B. Fordham Institute of all the state accountability plans (including

Washington, D.C.) as of October 2017, the country is doing better in signaling that every child is

important, regardless of their achievement level (Wright, 2017). Twenty-two states and Washington, D.C.

(45% of all states) ranked “strong” on this objective and 14 states (27%), including Maryland, ranked

“medium.”

Efforts across the country

Many initiatives have been implemented across the country to tackle the issue of the achievement gap.

Olszewski-Kubilius et al. (2012) provided a list of nine successful programs aimed at low-income and

high achievers, run by schools, school districts, and non-school organizations. Also, based on literature

and expert opinions, there are nine components that ensure a quality summer learning program: (1) smaller

class size; (2) differentiated instruction; (3) high-quality instruction; (4) aligned school-year and summer

curricula; (5) engaging and rigorous programming; (6) maximized participation and attendance; (7)

sufficient duration; (8) involved parents; and (9) evaluation of effectiveness (McCombs et al., 2011).

One of the programs highlighted by Olszewski-Kubilius et al. (2012) was Project NEXUS – Linking

Middle Schools to College Success, implemented by the Maryland State Department of Education

(MSDE) from 2005 to 2008. The goal of this program was to expand opportunities for students from low-

income families to gain access to challenging coursework and, therefore, to be prepared for the rigors of

higher education (Olszewski-Kubilius et al., 2012). The following positive outcomes were observed after

the program ended: (a) the number of middle school students taking Pre-AP courses in NEXUS schools

increased by more than 12%; (b) the number of high school students enrolled in AP courses increased by

23%; (c) the number of AP tests taken increased by 21%; and (d) an increase of 5% was observed in test

takers scoring 3-5 on AP exams (Olszewski-Kubilius et at., 2012).

Another pertinent program is the Young Scholars Program, implemented by the Fairfax County Public

Schools (FCPS). This program first identifies and nurtures advanced academic potential in students from

historically underrepresented groups. Then, school administrators, teachers, and advanced academic

resource teachers work together to prepare those identified young learners for more challenging and

rigorous courses (FCPS, 2018). The selection of students includes individuals from a diverse cultural,

ethnic, and linguistic background who are unlikely to be considered for gifted programs (Olszewski-

Kubilius et al., 2012). Positive outcomes were also observed during this program: (a) the number of Black

or African American, and Hispanic/Latino enrolled in the program increased by 21% from 2010 to 2012;

and (b) the number of young scholars identified for secondary advanced academics increased by more

than 200% from 2010 to 2011 (Olszewski-Kubilius et at., 2012).

Montgomery County Public Schools Office of Shared Accountability

Program Evaluation Unit 6 ELO STEP Evaluation

Efforts in MCPS

Besides ELO STEP, which mainly targets low-income high-achieving students, MCPS also offers other

summer programs that target students from low-income families. One of these programs is the Extended

Learning Opportunities Summer Adventures in Learning (ELO SAIL), which provides literacy and

mathematics instruction to Title I students entering kindergarten, Grade 1, or Grade 2 in the fall following

the program. The goal of this program is to prevent summer loss and prepare students for the next grade

level by focusing on reading, writing, and mathematics, as well as arts and sciences in some participating

schools. This program started in 2002 as a 4-week program in 17 Title I schools, and it expanded to a 5-

week program in 21 Title I schools by the summer of 2018. The most recent evaluation conducted by

Cooper-Martin et al. (2016) showed that this program had mixed results with a strong positive academic

outcome on the targeted population in the fall than at the end of the school year. For reading, the authors

reported that positive gains were observed mainly on kindergarteners compared to Grade 1 and Grade 2

students; however, these results differed from earlier evaluations (Cooper-Martin et al., 2016). Also,

positive outcomes in reading were observed in first and second graders who participated in previous ELO

SAIL, but not in the summer of 2015. For mathematics, these positive gains were mainly limited to Grade

1 and Grade 2 students, and these results were also observed in previous evaluations of the program,

except for 2012 (Cooper-Martin et al., 2016). The analyses focused on high-poverty or underrepresented

populations and found positive results on at least some measures for FARMS or ESOL recipients (Cooper-

Martin et al., 2016). The Program Evaluation Unit of the Office of Shared Accountability (OSA) is

currently conducting an evaluation of the outcome of the 2018 summer ELO SAIL, and the findings will

be shared when the study is completed.

Another initiative that MCPS started to implement during the summer of 2018 is the Summer Unleash

Potential (Summer UP) program. The goal of this program is to prevent summer learning loss and enhance

students’ social-emotional skills by providing high-quality instructional and enrichment activities to

students from low-income families. The Summer UP program targeted elementary and middle school

students at 43 focus schools, which are schools that do not meet the level of poverty for a Title I

designation but have a high percentage of students receiving FARMS services. The duration of this

program was five days a week for four weeks, 6.5 hours per day. A little more than 600 students from

Grades 3 to 8 and with academic need were identified as eligible to participate. The Program Evaluation

Unit of OSA is currently conducting an evaluation of this program, and the results will be shared when

available.

Evaluation Scope and Questions

The purpose of this study was to assess the outcome in terms of academic achievement of the students

enrolled in the 2018 ELO STEP program by addressing the following questions:

1. What were the demographic characteristics of students who participated in the summer 2018 ELO

STEP by gender, race/ethnicity, services received, and focus group status?

2. After controlling for pre-existing differences among students, were there differences in reading and

mathematics performance in the fall of 2018 among students receiving the ELO STEP program

compared to their peers who were invited but did not participate? Did these differences vary by

students receiving ESOL services and student focus groups?

Montgomery County Public Schools Office of Shared Accountability

Program Evaluation Unit 7 ELO STEP Evaluation

Methodology

This section includes the evaluation design, measures, data sources, analytical sample, data analysis

procedures, and strengths and limitations associated with this study.

Evaluation Design

In conducting the evaluation of an intervention program, an experimental design with random assignment

is an ideal sampling method to create experimental and control groups. The students in the program would

receive a treatment (e.g., ELO STEP), whereas no specific treatment would be given to those students in

the control group. The procedure of random assignment is a powerful technique for controlling all known

and “unknown” extraneous (or covariate) variables because it makes both groups very similar at the time

of program delivery. Unfortunately, randomized methods are not often feasible in educational settings

due to ethical, practical, and logistical issues (Modarresi et al., 2007). Consequently, this evaluation

employed a non-equivalent comparison group design, a frequently used type of quasi-experimental design,

as recommended by many researchers (Bordens & Abbott, 2008; Fraenkel & Wallen, 2009; Shadish, Cook

& Campbell, 2002). The design (Figure 1 below) compares the outcomes for the group of students who

participated in the program with a comparison group of students who did not participate but were similar

on observable characteristics (Institute of Education Sciences, 2016). The emphasis in this design is to

maximize internal validity of the study by controlling for confounding variables.

Figure 1. Evaluation Design of the ELO STEP Program

Pre-program ELO STEP Post-program

ELO STEP student group O1 => X => O2

Comparison group (Non-STEP) O1 => C => O2

O1 – Spring 2018 test scores for Grades 2, 3, and 4 in mathematics and reading

X – Five weeks of ELO STEP program treatment from July 9 to August 10, 2018

C – Non-STEP (no ELO STEP treatment)

O2 – Fall 2018 test scores for Grades 3, 4, and 5 in mathematics and reading

Pre-program Measures. Pre-program measures for addressing the second evaluation question included

the Measures of Academic Progress – Reading (MAP-R), the Measures of Academic Progress –

Mathematics (MAP-M), the Assessment Program in Primary Reading (MCPS AP-PR), and the Measures

of Academic Progress – Primary Grades (MAP-P). Specifically, Rasch Unit (RIT) scores from the spring

2018 administration of the MAP-R and MAP-M were used as pre-program measures in reading and math

for rising Grade 4 and Grade 5 students. MCPS has been administering the MAP-R in Grades 3–8 since

2004. For the rising Grade 3 students, the pre-program measure was the spring 2018 MCPS AP-PR in

reading and MAP-P in Mathematics since Grade 2 students in MCPS do not take MAP-R or MAP-M.

MAP-R, MAP-M, and MAP-P are computer adaptive achievement tests developed by the Northwest

Education Association (NWEA, 2008 and 2011). MAP-R measures five reading areas, such as word

recognition and vocabulary, reading comprehension–literal, reading comprehension–

inferential/interpretive, reading comprehension-evaluation, and literary response and analysis. MAP-M,

on the other hand, measures algebraic concepts, computation, geometry, measurement, number sense and

Montgomery County Public Schools Office of Shared Accountability

Program Evaluation Unit 8 ELO STEP Evaluation

numeration, statistics, and probability and graphing (NWEA, 2011). RIT scale scores on MAP-R and

MAP-M were used for advanced analyses since the equal-interval property of the RIT scale scores makes

them appropriate for various statistical purposes. For the rising Grade 3 students, the pre-program

measures in mathematics were RIT scores in mathematics from the spring 2018 administration of

MAP-P because Grade 2 students in MCPS do not take MAP-M.

Post-program (Outcome) Measures. RIT scores in MAP-R and MAP-M from fall 2018 for Grades 3, 4,

and 5 were used as outcome measures for reading and mathematics respectively to examine the ELO STEP

program effect on student academic performance.

Student Characteristics Data Sources. The measures for student characteristics included gender,

race/ethnicity, services received, and student focus groups status. The receipt of services referred to the

English for Speakers of Other Languages (ESOL), Free and Reduced-priced Meal Systems (FARMS),

and special education services. These measures were extracted from MCPS data files. Student attendance

at the ELO STEP program was recorded daily by staff at each of the program sites and provided to the

evaluators.

Sample

ELO STEP participants. The participant group comprised all students who met the attendance threshold,

defined as the students who attended the program for 19 or more days (out of 25 days). This evaluation

used the cutoff of 19 days based on an analysis of the distribution of participants and days of attendance.

The attendance threshold decision made by evaluators was supported by the literature that suggests that

the relationship between program attendance and estimated treatment effects for similar programs tend to

be between 15 and 25 days or more of attendance (Augustine et al., 2016). Additionally, this sample of

students did not differ based on the hours of instruction received (morning vs. extended hours). The

analyses only considered one single treatment group with one single level of treatment in order to obtain

a larger sample size and more statistical power. In total, the number of ELO STEP attendees was 730,

including those who were not initially invited but participated. However, for addressing the second

research question, the students who were initially invited to the program (e.g., met certain criteria) and

attended for 19 or more days were included in the analysis (664 students).

Nonparticipants (comparison group). The second sample of students were nonparticipants, defined as

those students who met the selection criteria and were invited to ELO STEP but did not attend any day of

the program. Excluded from the comparison sample (nonparticipant) were students who did not

participate in ELO STEP but attended a different summer program (e.g., Building Educated Leaders for

Life or BELL). The total number of nonparticipants (comparison group) was 1,004 students.

Table 1 below presents the demographic characteristics of both groups. The group of nonparticipants was

approximately similar to the attendees in general, although some differences (greater than five percentage

points) were found for certain student characteristics. For example, the proportion of Black or African

American was slightly higher among the participants compared to the nonparticipants (30% vs. 21.5%),

and the proportion of participants “Non-FARMS All Other Students” (Monitoring Group) was slightly

lower among the participants compared to the nonparticipants (12.3% vs. 20.8%).

Montgomery County Public Schools Office of Shared Accountability

Program Evaluation Unit 9 ELO STEP Evaluation

Table 1. Characteristics of 2018 ELO STEP Participants and Nonparticipants

Participants

(19 or more days of

attendance)

Nonparticipants

N % N %

Total 730 100.0 1,004 100.0

Grade level as of fall 2017

Grade 3 267 36.6 391 38.9

Grade 4 214 29.3 308 30.7

Grade 5 249 34.1 305 30.4

Gender

Female 351 48.1 464 46.2

Male 379 51.9 540 53.8

Race/ethnicity

Asian 90 12.3 99 9.9

Black or African American 222 30.4 216 21.5

Hispanic/Latino 362 49.6 490 48.8

White -- -- 160 15.9

Receipt of services during the school year 2017–2018

ESOL 167 22.9 219 21.8

FARMS 488 66.8 589 58.7

Focus groups

Non-FARMS All Other Student Groups (Monitoring Group) 90 12.3 209 20.8

Non-FARMS Black or African American 86 11.8 89 8.9

Non-FARMS Hispanic/Latino 66 9.0 117 11.7

FARMS All Other Student Groups 56 7.7 89 8.9

FARMS Black or African American 136 18.6 127 12.6

FARMS Hispanic/Latino 296 40.5 373 37.2

Note. Results are not reported (--) for groups with fewer than 10 students. For groups of between 10 and 20

students, only the percentage rate is reported only if the percentage rate is ≤ 95% or ≥ 5%.

Analytical Procedures

Descriptive analyses, tests of significance, and effect size measures were used to assess ELO STEP effects

in this study. Specifically, this study was conducted via the following analytical techniques.

Descriptive statistics. The characteristics of ELO STEP participants were analyzed using descriptive

statistics. Analyses were done at each of the grade levels and within each grade. The data were

disaggregated and described by student characteristics (gender and race/ethnicity), service receipt

measures, and student focus groups or ESOL status. Bivariate analyses, such as chi-square tests, were

used where appropriate and applicable, especially to examine the outcome differences between

Montgomery County Public Schools Office of Shared Accountability

Program Evaluation Unit 10 ELO STEP Evaluation

participants and nonparticipants before the exposure to the program (Appendix E). This analysis revealed

that participants and nonparticipants performed in general at a similar level on all the pre-program

measures (spring 2018), except for math in Grade 3 and reading in Grade 4. A similar overall conclusion

was obtained when the analysis was disaggregated by focus group, with the exception of “Non-FARMS

All Other Student Groups” (or Monitoring Group). As a result, math and reading scores were included in

the statistical models, where appropriate, to control for students’ prior academic achievement.

Advanced statistical models. The goal of the advanced analyses was to compare the achievement of

students in the ELO STEP group to their counterparts in the comparison group to assess the effectiveness

of the program. Empirically, selection bias is an important problem in utilizing quasi-experimental data

to evaluate the impact of any programs (Shadish, Cook & Campbell, 2002). In gauging the efficacy of an

intervention program, the analysis of covariance (ANCOVA) is commonly used as an analytical procedure

on the data collected from quasi-experimental designs to adjust for selection bias. The procedure assesses

the effects of a program while controlling for covariates, particularly a pretest (initial ability measure) that

might confound the effects of a program (Campbell & Stanley, 1963; Judd, Smith, & Kidder, 1991; Kirk,

1995; Institute of Education Sciences, 2016). In this evaluation, ANCOVA was used to test for significant

differences between the two groups of students’ mean RIT scores on MAP tests (outcome measures) after

adjusting for covariates and control variables to increase the statistical power. This procedure improves

precision by removing some of the variability in the outcome measures that were not due to the program’s

effects. A description of step-by-step analytical procedures is presented below.

To balance the nonequivalent groups (ELO STEP and comparison groups), propensity scores based on

gender, race/ethnicity, FARMS status, and participation in the English for Speakers of Other Languages

(ESOL) and special education programs were computed by using logistic regression models as suggested

by the literature (Luellen, Shadish, & Clark, 2005). Propensity scores represent “selection bias” or the

likelihood that a student with certain pre-existing characteristics would be enrolled either in the ELO

STEP program or included in the comparison groups. The propensity scores were then divided into five

categories (quintiles) and incorporated as categorical covariates in the statistical models to control for

selection bias (Rosenbaum & Rubin, 1982, 1983, 1984, 1985; Austin, 2011).

In addition to utilizing propensity scores, a measure of initial abilities of students (spring test scores) was

included in each of the statistical models as a covariate, where available and appropriate, to further reduce

the selection bias inherent in the quasi-experimental design. The independent variable in each of the

statistical models was a dummy variable created to represent the status of the students’ experience (1 =

ELO STEP group, and 0 = comparison group). Analyses were done at grade level as well as subject level.

In addition to the propensity scores (quintiles), the covariate (student spring test scores), the dummy

variable representing students’ status (participation or not), and the interaction between them were

included in each of the statistical models. The coefficient of this product was used to test for non-

parallelism or interaction. The General Linear Model (GLM) parameterization procedure, the general

analysis procedures, was used to test the significant differences between adjusted means. The analysis for

each of the outcome measures was conducted separately.

Effect size. The statistical analyses were supplemented with the computation of effect size statistics to

judge whether the observed differences between student groups (ELO STEP vs. comparison) were large

enough to be of practical significance to educators. A small mean difference between the treatment and

comparison group may be statistically significant when a sampling error is small due to a large sample

size. Conversely, a relatively large mean difference may not be statistically significant when a sampling

Montgomery County Public Schools Office of Shared Accountability

Program Evaluation Unit 11 ELO STEP Evaluation

error is large due to a small sample size. Therefore, several researchers (e.g., Kline, 2004; Rossi, Lipsey,

& Freeman, 2004; Carver, 1993; Levin, 1993; Thompson, 1995; American Psychological Association,

2001) suggest that it would be prudent to report the results of statistical significance tests as well as

estimates of the practical significance (e.g., the effect size).

One of the most common effect size measures is the standardized mean difference, Cohen’s d, by which

an effect size (ES) of 0.20 is considered small, an ES of at least 0.50 is considered medium, and an ES of

0.80 or greater is considered large (Cohen, 1988). However, a study examining evaluations of 346

education programs for at-risk children reported that the average effect size, adjusted for characteristics,

was d = 0.12 (Borman et al., 2002). In addition, Lipsey and Wilson (1993) in their meta-analyses of

school improvement programs concluded that educational treatment effects of d = 0.10 to d = 0.20 should

not be interpreted as trivial. In a more recent article, Lipsey et al., (2012) noted that “Cohen’s broad

categories of small, medium and large are clearly not tailored to the effects of intervention studies in

education.” The authors argued that another basis for assessing the practical significance of a program

effect would be to compare it with the effects found for similar programs with similar study samples and

outcome measures. In their review of studies of intervention programs, Lipsey et al. (2012) concluded

that the mean of 227 reported effect sizes of interventions that focus on curriculum or broad instructional

programs across intervention programs studies is 0.13, and the median ES is 0.08. Therefore, in this

evaluation, an ES of 0.15 was considered an appropriate level for the threshold for a small practically

significant effect, d = 0.50 the threshold for a medium effect, and d = 0.80 the threshold for a large effect.

Effect Size Formula. In their article, Lipsey et al. (2012) indicated that adjusted mean values better

estimate the program’s effect by reducing any bias from the baseline differences and thus are the best

choices for use in a representation of that effect. The authors continued by stating:

When that representation involves the standard deviations, however, their values should not be

adjusted for the influence of the covariates. In virtually all such instances, the standard deviations are

used as estimates of the corresponding population standard deviations on the outcome variables

without consideration for the particular covariates that may have been used in estimating the difference

on the means (Lipsey et al., 2012).

Consequently, this study used the following formula to calculate the effect sizes: 𝐸𝑆 = (𝑀𝑡 −𝑀𝑐)/𝑆𝐷.

The 𝑀𝑡 and 𝑀𝑐 are the adjusted group means for the ELO STEP group (treatment) and the comparison

group respectively, and 𝑆𝐷 is the standard deviation of the pooled outcome scores (Borman, et al., 2002;

Thompson, 2002; Lipsey et al., 2012). The ES index is scale invariant or metric-free and can be used to

interpret the patterns of outcome measures’ differences between groups of students and across different

measures (Bloom et al., 2008; Lipsey et al, 2012). The Cohen’s d index has been shown to be upwardly

biased when the sample size is small (Institute of Education Sciences, 2016); therefore, caution should be

exercised in the interpretation of effect sizes when study samples are small.

Strengths and Limitations

Strengths. The findings presented in this report are based on a sound evaluation design and appropriate

analyses. The current evaluation used a non-equivalent comparison group design to investigate the

impacts of ELO STEP on the students’ academic achievement. It is important to note that isolating and

showing the effects of any program on students’ achievement is not an easy task in educational settings.

Montgomery County Public Schools Office of Shared Accountability

Program Evaluation Unit 12 ELO STEP Evaluation

The authors employed two control techniques for improving the internal validity of the findings and for

estimating a less biased effect of the ELO STEP program.

Control by study design. The key component of the quasi-experimental design is the use of

appropriate comparison groups when evaluating a program’s outcomes. In this evaluation, those

students who met the ELO STEP selection criteria in terms of achievement measures but did not

attend the program were used to be included in the comparison groups in all grades studied in this

report. The use of a comparison group for interpreting findings increases our confidence that

observed students’ outcomes are the result of participation in the program instead of a function of

students’ initial abilities.

Control by statistical techniques. A major problem with employing the quasi-experimental design

in educational settings is that the two groups of students may have important pre-existing

differences that may influence their achievement after exposure to an intervention, and this will

consequently threaten the internal validity of the findings (Gay & Airasian, 2000). Prominent

researchers (Campbell & Stanley, 1963; Judd, Smith, & Kidder, 1991) argued that in order to

observe the true effects of treatment in quasi-experimental designs, ANCOVA should be

conducted to control for confounding variables. Therefore, ANCOVA procedures were utilized

in this study to evaluate the outcomes of the ELO STEP program.

Limitations. As mentioned previously, this evaluation relied on a quasi-experimental design to address

the effectiveness of ELO STEP. However, many researchers argue that only an experimental design can

control each of the sources of internal validity (e.g., selection bias, maturation, history, attrition) in a study

(Campbell & Stanley1963; Babbie, 1992; Judd, Smith, & Kidder, 1991; Hedrick et al., 1993).

Consequently, causal conclusions about the impact of ELO STEP program on the reading and mathematics

achievement of students may not be inferred from the analyses presented in this report.

Also, some practical and logistical issues occurred during or after the implementation of ELO STEP. For

instance, the sample size of participants was reduced when some students in the treatment group

transferred to another school system for the 2018–2019 school year, and when some students had to be

excluded from the analysis due to lack of complete test scores data in math or reading, either before or

after the program. Moreover, this evaluation measured the effectiveness of ELO STEP by using students’

scores in MAP-M and MAP-R from spring and fall 2018. However, the gap in time between the end of

the summer program and the post-program test administration during the school year could have allowed

other factors, different from the program, to influence students’ performance. In the case of fall 2018

assessments, the window to administer MAP-R or MAP-M was almost two months (from September 11

to November 2, 2018). Participants who took these tests at the end of the assessment period rather than

at the beginning were more likely to be exposed to other factors, such as more instruction days, not

necessarily attributed to the program. Additionally, the effectiveness of this program was mainly assessed

by the academic performance of students in both reading and mathematics. Other program effects (e.g.,

having better critical skills, engagement, etc.) were not addressed by this study.

In closing, it is important to note that only one measure of students’ mathematics or reading achievement

(MAP test scores) was used to examine the effectiveness of ELO STEP. The overall results may change

if multiple measures were used in the outcome analyses.

Montgomery County Public Schools Office of Shared Accountability

Program Evaluation Unit 13 ELO STEP Evaluation

Results

Results are shown in the order of the evaluation questions.

Evaluation Question 1. What were the demographic characteristics of students who participated in

the summer 2018 ELO STEP by gender, race/ethnicity, services received, and focus group status?

Table 2 shows the characteristics of participants who attended at least 19 of the 25 days of instruction.

According to ELO STEP records, 730 students attended the program. There were slightly more male than

female participants (52% vs. 48%), and around 50% of attendees were Hispanic/Latino, and 30% were

Black or African American. Regarding services received in 2017–2018, about 67% of the participants

received FARMS services, and 23% of them received ESOL instruction.

Table 2. Characteristics of 2018 ELO STEP Participants by Grade (Gender, Race/Ethnicity, Services

Received, and Focus Group)

Total Grade 3 Grade 4 Grade 5

N % N % N % N %

Total 730 100.0 267 100.0 214 100.0 249 100.0

Gender

Female 351 48.1 122 45.7 104 48.6 125 50.2

Male 379 51.9 145 54.3 110 51.4 124 49.8

Race/ethnicity

Asian 90 12.3 26 9.7 25 11.7 39 15.7

Black or African American 222 30.4 82 30.7 69 32.2 71 28.5

Hispanic/Latino 362 49.6 135 50.6 111 51.9 116 46.6

White -- ≤ 5.0 -- 5.6 -- -- -- 6.4

Receipt of services in 2017–2018

ESOL 167 22.9 84 31.5 48 22.4 35 14.1

FARMS 488 66.8 183 68.5 141 65.9 164 65.9

Special education -- ≤ 5.0 -- 7.1 -- -- -- --

Focus groups

Non-FARMS All Other Student Groups

(Monitoring Group) 90 12.3 33 12.4 25 11.7 32 12.9

Non-FARMS Black or African American 86 11.8 28 10.5 28 13.1 30 12.0

Non-FARMS Hispanic/Latino 66 9.0 23 8.6 -- 9.3 23 9.2

FARMS All Other Student Groups 56 7.7 -- 6.4 -- -- 30 12.0

FARMS Black or African American 136 18.6 54 20.2 41 19.2 41 16.5

FARMS Hispanic/Latino 296 40.5 112 41.9 91 42.5 93 37.3

Note. Results are not reported (--) for groups with fewer than 10 students. For groups of between 10 and 20 students,

only the percentage rate is reported.

Table 2 above also shows participants’ characteristics disaggregated by grade. In general, almost similar

distributions were observed across grades, especially the distribution of Hispanic/Latino and Black or

Montgomery County Public Schools Office of Shared Accountability

Program Evaluation Unit 14 ELO STEP Evaluation

African American participants, as well as the distribution of participants receiving FARMS services. Also,

the distribution of participants in each of the focus groups did not differ much across grades.

By Gender. Figure 2 shows the distribution of ELO STEP participants by gender and grade. Among third

graders, there was a slightly higher participation in ELO STEP of male students compared to female.

However, this difference decreased for higher grades; the gender distribution of participants in Grade 5

was almost the same.

By Race/Ethnicity. Figure 3 displays the distribution of ELO STEP participants by race/ethnicity and

grade. Across all grades, there was a high participation of Hispanic/Latino students, followed by Black

or African American. The percentages of Hispanic/Latino and Black or African American participants

were almost similar across all grades, and participation of students with “Other” race was slightly higher

in Grade 5 compared to Grades 3 and 4.

45.754.348.6 51.450.2 49.8

0

20

40

60

80

100

Female Male

Per

centa

ge

of

Par

tici

pan

ts

Grade 3 Grade 4 Grade 5

5.6

30.7

50.6

13.1

--

32.2

51.9

14.56.4

28.5

46.6

18.5

0

20

40

60

80

100

White Black or African

American

Hispanic/Latino OtherPer

centa

ge

of

Par

tici

pan

ts

Grade 3 Grade 4 Grade 5

Figure 2. Gender distribution of ELO STEP participants by grade.

Figure 3. Race/Ethnicity distribution of ELO STEP participants by grade.

Note. “Other” includes American Indian or Alaskan Native, Asian, and with two or more races. Results are not

reported (--) for groups with fewer than 10 students.

Montgomery County Public Schools Office of Shared Accountability

Program Evaluation Unit 15 ELO STEP Evaluation

By Services Received. Figure 4 shows the distribution of the services received by ELO STEP participants

during the 2017–2018 school year by grade. The percentage of participants receiving FARMS services

was almost similar across all grades; however, the percentage of participants receiving ESOL instruction

and special education was especially higher in Grade 3 in comparison to the other grades.

By Focus Groups. On average, 12% of ELO STEP attendees in each of the grades were neither Black or

African American nor Hispanic/Latino and not receiving FARMS services (also defined as “Monitoring

Group”), whereas the rest of participants (on average 88%) were categorized as other focus groups (Figure

5).

Figure 6 shows the distribution of ELO STEP participants by each focus group, excluding the Monitoring

Group. The percentage of participants in each of the focus groups was almost similar across grades, with

a maximum variation that did not exceed more than five percentage points. The only focus group with a

31.5

68.5

7.1

22.4

65.9

--

14.1

65.9

--0

20

40

60

80

100

ESOL FARMS Special Education

Per

centa

ge

of

Par

tici

pan

ts

Grade 3 Grade 4 Grade 5

87.6

12.4

88.3

11.7

87.1

12.9

0

20

40

60

80

100

Other Focus Groups Monitoring Group

Per

centa

ge

of

Par

tici

pan

ts

Grade 3 Grade 4 Grade 5

Figure 4. Distribution of Services Received by ELO STEP participants during

the school year 2017–2018 by grade.

Note. Results are not reported (--) for groups with fewer than 10 students.

Figure 5. Distribution of ELO STEP participants in focus and monitoring

group by grade.

Montgomery County Public Schools Office of Shared Accountability

Program Evaluation Unit 16 ELO STEP Evaluation

higher variation by grade was observed among “FARMS All Other Student Groups” (6.4% of third graders

versus 12% of fifth graders).

Evaluation Question 2. After controlling for pre-existing differences among students, were there

differences in reading and mathematics performance in the fall of 2018 among students receiving

the ELO STEP program compared to their peers who were invited but did not participate? Did

these differences vary by students receiving ESOL services and student focus groups?

As mentioned in the Methodology section, the program’s outcome was examined by comparing the

MAP-R and MAP-M scores from fall 2018 between ELO STEP participants and their comparison group.

Analyses were conducted separately by grade, for all students and by focus group, and effect sizes (ES)

were calculated to show if the program effect was practically significant.

Grade 5

Mathematics. Table 3 presents the adjusted means on fall MAP-M for the ELO STEP and comparison

groups. After adjusting for the effects of student characteristics and pre-test scores, the calculated adjusted

mean difference (1.64) for all Grade 5 students between the two groups (attendees and non-attendees) was

statistically (p < 0.01) and practically (ES = 0.17) significant in favor of the ELO STEP group.

In the case of participants receiving English for Speakers of Other Languages (ESOL) services, also a

positive and practical (ES = 0.15) significant difference was found, suggesting that participants in ELO

STEP who received ESOL services in the 2017–2018 school year performed better on fall MAP-M than

their nonparticipant peers (Table 3).

Disaggregation of the data by focus group showed that differences between participants and

nonparticipants were statistically (p < 0.01) and practically (|ES| ≥ 0.15) significant for only two of the six

comparisons made (Black or African American, and Hispanic/Latino students receiving FARMS services)

10.5 8.66.4

20.2

41.9

13.19.3

--

19.2

42.5

12.09.2

12.016.5

37.3

0

10

20

30

40

50

Non-FARMS Black or

African American

Non-FARMS

Hispanic/Latino

FARMS All Other

Student Groups

FARMS Black or

African American

FARMS

Hispanic/Latino

Per

cenat

ge

of

Par

tici

pan

ts

Grade 3 Grade 4 Grade 5

Figure 6. Distribution of ELO STEP participants in each of the focus groups by grade.

Note. Results are not reported (--) for groups with fewer than 10 students.

Montgomery County Public Schools Office of Shared Accountability

Program Evaluation Unit 17 ELO STEP Evaluation

in favor of the ELO STEP group. For these two focus groups, participants significantly outperformed

their nonparticipant peers on fall MAP-M. In the case of Hispanic/Latino participants who did not receive

FARMS services, the adjusted mean difference (2.43) was practically (ES = 0.24) but not statistically (p

> 0.05) significant, suggesting that the magnitude of difference was meaningful in educational settings.

For the rest of focus groups and the Monitoring Group, the analyses revealed neither a statistical nor a

practical significance between the participants and nonparticipants.

Table 3. Adjusted Means of Grade 5 MAP-M RIT Scores for ELO STEP Participants and their Peers

Means of MAP-M RIT Scores in Fall 2018 ELO STEP Program Effect

ELO STEP Participants Nonparticipants ELO STEP vs. non-ELO

STEP

N

Original

Mean

(SD)

Adjusted

Mean N

Original

Mean

(SD)

Adjusted

Mean

Adjusted

Mean

Difference

St.

Error

Effect

Size

(ES)

All Grade 5 Students 199 225.31

(8.71) 225.77 282

224.62

(10.35) 224.13 1.64** 0.51 0.17

English for Speakers

of Other Languages

(ESOL) participants

19 221.58

(6.65) 222.29 40

221.55

(6.93) 221.29 1.01 1.31 0.15

Non-FARMS

All Other Student

Groups

(Monitoring Group)

31 230.00

(8.39) 230.81 61

231.31

(10.17) 230.74 0.07 1.31 0.01

Non-FARMS Black

or African American 28

222.46

(8.64) 224.10 29

224.90

(10.86) 223.20 0.90 1.62 0.09

Non-FARMS

Hispanic/Latino 21

222.00

(9.30) 224.04 36

222.86

(10.52) 221.61 2.43 1.62 0.24

FARMS All Other

Student Groups 23

227.26

(10.17) 225.37 34

224.97

(9.32) 226.20 -0.83 1.49 -0.09

FARMS Black or

African American 32

224.97

(8.46) 223.50 28

217.64

(7.05) 218.86 4.64** 1.31 0.54

FARMS

Hispanic/Latino 64

224.84

(7.37) 224.89 94

222.76

(9.24) 222.72 2.17** 0.77 0.25

SD = Standard Deviation. *p < 0.05 and **p < 0.01

Bold indicates a practical significant difference (|ES| ≥ 0.15)

Reading. Table 4 shows that for all Grade 5 students, the adjusted mean difference (-0.08) was neither

statistically (p > 0.05) nor practically (ES = -0.01) significant. The findings suggest that, after adjusting

for the effects of student characteristics and pre-test scores, ELO STEP students as a group performed

similarly to their peers in the comparison group as measured by fall 2018 MAP-R.

Table 4 also shows the results for participants receiving ESOL services, who reported a positive and

practical (ES = 0.40) significant difference.

Montgomery County Public Schools Office of Shared Accountability

Program Evaluation Unit 18 ELO STEP Evaluation

The analysis by focus group did not find any statistically (p > 0.05) significant difference between ELO

STEP participants and nonparticipants in each of the comparisons made as measured by fall 2018 MAP-

R. However, the effect size analyses showed that the comparisons between the two groups (Monitoring

Group vs. Comparison group) and one of the five focus groups (FARMS Hispanic/Latino vs. Comparison)

showed practical significance. The Monitoring Group presented a negative adjusted mean difference

(-2.16), which was practically significant (ES = -0.20), suggesting that nonparticipants as a group

performed better than their participant peers. A contrasting result was found among the Hispanic/Latino

receiving FARMS services; the adjusted mean difference between the two was positive (1.54) and

practically significant (ES = 0.17).

Table 4. Adjusted Means of Grade 5 MAP-R RIT Scores for ELO STEP Participants and their Peers

Means of MAP-R RIT Scores in Fall 2018 ELO STEP Program

Effect

ELO STEP Participants Nonparticipants ELO STEP vs. non-ELO

STEP

N

Original

Mean

(SD)

Adjusted

Mean N

Original

Mean

(SD)

Adjusted

Mean

Adjusted

Mean

Difference

St.

Error

Effect

Size

(ES)

All Grade 5

Students 203

216.07

(8.73) 216.89 282

217.94

(10.93) 216.97 -0.08 0.59 -0.01

English for Speakers

of Other Languages

(ESOL) participants

19 209.47

(8.05) 211.43 39

208.74

(9.37) 207.88 3.55 2.20 0.40

Non-FARMS

All Other Student

Groups

(Monitoring Group)

30 220.84

(9.70) 222.73 60

226.05

(11.09) 224.89 -2.16 1.30 -0.20

Non-FARMS Black

or African American 29

216.97

(6.43) 217.93 28

220.11

(9.66) 218.31 -0.38 1.59 -0.05

Non-FARMS

Hispanic/Latino 21

212.10

(11.94) 214.28 37

216.43

(11.14) 215.12 -0.84 2.20 -0.07

FARMS All Other

Student Groups 25

213.92

(9.60) 215.67 34

216.24

(8.37) 214.88 0.79 1.87 0.09

FARMS Black or

African American 33

218.00

(7.75) 216.70 28

214.71

(7.45) 216.24 0.46 1.33 0.06

FARMS

Hispanic/Latino 65

214.52

(6.90) 215.11 95

214.06

(9.97) 213.58 1.54 0.98 0.17

SD = Standard Deviation. *p < 0.05 and **p < 0.01

Bold indicates a practical significant difference (|ES| ≥ 0.15)

Grade 4

Mathematics. Table 5 presents the adjusted means on fall MAP-M for the ELO STEP and comparison

groups after controlling for effects of student characteristics and initial differences on pre-program

measures. The calculated adjusted mean difference (0.93) for all Grade 4 students between the two groups

Montgomery County Public Schools Office of Shared Accountability

Program Evaluation Unit 19 ELO STEP Evaluation

(participants and nonparticipants) was neither statistically (p > 0.05) nor practically (ES = 0.11)

significant. However, in the case of participants receiving ESOL services, a positive and practical (ES =

0.30) significant difference was found (Table 5).

When the data were disaggregated by focus group, the comparison between participants and

nonparticipants revealed mixed results. For the first two comparisons (Table 5), the adjusted mean

differences were both statistically (p < 0.05) and practically significant (|ES| ≥ 0.15) for the Monitoring

Group and Non-FARMS Black or African American students, revealing that participants performed better

on fall MAP-M than their nonparticipant peers. Moreover, the adjusted mean difference (Table 5) for

Non-FARMS Hispanic/Latino was not statistically but practically significant in favor of ELO STEP

group. Conversely, for all participants receiving FARMS (excluding Hispanic/Latino), the adjusted mean

differences were not statistically but practically (|ES| ≥ 0.15) significant in favor of the comparison group.

Table 5. Adjusted Means of Grade 4 MAP-M RIT Scores for ELO STEP Participants and their Peers

Means of MAP-M RIT Scores in Fall 2018 ELO STEP Program

Effect

ELO STEP Participants Nonparticipants ELO STEP vs. non-ELO

STEP

N

Original

Mean

(SD)

Adjusted

Mean N

Original

Mean

(SD)

Adjusted

Mean

Adjusted

Mean

Difference

St.

Error

Effect

Size

(ES)

All Grade 4

Students 195

211.40

(8.52) 211.92 291

211.30

(8.58) 210.99 0.93 0.58 0.11

English for Speakers

of Other Languages

(ESOL) participants

43 209.91

(6.64) 210.44 67

207.66

(7.46) 208.29 2.14 1.19 0.30

Non-FARMS

All Other Student

Groups

(Monitoring Group)

22 214.14

(8.70) 218.77 61

215.98

(10.17) 215.00 3.77* 1.66 0.38

Non-FARMS Black

or African American 27

213.85

(8.12) 214.73 20

212.85

(8.29) 211.20 3.53* 1.35 0.44

Non-FARMS

Hispanic/Latino 19

209.89

(9.23) 210.39 36 207.86

208.06

(6.30) 2.52 1.58 0.34

FARMS All Other

Student Groups 9

210.89

(5.58) 210.58 14

211.57

(8.92) 212.05 -1.47 1.97 -0.19

FARMS Black or

African American 37

211.42

(6.88) 210.64 37

211.14

(9.37) 212.06 -1.42 1.22 -0.17

FARMS

Hispanic/Latino 81

210.25

(9.23) 210.56 123

209.68

(7.11) 209.51 1.05 0.94 0.13

SD = Standard Deviation. *p < 0.05 and **p < 0.01.

Bold indicates a practical significant difference (|ES| ≥ 0.15)

Note. Due to a small sample, results for "FARMS All Other Student Groups" were validated via t-tests.

Montgomery County Public Schools Office of Shared Accountability

Program Evaluation Unit 20 ELO STEP Evaluation

Reading. Table 6 shows that the adjusted mean difference of all Grade 4 students (-0.07) was neither

statistically (p > 0.05) nor practically (ES = -0.01) significant, suggesting that ELO STEP students as a

group performed similarly to their peers in the comparison group as measured by fall 2018 MAP-R. Also,

for participants receiving ESOL services, neither practical nor statistically significant difference was

found (Table 6).

Results by focus group were mixed and in both directions (either negative or positive); however, none of

the comparisons among groups were statistically significant (p > 0.05). The calculated effect sizes from

the focus group analyses (ELO STEP group vs. comparison group) revealed that two of the six

comparisons made were practically significant (|ES| ≥ 0.15). In the case of Hispanic/Latino participants

not receiving FARMS services, the analysis showed that they performed better on fall MAP-R than their

nonparticipant peers (ES = 0.24). Conversely, in the case of all FARMS recipients (excluding Black or

African American and Hispanic/Latino), reading performance of participants was lower compared to

nonparticipants.

Table 6. Adjusted Means of Grade 4 MAP-R RIT Scores for ELO STEP Participants and their Peers

Means of MAP-R RIT Scores in Fall 2018 ELO STEP Program

Effect

ELO STEP Participants Nonparticipants ELO STEP vs. non-ELO

STEP

N

Original

Mean

(SD)

Adjusted

Mean N

Original

Mean

(SD)

Adjusted

Mean

Adjusted

Mean

Difference

St.

Error

Effect

Size

(ES)

All Grade 4

Students 195

208.70

(9.21) 209.18 290

209.52

(10.04) 209.25 -0.07 0.62 -0.01

English for Speakers

of Other Languages

(ESOL) participants

43 203.51

(8.51) 204.09 67

202.13

(8.28) 202.91 1.18 1.46 0.14

Non-FARMS

All Other Student

Groups

(Monitoring Group)

22 211.50

(9.66) 215.01 61

216.27

(10.10) 215.42 -0.41 1.82 -0.04

Non-FARMS Black

or African American 27

210.63

(10.87) 211.68 20

211.90

(10.87) 210.24 1.44 1.80 0.13

Non-FARMS

Hispanic/Latino 19

209.58

(10.83) 210.20 36

208.25

(9.73) 207.83 2.37 1.84 0.24

FARMS All Other

Student Groups 9

205.44

(11.98) 204.85 14

210.00

(8.74) 210.27 -5.42 3.16 -0.54

FARMS Black or

African American 37

209.08

(9.32) 209.28 37

208.89

(9.68) 208.65 0.64 1.62 0.07

FARMS

Hispanic/Latino 81

207.27

(7.49) 207.25 122

206.28

(8.56) 206.29 0.96 0.86 0.12

SD = Standard Deviation. *p < 0.05 and **p < 0.01.

Bold indicates a practical significant difference (|ES| ≥ 0.15)

Note. Due to a small sample, results for "FARMS All Other Student Groups" were validated via t-tests.

Montgomery County Public Schools Office of Shared Accountability

Program Evaluation Unit 21 ELO STEP Evaluation

Grade 3

Mathematics. Table 7 presents the adjusted means on fall MAP-M for the ELO STEP and comparison

groups after controlling for student characteristics and initial differences on pre-program measures. The

calculated adjusted mean difference (0.95) for all Grade 3 students between the two groups (participants

and nonparticipants) was statistically (p < 0.05) but not practically (|ES| ≤ 0.15) significant in favor of the

ELO STEP group. However, in the case of participants receiving ESOL services, a positive and practical

(ES = 0.16) significant difference was found (Table 7).

When the analysis was disaggregated by focus group, none of the adjusted mean differences between the

two groups of students was statistically significant (p > 0.05). However, the calculated effect sizes from

the focus group analyses (ELO STEP group vs. comparison group) revealed that three of the six

comparisons made were practically significant with ES greater than 0.15: (a) Non-FARMS All Other

Student Groups (Monitoring Group); (b) Non-FARMS Black or African American; and (c) FARMS All

Other Student Groups. In the case of these three particular subgroups, participants performed better than

their nonparticipant peers on fall MAP-M. For the rest of focus groups, neither a statistical nor a practical

significance was found for adjusted mean differences between attendees and non-attendees.

Table 7. Adjusted Means of Grade 3 MAP-M RIT Scores for ELO STEP Participants and their Peers

Means of MAP-M RIT Scores in Fall 2018 ELO STEP Program Effect

ELO STEP Participants Nonparticipants ELO STEP vs. non-ELO

STEP

N

Original

Mean

(SD)

Adjusted

Mean N

Original

Mean

(SD)

Adjusted

Mean

Adjusted

Mean

Difference

St.

Error

Effect

Size

(ES)

All Grade 3 Students 250 198.63

(7.90) 198.94 342

198.09

(8.61) 197.99 0.95* 0.46 0.11

English for Speakers

of Other Languages

(ESOL) participants

78 195.46

(6.65) 195.58 95

193.90

(7.05) 194.49 1.09 0.81 0.16

Non-FARMS

All Other Student

Groups

(Monitoring Group)

32 205.16

(7.09) 204.63 59

202.07

(9.93) 202.49 2.14 1.29 0.24

Non-FARMS Black

or African American 26

201.35

(6.15) 200.67 33

197.70

(8.60) 198.36 2.31 1.18 0.30

Non-FARMS

Hispanic/Latino 23

199.74

(8.71) 198.96 37

197.86

(7.38) 198.16 0.81 1.52 0.10

FARMS All Other

Student Groups --

201.18

(8.10) 202.15 32

198.88

(9.62) 198.39 3.76 2.14 0.41

FARMS Black or

African American 48

197.35

(8.36) 197.70 52

198.60

(8.42) 198.24 -0.54 1.03 -0.06

FARMS

Hispanic/Latino 104

195.90

(6.61) 196.27 129

196.06

(7.55) 195.97 0.30 0.70 0.04

SD = Standard Deviation. *p < 0.05 and **p < 0.01.

Bold indicates a practical significant difference (|ES| ≥ 0.15)

Montgomery County Public Schools Office of Shared Accountability

Program Evaluation Unit 22 ELO STEP Evaluation

Reading. For all Grade 3 students, Table 8 shows that the adjusted mean difference (0.04) on fall

MAP-R for participants and nonparticipants was neither statistically (p > 0.05) nor practically (ES = 0)

significant, suggesting that ELO STEP third graders as a group performed similarly to their peers in the

comparison group as measured by fall 2018 MAP-R. However, for participants receiving ESOL services,

a positive and practical (ES = 0.22) significant difference was found (Table 8).

Similar to the previous analysis by focus groups for other grades, results were mixed. None of the adjusted

mean differences in each of the six comparisons made was statistically significant (p > 0.05). However,

the calculated effect sizes from the focus group analyses (ELO STEP group vs. comparison group)

revealed that two of the six comparisons made were practically significant (|ES| ≥ 0.15). The adjusted

mean difference between the two groups of third graders (ELO STEP vs. comparison group) in the

Monitoring Group was practically significant (ES = -0.34) in favor of the comparison group. Conversely,

third graders who were Black or African American and received FARMS services had a positive (3.53)

and practically significant (ES = 0.33) adjusted mean difference.

Table 8. Adjusted Means of Grade 3 MAP-R RIT Scores for ELO STEP Participants and their Peers

Means of MAP-R RIT Scores in Fall 2018 ELO STEP Program Effect

ELO STEP Participants Nonparticipants ELO STEP vs. non-ELO

STEP

N

Original

Mean

(SD)

Adjusted

Mean N

Original

Mean

(SD)

Adjusted

Mean

Adjusted

Mean

Difference

St.

Error

Effect

Size

(ES)

All Grade 3

Students 252

197.70

(11.04) 198.19 357

198.52

(12.24) 198.15 0.04 0.91 0.00

English for Speakers

of Other Languages

(ESOL) participants

79 193.30

(11.17) 193.47 96

191.23

(10.12) 191.09 2.38 1.61 0.22

Non-FARMS

All Other Student

Groups

(Monitoring Group)

32 202.69

(10.38) 202.69 71

206.52

(11.63) 206.52 -3.83 2.41 -0.34

Non-FARMS Black

or African American 26

199.04

(9.50) 199.13 34

199.18

(11.97) 199.11 0.02 2.87 0.00

Non-FARMS

Hispanic/Latino 23

198.57

(13.58) 198.56 38

198.61

(10.87) 198.61 -0.04 3.18 0.00

FARMS All Other

Student Groups 17

199.88

(9.50) 199.83 32

198.81

(11.70) 198.84 0.99 3.15 0.09

FARMS Black or

African American 48

200.04

(9.70) 199.95 52

196.33

(11.59) 196.42 3.53 2.13 0.33

FARMS

Hispanic/Latino 106

194.26

(10.96) 194.30 130

194.75

(11.49) 194.72 -0.42 1.47 -0.04

SD = Standard Deviation. *p < 0.05 and **p < 0.01

Bold indicates a practical significant difference (|ES| ≥ 0.15)

Montgomery County Public Schools Office of Shared Accountability

Program Evaluation Unit 23 ELO STEP Evaluation

Comparison of the Program Outcome in Mathematics and Reading across Grades

The outcome of the ELO STEP program on student reading and mathematics performance, measured by

effect sizes, is presented below.

Mathematics. The effect sizes in mathematics ranged from 0.11 for third and fourth graders to 0.17 for

fifth graders, as Figure 7 shows below. There was evidence of program benefits in mathematics for

participants entering Grade 5 since the effect size associated to their adjusted mean difference was

statistically (p < 0.01) and practically significant (ES = 0.17), suggesting that the magnitude of the

difference was meaningful in educational settings.

When the analysis focused on the effect of ELO STEP in specific subgroups, positive results in

mathematics performance were observed across all grades for participants receiving ESOL instruction

during the 2017–2018 school year. As Figure 8 shows below, the calculated effect sizes for ELO STEP

participants receiving ESOL services were practically (|ES| ≥ 0.15) significant for all grades.

0.11 0.11

0.17**

0.00

0.10

0.20

0.30

Eff

ect

size

Grade 3 Grade 4 Grade 5

0.16

0.30

0.15

0.00

0.10

0.20

0.30

Eff

ect

size

Grade 3 Grade 4 Grade 5

Figure 7. Effect size of fall MAP-M scores for all ELO STEP participants.

Note. Bold indicates a practically significant difference. *p < 0.05, and **p < 0.01

Figure 8. Effect size of fall MAP-M scores for all ELO STEP participants

receiving ESOL services.

Note. Bold indicates a practically significant difference.

Montgomery County Public Schools Office of Shared Accountability

Program Evaluation Unit 24 ELO STEP Evaluation

The program’s outcome in mathematics by grade level and focus group, measured by effect sizes, are

illustrated in Figure 9. All the focus groups resulted in at least one grade with a practically significant

adjusted mean difference between attendees and non-attendees. The focus group with the largest effect

sizes was Black or African American fifth graders who received FARMS services and who participated

in ELO STEP. This group had a medium effect size (ES = 0.54), which was statistically significant

(p < 0.01), suggesting that they performed better on fall MAP-M compared to their nonparticipant peers.

Other subgroups with positive effect sizes, but not all statistically significant, are in bold in the first

quadrant of Figure 9 below. Also, two subgroup analyses, both fourth graders, revealed a negative but

small effect size, which was not statically significant (p > 0.05): (a) FARMS All Other Student Groups;

and (b) FARMS Black or African American.

Reading. As Figure 10 shows below, the effect sizes associated with the ELO STEP program in reading

were almost zero for fourth and fifth graders, and zero for third graders. As a result, regardless of the

grade level, ELO STEP participants performed as well as their nonparticipant peers on fall MAP-R.

0.240.30

0.10

0.41

-0.06

0.04

0.38*0.44*

0.34

-0.19 -0.17

0.13

0.010.09

0.24

-0.09

0.54**

0.25**

-0.30

-0.15

0.00

0.15

0.30

0.45

0.60

Non-FARMS All

Other Student

Groups

(Monitoring Group)

Non-FARMS Black

or African

American

Non-FARMS

Hispanic/Latino

FARMS All Other

Student Groups

FARMS Black or

African American

FARMS

Hispanic/Latino

Eff

ect

size

Grade 3 Grade 4 Grade 5

-0.01 -0.01-0.02

-0.01

0.00

0.01

0.02

Eff

ect

size

Grade 4 Grade 5

Figure 9. Effect size of fall MAP-M scores for ELO STEP participants by focus group.

Note. Bold indicates a practically significant difference. *p < 0.05, and **p < 0.01

Figure 10. Effect size of fall MAP-R scores for all ELO STEP participants.

Note. Grade 3 reported an overall effect size of zero.

Montgomery County Public Schools Office of Shared Accountability

Program Evaluation Unit 25 ELO STEP Evaluation

However, positive results in reading performance were observed, at least for two grades (Grades 3 and 5),

for participants receiving ESOL instruction during the 2017–2018 school year. Figure 11 shows the

calculated effect sizes for participants receiving ESOL services, practically (|ES| ≥ 0.15) significant for

Grades 3 and 5.

The outcome of the ELO STEP program in reading, as measured by effect sizes, were disaggregated by

focus group in Figure 12. All focus groups reported at least one grade with a practically significant

adjusted mean difference between attendees and non-attendees, with the exception of Non-FARMS Black

or African American students. Regardless of the grade level and participation in ELO STEP, this

particular subgroup of students performed the same on fall MAP-R. The group with the largest effect

size, although not statistically significant, was fourth graders from “All Other Student Groups” receiving

FARMS services who had a medium negative effect size (-0.54), suggesting that participants’ performance

on fall MAP-R was poorer than their nonparticipant peers. Smaller negative effect sizes, not statistically

significant, were also found for third and fifth graders in the Monitoring Group. These findings revealed

that participants from racial/ethnic groups other than Black or African American and Hispanic/Latino

performed poorer on reading than their nonparticipant peers.

Noteworthy positive effect sizes, none of them statistically significant, were also observed in the following

subgroups: (a) fourth graders Non-FARMS Hispanic/Latino; (b) third graders FARMS Black or African

American; and (c) fifth graders FARMS Hispanic/Latino (Figure 12). These findings revealed that no

overall positive conclusion could be inferred; however, positive effects of the program on reading

performance can be associated only to specific grade levels and student characteristics.

0.22

0.14

0.40

0.00

0.10

0.20

0.30

0.40

Eff

ect

size

Grade 3 Grade 4 Grade 5

Figure 11. Effect size of fall MAP-R scores for all ELO STEP participants

receiving ESOL services.

Note. Bold indicates a practically significant difference.

Montgomery County Public Schools Office of Shared Accountability

Program Evaluation Unit 26 ELO STEP Evaluation

Summary

The outcomes of the ELO STEP program varied by content area and by grade, with more positive findings

in mathematics than in reading. As Table 9 summarizes below, in mathematics, the benefits of ELO STEP

were evident (statistically and practically significant) for fifth graders overall, particularly for Black or

African American and Hispanic/Latino students receiving FARMS services, as well as for all fourth

graders not receiving FARMS services (excluding Hispanic/Latino). For some other subgroups, positive

and practical (|ES| ≥ 0.15) significant adjusted mean differences were found (Table 9), including: (a) third

graders FARMS All Other Student Groups, Non-FARMS Black or African, and Monitoring Group; and

(b) fourth and fifth graders Non-FARMS Hispanic/Latino students.

In the case of participants receiving ESOL services, also positive and practical (|ES| ≥ 0.15) significant

differences (but not statistically significant) were found across all grades, suggesting that participants in

ELO STEP who received ESOL services in the 2017–2018 school year performed better on fall MAP-M

than their nonparticipant peers.

Conversely, a small and negative practical significance results were found for fourth graders receiving

FARMS services (excluding Hispanic/Latino), indicating that nonparticipants in Grade 4 did better on fall

MAP-M than participants; however, these results were not statistically significant (p > 0.05), as Table 9

shows below.

-0.34

0.00 0.000.09

0.33

-0.04-0.04

0.13

0.24

-0.54

0.070.12

-0.20

-0.05 -0.07

0.09 0.06

0.17

-0.75

-0.55

-0.35

-0.15

0.05

0.25

0.45

Non-FARMS All

Other Student

Groups

(Monitoring Group)

Non-FARMS Black

or African

American

Non-FARMS

Hispanic/Latino

FARMS All Other

Student Groups

FARMS Black or

African American

FARMS

Hispanic/Latino

Eff

ect

size

Grade 3 Grade 4 Grade 5

Figure 12. Effect size of fall MAP-R scores for ELO STEP participants by focus group.

Note. Bold indicates a practically significant difference. None of the effect sizes was statistically significant.

Montgomery County Public Schools Office of Shared Accountability

Program Evaluation Unit 27 ELO STEP Evaluation

Table 9. Statistical Significance and Effect Sizes for ELO STEP Participants vs. Comparison Students in

Math

Group Measure Grade 3 Grade 4 Grade 5

All Students MAP-M *

d = 0.11

d = 0.11

**

d = 0.17

English for Speakers of Other Languages (ESOL)

participants MAP-M

d = 0.16

d = 0.30

d = 0.15

Non-FARMS All Other Student Groups

(Monitoring Group) MAP-M

d = 0.24

*

d = 0.38

d = 0.01

Non-FARMS Black or African American MAP-M

d = 0.30

*

d = 0.44

d = 0.09

Non-FARMS Hispanic/Latino MAP-M

d = 0.10

d = 0.34

d = 0.24

FARMS All Other Student Groups MAP-M

d = 0.41

d = -0.19

d = -0.09

FARMS Black or African American MAP-M

d = -0.06

d = -0.17

**

d = 0.54

FARMS Hispanic/Latino MAP-M

d = 0.04

d = 0.13

**

d = 0.25

*p < 0.05, **p < 0.01

Note. Bold indicates a practical significant difference (|ES| ≥ 0.15)

Shaded cells indicate a positive effect, statistically and practically significant.

In reading, even though the outcome of ELO STEP was not statistically significant for any of the grades

or by student subgroups, some differences were practically significant, which is relevant in educational

settings (Table 10). The benefits of ELO STEP were not evident for all students, but four subgroups had

practically significant differences: (a) third and fifth graders receiving ESOL services; (b) fifth graders

FARMS Hispanic/Latino; (c) fourth graders Non-FARMS Hispanic/Latino; and (d) third grader FARMS

Black or African American students.

Conversely, negative practical significance differences were found for some subgroups, indicating that

nonparticipants did better on fall MAP-M than participants: (a) fifth and third graders in the Monitoring

group, and (b) fourth graders FARMS All Other Student Groups.

Montgomery County Public Schools Office of Shared Accountability

Program Evaluation Unit 28 ELO STEP Evaluation

Table 10. Statistical Significance and Effect Sizes for ELO STEP Participants vs. Comparison Students

in Reading

Group Measure Grade 3 Grade 4 Grade 5

All Students MAP-R

d = 0.00

d = -0.01

d = -0.01

English for Speakers of Other Languages (ESOL)

participants MAP-R

d = 0.22

d = 0.14

d = 0.40

Non-FARMS All Other Student Groups

(Monitoring Group) MAP-R

d = -0.34

d = -0.04

d = -0.20

Non-FARMS Black or African American MAP-R

d = 0.00

d = 0.13

d = -0.05

Non-FARMS Hispanic/Latino MAP-R

d = 0.00

d = 0.24

d = -0.07

FARMS All Other Student Groups MAP-R

d = 0.09

d = -0.54

d = 0.09

FARMS Black or African American MAP-R

d = 0.33

d = 0.07

d = 0.06

FARMS Hispanic/Latino MAP-R

d = -0.04

d = 0.12

d = 0.17

*p < 0.05, **p < 0.01

Note. Bold indicates a practical significant difference (|ES| ≥ 0.15)

Shaded cells indicate a positive effect, statistically and practically significant.

Discussion and Conclusion

Decision makers in K-12 education have been facing higher public expectations, uncertain budgets, and

demanding constituencies in the last few years. There is an urgent need for sound evaluative information

to help school leaders to make decisions based on evaluative information regarding the expansion,

continuation, and improvement of intervention programs in the school system.

After the Every Student Succeeds Act (ESSA), passed in 2015, states are more encouraged to focus on all

students, not only on struggling or at-risk children but also on students with different skill sets (Wright,

2017). ELO STEP is one example of these initiatives that intended to close the achievement gap between

high achievers from low-income families and their affluent peers by providing accelerated and enrichment

instructional opportunities. As stated by the literature, the achievement gap problem among high

achievers has been largely documented, suggesting that students in high poverty schools tend to be

Montgomery County Public Schools Office of Shared Accountability

Program Evaluation Unit 29 ELO STEP Evaluation

underrepresented among high achievers (Smarick, 2013) and, sometimes, these gaps close slowly or grow

over the past generation (Plucker at al., 2013).

The literature has also remarked on a number of features that are associated with effective summer

programs. Most of those features are aligned with the characteristics of the 2018 ELO STEP: (a) structured

instruction (in reading, writing, or mathematics) consistent with state and local content standards and

matched with students’ academic needs; (b) adequate intensity and duration of instruction (three hours a

day, five days per week, for five to six weeks); (c) lower student-to-adult ratios than those observed in the

regular school year; and (d) academic content supplemented with enrichment activities (Augustine et al.,

2013). For instance, the 2018 ELO STEP has provided a structured curriculum in math, science, and

information literacy in the morning, and supplemented it with academic content for some students (such

as digital literacy and coding) during the afternoon. Also, the curriculum was provided continuously for

five weeks to 785 students from 21 Title I schools, with an average class size of 14 students per section.

Findings from this evaluation indicated that the ELO STEP program might be used as a potential

mechanism to improve outcomes for above average students. Similar to the findings from the first

evaluation of ELO STEP in the summer of 2015, the overall outcome of the 2018 ELO STEP was more

evident in mathematics than in reading, and particularly for Grade 5 participants. A possible reason for

this positive effect of performance in mathematics might be attributed to the curriculum structure and

schedule. As indicated in Appendix B, the 2018 ELO STEP curriculum included math, which was taught

daily for 75 minutes. However, literacy activities (information literacy) were only offered 30 minutes

each day.

Regarding particular student subgroups, results from this evaluation were mixed, whereas the first

evaluation (Grade 3 only) found exclusively positive effect sizes in mathematics and reading for all the

student subgroups analyzed (race/ethnicity and FARMS status). The current evaluation found a variety

of results by student subgroups, with only one similarity with the first outcome evaluation of ELO STEP:

Hispanic/Latino participants, regardless of their grade level, performed better in mathematics than their

nonparticipant peers. For other subgroups in the current evaluation, positive and practically significant

effects varied depending on the race/ethnicity, grade, and service status of participants:

Black or African American. Comparative analyses (ELO STEP participants and nonparticipants)

revealed that Black or African American participants in Grades 3 and 4, who did not receive

FARMS services, performed better in mathematics, as well as Black or African American

participants in Grade 5 who received FARMS services. For reading, only Black or African

American participants in Grade 3 who received FARMS services performed better than their peers

in the comparison group.

Hispanic/Latino. Similar comparative analyses found that Hispanic/Latino participants in Grade

4 who did not receive FARMS services performed better than their peers (nonparticipants) in math.

Better performance was observed for Hispanic/Latino participants in Grade 5 regardless of their

FARMS status using parallel analyses. In reading, the comparison of the two groups of students

(participants and nonparticipants) found that Hispanic/Latino participants in Grade 4 who did not

receive FARMS services performed better than their peers. The same comparison found similar

results for Hispanic/Latino participants in Grade 5 who did receive FARMS services.

Montgomery County Public Schools Office of Shared Accountability

Program Evaluation Unit 30 ELO STEP Evaluation

FARMS recipients. For mathematics, the comparative analyses indicated that participants who

were not FARMS recipients, regardless of their grade level, tended to perform better in

mathematics than participants who received FARMS services. For reading, no overall conclusion

can be inferred by students’ FARMS status.

ESOL recipients. Comparative analyses, for mathematics, found that participants who received

ESOL services performed better in mathematics across all grades than participants who did not.

For reading, the same conclusion was observed but only for participants in Grade 3 and 5.

In closing, establishing the criteria of success and failure is very difficult in evaluating the impact of any

intervention programs. Borus (1979) asserts that the list of outcomes is always incomplete since only

those criteria of success, which can be measured, are considered. The effectiveness of ELO STEP in this

evaluation was assessed only by examining the performance of students in the area of reading and

mathematics through only one measure of students’ achievement (MAP test scores). Other program goals,

such as critical thinking, knowledge of science concepts, or students’ attitudes to complex tasks, cannot

be addressed by the analysis due to the lack of existing measures.

Recommendations

Based on the findings of this evaluation, the following recommendations are suggested:

Continue with the use of the instructional mathematics practices for ELO STEP for future positive

effects in mathematics. However, it is important to explore the reasons why the program had a

small negative effect size in Grade 4 participants who received FARMS services but who were not

Hispanic/Latino, even though these results were not statistically significant.

Examine the ELO STEP curriculum and make it more interdisciplinary. Similar to the

recommendation proposed in the first evaluation of ELO STEP (2016), the shorter instructional

time in literacy might be the reason why there was no apparent effect of the program on students’

overall reading performance.

Provide opportunity for and encourage teachers to conduct Action Research (AR) in their

classroom, which will allow them to collect data from their students via multiple measures, identify

patterns, collaborate with their peers, and think about different ways to improve instruction

delivery, student learning, and engagement. Guidelines and example about how to put AR in

practice can be included in the ELO STEP training that takes place before the program starts.

Increase the number of students participating in ELO STEP via outreach activities. Among the

1,824 students invited to the program, only less than the half (43%) chose to participate.

Include in future studies a formative evaluation component that incorporates students’, teachers’,

and parents’ feedback regarding their experience in the program. Program stakeholders’

experiences can help the administration and program staff identifying what components or

practices in the program are effective or need to be improved. Also, extend this recommendation

to parents of students who were invited but decided not to participate in order to understand the

reasons for their decision.

Montgomery County Public Schools Office of Shared Accountability

Program Evaluation Unit 31 ELO STEP Evaluation

Replicate the study over time using different student population to see if the observed findings are

stable.

Ask parents or guardians to sign a confirmation form at the time of ELO STEP registration

committing their child or children to attend, similar to what is done for the ELO SAIL program.

Administer a test, before and after the program, assessing all the content areas and skills that the

program intends to enrich (e.g., critical thinking, knowledge of science concepts, students’

attitudes to complex tasks, etc.). Results from these tests can provide indicators, relevant and

specific to the ELO STEP program, which can be used to measure the impact of the intervention.

Acknowledgements

The authors would like to thank Ms. Natalie Wolanin, logistics support specialist, Program Evaluation

Unit (PEU), for her careful and thoughtful review of the report; Ms. Nichelle Owens, supervisor, Division

of Title I and Early Childhood Programs and Services, for her collaboration in providing pertinent

information about the 2018 ELO STEP program and feedback to the draft report; and Mrs. Kurshanna

Dean, supervisor, Division of Accelerated and Enriched Instruction, for her feedback to the draft report as

well.

Montgomery County Public Schools Office of Shared Accountability

Program Evaluation Unit 32 ELO STEP Evaluation

References

American Psychological Association. (2001). Publication manual of the American Psychological

Association. Washington, DC: Author. Austin P.C, 2011.

Augustine, C., McCombs, J. S., Schwartz, H. L., & Zakaras, L. (2013). Getting to Work on Summer

Learning: Recommended Practices for Success, Santa Monica, Calif.: RAND Corporation, RR-

366-WF.

Augustine, C., McCombs, J. S., Pane, J., Schwartz, H., Schweig, J., McEachin, A., & Siler-Evans, K.

(2016). Learning from Summer: Effects of Voluntary Summer Learning Programs on Low-Income

Urban Youth, Santa Monica, CA: RAND Corporation, RR-1557-WF.

Austin, P. C. (2011). An introduction to propensity-score methods for reducing confounding in

observational studies. Multivariate Behavioral Research. Vol. 46.

Babbie, Earl. (1992). The practice of social research, 6th Edition. Belmont, CA: Wadsworth.

Bloom, H. S., Hill, C. J., Black, A. R., & Lipsey, M. W. (2008). Performance trajectories and performance

gaps as achievement effect-size benchmarks for educational interventions. Journal of Research on

Educational Effectiveness, 1(4), 289-328.

Bordens, K. S. & Abbott, B. B. (2008). Research design and methods: A process approach, 7th Edition.

New York, NY: McGraw-Hill.

Borman, G. D., Hewes, G. M., Overman, L. T., & Brown, S. (2003). Comprehensive school reform and

achievement: A meta-analysis. Review of Educational Research, 73(2), 125-230.

Borus, ME. (1979). Measuring the impact of employment-related social programs. W.E.UPJOHN

Institute for Employment research.

Campbell, D. T. & Stanley, J. C. (1963). Experimental and Quasi-experimental designs for research.

Chicago, IL: Rand McNally College Publishing.

Carver, R. P. (1993). The case against statistical significance testing, revisited. Journal of Experimental

Education, 61(4), 287–292.

Cohen, J. (1988). Statistical power analysis for the behavioral sciences (2nd Ed.). Hillsdale, NJ: Lawrence

Earlbaum Associates.

Cooper, H., Charlton, K., Valentine, J., Muhlenbruck, L., & Borman, G. (2000). Making the Most of

Summer School: A Meta-Analytic and Narrative Review. Monographs of the Society for Research

in Child Development, 65(1), I-127

Montgomery County Public Schools Office of Shared Accountability

Program Evaluation Unit 33 ELO STEP Evaluation

Cooper-Martin, E., & Zhao, H. (2016). Impact of the Extended Learning Opportunities Summer

Adventures in Learning (ELO SAIL) Program on Student Academic Performance: Part 2, Results

for Summer 2015. Rockville, MD: Montgomery County Public Schools.

Fairfax County Public Schools. (2018). Young Scholars (K-12). Retrieved November 26, 2018 from FCPS

website. https://www.fcps.edu/index.php/academics/middle-school-academics-7-8/advanced-

academics/young-scholars-k-12

Fraenkel, J. R., & Wallen, N. E. (2009). How to design and evaluate research in education. New York,

NY: McGraw-Hill.

Gay, L. R. & Airasian, P. W. (2000). Educational research: Competencies for analysis and application

(6th ed.). Englewood Cliffs, NJ: Prentice Hall.

Hedrick, T. E., Bickman, L., & Rog, D. J. (1993). Applied research design: A practical guide. Newbury

Park, CA: Sage Publications, Inc.

Institute of Education Sciences (2016). What Works Clearinghouse: Procedures and Standards Handbook

Version 3.0. Retrieved December 4, 2018 from:

http://ies.ed.gov/ncee/wwc/DocumentSum.aspx?sid=19

Judd C. M., Smith, E. R. & Kidder, L. H. (1991). Research methods in social relations. San Francisco,

CA: Holt, Rinehart and Winston, Inc.

Kaushal, N., Magnuson, K. A., & Waldfogel, J. (2011). “How Is Family Income Related to Investments

in Children’s Learning?” in Greg J. Duncan and Richard J. Murnane, eds., Whither Opportunity?

Rising Inequality and the Uncertain Life Chances of Low-Income Children, New York: Russell

Sage.

Kirk, R. E. (1995). Experimental design: Procedures for the behavioral sciences. New York, NY:

Brooks/Cole Publishing Company.

Kline, R. (2004). Beyond significance testing: Reforming data analysis methods in behavioral research.

Washington, DC: American Psychological Association.

Lang, E. (2018). Memorandum to principals of Title I schools, April 12, 2018. 2018 Extended Learning

Opportunities Summer Title I Enrichment Program. Rockville, MD: Montgomery County Public

Schools.

Levin, J. R. (1993). Statistical significance testing from three perspectives. Journal of Experimental

Education, 61(4), 378–381.

Lipsey, M.W., & Wilson, D.B. (1993). The efficacy of psychological, educational, and behavioral

treatment. Confirmation from meta-analysis. American Psychologist, 48, 1181-1209.

Montgomery County Public Schools Office of Shared Accountability

Program Evaluation Unit 34 ELO STEP Evaluation

Lipsey, M. W., Puzio, K., Yun, C., Hebert, M. A., Steinka-Fry, K., Cole, M. W., Roberts, M., Anthony,

K. S., & Busick, M. D. (2012). Translating the statistical representation of the effects of education

interventions into more readily interpretable forms (NCSER 2013-3000). Washington, DC:

National Center for Special Education Research, Institute of Education Sciences, U.S. Department

of Education.

Loveless, T., Parkas, S., & Duffett, A. (2008). High-Achieving Students in the Era of NCLB. Washington,

DC: Thomas B. Fordham Institute. https://eric.ed.gov/?id=ED501703

Luellen J. K., Shadish W. R., & Clark M. H. (2005). Propensity scores: An introduction and experimental

test. Evaluation Review, 29, 530-558. doi:10.1177/0193841X0575596.

Modarresi S., Connolly. F., & Merchlinsky, S. (2007). Applying Quasi-experimental Design to the Impact

Evaluation of a Professional Development Program: Design Issues & Methodological Challenges.

The Annual Meeting of the American Educational Research Association, Chicago, IL.

Montgomery County Public Schools. (2018). ELO STEP. Retrieved November 26, 2018, from MCPS

website https://www.montgomeryschoolsmd.org/departments/dtecps/title1/elo-step.aspx

Northwest Evaluation Association. (2008). RIT scale norms for use with Measures of Academic Progress.

Lake Oswego, OR: Northwest Evaluation Association.

Northwest Evaluation Association (2011). Technical manual for Measures of Academic Progress and

Measures of Academic Progress for Primary Grades. Lake Oswego, OR: Northwest Evaluation

Association.

Olszewski-Kubilius, P. & Clarenbach, J. (2012). Unlocking emergent talent: Supporting high achievement

of low-income, high-ability students. Washington, DC: National Association for Gifted Children.

http://www.jkcf.org/assets/1/7/Unlocking_Emergent_Talent.pdf

Peters, S., & Gentry, M. (2012). Group-Specific Norms and Teacher-Rating Scales: Implications for

Underrepresentation. Journal of Advanced Academics, 23(2), 125-144.

Plucker, J., Hardesty, J., & Burroughs, N. (2013). Talent on the sidelines: Excellence gaps and America’s

persistent talent underclass. Center for Education Policy Analysis at the Neag School of Education

University of Connecticut.

Rosenthal, R., & Rubin, D. B. (1982). A simple, general purpose display of magnitude of experimental

effect. Journal of Educational Psychology, 74, 166–169.

Rosenbaum, P. R., & Rubin, D. B. (1983). The central role of the propensity score in observational studies

for causal effects. Biometrika, 70, 41–45.

Rosenbaum, P. R., & Rubin, D. B. (1984). Reducing bias in observational studies using subclassification

on the propensity score. Journal of the American Statistical Association, 79, 561–524.

Montgomery County Public Schools Office of Shared Accountability

Program Evaluation Unit 35 ELO STEP Evaluation

Rosenbaum, P.R., & Rubin, D. B. (1985). Constructing a control group using multivariate matched

sampling that incorporates the propensity score. The American Statistician, 39, 33–38.

Rossi. P. & Freeman, H. E. (1993). Evaluation: A systematic approach. Beverly Hills, CA: Sage. Sage

Publications.

Rossi, P. H., Lipsey, Mark W., & Freeman, H. E. (2004). Evaluation: A systematic approach (7th edition.).

Thousand Oaks, CA: SAGE Publications, Inc.

Shadish, W. R., Cook, T. D., & Campbell, D. T. (2002). Experimental and Quasi-Experimental Designs

for Generalized Causal Inference, Houghton Mifflin Company, M.A.: Boston.

McCombs, J., Augustine, C., Schwartz, H., Bodilly, S., McInnis, B., Lichter, D., & Cross, A. (2011).

Making Summer Count: How Summer Programs Can Boost Children's Learning. Santa Monica,

CA: RAND Corporation.

Smarick, A. (2013). Closing America's High-achievement Gap: A Wise Giver's Guide to Helping Our

Most Talented Students Reach Their Full Potential. Washington, DC: The Philanthropy

Roundtable.

Thompson, B. (2002). “Statistical,” “practical,” and “clinical”: How many kinds of significance do

counselors need to consider? Journal of Counseling & Development, 80(1), 64–71.

White, W. (1906). Reviews before and after vacation. American Education, 185-188.

Wright, B. & Petrilli, M. (2017). Rating the Ratings: An Analysis of the 51 ESSA Accountability Plans.

Washington, DC: Thomas B. Fordham Institute.

Zhao, H., Modarresi, S., & Jang, S. (2016). Evaluation of Extended Learning Opportunities Summer Title

I Enrichment Program (ELO-STEP) in 2015–2016. Rockville, MD: Montgomery County Public

Schools.

Montgomery County Public Schools Office of Shared Accountability

Program Evaluation Unit 36 ELO STEP Evaluation

Appendix A

Montgomery County Public Schools Office of Shared Accountability

Program Evaluation Unit 37 ELO STEP Evaluation

Appendix B

Table B1. Schedules of 2018 ELO STEP and Enrichment Activities

ELO STEP Schedule

Time Content Area / Activity

9:00 - 9:15 Breakfast

9:15 - 9:30 Class Meeting or Activator

9:30 - 10:45 Math

10:45 - 11:45 Science

11:45 - 12:15 Lunch

12:15 - 12:45 Information Literacy

12:45 - 1:00 Reflection / Closure / Dismissal

Extended Schedule (Enrichment Activities)

Time Content Area / Activity

1:00 - 1:15 Transition

1:15 - 2:15 Digital Literacy

2:15 - 3:15 Coding

3:15 - 3:30 Reflection / Closure / Dismissal

Note. Five of the 14 sites did not provide enrichment activities.

Montgomery County Public Schools Office of Shared Accountability

Program Evaluation Unit 38 ELO STEP Evaluation

Appendix C

Table C1. FY 2018 Budget for ELO STEP

Number of Staff Cost

Teachers’ salaries, setup and professional

meeting

69 $422,370.00

Paraeducators’ salaries 9 $16,167.94

Staff benefits $35,083.04

Instructional materials $32,000.00

Total STEP cost $505,620.97

Source: Received from DTECPS on Sep. 19, 2018

Note. The ELO STEP budget excludes transportation and meal cost.

Montgomery County Public Schools Office of Shared Accountability

Program Evaluation Unit 39 ELO STEP Evaluation

Appendix D

Table D1. 2018 ELO STEP Program Sites and Bell Times

Title I Elementary Schools ELO STEP Site Bell Times

Arcola Arcola 9:00 - 3:30

Cresthaven Cresthaven 9:00 - 1:00

Capt. James E. Daly Capt. James E. Daly 9:00 - 1:00

Gaithersburg South Lake 9:00 - 3:30

Georgian Forest Georgian Forest 9:00 - 3:30

Harmony Hills Georgian Forest 9:00 - 3:30

Highland Sargent Shriver at Viers Mill 9:00 - 3:30

Jackson Road Cresthaven 9:00 - 3:30

Kemp Mill Arcola 9:00 - 3:30

JoAnn Leleck at Broad Acres JoAnn Leleck at Broad Acres 9:00 - 3:30

New Hampshire States Oak View 9:00 - 1:00

Oak View Oak View 9:00 - 1:00

Rolling Terrace Rolling Terrace 9:00 - 1:00

Roscoe R. Nix Cresthaven 9:00 - 3:30

Sargent Shriver Sargent Shriver at Viers Mill 9:00 - 3:30

South Lake South Lake 9:00 - 3:30

Summit Hall Summit Hall 9:00 - 3:30

Twinbrook Twinbrook 9:00 - 1:00

Washington Grove Summit Hall 9:00 - 3:30

Watkins Mill Watkins Mill 9:00 - 3:30

Weller Road Weller Road 9:00 - 3:30

Wheaton Woods Wheaton Woods 9:00 - 3:30

Montgomery County Public Schools Office of Shared Accountability

Program Evaluation Unit 40 ELO STEP Evaluation

Appendix E

Table E1. Mean and Standard Deviation of Spring 2018 MAP-P/M (Math) in Grade 2 for ELO STEP

Participants and their Comparison Group

Before ELO STEP intervention

(Spring 2018)

Participants vs.

Nonparticipants N Mean

Standard

Deviation

Mean

Difference p-value

All Students Participants 256 210.4 11.5 0.86 0.37

Nonparticipants 381 211.3 12.0

Non-FARMS All Other Student

Groups (Monitoring Group)

Participants 32 218.8 12.2 -2.19 0.44

Nonparticipants 78 216.6 13.9

Non-FARMS Black or African

American

Participants 26 214.5 12.4 -3.24 0.36

Nonparticipants 36 211.2 14.3

Non-FARMS Hispanic/Latino Participants 23 211.1 9.1 -1.41 0.62

Nonparticipants 40 209.7 11.6

FARMS All Other Student Groups Participants 17 211.6 9.9 2.32 0.46

Nonparticipants 33 213.9 10.8

FARMS Black or African American Participants 50 210.2 11.5 2.01 0.35

Nonparticipants 56 212.3 10.6

FARMS Hispanic/Latino Participants 108 206.8 10.4 1.02 0.43

Nonparticipants 138 207.8 9.8

Table E2. Mean and Standard Deviation of Spring 2018 MAP-P/M (Math) in Grade 3 for ELO STEP

Participants and their Comparison Group

Before ELO STEP intervention

(Spring 2018)

Participants vs.

Nonparticipants N Mean

Standard

Deviation

Mean

Difference p-value

All Students Participants 198 210.7 7.0 1.37* 0.05

Nonparticipants 302 212.1 8.4

Non-FARMS All Other Student

Groups (Monitoring Group)

Participants 22 210.9 5.6 5.17** 0.00

Nonparticipants 62 216.1 9.9

Non-FARMS Black or African

American

Participants 28 210.3 9.9 2.26 0.35

Nonparticipants 22 212.6 6.0

Non-FARMS Hispanic/Latino Participants 20 209.5 6.9 1.26 0.48

Nonparticipants 37 210.8 6.1

FARMS All Other Student Groups Participants 9 213.7 7.5 -0.73 0.82

Nonparticipants 16 212.9 7.7

FARMS Black or African American Participants 37 212.0 6.3 -2.59 0.20

Nonparticipants 39 209.4 10.7

FARMS Hispanic/Latino Participants 82 210.2 6.4 0.95 0.33

Nonparticipants 126 211.1 7.2

Montgomery County Public Schools Office of Shared Accountability

Program Evaluation Unit 41 ELO STEP Evaluation

Table E3. Mean and Standard Deviation of Spring 2018 MAP-P/M (Math) in Grade 4 for ELO STEP

Participants and their Comparison Group

Before ELO STEP intervention

(Spring 2018)

Participants vs.

Nonparticipants N Mean

Standard

Deviation

Mean

Difference

p-

value

All Students Participants 208 225.0 8.9 1.05 0.22

Nonparticipants 298 226.1 10.2

Non-FARMS All Other Student

Groups (Monitoring Group)

Participants 31 231.0 9.0 1.77 0.39

Nonparticipants 64 232.8 9.4

Non-FARMS Black or African

American

Participants 29 222.6 7.9 3.88 0.12

Nonparticipants 30 226.4 10.5

Non-FARMS Hispanic/Latino Participants 21 221.7 8.6 3.41 0.18

Nonparticipants 40 225.1 9.6

FARMS All Other Student

Groups

Participants 26 227.3 9.4 -2.68 0.26

Nonparticipants 35 224.6 9.0

FARMS Black or African

American

Participants 34 224.4 8.0 -3.10 0.13

Nonparticipants 29 221.3 7.9

FARMS Hispanic/Latino Participants 67 223.8 8.2 0.17 0.91

Nonparticipants 100 224.0 10.1

Table E4. Percentage of Students Meeting the Spring 2018 AP-PR Benchmark in Grade 2 for ELO

STEP Participants and their Comparison Group

Before ELO STEP intervention

(Spring 2018)

Participants vs.

Nonparticipants N

% Met

Benchmark p-value

All Students Participants 257 91.8 0.59

Nonparticipants 382 90.6

Non-FARMS All Other Student Groups

(Monitoring Group)

Participants 32 87.5 0.96

Nonparticipants 78 87.2

Non-FARMS Black or African American Participants 26 92.3 0.65

Nonparticipants 36 88.9

Non-FARMS Hispanic/Latino Participants 23 87.0 0.95

Nonparticipants 40 87.5

FARMS All Other Student Groups Participants 17 88.2 0.74

Nonparticipants 33 84.8

FARMS Black or African American Participants 50 92.0 0.63

Nonparticipants 56 89.3

FARMS Hispanic/Latino Participants 109 94.5 0.67

Nonparticipants 139 95.7

Montgomery County Public Schools Office of Shared Accountability

Program Evaluation Unit 42 ELO STEP Evaluation

Table E5. Mean and Standard Deviation of Spring 2018 MAP-R (Reading) in Grade 3 for ELO STEP

Participants and their Comparison Group

Before ELO STEP intervention

(Spring 2018)

Participants vs.

Nonparticipants N Mean

Standard

Deviation

Mean

Difference

p-

value

All Students Participants 198 206.6 9.1 1.47 0.09

Nonparticipants 301 208.1 9.8

Non-FARMS All Other Student

Groups (Monitoring Group)

Participants 22 208.4 7.7 5.53* 0.03

Nonparticipants 62 213.9 10.7

Non-FARMS Black or African

American

Participants 28 209.3 11.0 3.31 0.26

Nonparticipants 22 212.6 8.9

Non-FARMS Hispanic/Latino Participants 20 206.2 11.5 1.58 0.54

Nonparticipants 37 207.8 7.8

FARMS All Other Student

Groups

Participants 9 206.3 9.8 -1.15 0.76

Nonparticipants 16 205.2 8.1

FARMS Black or African

American

Participants 37 207.0 8.8 0.62 0.76

Nonparticipants 39 207.6 8.4

FARMS Hispanic/Latino Participants 82 205.1 8.1 -0.19 0.88

Nonparticipants 125 204.9 9.2

Table E6. Mean and Standard Deviation of Spring 2018 MAP-R (Reading) in Grade 4 for ELO STEP

Participants and their Comparison Group

Before ELO STEP intervention

(Spring 2018)

Participants vs.

Nonparticipants N Mean

Standard

Deviation

Mean

Difference

p-

value

All Students Participants 207 215.0 9.1 2.19** 0.01

Nonparticipants 296 217.2 9.8

Non-FARMS All Other Student

Groups (Monitoring Group)

Participants 30 221.5 8.8 3.18 0.15

Nonparticipants 62 224.7 10.4

Non-FARMS Black or African

American

Participants 29 215.7 9.7 4.11 0.08

Nonparticipants 30 219.8 9.0

Non-FARMS Hispanic/Latino Participants 21 213.3 9.9 3.44 0.16

Nonparticipants 40 216.7 8.5

FARMS All Other Student

Groups

Participants 26 211.2 8.5 3.51 0.13

Nonparticipants 35 214.7 9.0

FARMS Black or African

American

Participants 34 216.8 8.9 -3.96 0.06

Nonparticipants 29 212.9 7.3

FARMS Hispanic/Latino Participants 67 212.9 8.0 1.20 0.36

Nonparticipants 100 214.1 8.5