Background

This report details the findings from an efficacy study on the STEMscopes Science 5th grade curriculum in Georgia. We used a post-facto quasi-experimental design (QED) that utilizes a matched control group to evaluate the potential associations between STEMscopes Science and science achievement for 5th graders in urban school districts in Georgia. We focused on urban districts as many educators and researchers in the science education community feel that research focused on urban students’ science achievement is a priority that needs attention (Fraser-Abder, Atwater, & Lee, 2006). We defined urban districts as districts that fall in midsized or large cities based on the National Center of Education Statistic’s classifications. The evidence in this study is consistent with the “Every Student Succeeds Act’s (ESSA) Tier Two evidence. QEDs with matched samples attempt to overcome the hurdle of “non-random” assignment.

In addition to examining the potential relationship between STEMscopes Science implementation and science achievement for all urban students, we also considered achievement in several sub-groups of students. Specifically, previous results in the field (Morgan, Farkas, Hillemeier, & Maczuga, 2016) suggest there are science achievement gaps among students who are considered minority groups relative to their non-minority peers, as well as achievement gaps related to socio-economic status. However, past STEMscopes reports suggest that STEMscopes may have a stronger association with standardized test passing scores among females, minorities (particularly black/African American students) as well as students considered economically disadvantaged. We evaluated the association between STEMscopes Science and district passing rates among several sub-groups of students: females & males, African Americans, students who recieved special education services, and economically disadvantaged students.

Thus, within the current report we considered: 1) potential group differences in science achievement (operationalized by the percent of students in a given school who pass their science standardized test) for STEMscopes versus non-STEMscopes schools in urban schools districts in Georgia. Overall, we hypothesized that even with the more stringent matched control group QED design, schools that purchased and used STEMscopes during the 2021-2022 school year would include a higher percent of students, on average, that pass on the 2022 Georgia Milestones Science assessment than schools that did not purchase STEMscopes (i.e., “non-STEMscopes schools”). 2) We also anticipated significant associations between STEMscopes Science and school 5th grade passing rates on the Georgia Milestones among subgroups of students.

Results

To examine the effectiveness of STEMscopes Science to increase Georgia urban schools 5th grade passing rates on the Georgia Milestone Assessment, we conducted multiple regression analyses with 144 matched schools from urban school districts in Georgia. Our first analysis focused on predicting 2022 Georgia Milestones 5th grade school passing rates (outcome) with a binary variable indicating whether a school was a STEMscopes school or non-STEMscopes school, and covariates (see methods). Results were significant (non-STEMscopes M = 18.96, STEMscopes M = 25.36, b = 6.40, p < 0.05, ES = 0.50), see Figure 1 for comparison of rates between STEMscopes schools and non-STEMscopes schools and Table 1 for model parameters. In addition, there were also significant findings among sub-groups of students such that: among economically disadvantaged students, there was a positive association between STEMscopes Science and Georgia Milestones passing rates (non-STEMscopes M = 15.89, STEMscopes M = 22.23, b =6.34, p<.05, ES = 0.46). There was also a significant finding among African American students (non-STEMscopes M = 14.87, STEMscopes M = 19.34, b =4.47, p<.05, ES =0.32), however the state data included too much missing data for other racial/ethnic sub- populations (greater than 50% missing), so no other groups were analyzed. There were also signficant findings among students classified as recieving special education services (non- STEMscopes M = 6.01, STEMscopes M = 11.70, b = 5.69, p<.05, ES = 0.43). Finally, there was a significant finding among females (non-STEMscopes M = 19.03, STEMscopes M = 25.93, b = 6.90, p<.05, ES = 0.50), and for males (non-STEMscopes M = 19.05, STEMscopes M = 24.98, b = 5.93, p<.05, ES = 0.41). These findings suggest that STEMscopes is effective across many sub-populations of students, as well as when all students are considered.

Table 1

Full model comparing non-STEMscopes and STEMscopes school passing rate, all students


Model parameters Estimate (b) Standard error p-value
Intercept 18.96 1.24 <.05
STEMscopes curriculum district 6.40 1.73 <.05
Baseline school 4th grade math passing rate  10.71 1.29 <.05
District Size 0.93 0.92 0.32
Percent economically disadvantaged students  -0.75 0.93 <.05
Percent of LEP students 0.92 0.87 0.30
Percent of special education students  -0.34 0.96 0.73
Percent of gifted and talented students 4.48 1.40 <.05
Percent of Asian students 0.41 1.21 0.74

Methods

In this section, details are provided about study procedures including the data sources, variables used, and participating districts

Data Sources
Data for this study came from three sources. First, schools that purchased and used STEMscopes for 5th grade in the 2021 - 2022 school year were identified through the STEMscopes analytics platform. Within the analytics reports, we used the number of 5th grade scopes (or units) accessed as a metric of use. We were interested specifically in schools that used STEMscopes Science in a full curriculum capacity. We defined a school as fully using the curriculum if they accessed at least ~80% of the curriculum.

Second, school demographic data and school performance on the 2020-2021 Georgia Milestones Assessment were accessed through the Georgia Department of Education website. We used the 2020 - 2021 Georgia Milestones school summary data for 4th grade and focused on the school level “proficient and above” on the Georgia Milestones mathematics test as a baseline measure of academic achievement. Specifically, the state of Georgia creates proficiency benchmarks in all academic content and identifies students as “Beginners,” “Developing,” “Proficient” and “Distinguished.” The percentage of students who are classified as proficient and distinguished is used by the state as the district passing rate. We use the 2020-2021 math school passing rate for 4th grade because a Georgia Milestones science test is not administered in 4th grade. We wanted to ensure it was (approximately) the same students contributing scores to a school’s passing rate. Since the math and science components of the Georgia Milestones assessment correlate highly (r = 0.90 in 2020-2021 5th grade), this is an appropriate way to ensure that there were no baseline differences across STEMscopes and non-STEMscopes matched schools in prior academic achievement.

We also downloaded 2021-2022 school year school enrollment data including race/ethnicity percentages data, student program data including percentage data for students considered economically disadvantaged, Limited English Proficient (LEP), students who were eligible to receive special education services, as well as gifted and talented program students. We focused on these covariates as previous research has indicated they are associated with science achievement. These variables were used to match STEMscopes and non-STEMscopes schools (see participants section below for details on matching). Once matching was complete and baseline analyses were conducted (see baseline equivalence), we downloaded the 2022 Georgia Milestones End-of-Course assessments by grade from the govenor’s office website. We used this third data source because files on the govenor’s website included score breakout by student sub- populations. We used the percent of students in each school who were proficient and abovegrade level standard (passing rate) on the spring 2022 Georgia Milestones as the outcome variable(s).

Missing Data
If a variable (e.g., program category/sub-population) had less than 10 students, the data was removed by the state and identified “TFS” (too few students), which is included instead in order to protect student privacy under the Family Educational Rights and Privacy Act (FERPA). This leads to missingness by design in both covariate and outcome variables. We handled this missing data in two ways. For any covariate variable used to match schools, we used multiple imputation by chained equations via R-studio’s “MICE” package with the “CART” imputation method (see Van Buuren and Groothuis-Oudshoorn, 2011). We use MICE procedures during this step to ensure complete data for matching procedures via R-Studio's “Match-it” package, although few covariates included missing data (math 2021 passing rate n = 23 [~12% of total unmatched sample], and percent of gifted and talented students n = 24 missing [~12%]). Once data were matched, in all final analyses we used R-Studio’s “Lavaan” package which uses full information maximum likelihood procedures to handle missing outcome data. Many of the student sub-populations had extensive missing data. We only ran analyses among sub-population data that included over 50% of the sample.

Participants
Before collecting data, we used the National Center of Education Statistics (NCES) website to classify “urban districts.” Specifically, we focused on districts that were in “midsize” or “large” cities via NCES classifications. Six districts fit this classification and these districts included 197 schools. In the 2021-2022 school year, the overall number of Georgia urban schools that purchased and used STEMscopes (in any capacity) for 5th grade was 86 out of 197 schools (~44%). Of these 86 schools, 72 met the criterion of schoolwide usage of at least 80% of 5th grade science units (scopes) accessed and were eligible for the study.

As mentioned previously, to match schools we used R-Studio’s “Match-it” package with the “nearest neighbor” method. We included 11 school level variables to match data: 5 race/ethnicity variables representing the percent of the school population that fit each category: White/Caucasian, Hispanic/Latinx, Black/African American, Asian, and two or more races/ethnicities; as well as the total number of students enrolled in a school, the percent of students that qualified for special education or gifted and talented programs, the percent of Limited English Proficieny students, and the percent of economically disadvantaged students. Finally, schools were matched based on ‘baseline’ academic achievement as indexed by the Georgia Milestones 2021 4th grade math passing rate for a given school. All 72 STEMscopes schools were matched, resulting in a final sample size of 144 urban schools.

Baseline Equivalence
For all covariate variables (the variables used for matching) including baseline academic achievement, there were no significant differences between matched groups (see Table 2). However, the What Works Clearinghouse (WWC) standards require that baseline differences with a Hedge’s G effect size greater than 0.05 must be controlled for statistically. Following the advice of Stuart, 2010, we included all covariates that were not collinear in the final analyses as a complementary approach to matching, and a more stringent test of effects. Several race/ethnicity variables were correlated at or above -/+0.70. Specifically, the percent of students that were Black/African American was highly negatively correlated both with the percent of students who reported being White/Caucasian (r = -0.91) and with the percent of students who reported two or more ethnicities (r = -0.70). The percent of students that was Hispanic/Latinx was highly correlated with the percent of students who reported being Limited English Proficient (r = 0.94). With this in mind, we present models above excluding these highly correlated variables (however, we re-ran models that separately included each of race/ethnicity percent variables along with the other not collinear covariates and the pattern of findings was the same). Inclusion of these covariates satisfies the WWC standard as several variables had effect sizes greater than 0.05.

Table 2

Baseline comparison of matched STEMscopes and non-STEMscopes districts


Model parameters Sample Total NON-STEMscopes STEMscopes t-value p-value Effect Size
4th grade math pass rate ‘21 18.42 18.72 18.13 0.19 0.85 0.03
School Size 486 487 485 0.06 0.96 0.01
Percent economically disadvantaged students 59.60 58.81 60.40 0.23 0.82 0.04
Percent Black/African American students 72.56 72.32 72.81 0.12 0.91 0.02
Percent Latino/Hispanic students 8.66 8.93 8.39 0.33 0.74 0.05
Percent Asian students 1.14 1.01 1.26 0.65 0.51 0.11
Percent White/Caucasian students 13.24 13.35 13.14 0.07 0.94 0.01
Percent Two or more races students 4.21 4.18 4.24 0.11 0.91 0.02
Percent of Limited English Proficient (LEPs) 5.13 5.39 4.86 0.37 0.71 0.06
Percent of special education students 10.77 10.70 10.84 0.21 0.84 0.03
Percent of gifted and talented students 5.62 5.82 5.42 0.39 0.69 0.07

Conclusion

These findings provide evidence of the efficacy of the STEMscopes Science 5th grade curriculum. The STEMscopes Science curriculum was associated with ~4 -7 point increases in the average percent of schools’ student passing rates relative to schools that did not use STEMscopes and in consideration of different sub-groups of students. The effect sizes associated with these increases ranged from 0.32 - 0.50, indicating medium effects based on field standards for psychological research (e.g., Brydges, 2019). Typically Hedges G effects near 0.15-0.20 are considered small, ~0.35 -0.50 are considered medium and > ~0.70 are considered large. As a practical measure of effect size, we multiplied the overall average estimated change in the percent of students passing for STEMscopes schools (versus non-STEMscopes schools) times the total number of students tested in STEMscopes schools. Given that schools were closely matched regarding enrollment, this provides a rough estimate of the number of students who more likely passed their Georgia Milestone 2022 5th grade science assessment in relation to their school using STEMscopes Science. Specifically, we estimate 2,717 additional students passed the 2022 Georgia Milestone science achievement test in STEMscopes schools.

Work Cited

Brydges, C. R. (2019). Effect size guidelines, sample size calculations, and statistical power in gerontology. Innovation in aging, 3(4).
Fraser-Abder, P., Atwater, M., & Lee, O. (2006). Research in urban science education: An essential journey. Journal of Research in Science Teaching, 43(7), 599–606.
Morgan, P.L., Farkas, G., Hillemeier, M. M., and Maczuga, S. (2016). Science achievement gaps begin very early, persist, and are largely explained by modifiable factors. Educational Researcher, 45, 18-35.
Stuart, E. A. (2010). Matching methods for causal inference: A review and a look forward. Statistical science: a review journal of the Institute of Mathematical Statistics, 25, 1.