<?xml version="1.0" encoding="UTF-8" standalone="no"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Archiving and Interchange DTD v2.3 20070202//EN" "archivearticle.dtd">
<article xml:lang="EN" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" article-type="systematic-review">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">Front. Psychol.</journal-id>
<journal-title>Frontiers in Psychology</journal-title>
<abbrev-journal-title abbrev-type="pubmed">Front. Psychol.</abbrev-journal-title>
<issn pub-type="epub">1664-1078</issn>
<publisher>
<publisher-name>Frontiers Media S.A.</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="doi">10.3389/fpsyg.2022.873995</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Psychology</subject>
<subj-group>
<subject>Systematic Review</subject>
</subj-group>
</subj-group>
</article-categories>
<title-group>
<article-title>Determining an Evidence Base for Particular Fields of Educational Practice: A Systematic Review of Meta-Analyses on Effective Mathematics and Science Teaching</article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author" corresp="yes">
<name><surname>Knogler</surname> <given-names>Maximilian</given-names></name>
<xref ref-type="corresp" rid="c001"><sup>&#x0002A;</sup></xref>
<uri xlink:href="http://loop.frontiersin.org/people/925153/overview"/>
</contrib>
<contrib contrib-type="author">
<name><surname>Hetmanek</surname> <given-names>Andreas</given-names></name>
<uri xlink:href="http://loop.frontiersin.org/people/1721803/overview"/>
</contrib>
<contrib contrib-type="author">
<name><surname>Seidel</surname> <given-names>Tina</given-names></name>
<uri xlink:href="http://loop.frontiersin.org/people/925211/overview"/>
</contrib>
</contrib-group>
<aff><institution>Department of Educational Sciences, TUM School of Social Sciences and Technology, Technical University of Munich</institution>, <addr-line>Munich</addr-line>, <country>Germany</country></aff>
<author-notes>
<fn fn-type="edited-by"><p>Edited by: Cheng Yong Tan, The University of Hong Kong, Hong Kong SAR, China</p></fn>
<fn fn-type="edited-by"><p>Reviewed by: Peter Verkoeijen, Erasmus University Rotterdam, Netherlands; Parul Acharya, Columbus State University, United States</p></fn>
<corresp id="c001">&#x0002A;Correspondence: Maximilian Knogler <email>maximilian.knogler&#x00040;tum.de</email></corresp>
<fn fn-type="other" id="fn001"><p>This article was submitted to Educational Psychology, a section of the journal Frontiers in Psychology</p></fn></author-notes>
<pub-date pub-type="epub">
<day>25</day>
<month>04</month>
<year>2022</year>
</pub-date>
<pub-date pub-type="collection">
<year>2022</year>
</pub-date>
<volume>13</volume>
<elocation-id>873995</elocation-id>
<history>
<date date-type="received">
<day>11</day>
<month>02</month>
<year>2022</year>
</date>
<date date-type="accepted">
<day>25</day>
<month>03</month>
<year>2022</year>
</date>
</history>
<permissions>
<copyright-statement>Copyright &#x000A9; 2022 Knogler, Hetmanek and Seidel.</copyright-statement>
<copyright-year>2022</copyright-year>
<copyright-holder>Knogler, Hetmanek and Seidel</copyright-holder>
<license xlink:href="http://creativecommons.org/licenses/by/4.0/"><p>This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.</p></license>
</permissions>
<abstract>
<p>The call for evidence-based practice in education emphasizes the need for research to provide evidence for particular fields of educational practice. With this systematic literature review we summarize and analyze aggregated effectiveness information from 41 meta-analyses published between 2004 and 2019 to inform evidence-based practice in a particular field. In line with target specifications in education that are provided for a certain school subject <italic>and</italic> educational level, we developed and adopted a selection heuristic for filtering aggregated effect sizes specific to both science and mathematics education <italic>and</italic> the secondary student population. The results include 78 context-specific aggregated effect sizes based on data from over one million students. The findings encompass a multitude of different teaching strategies, most of which offer a measurable advantage to alternatives. Findings demonstrate that context-specific effect size information may often differ from more general effect size information on teaching effectiveness and adherence to quality standards varies in sampled meta-analyses. Thus, although meta-analytic research has strongly developed over the last few years, providing context-specific and high-quality evidence still needs to be a focus in the field of secondary mathematics and science teaching and beyond.</p></abstract>
<kwd-group>
<kwd>meta-analyses</kwd>
<kwd>systematic review</kwd>
<kwd>evidence-based/evidence-informed practice</kwd>
<kwd>Science Technology Engineering Mathematics (STEM)</kwd>
<kwd>teaching effectiveness</kwd>
</kwd-group>
<contract-sponsor id="cn001">Bundesministerium f&#x000FC;r Bildung und Forschung<named-content content-type="fundref-id">10.13039/501100002347</named-content></contract-sponsor>
<counts>
<fig-count count="2"/>
<table-count count="3"/>
<equation-count count="0"/>
<ref-count count="113"/>
<page-count count="25"/>
<word-count count="16799"/>
</counts>
</article-meta>
</front>
<body>
<sec sec-type="intro" id="s1">
<title>Introduction</title>
<p>Educational science is a comparably young and dynamic research field. Despite ongoing discussions on the merits and demerits of research in this field, it is remarkable how research activities and applied methodologies have developed over the last few decades (Hedges, <xref ref-type="bibr" rid="B42">2018</xref>). For example, recent years have witnessed a surge of empirical studies on teaching and its associations with learning (Seidel and Shavelson, <xref ref-type="bibr" rid="B88">2007</xref>; Schneider and Preckel, <xref ref-type="bibr" rid="B79">2017</xref>). Simultaneously, there is a greater demand from policymakers that educational policy and practice must be guided by evidence of effectiveness (e.g., No Child Left Behind Act, <xref ref-type="bibr" rid="B67">2002</xref>; Every Student Succeeds Act, <xref ref-type="bibr" rid="B30">2015</xref>).</p>
<p>Due to these developments, it is increasingly imperative for educators as well as policymakers to obtain reliable and accessible information of &#x0201C;what works&#x0201D; in education. Yet, given the proliferation of educational research output and potential evidence that stems from diverse disciplines and methodologies, this is a challenging task. In order to address this challenge and to render the best available evidence usable as a resource, the question of how these research findings can be selected and organized in a specific evidence base is paramount.</p>
<p>Through this systematic review, we address the need for research to provide evidence for evidence-based practice with regard to particular fields of educational practice. The determination of such an evidence base is a multiple step process. In a first step we identify secondary mathematics and science teaching as a particular field of educational practice. Here, we highlight the fact, that goals in teaching at schools are provided on the level of a certain subject and educational level (e.g., Common Core State Standards; Next Generation Science Standards), and conclude that effectiveness information that cuts across these two categories for specification is best suitable for informing effective teaching. In a second step, we then develop a heuristic for selecting the best available evidence for informing decisions within this particular field of practice. In a third step, we operationalize and apply the selection heuristic and analyze the findings by describing the state of accumulated knowledge relevant for this field. Finally, we provide some reflections and suggestions for the further development of this evidence base.</p>
<sec>
<title>Evidence for Particular Fields of Educational Practice: Secondary Mathematics and Science Teaching</title>
<p>There has been a growing consensus in numerous countries regarding the general importance as well as specific goals of science and mathematics education (OECD, <xref ref-type="bibr" rid="B68">2019</xref>), which has resulted in the development of (national) educational standards (e.g., Common Core State Standards; Next Generation Science Standards). These standards identify concepts, ideas, and practices that must be emphasized in schools and provide clear normative criteria for successful education in these subjects. It must be noted that educational standards do not merely provide orientation but they are also a core instrument in standards-based reforms aimed at improving educational outcomes. It is on the basis of certain standards that student achievement is assessed and that educators are held accountable for ensuring that their students meet the standard requirements. Importantly, however, standards do not specify effective means for teachers to attain these goals with their students. Thus, identifying effective teaching strategies is one of the hallmark tasks of empirical educational research (Shavelson and Towne, <xref ref-type="bibr" rid="B90">2002</xref>; Mayer, <xref ref-type="bibr" rid="B63">2004</xref>; Hattie, <xref ref-type="bibr" rid="B40">2009</xref>).</p>
<p>Over the last decade, research in science and mathematics education has been particularly productive in terms of collecting high-quality empirical information regarding effective teaching in these subjects (Cheung et al., <xref ref-type="bibr" rid="B13">2017</xref>; Hedges, <xref ref-type="bibr" rid="B42">2018</xref>; Lin et al., <xref ref-type="bibr" rid="B58">2019</xref>). However, the rapid development in scholarship in STEM education has produced an enormous number of studies published in a wide range of journals (Li et al., <xref ref-type="bibr" rid="B57">2020</xref>). The underlying research of these studies is complex as it covers different subjects, grade-levels, student outcomes, among others, and relies on a multitude of different qualitative and quantitative methodological approaches (Brown, <xref ref-type="bibr" rid="B9">2012</xref>; Li et al., <xref ref-type="bibr" rid="B57">2020</xref>). Consequently, this body of empirical research remains rather fragmented, and for educators, it remains unclear what kind of research and which research outlets to consult in order to find out which teaching strategies<xref ref-type="fn" rid="fn0001"><sup>1</sup></xref> they can employ to ensure that students will succeed in meeting the set standards (Kloser, <xref ref-type="bibr" rid="B51">2014</xref>; Cheung et al., <xref ref-type="bibr" rid="B13">2017</xref>). In other words, there is a clear mismatch between the specific, agreed-upon, and easy-to-access information available on binding standards and targets of teaching and an increasing number of scientific literature, which includes complex information on the effectiveness of practices related to reaching those targets. Moreover, compared to the consensus in goals, there seems to be much more diversity regarding a consensus in effective strategies. This lack of consensus is considered one of the main obstacles in addressing calls for evidence-based teaching and in the further advancement of teacher preparation and professional development (Grossman et al., <xref ref-type="bibr" rid="B37">2009</xref>; Windschitl et al., <xref ref-type="bibr" rid="B110">2012</xref>; Kloser, <xref ref-type="bibr" rid="B51">2014</xref>; Lynch et al., <xref ref-type="bibr" rid="B60">2019</xref>), as well as in improving the outcomes of education in general (Cohen et al., <xref ref-type="bibr" rid="B17">2018</xref>). Consequently, the current situation in mathematics and science education both enables and requires working on an evidence base for this particular field of practice.</p>
</sec>
<sec>
<title>Selecting Evidence for a Particular Field of Educational Practice</title>
<p>Determining an evidence base for a particular field of practice is a process of information selection based on well-considered specifications. Some of these specifications are substantive; they define the field of practice for which research findings can serve as warrants in evaluation, decision-making, reflection, and so on (see Cain et al., <xref ref-type="bibr" rid="B11">2019</xref>). Other specifications are methodological; they define the research method that generated the finding (and thus determines its weight as a warrant). In the ensuing paragraphs, we further elaborate on substantive and methodological specifications with regard to the aim of this systematic review&#x02014;that is, to identify an evidence base for secondary mathematics and science education.</p>
<p>Substantive specifications follow the logic of effective practice (in teaching) including its goals&#x02014;for example, in terms of educational standards. In simple terms, this logic can be stated in the following manner: an effective teaching strategy X leads to changes in a learning outcome Y in population Z. As highlighted above, educational standards are specific regarding Y and Z (and non-specific regarding X). Standards define learning outcomes in certain subjects and for certain levels of schooling, which in our case are outcomes related to mathematics and science education on the secondary schooling level. Empirical studies in educational research specify all three parameters (and many more). Thus, for establishing an evidence base on effective mathematics and science teaching for the secondary population, it is important to identify research on effective strategies that includes outcomes related to mathematics and science education on the secondary level. This is already a strong limiting factor compared to the vast sources of potentially relevant information. Nevertheless, the resulting evidence base still includes a diverse set of learning outcomes (knowledge, specific and generic skills, attitudes, etc.). Moreover, the evidence base also includes a diverse set of teaching strategies (e.g., inquiry-based teaching), which are often linked to specific outcomes and have previously been categorized on the level of practices (e.g., Bisra et al., <xref ref-type="bibr" rid="B6">2018</xref>), interventions (e.g., Donker et al., <xref ref-type="bibr" rid="B27">2014</xref>), and programs (e.g., Cheung and Slavin, <xref ref-type="bibr" rid="B15">2013</xref>). Thus, although these substantive specifications considerably narrow the scope of eligible research, there is still a lot of diversity in selected evidence, which further calls for a systematic organization of findings in order to support their inclusion for an evidence base.</p>
<p>Methodological specifications result from the properties of the underlying research paradigm (i.e., educational/teaching effectiveness research) and the methodological prerequisites underlying claims for effectiveness. Thus, while the substantive specifications generally define the parameters (X, Y, and Z), methodological specifications pertain to the relationships among these parameters. Again, simply stated, the applied research methodology must support both claims for causality (X causes Y) and claims for (causal) generalizability (X causes Y and this is true for Z). There is considerable consensus that claims for causality are best supported by experimental research (e.g., Shadish et al., <xref ref-type="bibr" rid="B89">2002</xref>), which is characterized by high internal or statistical-conclusion validity. Internal validity depends on a number of factors (type of experimental design, assignment procedure, fidelity of implementation, elimination of experimental confounds, etc.), which often are not optimally realized in teaching effectiveness research (Slavin, <xref ref-type="bibr" rid="B95">2008</xref>, <xref ref-type="bibr" rid="B96">2020</xref>). However, a more general weakness of the experimental approach is the generalizability of this causal relationship (causal generalizability), as most experiments rely on non-representative samples (e.g., convenience samples) of populations and replications are rare. Both aspects reduce the external validity of a study, and the extrapolation of findings from a study to an inference population is often not warranted. Therefore, in the general field of psychology, researchers have proposed measures to increase causal validity in research (e.g., Staines, <xref ref-type="bibr" rid="B99">2008</xref>), and these have been echoed in educational research (e.g., Robinson et al., <xref ref-type="bibr" rid="B74">2013</xref>). With regard to primary studies, authors have encouraged researchers to better address factors that increase internal validity (Shavelson and Towne, <xref ref-type="bibr" rid="B90">2002</xref>; Robinson et al., <xref ref-type="bibr" rid="B74">2013</xref>), which led to a broader implementation of more rigorous research designs in education (Hedges, <xref ref-type="bibr" rid="B42">2018</xref>). Moreover, causal generalizability increases when an effect is found to be present in more than one study (conceptual replication). The effects of the same or of a similar intervention from multiple studies lead to the aggregation of effect sizes. Aggregated effect size estimates are superior to individual studies with respect to replication probability (e.g., Hedges, <xref ref-type="bibr" rid="B41">2013</xref>), and they enable correction of the distorting effects of different error types (e.g., sampling error, measurement error) that often produce the illusion of conflicting findings. Thus, from a methodological perspective, effectiveness claims for the field of teaching are currently best supported by aggregated findings from experimental research. With regard to the process of research to practice transfer, Schraw and Patall (<xref ref-type="bibr" rid="B81">2013</xref>, p. 364) also more generally argue that &#x0201C;good practice does not always follow directly from good research, but usually is mediated by synthesis of findings.&#x0201D; Hence, in order to identify the best available evidence for this particular field of practice, we propose considering both substantive and methodological specifications by pooling aggregated effect sizes from experimental research on teaching effectiveness that are specific regarding outcomes and the inference population.</p>
</sec>
<sec>
<title>The Present Review</title>
<p>With this systematic review of meta-analyses, we aim to make a valuable contribution toward creating an evidence-base in a particular field of educational practice. While recent systematic reviews of meta-analytic research provide broad and inclusive summaries (e.g., Hattie, <xref ref-type="bibr" rid="B40">2009</xref>; Schneider and Preckel, <xref ref-type="bibr" rid="B79">2017</xref>), this review seeks to harness the power of focus with regard to the scope and content of analysis. In order to match the level of specificity of educational goals and standards that are both domain <italic>and</italic> schooling-level specific, we seek to develop an evidence base on effective teaching strategies in mathematics and science subjects for secondary student populations. This also takes into consideration that context variables (such as domain and schooling-level) can have considerable impact on the effectiveness of particular teaching strategies (e.g., ; Seidel and Shavelson, <xref ref-type="bibr" rid="B88">2007</xref>; Dignath and Buttner, <xref ref-type="bibr" rid="B26">2008</xref>; Dunlosky et al., <xref ref-type="bibr" rid="B28">2013</xref>; Donker et al., <xref ref-type="bibr" rid="B27">2014</xref>). Due to its strict focus and selection criteria, our approach is limited in that it cannot utilize the full range of knowledge provided by a broader selection of meta-analyses in the field of teaching effectiveness and by the single studies cited therein. Moreover, although we highlight this selective information as particularly relevant for an evidence base, we do acknowledge that there are also other forms of evidence that can or must inform decision-making such as multiple types of data (e.g., Howe, <xref ref-type="bibr" rid="B49">2009</xref>; Windschitl et al., <xref ref-type="bibr" rid="B110">2012</xref>; Dunlosky et al., <xref ref-type="bibr" rid="B28">2013</xref>; Kloser, <xref ref-type="bibr" rid="B51">2014</xref>). Overall, this review closes a gap by providing and analyzing effectiveness information for evidence-based practice specifically in a particular field of educational practice.</p>
<p>For systematic selection and analysis, we developed a selection heuristic which enabled us to filter all meta-analyses that provide at least one aggregated effect size specific to mathematics and science domains and the secondary student population. Our research interest was fourfold. First, we were interested in the number of aggregated effect sizes that are specific to the context of secondary mathematics and science teaching and the particular foci and design of published meta-analyses that provide this information. To this end, we extracted all aggregated effect size estimates matching our selection criteria and described the design of the meta-analyses. Second, we wanted to know to what extent context-specific effect size estimates (for the secondary mathematics and science population) differ from more general effect size estimates (overall effects) reported in selected meta-analyses on teaching effectiveness. If overall effects do not differ from context-specific effects, this may provide some indication that overall effects can provide some orientation for judging the effectiveness of teaching strategies, particularly when more specific effect estimates are not available. Third, in a bottom-up approach, we identified major types of teaching strategies and categorized all aggregated effect sizes from our selected sample into coherent categories (such as inquiry learning or self-regulated learning). This categorization offers a clear and integrated summary of effectiveness information that is both reliable and relevant for the context of secondary mathematics and science education. It enables educators and researchers in the field of effective mathematics and science teaching to estimate the stage of accumulated knowledge, which they can use to further advance work in this field. Fourth, we wanted to analyze the extent to which meta-analyses in the field of mathematics and science teaching currently meet standards for high-quality meta-analytic research. Thereto, we identified established quality criteria from the literature and rated meta-analyses in our sample against these criteria. Findings regarding quality can help to further raise the standard for meta-analyses in educational effectiveness research and thus contribute to a more transparent and reliable evidence base.</p>
</sec>
</sec>
<sec sec-type="methods" id="s2">
<title>Methods</title>
<sec>
<title>Search and Selection</title>
<p>Until May 2019, we systematically searched databases and relevant individual educational review and science and mathematics education journals. We utilized a search string that combined the term &#x0201C;meta-analysis&#x0201D; with further specifications such as &#x0201C;learning,&#x0201D; &#x0201C;teaching,&#x0201D; &#x0201C;teaching effectiveness,&#x0201D; &#x0201C;STEM subjects,&#x0201D; &#x0201C;mathematics,&#x0201D; &#x0201C;science,&#x0201D; &#x0201C;biology,&#x0201D; &#x0201C;physics,&#x0201D; &#x0201C;chemistry,&#x0201D; &#x0201C;secondary education population,&#x0201D; and &#x0201C;student learning outcomes.&#x0201D; We used several approaches to locate relevant literature, including database search (Web of Science, Scopus, ERIC, PsycINFO, and Psych Index), hand-search in top review and (science and mathematics) educational journals, and adopted an ancestral approach by scanning the reference lists of identified publications for further relevant publications. We supplemented all details on the databases, search strings, and the complete list of hand-searched journals (see <xref ref-type="supplementary-material" rid="SM1">Supplementary Material S1</xref>). The selection process covered two steps: first, the first two authors scanned titles and abstracts for relevance (agreement: Cohen&#x00027;s kappa = 0.65; disagreements were resolved by discussion). Second, from the remaining publications, we assessed full texts in detail for a match to the following eligibility criteria:</p>
<list list-type="order">
<list-item><p>The study is a meta-analysis, that is, averaged at least two standardized effect sizes obtained from different samples.</p></list-item>
<list-item><p>The meta-analysis analyzed studies<xref ref-type="fn" rid="fn0002"><sup>2</sup></xref> on teaching effectiveness, which include interventions that manipulated an independent variable.</p></list-item>
<list-item><p>The meta-analysis included a student-level outcome measure as a dependent variable.</p></list-item>
<list-item><p>The meta-analysis reported at least one separate effect size specific for secondary education AND mathematics and science subjects.<xref ref-type="fn" rid="fn0003"><sup>3</sup></xref></p></list-item>
<list-item><p>The search filter of the meta-analysis was not explicitly limited to a specific subgroup of students (e.g., students with special needs, low socioeconomic status, gifted students, at-risk students).</p></list-item>
<list-item><p>The meta-analysis was published in a peer-reviewed journal.</p></list-item>
<list-item><p>The meta-analysis was published in or after the year 2004 (cut-off year of inclusion by previous research synthesis: Seidel and Shavelson (<xref ref-type="bibr" rid="B88">2007</xref>) and Hattie (<xref ref-type="bibr" rid="B40">2009</xref>).</p></list-item>
<list-item><p>The report must be available in English.</p></list-item>
</list>
<p>We double coded each study: Cohen&#x00027;s kappa = 0.63 to 1.00 (Mean = 0.77) and inconsistencies were resolved by discussion. In case of missing or insufficient information, we contacted the first authors. <xref ref-type="fig" rid="F1">Figure 1</xref> depicts the details of the selection process.</p>
<fig id="F1" position="float">
<label>Figure 1</label>
<caption><p>PRISMA flow diagram.</p></caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fpsyg-13-873995-g0001.tif"/>
</fig>
</sec>
<sec>
<title>Data Extraction, Coding, and Analysis</title>
<sec>
<title>Procedures</title>
<p>For data extraction and coding, we created an extensive coding manual. All sections of the manual build on existing literature (details in the descriptions below) and underwent a cyclical process of testing, coder training, reliability checks, and adaptation. Using the final version of the manual, the two first authors coded all sampled meta-analyses. Further, agreement rates were checked for each item, and inconsistencies were resolved by discussion. The complete coding manual is pre-registered and together with <xref ref-type="supplementary-material" rid="SM1">Supplemental Material</xref> provided on Open Science Framework (Weblink: <ext-link ext-link-type="uri" xlink:href="https://osf.io/9n99n/?view_only=bb30c83e9bf34d73a79138ddcf91da5c">https://osf.io/9n99n/?view_only=bb30c83e9bf34d73a79138ddcf91da5c</ext-link>).</p>
</sec>
<sec>
<title>Extraction of Effect Sizes</title>
<p>Generally, we extracted effect sizes based on random-effects models (Hedges and Vevea, <xref ref-type="bibr" rid="B44">1998</xref>), including 95% confidence intervals (CI) and the underlying number of primary effect sizes (k). In line with the goal of this systematic review, we extracted all effect sizes specific to both subject-domain (i.e., mathematics, science) and schooling level (i.e., secondary students from middle and high school), as well as overall effects reported in the selected meta-analyses. We consider these specific effect sizes to provide the best available estimate for the context-specific effectiveness of a particular teaching strategy. In order to extract these specific effect sizes, we followed the heuristic depicted in <xref ref-type="table" rid="T1">Table 1</xref>. Meta-analyses that fulfill our eligibility criteria fall into four categories, depending on their focus of investigation. Meta-analyses belonging to the first category investigate mathematics and science interventions within the secondary student population. These meta-analyses only include primary studies conducted with secondary students in mathematics and science education. All effect sizes included in these meta-analyses are automatically specific and, thus, were extracted. Meta-analyses in the remaining categories are more inclusive (i.e., different educational levels and/or subject domains) and thus use standard methods such as subgroup-analysis or meta-regression (Borenstein et al., <xref ref-type="bibr" rid="B8">2011</xref>) to test for generalizability to the context of secondary mathematics and science education. Thus, the extraction of effect sizes in categories 2&#x02013;4 meta-analyses can be limited due to restrictions because of a statistically significant moderator influence. For example, if a meta-analysis in category 2 yielded a statistically significant moderating effect of level of schooling, we only extracted the effect size(s) relevant for the secondary level, as only this/these effect size(s) is/are specific for both mathematics and science as well as secondary students. However, if a meta-analysis in category 2 yielded a statistically non-significant moderation by schooling level, we inferred that all effects are robust with regard to the level of schooling. Consequently, we extracted all effect sizes reported in this meta-analysis. The first two authors double coded each meta-analysis that met the above criteria. The rate of agreement was 92%, and the remaining differences were discussed and resolved.</p>
<table-wrap position="float" id="T1">
<label>Table 1</label>
<caption><p>Heuristic for extracting effect sizes specifically for secondary mathematics and science teaching.</p></caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th valign="top" align="left"><bold>Category</bold></th>
<th valign="top" align="left"><bold>Focus of meta-analysis</bold></th>
<th valign="top" align="left"><bold>Moderating effects results</bold></th>
<th valign="top" align="left"><bold>Extraction of effect sizes</bold></th>
<th valign="top" align="center"><bold>Code</bold></th>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left">1</td>
<td valign="top" align="left">Mathematics and science interventions within secondary student population</td>
<td/>
<td valign="top" align="left">All effect sizes extracted</td>
<td valign="top" align="center">1</td>
</tr>
<tr>
<td valign="top" align="left">2</td>
<td valign="top" align="left">Mathematics and science interventions with schooling level as moderator</td>
<td valign="top" align="left">Schooling level sign</td>
<td valign="top" align="left">Secondary level effect size extracted</td>
<td valign="top" align="center">2</td>
</tr>
<tr>
<td/>
<td/>
<td valign="top" align="left">Schooling level n.s.</td>
<td valign="top" align="left">All effect sizes extracted</td>
<td valign="top" align="center">3</td>
</tr>
<tr>
<td valign="top" align="left">3</td>
<td valign="top" align="left">Secondary school interventions with subject domain as moderator</td>
<td valign="top" align="left">Subject domain sign.</td>
<td valign="top" align="left">Mathematics and science effect size(s) extracted</td>
<td valign="top" align="center">4</td>
</tr>
<tr>
<td/>
<td/>
<td valign="top" align="left">Subject domain n.s.</td>
<td valign="top" align="left">All effect sizes extracted</td>
<td valign="top" align="center">5</td>
</tr>
<tr>
<td valign="top" align="left">4</td>
<td valign="top" align="left">Teaching interventions with subject domain and schooling level as moderators</td>
<td valign="top" align="left">Subject domain n.s. &#x0002B; schooling level n.s.</td>
<td valign="top" align="left">All effect sizes extracted</td>
<td valign="top" align="center">6</td>
</tr>
<tr>
<td/>
<td/>
<td valign="top" align="left">Subject domain sign. &#x0002B; schooling level sign.</td>
<td valign="top" align="left">No effect size extracted (publication excluded)</td>
<td valign="top" align="center">7</td>
</tr>
<tr>
<td/>
<td/>
<td valign="top" align="left">Subject domain sign. &#x0002B; schooling level n.s.</td>
<td valign="top" align="left">Mathematics and science effect size(s) extracted</td>
<td valign="top" align="center">8</td>
</tr>
<tr>
<td/>
<td/>
<td valign="top" align="left">Subject domain n.s. &#x0002B; schooling level sign.</td>
<td valign="top" align="left">Secondary effect size extracted</td>
<td valign="top" align="center">9</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<p><italic>sign., significant; n.s., not significant; Code: listed for matching information with <xref ref-type="table" rid="T2">Table 2</xref></italic>.</p>
</table-wrap-foot>
</table-wrap>
</sec>
<sec>
<title>Comparison of Overall and Specific Effect Sizes</title>
<p>Since the meta-analyses in our sample include both a specific aggregated effect size (often based on a subset of the primary data) as well as overall effects (based on all primary data), we analyzed the extent to which overall effects differ from specific effects in order to determine whether overall effects in general provide good orientation in cases in which more specific effect estimates are not available. To compare specific and overall effects, we extracted all reported overall effect sizes and analyzed the difference between the overall and the specific effect sizes. We thereby distinguished between four levels of difference (see, e.g., Fan et al., <xref ref-type="bibr" rid="B32">2017</xref>): (0) no difference: numeric values of the two-point estimates of the statistical means are identical; (1) weak level of difference: numeric values of the two-point estimates of the statistical means are not identical; (2) moderate level of difference: at least one-point estimates of the statistical mean is not encompassed by the 95% confidence interval of the other mean; (3) high level of difference: 95% confidence intervals of the two-point estimates of the means do not overlap.</p>
</sec>
<sec>
<title>Analysis of Context-Specific Effectiveness</title>
<p>As a next step in the analysis, the selected effectiveness information was categorized and summarized in a meaningful manner. A particular challenge was given by the heterogeneity of the study characteristics. Although almost all sampled meta-analyses are exclusively based on experimental research to determine the effectiveness of educational interventions on student achievement, our sample demonstrates considerable variations on many parameters that have shown to influence effect sizes (e.g., Slavin and Madden, <xref ref-type="bibr" rid="B94">2011</xref>; de Boer et al., <xref ref-type="bibr" rid="B22">2014</xref>; Cheung and Slavin, <xref ref-type="bibr" rid="B14">2016</xref>). This simultaneous variation on several parameters, particularly in research methodology (e.g., sampling, group assignment, comparison condition, outcome measure, effect size calculation, etc.), complicates comparing and contrasting results across different meta-analyses. The resulting complexity of effect size comparisons, highlighted in the literature (see e.g., Coe, <xref ref-type="bibr" rid="B16">2002</xref>; Hill et al., <xref ref-type="bibr" rid="B47">2008</xref>; Ferguson, <xref ref-type="bibr" rid="B33">2009</xref>; Dunlosky et al., <xref ref-type="bibr" rid="B28">2013</xref>; Belland et al., <xref ref-type="bibr" rid="B5">2017</xref>; Schneider and Preckel, <xref ref-type="bibr" rid="B79">2017</xref>; Simpson, <xref ref-type="bibr" rid="B93">2018</xref>), does not favor rank-ordering effect sizes on a single scale in terms of their magnitude. Thus, instead of providing rank orders, we categorized all aggregated effect sizes into coherent categories with regard to meta-analytic design, teaching strategies, and learning outcomes (see <xref ref-type="table" rid="T2">Table 2</xref>).</p>
<table-wrap position="float" id="T2">
<label>Table 2</label>
<caption><p>Effectiveness summary.</p></caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th valign="top" align="left"><bold>References</bold></th>
<th valign="top" align="center"><bold>Code</bold></th>
<th valign="top" align="center"><bold>Quality</bold></th>
<th valign="top" align="left"><bold>Type of effect size</bold></th>
<th valign="top" align="left"><bold>Independent variable (overall effect)</bold></th>
<th valign="top" align="left"><bold>Dependent variable (overall effect)</bold></th>
<th valign="top" align="center"><bold>k</bold></th>
<th valign="top" align="center"><bold>ES</bold></th>
<th valign="top" align="center"><bold>CI -/&#x0002B;</bold></th>
<th valign="top" align="left"><bold>Independent variable (specific effect)</bold></th>
<th valign="top" align="left"><bold>Dependent variable (specific effect)</bold></th>
<th valign="top" align="center"><bold>k</bold></th>
<th valign="top" align="center"><bold>ES</bold></th>
<th valign="top" align="center"><bold>CI -/&#x0002B;</bold></th>
<th valign="top" align="center"><bold>ES diff</bold></th>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="center" colspan="15"><bold>Effectiveness of individual strategies</bold></td>
</tr>
<tr>
<td valign="top" align="left" colspan="15"><bold>Inquiry-based and project-based learning</bold></td>
</tr>
<tr>
<td valign="top" align="left">Furtak et al. (<xref ref-type="bibr" rid="B34">2012</xref>)</td>
<td valign="top" align="center">1</td>
<td valign="top" align="center">45%</td>
<td valign="top" align="left">Glass&#x00027; d</td>
<td valign="top" align="left">Inquiry-based science teaching</td>
<td valign="top" align="left">Science achievement</td>
<td valign="top" align="center">37</td>
<td valign="top" align="center">0.50</td>
<td valign="top" align="center">0.27; 0.73</td>
<td valign="top" align="left">Inquiry-based science teaching</td>
<td valign="top" align="left">Science achievement</td>
<td valign="top" align="center">37</td>
<td valign="top" align="center">0.50</td>
<td valign="top" align="center">0.27; 0.73</td>
<td valign="top" align="center">n.a.</td>
</tr>
<tr>
<td valign="top" align="left">Lazonder and Harmsen (<xref ref-type="bibr" rid="B54">2016</xref>)</td>
<td valign="top" align="center">3</td>
<td valign="top" align="center">74%</td>
<td valign="top" align="left">Cohen&#x00027;s d</td>
<td valign="top" align="left">Guidance in inquiry-based learning</td>
<td valign="top" align="left">Learning activities</td>
<td valign="top" align="center">20</td>
<td valign="top" align="center">0.66</td>
<td valign="top" align="center">0.44; 0.88</td>
<td valign="top" align="left">Guidance in inquiry-based learning</td>
<td valign="top" align="left">Learning activities</td>
<td valign="top" align="center">20</td>
<td valign="top" align="center">0.66</td>
<td valign="top" align="center">0.44; 0.88</td>
<td valign="top" align="center">n.a.</td>
</tr>
<tr>
<td/>
<td valign="top" align="center">3</td>
<td/>
<td valign="top" align="left">Cohen&#x00027;s d</td>
<td valign="top" align="left">Guidance in inquiry-based learning</td>
<td valign="top" align="left">Performance success</td>
<td valign="top" align="center">17</td>
<td valign="top" align="center">0.71</td>
<td valign="top" align="center">0.52; 0.90</td>
<td valign="top" align="left">Guidance in inquiry-based learning</td>
<td valign="top" align="left">Performance success</td>
<td valign="top" align="center">17</td>
<td valign="top" align="center">0.71</td>
<td valign="top" align="center">0.52; 0.90</td>
<td valign="top" align="center">n.a.</td>
</tr>
<tr>
<td/>
<td valign="top" align="center">3</td>
<td/>
<td valign="top" align="left">Cohen&#x00027;s d</td>
<td valign="top" align="left">Guidance in inquiry-based learning</td>
<td valign="top" align="left">Learning outcomes</td>
<td valign="top" align="center">60</td>
<td valign="top" align="center">0.50</td>
<td valign="top" align="center">0.37; 0.62</td>
<td valign="top" align="left">Guidance in inquiry-based learning</td>
<td valign="top" align="left">Learning outcomes</td>
<td valign="top" align="center">60</td>
<td valign="top" align="center">0.50</td>
<td valign="top" align="center">0.37; 0.62</td>
<td valign="top" align="center">n.a.</td>
</tr>
<tr>
<td valign="top" align="left">Chen and Yang (<xref ref-type="bibr" rid="B12">2019</xref>)</td>
<td valign="top" align="center">8</td>
<td valign="top" align="center">71%</td>
<td valign="top" align="left">Hedges&#x02018; g</td>
<td valign="top" align="left">Project-based learning</td>
<td valign="top" align="left">Academic achievement</td>
<td valign="top" align="center">30</td>
<td valign="top" align="center">0.71</td>
<td valign="top" align="center">0.67; 0.75</td>
<td valign="top" align="left">Project-based learning</td>
<td valign="top" align="left">Academic achievement</td>
<td valign="top" align="center">11</td>
<td valign="top" align="center">0.64</td>
<td valign="top" align="center">0.54; 0.75</td>
<td valign="top" align="center">1</td>
</tr>
<tr>
<td valign="top" align="left" colspan="15"><bold>Game-based learning</bold></td>
</tr>
<tr>
<td valign="top" align="left">Wouters et al. (<xref ref-type="bibr" rid="B111">2013</xref>)</td>
<td valign="top" align="center">8</td>
<td valign="top" align="center">69%</td>
<td valign="top" align="left">Cohen&#x00027;s d</td>
<td valign="top" align="left">Game-based learning</td>
<td valign="top" align="left">Learning</td>
<td valign="top" align="center">77</td>
<td valign="top" align="center">0.29</td>
<td valign="top" align="center">0.17; 0.42</td>
<td valign="top" align="left">Game-based learning</td>
<td valign="top" align="left">Learning in biology</td>
<td valign="top" align="center">28</td>
<td valign="top" align="center">0.11</td>
<td valign="top" align="center">&#x02212;0.11; 0.33</td>
<td valign="top" align="center">1</td>
</tr>
<tr>
<td/>
<td/>
<td/>
<td/>
<td/>
<td/>
<td/>
<td/>
<td/>
<td valign="top" align="left">Game-based learning</td>
<td valign="top" align="left">Learning in math</td>
<td valign="top" align="center">16</td>
<td valign="top" align="center">0.17</td>
<td valign="top" align="center">0.07; 0.28</td>
<td valign="top" align="center">1</td>
</tr>
<tr>
<td/>
<td valign="top" align="center">6</td>
<td/>
<td valign="top" align="left">Cohen&#x00027;s d</td>
<td valign="top" align="left">Game-based learning</td>
<td valign="top" align="left">Motivation</td>
<td valign="top" align="center">31</td>
<td valign="top" align="center">0.26</td>
<td valign="top" align="center">&#x02212;0.03; 0.56</td>
<td/>
<td valign="top" align="left">Motivation</td>
<td valign="top" align="center">31</td>
<td valign="top" align="center">0.26</td>
<td valign="top" align="center">&#x02212;0.03; 0.56</td>
<td valign="top" align="center">n.a.</td>
</tr>
<tr>
<td/>
<td valign="top" align="center">6</td>
<td/>
<td valign="top" align="left">Cohen&#x00027;s d</td>
<td valign="top" align="left">Game-based learning</td>
<td valign="top" align="left">Retention</td>
<td valign="top" align="center">17</td>
<td valign="top" align="center">0.36</td>
<td valign="top" align="center">Not reported</td>
<td/>
<td valign="top" align="left">Retention</td>
<td valign="top" align="center">17</td>
<td valign="top" align="center">0.36</td>
<td valign="top" align="center">Not reported</td>
<td valign="top" align="center">n.a.</td>
</tr>
<tr>
<td valign="top" align="left">Wouters et al. (<xref ref-type="bibr" rid="B111">2013</xref>)</td>
<td valign="top" align="center">8</td>
<td valign="top" align="center">69%</td>
<td valign="top" align="left">Cohen&#x00027;s d</td>
<td valign="top" align="left">Instructional support in GBL</td>
<td valign="top" align="left">Learning outcomes</td>
<td valign="top" align="center">107</td>
<td valign="top" align="center">0.34</td>
<td valign="top" align="center">not reported</td>
<td valign="top" align="left">Instructional support in GBL</td>
<td valign="top" align="left">Learning outcomes in biology</td>
<td valign="top" align="center">35</td>
<td valign="top" align="center">0.59</td>
<td valign="top" align="center">0.38; 1.76</td>
<td valign="top" align="center">1</td>
</tr>
<tr>
<td/>
<td/>
<td/>
<td/>
<td/>
<td/>
<td/>
<td/>
<td/>
<td valign="top" align="left">Instructional support in GBL</td>
<td valign="top" align="left">Learning outcomes in math</td>
<td valign="top" align="center">11</td>
<td valign="top" align="center">0.40</td>
<td valign="top" align="center">0.10; 1.19</td>
<td valign="top" align="center">1</td>
</tr>
<tr>
<td valign="top" align="left">Tokac et al. (<xref ref-type="bibr" rid="B105">2019</xref>)</td>
<td valign="top" align="center">3</td>
<td valign="top" align="center">69%</td>
<td valign="top" align="left">Hedges&#x00027;d</td>
<td valign="top" align="left">Game-based learning</td>
<td valign="top" align="left">Mathematics achievement</td>
<td valign="top" align="center">39</td>
<td valign="top" align="center">0.13</td>
<td valign="top" align="center">0.02; 0.24</td>
<td valign="top" align="left">Game-based learning</td>
<td valign="top" align="left">Mathematics achievement</td>
<td valign="top" align="center">39</td>
<td valign="top" align="center">0.13</td>
<td valign="top" align="center">0.02; 0.24</td>
<td valign="top" align="center">n.a.</td>
</tr>
<tr>
<td valign="top" align="left" colspan="15"><bold>Self-regulated learning/learning strategies training</bold></td>
</tr>
<tr>
<td valign="top" align="left">Dignath and Buttner (<xref ref-type="bibr" rid="B26">2008</xref>)</td>
<td valign="top" align="center">4</td>
<td valign="top" align="center">56%</td>
<td valign="top" align="left">Weighted es</td>
<td valign="top" align="left">SRL training characteristics</td>
<td valign="top" align="left">Performance</td>
<td valign="top" align="center">357</td>
<td valign="top" align="center">0.69</td>
<td valign="top" align="center">not reported</td>
<td valign="top" align="left">SRL training characteristics</td>
<td valign="top" align="left">Performance math secondary</td>
<td valign="top" align="center">12</td>
<td valign="top" align="center">0.23</td>
<td valign="top" align="center">0.07; 0.38</td>
<td valign="top" align="center">1</td>
</tr>
<tr>
<td valign="top" align="left">de Boer et al. (<xref ref-type="bibr" rid="B22">2014</xref>)</td>
<td valign="top" align="center">8</td>
<td valign="top" align="center">73%</td>
<td valign="top" align="left">Hedges&#x00027; g</td>
<td valign="top" align="left">Attributes of interventions</td>
<td valign="top" align="left">Academic performance (math and science)</td>
<td valign="top" align="center">95</td>
<td valign="top" align="center">0.66</td>
<td valign="top" align="center">0.56; 0.76</td>
<td valign="top" align="left">Attributes of interventions</td>
<td valign="top" align="left">Academic performance (math and science)</td>
<td valign="top" align="center">95</td>
<td valign="top" align="center">0.66</td>
<td valign="top" align="center">0.56; 0.76</td>
<td valign="top" align="center">n.a.</td>
</tr>
<tr>
<td valign="top" align="left">Donker et al. (<xref ref-type="bibr" rid="B27">2014</xref>)</td>
<td valign="top" align="center">8</td>
<td valign="top" align="center">69%</td>
<td valign="top" align="left">Hedges&#x00027; g</td>
<td valign="top" align="left">SRL instruction</td>
<td valign="top" align="left">Academic performance (math and science)</td>
<td valign="top" align="center">180</td>
<td valign="top" align="center">0.66</td>
<td valign="top" align="center">0.56; 0.76</td>
<td valign="top" align="left">SRL instruction</td>
<td valign="top" align="left">Academic performance math</td>
<td valign="top" align="center">44</td>
<td valign="top" align="center">0.66</td>
<td valign="top" align="center">Not reported</td>
<td valign="top" align="center">0</td>
</tr>
<tr>
<td/>
<td/>
<td/>
<td/>
<td/>
<td/>
<td/>
<td/>
<td/>
<td/>
<td valign="top" align="left">Academic performance science</td>
<td valign="top" align="center">9</td>
<td valign="top" align="center">0.73</td>
<td valign="top" align="center">Not reported</td>
<td valign="top" align="center">1</td>
</tr>
<tr>
<td valign="top" align="left">Bisra et al. (<xref ref-type="bibr" rid="B6">2018</xref>)</td>
<td valign="top" align="center">6</td>
<td valign="top" align="center">56%</td>
<td valign="top" align="left">Hedges&#x00027; g</td>
<td valign="top" align="left">Self-explanation prompts</td>
<td valign="top" align="left">Cognitive learning outcomes</td>
<td valign="top" align="center">69</td>
<td valign="top" align="center">0.55</td>
<td valign="top" align="center">0.45; 0.65</td>
<td valign="top" align="left">Self-explanation prompts</td>
<td valign="top" align="left">Cognitive learning outcomes</td>
<td valign="top" align="center">69</td>
<td valign="top" align="center">0.55</td>
<td valign="top" align="center">0.45; 0.65</td>
<td valign="top" align="center">n.a.</td>
</tr>
<tr>
<td valign="top" align="left">Lee et al. (<xref ref-type="bibr" rid="B55">2018</xref>)</td>
<td valign="top" align="center">3</td>
<td valign="top" align="center">45%</td>
<td valign="top" align="left">Cohen&#x00027;s d</td>
<td valign="top" align="left">Metacognitive training</td>
<td valign="top" align="left">Algebraic reasoning</td>
<td valign="top" align="center">21</td>
<td valign="top" align="center">0.97</td>
<td valign="top" align="center">0.88; 1.06</td>
<td valign="top" align="left">Metacognitive training</td>
<td valign="top" align="left">Algebraic reasoning</td>
<td valign="top" align="center">21</td>
<td valign="top" align="center">0.97</td>
<td valign="top" align="center">0.88; 1.06</td>
<td valign="top" align="center">n.a.</td>
</tr>
<tr>
<td valign="top" align="left">Zheng (<xref ref-type="bibr" rid="B113">2016</xref>)</td>
<td valign="top" align="center">6</td>
<td valign="top" align="center">60%</td>
<td valign="top" align="left">Cohen&#x00027;s d</td>
<td valign="top" align="left">SRL scaffolds in computer-based learning environments</td>
<td valign="top" align="left">Academic performance</td>
<td valign="top" align="center">29</td>
<td valign="top" align="center">0.44</td>
<td valign="top" align="center">0.23; 0.65</td>
<td valign="top" align="left">SRL scaffolds in computer-based learning environments</td>
<td valign="top" align="left">Academic performance</td>
<td valign="top" align="center">29</td>
<td valign="top" align="center">0.44</td>
<td valign="top" align="center">0.23; 0.65</td>
<td valign="top" align="center">n.a.</td>
</tr>
<tr>
<td valign="top" align="left" colspan="15"><bold>Educational technology: software/individualized learning</bold></td>
</tr>
<tr>
<td valign="top" align="left">Li and Ma (<xref ref-type="bibr" rid="B56">2010</xref>)</td>
<td valign="top" align="center">2</td>
<td valign="top" align="center">74%</td>
<td valign="top" align="left">Cohen&#x00027;s d</td>
<td valign="top" align="left">Computer technology</td>
<td valign="top" align="left">Math achievement</td>
<td valign="top" align="center">85</td>
<td valign="top" align="center">0.28</td>
<td valign="top" align="center">0.13; 0.43</td>
<td valign="top" align="left">Computer technology</td>
<td valign="top" align="left">Math achievement</td>
<td valign="top" align="center">37</td>
<td valign="top" align="center">0.61</td>
<td valign="top" align="center">0.43; 0.79</td>
<td valign="top" align="center">2</td>
</tr>
<tr>
<td valign="top" align="left">Cheung and Slavin (<xref ref-type="bibr" rid="B15">2013</xref>)</td>
<td valign="top" align="center">3</td>
<td valign="top" align="center">74%</td>
<td valign="top" align="left">Weighted ES</td>
<td valign="top" align="left">Technology applications</td>
<td valign="top" align="left">Math achievement</td>
<td valign="top" align="center">74</td>
<td valign="top" align="center">0.16</td>
<td valign="top" align="center">0.11; 0.20</td>
<td valign="top" align="left">Technology applications</td>
<td valign="top" align="left">Math achievement</td>
<td valign="top" align="center">74</td>
<td valign="top" align="center">0.16</td>
<td valign="top" align="center">0.11; 0.20</td>
<td valign="top" align="center">n.a.</td>
</tr>
<tr>
<td valign="top" align="left">Ma et al. (<xref ref-type="bibr" rid="B61">2014</xref>)</td>
<td valign="top" align="center">6</td>
<td valign="top" align="center">62%</td>
<td valign="top" align="left">Hedges&#x00027; g</td>
<td valign="top" align="left">Intelligent tutoring systems</td>
<td valign="top" align="left">Learning outcomes</td>
<td valign="top" align="center">107</td>
<td valign="top" align="center">0.41</td>
<td valign="top" align="center">0.34; 0.48</td>
<td valign="top" align="left">Intelligent tutoring systems</td>
<td valign="top" align="left">Learning outcomes</td>
<td valign="top" align="center">107</td>
<td valign="top" align="center">0.41</td>
<td valign="top" align="center">0.34; 0.48</td>
<td valign="top" align="center">n.a.</td>
</tr>
<tr>
<td valign="top" align="left">Steenbergen-Hu and Cooper (<xref ref-type="bibr" rid="B100">2013</xref>)</td>
<td valign="top" align="center">3</td>
<td valign="top" align="center">74%</td>
<td valign="top" align="left">Hedges&#x00027; g</td>
<td valign="top" align="left">Intelligent tutoring systems</td>
<td valign="top" align="left">Math learning</td>
<td valign="top" align="center">17</td>
<td valign="top" align="center">0.01</td>
<td valign="top" align="center">&#x02212;0.10; 0.12</td>
<td valign="top" align="left">Intelligent tutoring systems</td>
<td valign="top" align="left">Math learning</td>
<td valign="top" align="center">17</td>
<td valign="top" align="center">0.01</td>
<td valign="top" align="center">&#x02212;0.10; 0.12</td>
<td valign="top" align="center">n.a.</td>
</tr>
<tr>
<td valign="top" align="left">Gerard et al. (<xref ref-type="bibr" rid="B35">2015</xref>)</td>
<td valign="top" align="center">6</td>
<td valign="top" align="center">57%</td>
<td valign="top" align="left">Hedges&#x00027; g</td>
<td valign="top" align="left">Automated adaptive guidance</td>
<td valign="top" align="left">Academic achievement</td>
<td valign="top" align="center">24</td>
<td valign="top" align="center">0.34</td>
<td valign="top" align="center">0.23; 0.45</td>
<td valign="top" align="left">Automated adaptive guidance</td>
<td valign="top" align="left">Academic achievement</td>
<td valign="top" align="center">24</td>
<td valign="top" align="center">0.34</td>
<td valign="top" align="center">0.23; 0.45</td>
<td valign="top" align="center">n.a.</td>
</tr>
<tr>
<td/>
<td valign="top" align="center">6</td>
<td/>
<td valign="top" align="left">Hedges&#x00027; g</td>
<td valign="top" align="left">Advanced vs. Simple adaptive guidance</td>
<td valign="top" align="left">Academic achievement</td>
<td valign="top" align="center">29</td>
<td valign="top" align="center">0.27</td>
<td valign="top" align="center">0.15; 0.38</td>
<td valign="top" align="left">Advanced vs. Simple adaptive guidance</td>
<td valign="top" align="left">Academic achievement</td>
<td valign="top" align="center">29</td>
<td valign="top" align="center">0.27</td>
<td valign="top" align="center">0.15; 0.38</td>
<td valign="top" align="center">n.a.</td>
</tr>
<tr>
<td valign="top" align="left">Belland et al. (<xref ref-type="bibr" rid="B5">2017</xref>)</td>
<td valign="top" align="center">2</td>
<td valign="top" align="center">86%</td>
<td valign="top" align="left">Hedges&#x00027; g</td>
<td valign="top" align="left">Computer-based scaffolding</td>
<td valign="top" align="left">Cognitive outcomes</td>
<td valign="top" align="center">333</td>
<td valign="top" align="center">0.46</td>
<td valign="top" align="center">0.37; 0.55</td>
<td valign="top" align="left">Computer-based scaffolding</td>
<td valign="top" align="left">Cognitive outcomes: middle school</td>
<td valign="top" align="center">108</td>
<td valign="top" align="center">0.37</td>
<td valign="top" align="center">0.28; 0.48</td>
<td valign="top" align="center">2</td>
</tr>
<tr>
<td/>
<td/>
<td/>
<td/>
<td/>
<td/>
<td/>
<td/>
<td/>
<td valign="top" align="left">Computer-based scaffolding</td>
<td valign="top" align="left">Cognitive outcomes: secondary school</td>
<td valign="top" align="center">53</td>
<td valign="top" align="center">0.48</td>
<td valign="top" align="center">0.35; 0.60</td>
<td valign="top" align="center">1</td>
</tr>
<tr>
<td valign="top" align="left" colspan="15"><bold>Educational technology: hardware/mobile learning</bold></td>
</tr>
<tr>
<td valign="top" align="left">Sung et al. (<xref ref-type="bibr" rid="B101">2016</xref>)</td>
<td valign="top" align="center">9</td>
<td valign="top" align="center">62%</td>
<td valign="top" align="left">Hedges&#x00027; g</td>
<td valign="top" align="left">Integrating mobile devices with teaching</td>
<td valign="top" align="left">Academic achievement</td>
<td valign="top" align="center">108</td>
<td valign="top" align="center">0.52</td>
<td valign="top" align="center">0.43; 0.61</td>
<td valign="top" align="left">Integrating mobile devices with teaching</td>
<td valign="top" align="left">Academic achievement: secondary school</td>
<td valign="top" align="center">20</td>
<td valign="top" align="center">0.45</td>
<td valign="top" align="center">0.24; 0.66</td>
<td valign="top" align="center">1</td>
</tr>
<tr>
<td valign="top" align="left">Tingir et al. (<xref ref-type="bibr" rid="B104">2017</xref>)</td>
<td valign="top" align="center">8</td>
<td valign="top" align="center">76%</td>
<td valign="top" align="left">Cohen&#x00027;s d</td>
<td valign="top" align="left">Mobile devices</td>
<td valign="top" align="left">Achievement</td>
<td valign="top" align="center">23</td>
<td valign="top" align="center">0.48</td>
<td valign="top" align="center">0.26; 0.71</td>
<td valign="top" align="left">Mobile devices</td>
<td valign="top" align="left">Math achievement</td>
<td valign="top" align="center">3</td>
<td valign="top" align="center">0.16</td>
<td valign="top" align="center">&#x02212;0.55; 0.87</td>
<td valign="top" align="center">2</td>
</tr>
<tr>
<td/>
<td/>
<td/>
<td/>
<td/>
<td/>
<td/>
<td/>
<td/>
<td/>
<td valign="top" align="left">Science achievement</td>
<td valign="top" align="center">8</td>
<td valign="top" align="center">0.53</td>
<td valign="top" align="center">0.40; 0.66</td>
<td valign="top" align="center">1</td>
</tr>
<tr>
<td valign="top" align="left">Sung et al. (<xref ref-type="bibr" rid="B102">2017</xref>)</td>
<td valign="top" align="center">6</td>
<td valign="top" align="center">57%</td>
<td valign="top" align="left">Hedges&#x00027; g</td>
<td valign="top" align="left">Mobile computer-supported-collaborative learning</td>
<td valign="top" align="left">Learning outcomes (achievement, attitude, peer-interaction)</td>
<td valign="top" align="center">163</td>
<td valign="top" align="center">0.52</td>
<td valign="top" align="center">0.38; 0.66</td>
<td valign="top" align="left">Mobile computer-supported-collaborative learning</td>
<td valign="top" align="left">Learning outcomes (achievement, attitude, peer-interaction)</td>
<td valign="top" align="center">163</td>
<td valign="top" align="center">0.52</td>
<td valign="top" align="center">0.38; 0.66</td>
<td valign="top" align="center">n.a.</td>
</tr>
<tr>
<td valign="top" align="left" colspan="15"><bold>Design of learning material</bold></td>
</tr>
<tr>
<td valign="top" align="left">Ginns et al. (<xref ref-type="bibr" rid="B36">2013</xref>)</td>
<td valign="top" align="center">6</td>
<td valign="top" align="center">63%</td>
<td valign="top" align="left">Cohen&#x00027;s d</td>
<td valign="top" align="left">Conversational style instructional text</td>
<td valign="top" align="left">Retention</td>
<td valign="top" align="center">30</td>
<td valign="top" align="center">0.30</td>
<td valign="top" align="center">0.18; 0.41</td>
<td valign="top" align="left">Conversational style instructional text</td>
<td valign="top" align="left">retention</td>
<td valign="top" align="center">30</td>
<td valign="top" align="center">0.30</td>
<td valign="top" align="center">0.18; 0.41</td>
<td valign="top" align="center">n.a.</td>
</tr>
<tr>
<td/>
<td/>
<td/>
<td/>
<td/>
<td valign="top" align="left">Transfer</td>
<td valign="top" align="center">25</td>
<td valign="top" align="center">0.54</td>
<td valign="top" align="center">0.25; 0.83</td>
<td/>
<td valign="top" align="left">Transfer</td>
<td valign="top" align="center">25</td>
<td valign="top" align="center">0.54</td>
<td valign="top" align="center">0.25; 0.83</td>
<td valign="top" align="center">n.a.</td>
</tr>
<tr>
<td valign="top" align="left">Schneider et al. (<xref ref-type="bibr" rid="B80">2018</xref>)</td>
<td valign="top" align="center">8</td>
<td valign="top" align="center">89%</td>
<td valign="top" align="left">Hedges&#x00027; g</td>
<td valign="top" align="left">Signaled multimedia material</td>
<td valign="top" align="left">Retention</td>
<td valign="top" align="center">139</td>
<td valign="top" align="center">0.53</td>
<td valign="top" align="center">0.42; 0.64</td>
<td valign="top" align="left">Signaled multimedia material</td>
<td valign="top" align="left">Retention in biology</td>
<td valign="top" align="center">32</td>
<td valign="top" align="center">0.35</td>
<td valign="top" align="center">0.11; 0.59</td>
<td valign="top" align="center">2</td>
</tr>
<tr>
<td/>
<td/>
<td/>
<td/>
<td/>
<td/>
<td/>
<td/>
<td/>
<td/>
<td valign="top" align="left">Retention in chemistry</td>
<td valign="top" align="center">4</td>
<td valign="top" align="center">0.80</td>
<td valign="top" align="center">0.15; 1.45</td>
<td valign="top" align="center">2</td>
</tr>
<tr>
<td/>
<td/>
<td/>
<td/>
<td/>
<td/>
<td/>
<td/>
<td/>
<td/>
<td valign="top" align="left">Retention in math</td>
<td valign="top" align="center">9</td>
<td valign="top" align="center">0.08</td>
<td valign="top" align="center">&#x02212;0.32; 0.49</td>
<td valign="top" align="center">2</td>
</tr>
<tr>
<td/>
<td/>
<td/>
<td/>
<td/>
<td/>
<td/>
<td/>
<td/>
<td/>
<td valign="top" align="left">Retention in physics</td>
<td valign="top" align="center">36</td>
<td valign="top" align="center">0.43</td>
<td valign="top" align="center">0.21; 0.65</td>
<td valign="top" align="center">1</td>
</tr>
<tr>
<td/>
<td/>
<td/>
<td/>
<td/>
<td/>
<td/>
<td/>
<td/>
<td/>
<td valign="top" align="left">Retention in geography</td>
<td valign="top" align="center">17</td>
<td valign="top" align="center">0.61</td>
<td valign="top" align="center">0.31; 0.92</td>
<td valign="top" align="center">1</td>
</tr>
<tr>
<td/>
<td valign="top" align="center">6</td>
<td/>
<td/>
<td valign="top" align="left">Signaled multimedia material</td>
<td valign="top" align="left">Transfer</td>
<td valign="top" align="center">70</td>
<td valign="top" align="center">0.33</td>
<td valign="top" align="center">0.22; 0.43</td>
<td valign="top" align="left">Signaled multimedia material</td>
<td valign="top" align="left">Transfer</td>
<td valign="top" align="center">70</td>
<td valign="top" align="center">0.33</td>
<td valign="top" align="center">0.22; 0.43</td>
<td valign="top" align="center">n.a.</td>
</tr>
<tr>
<td valign="top" align="left">Schroeder and Cenkci (<xref ref-type="bibr" rid="B83">2018</xref>)</td>
<td valign="top" align="center">9</td>
<td valign="top" align="center">75%</td>
<td valign="top" align="left">Hedges&#x00027; g</td>
<td valign="top" align="left">integrated multimedia design</td>
<td valign="top" align="left">learning</td>
<td valign="top" align="center">58</td>
<td valign="top" align="center">0.63</td>
<td valign="top" align="center">not reported</td>
<td valign="top" align="left">Integrated multimedia design</td>
<td valign="top" align="left">Learning grade 6&#x02013;8</td>
<td valign="top" align="center">7</td>
<td valign="top" align="center">0.43</td>
<td valign="top" align="center">0.22; 0.63</td>
<td valign="top" align="center">1</td>
</tr>
<tr>
<td/>
<td/>
<td/>
<td/>
<td/>
<td/>
<td/>
<td/>
<td/>
<td/>
<td valign="top" align="left">Learning grade 9&#x02013;12</td>
<td valign="top" align="center">7</td>
<td valign="top" align="center">0.81</td>
<td valign="top" align="center">0.55; 1.08</td>
<td valign="top" align="center">1</td>
</tr>
<tr>
<td valign="top" align="left" colspan="15"><bold>Using similarities and differences</bold></td>
</tr>
<tr>
<td valign="top" align="left">Apthorp et al. (<xref ref-type="bibr" rid="B3">2012</xref>)</td>
<td valign="top" align="center">6</td>
<td valign="top" align="center">60%</td>
<td valign="top" align="left">Hedges&#x00027; g</td>
<td valign="top" align="left">Similarities and differences</td>
<td valign="top" align="left">Achievement (math and science)</td>
<td valign="top" align="center">14</td>
<td valign="top" align="center">0.65</td>
<td valign="top" align="center">0.39; 0.91</td>
<td valign="top" align="left">Similarities and differences</td>
<td valign="top" align="left">Achievement (math and science)</td>
<td valign="top" align="center">14</td>
<td valign="top" align="center">0.65</td>
<td valign="top" align="center">0.39; 0.91</td>
<td valign="top" align="center">n.a.</td>
</tr>
<tr>
<td valign="top" align="left" colspan="15"><bold>Mathematical modeling</bold></td>
</tr>
<tr>
<td valign="top" align="left">Sokolowski (<xref ref-type="bibr" rid="B98">2015</xref>)</td>
<td valign="top" align="center">2</td>
<td valign="top" align="center">71%</td>
<td valign="top" align="left">Hedges&#x00027; g</td>
<td valign="top" align="left">Mathematical modeling</td>
<td valign="top" align="left">Math achievement</td>
<td valign="top" align="center">14</td>
<td valign="top" align="center">0.69</td>
<td valign="top" align="center">0.59; 0.79</td>
<td valign="top" align="left">Mathematical modeling</td>
<td valign="top" align="left">Math achievement high school</td>
<td valign="top" align="center">7</td>
<td valign="top" align="center">0.94</td>
<td valign="top" align="center">0.79; 1.08</td>
<td valign="top" align="center">3</td>
</tr>
<tr>
<td valign="top" align="left" colspan="15"><bold>Self-grading</bold></td>
</tr>
<tr>
<td valign="top" align="left">Sanchez et al. (<xref ref-type="bibr" rid="B75">2017</xref>)</td>
<td valign="top" align="center">6</td>
<td valign="top" align="center">86%</td>
<td valign="top" align="left">Hedges&#x00027; g</td>
<td valign="top" align="left">Self-grading</td>
<td valign="top" align="left">Test performance</td>
<td valign="top" align="center">22</td>
<td valign="top" align="center">0.34</td>
<td valign="top" align="center">0.15; 0.52</td>
<td valign="top" align="left">Self-grading</td>
<td valign="top" align="left">Test performance</td>
<td valign="top" align="center">22</td>
<td valign="top" align="center">0.34</td>
<td valign="top" align="center">0.15; 0.52</td>
<td valign="top" align="center">n.a.</td>
</tr>
<tr>
<td valign="top" align="left" colspan="15"><bold>Peer instruction</bold></td>
</tr>
<tr>
<td valign="top" align="left">Balta et al. (<xref ref-type="bibr" rid="B4">2017</xref>)</td>
<td valign="top" align="center">8</td>
<td valign="top" align="center">72%</td>
<td valign="top" align="left">Cohen&#x00027;s d</td>
<td valign="top" align="left">Peer instruction</td>
<td valign="top" align="left">Learning gains</td>
<td valign="top" align="center">35</td>
<td valign="top" align="center">0.94</td>
<td valign="top" align="center">0.70; 1.17</td>
<td valign="top" align="left">Peer instruction</td>
<td valign="top" align="left">Learning gains in physics</td>
<td valign="top" align="center">15</td>
<td valign="top" align="center">1.30</td>
<td valign="top" align="center">0.88; 1.71</td>
<td valign="top" align="center">2</td>
</tr>
<tr>
<td/>
<td/>
<td/>
<td/>
<td/>
<td/>
<td/>
<td/>
<td/>
<td/>
<td valign="top" align="left">Learning gains in math</td>
<td valign="top" align="center">6</td>
<td valign="top" align="center">0.91</td>
<td valign="top" align="center">0.41; 1.4</td>
<td valign="top" align="center">1</td>
</tr>
<tr>
<td/>
<td/>
<td/>
<td/>
<td/>
<td/>
<td/>
<td/>
<td/>
<td/>
<td valign="top" align="left">Learning gains in biology</td>
<td valign="top" align="center">4</td>
<td valign="top" align="center">0.78</td>
<td valign="top" align="center">0.48; 1.06</td>
<td valign="top" align="center">1</td>
</tr>
<tr>
<td/>
<td/>
<td/>
<td/>
<td/>
<td/>
<td/>
<td/>
<td/>
<td/>
<td valign="top" align="left">Learning gains in geography</td>
<td valign="top" align="center">1</td>
<td valign="top" align="center">0.19</td>
<td valign="top" align="center">&#x02212;0.24; 0.63</td>
<td valign="top" align="center">3</td>
</tr>
<tr>
<td/>
<td/>
<td/>
<td/>
<td/>
<td/>
<td/>
<td/>
<td/>
<td/>
<td valign="top" align="left">Learning gains in chemistry</td>
<td valign="top" align="center">1</td>
<td valign="top" align="center">0.34</td>
<td valign="top" align="center">&#x02212;0.07; 0.75</td>
<td valign="top" align="center">2</td>
</tr>
<tr>
<td valign="top" align="left" colspan="15"><bold>Homework</bold></td>
</tr>
<tr>
<td valign="top" align="left">Fan et al. (<xref ref-type="bibr" rid="B32">2017</xref>)</td>
<td valign="top" align="center">2</td>
<td valign="top" align="center">91%</td>
<td valign="top" align="left">Weighted r</td>
<td valign="top" align="left">Homework</td>
<td valign="top" align="left">Performance math and science</td>
<td valign="top" align="center">61</td>
<td valign="top" align="center">0.22</td>
<td valign="top" align="center">0.19; 0.25</td>
<td valign="top" align="left">homework</td>
<td valign="top" align="left">Performance math and science junior high school</td>
<td valign="top" align="center">23</td>
<td valign="top" align="center">0.15</td>
<td valign="top" align="center">0.11; 0.18</td>
<td valign="top" align="center">2</td>
</tr>
<tr>
<td/>
<td/>
<td/>
<td/>
<td/>
<td/>
<td/>
<td/>
<td/>
<td/>
<td valign="top" align="left">Performance math and science senior high school</td>
<td valign="top" align="center">17</td>
<td valign="top" align="center">0.3</td>
<td valign="top" align="center">0.25; 0.34</td>
<td valign="top" align="center">1</td>
</tr>
<tr>
<td valign="top" align="left" colspan="15"><bold>Concept maps</bold></td>
</tr>
<tr>
<td valign="top" align="left">Schroeder et al. (<xref ref-type="bibr" rid="B84">2017</xref>)</td>
<td valign="top" align="center">6</td>
<td valign="top" align="center">67%</td>
<td valign="top" align="left">Hedges&#x00027; g</td>
<td valign="top" align="left">Concept maps</td>
<td valign="top" align="left">Learning</td>
<td valign="top" align="center">142</td>
<td valign="top" align="center">0.58</td>
<td valign="top" align="center">Not reported</td>
<td valign="top" align="left">Concept maps</td>
<td valign="top" align="left">Learning</td>
<td valign="top" align="center">142</td>
<td valign="top" align="center">0.58</td>
<td valign="top" align="center">Not reported</td>
<td valign="top" align="center">n.a.</td>
</tr>
<tr>
<td/>
<td valign="top" align="center">6</td>
<td/>
<td/>
<td valign="top" align="left">Concept maps constructed</td>
<td valign="top" align="left">Learning</td>
<td valign="top" align="center">75</td>
<td valign="top" align="center">0.72</td>
<td valign="top" align="center">0.56; 0.88</td>
<td valign="top" align="left">Concept maps constructed</td>
<td valign="top" align="left">Learning</td>
<td valign="top" align="center">75</td>
<td valign="top" align="center">0.72</td>
<td valign="top" align="center">0.56; 0.88</td>
<td valign="top" align="center">n.a.</td>
</tr>
<tr>
<td/>
<td valign="top" align="center">9</td>
<td/>
<td/>
<td valign="top" align="left">Concept maps studied</td>
<td valign="top" align="left">Learning</td>
<td valign="top" align="center">67</td>
<td valign="top" align="center">0.43</td>
<td valign="top" align="center">0.29; 0.57</td>
<td valign="top" align="left">Concept maps studied</td>
<td valign="top" align="left">Learning intermediate level</td>
<td valign="top" align="center">7</td>
<td valign="top" align="center">0.82</td>
<td valign="top" align="center">0.62; 1.02</td>
<td valign="top" align="center">3</td>
</tr>
<tr>
<td/>
<td/>
<td/>
<td/>
<td/>
<td/>
<td/>
<td/>
<td/>
<td/>
<td valign="top" align="left">Learning secondary level</td>
<td valign="top" align="center">4</td>
<td valign="top" align="center">1.24</td>
<td valign="top" align="center">0.79; 1.69</td>
<td valign="top" align="center">3</td>
</tr>
<tr>
<td valign="top" align="left" colspan="15"><bold>Social and Emotional Learning Programs</bold></td>
</tr>
<tr>
<td valign="top" align="left">Corcoran et al. (<xref ref-type="bibr" rid="B20">2017</xref>)</td>
<td valign="top" align="center">3</td>
<td valign="top" align="center">81%</td>
<td valign="top" align="left">Hedges&#x00027; g</td>
<td valign="top" align="left">School-based social and emotional learning programs</td>
<td valign="top" align="left">Academic achievement in math</td>
<td valign="top" align="center">33</td>
<td valign="top" align="center">0.26</td>
<td valign="top" align="center">0.18; 0.34</td>
<td valign="top" align="left">School-based social and emotional learning programs</td>
<td valign="top" align="left">Academic achievement in math</td>
<td valign="top" align="center">33</td>
<td valign="top" align="center">0.26</td>
<td valign="top" align="center">0.18; 0.34</td>
<td valign="top" align="center">n.a.</td>
</tr>
<tr>
<td/>
<td/>
<td/>
<td/>
<td valign="top" align="left">School-based social and emotional learning programs</td>
<td valign="top" align="left">Academic achievement in science</td>
<td valign="top" align="center">5</td>
<td valign="top" align="center">0.19</td>
<td valign="top" align="center">0.05; 0.33</td>
<td valign="top" align="left">School-based social and emotional learning programs</td>
<td valign="top" align="left">Academic achievement in science</td>
<td valign="top" align="center">5</td>
<td valign="top" align="center">0.19</td>
<td valign="top" align="center">0.05; 0.33</td>
<td valign="top" align="center">n.a.</td>
</tr>
<tr>
<td valign="top" align="left" colspan="15"><bold>Learning from failure</bold></td>
</tr>
<tr>
<td valign="top" align="left">Darabi et al. (<xref ref-type="bibr" rid="B21">2018</xref>)</td>
<td valign="top" align="center">6</td>
<td valign="top" align="center">64%</td>
<td valign="top" align="left">Hedges&#x00027; g</td>
<td valign="top" align="left">Learning from failure</td>
<td valign="top" align="left">Learning performance</td>
<td valign="top" align="center">23</td>
<td valign="top" align="center">0.43</td>
<td valign="top" align="center">0.19; 0.68</td>
<td valign="top" align="left">Learning from failure</td>
<td valign="top" align="left">Learning performance</td>
<td valign="top" align="center">23</td>
<td valign="top" align="center">0.43</td>
<td valign="top" align="center">0.19; 0.68</td>
<td valign="top" align="center">n.a.</td>
</tr>
<tr>
<td valign="top" align="left" colspan="15"><bold>Flipped classroom</bold></td>
</tr>
<tr>
<td valign="top" align="left">van Alten et al. (<xref ref-type="bibr" rid="B107">2019</xref>)</td>
<td valign="top" align="center">6</td>
<td valign="top" align="center">92%</td>
<td valign="top" align="left">Hedges&#x00027; g</td>
<td valign="top" align="left">Flipped classroom teaching</td>
<td valign="top" align="left">Achievement</td>
<td valign="top" align="center">115</td>
<td valign="top" align="center">0.36</td>
<td valign="top" align="center">0.28; 0.44</td>
<td valign="top" align="left">Flipped classroom teaching</td>
<td valign="top" align="left">Achievement</td>
<td valign="top" align="center">114</td>
<td valign="top" align="center">0.36</td>
<td valign="top" align="center">0.28; 0.44</td>
<td valign="top" align="center">n.a.</td>
</tr>
<tr>
<td/>
<td valign="top" align="center">6</td>
<td/>
<td valign="top" align="left">Hedges&#x00027; g</td>
<td valign="top" align="left">Flipped classroom teaching</td>
<td valign="top" align="left">Satisfaction</td>
<td valign="top" align="center">22</td>
<td valign="top" align="center">0.05</td>
<td valign="top" align="center">&#x02212;0.23; 0.32</td>
<td valign="top" align="left">Flipped classroom teaching</td>
<td valign="top" align="left">Satisfaction</td>
<td valign="top" align="center">22</td>
<td valign="top" align="center">0.05</td>
<td valign="top" align="center">&#x02212;0.23; 0.32</td>
<td valign="top" align="center">n.a.</td>
</tr>
<tr>
<td valign="top" align="center" colspan="15"><bold>Comparisons between strategies</bold></td>
</tr>
<tr>
<td valign="top" align="left" colspan="15"><bold>Comparisons between innovative approaches</bold></td>
</tr>
<tr>
<td valign="top" align="left">Schroeder et al. (<xref ref-type="bibr" rid="B82">2007</xref>)</td>
<td valign="top" align="center">3</td>
<td valign="top" align="center">62%</td>
<td valign="top" align="left">Glass&#x00027; d</td>
<td valign="top" align="left">Teaching strategies</td>
<td valign="top" align="left">Science achievement</td>
<td valign="top" align="center">61</td>
<td valign="top" align="center">0.67</td>
<td valign="top" align="center">0.66; 0.68</td>
<td valign="top" align="left">Teaching strategies</td>
<td valign="top" align="left">Science achievement</td>
<td valign="top" align="center">61</td>
<td valign="top" align="center">0.67</td>
<td valign="top" align="center">0.66; 0.68</td>
<td valign="top" align="center">n.a.</td>
</tr>
<tr>
<td valign="top" align="left">Savelsbergh et al. (<xref ref-type="bibr" rid="B76">2016</xref>)</td>
<td valign="top" align="center">2</td>
<td valign="top" align="center">64%</td>
<td valign="top" align="left">Pooled d (Morris, <xref ref-type="bibr" rid="B65">2008</xref>)</td>
<td valign="top" align="left">Innovative teaching strategies</td>
<td valign="top" align="left">Math &#x00026; science attitude</td>
<td valign="top" align="center">60</td>
<td valign="top" align="center">0.35</td>
<td valign="top" align="center">0.24; 0.47</td>
<td valign="top" align="left">Innovative teaching strategies</td>
<td valign="top" align="left">Math and science attitude</td>
<td valign="top" align="center">60</td>
<td valign="top" align="center">0.35</td>
<td valign="top" align="center">0.24; 0.47</td>
<td valign="top" align="center">n.a.</td>
</tr>
<tr>
<td/>
<td valign="top" align="center">2</td>
<td/>
<td valign="top" align="left">Pooled d (Morris, <xref ref-type="bibr" rid="B65">2008</xref>)</td>
<td valign="top" align="left">Innovative teaching strategies</td>
<td valign="top" align="left">Math and science achievement</td>
<td valign="top" align="center">40</td>
<td valign="top" align="center">0.78</td>
<td valign="top" align="center">0.60; 0.97</td>
<td valign="top" align="left">Innovative teaching strategies</td>
<td valign="top" align="left">Math and science achievement</td>
<td valign="top" align="center">40</td>
<td valign="top" align="center">0.78</td>
<td valign="top" align="center">0.60; 0.97</td>
<td valign="top" align="center">n.a.</td>
</tr>
<tr>
<td valign="top" align="left">Cheung et al. (<xref ref-type="bibr" rid="B13">2017</xref>)</td>
<td valign="top" align="center">1</td>
<td valign="top" align="center">52%</td>
<td valign="top" align="left">Weighted ES</td>
<td valign="top" align="left">Science programs</td>
<td valign="top" align="left">Science achievement</td>
<td valign="top" align="center">21</td>
<td valign="top" align="center">0.17</td>
<td valign="top" align="center">Not reported</td>
<td valign="top" align="left">Science programs</td>
<td valign="top" align="left">Science achievement</td>
<td valign="top" align="center">21</td>
<td valign="top" align="center">0.17</td>
<td valign="top" align="center">Not reported</td>
<td valign="top" align="center">n.a.</td>
</tr>
<tr>
<td valign="top" align="left" colspan="15"><bold>Comparisons of instructional methods for learning algebra</bold></td>
</tr>
<tr>
<td valign="top" align="left">Haas (<xref ref-type="bibr" rid="B39">2005</xref>)</td>
<td valign="top" align="center">1</td>
<td valign="top" align="center">26%</td>
<td valign="top" align="left">Glass&#x00027; d</td>
<td valign="top" align="left">Direct instruction</td>
<td valign="top" align="left">Algebra achievement</td>
<td valign="top" align="center">19</td>
<td valign="top" align="center">0.55</td>
<td valign="top" align="center">0.41; 0.69</td>
<td valign="top" align="left">Direct instruction</td>
<td valign="top" align="left">Algebra achievement</td>
<td valign="top" align="center">19</td>
<td valign="top" align="center">0.55</td>
<td valign="top" align="center">0.41; 0.69</td>
<td valign="top" align="center">n.a.</td>
</tr>
<tr>
<td/>
<td/>
<td/>
<td/>
<td valign="top" align="left">Problem-based learning</td>
<td valign="top" align="left">Algebra achievement</td>
<td valign="top" align="center">14</td>
<td valign="top" align="center">0.52</td>
<td valign="top" align="center">0.35; 0.69</td>
<td valign="top" align="left">Problem-based learning</td>
<td valign="top" align="left">Algebra achievement</td>
<td valign="top" align="center">14</td>
<td valign="top" align="center">0.52</td>
<td valign="top" align="center">0.35; 0.69</td>
<td valign="top" align="center">n.a.</td>
</tr>
<tr>
<td/>
<td/>
<td/>
<td/>
<td valign="top" align="left">Manipulatives, models, multiple representations</td>
<td valign="top" align="left">Algebra achievement</td>
<td valign="top" align="center">13</td>
<td valign="top" align="center">0.38</td>
<td valign="top" align="center">0.28; 0.48</td>
<td valign="top" align="left">Manipulatives, models, multiple representations</td>
<td valign="top" align="left">Algebra achievement</td>
<td valign="top" align="center">13</td>
<td valign="top" align="center">0.38</td>
<td valign="top" align="center">0.28; 0.48</td>
<td valign="top" align="center">n.a.</td>
</tr>
<tr>
<td/>
<td/>
<td/>
<td/>
<td valign="top" align="left">Cooperative learning</td>
<td valign="top" align="left">Algebra achievement</td>
<td valign="top" align="center">3</td>
<td valign="top" align="center">0.34</td>
<td valign="top" align="center">0.30; 0.38</td>
<td valign="top" align="left">Cooperative learning</td>
<td valign="top" align="left">Algebra achievement</td>
<td valign="top" align="center">3</td>
<td valign="top" align="center">0.34</td>
<td valign="top" align="center">0.30; 0.38</td>
<td valign="top" align="center">n.a.</td>
</tr>
<tr>
<td/>
<td/>
<td/>
<td/>
<td valign="top" align="left">Communication and study skills</td>
<td valign="top" align="left">Algebra achievement</td>
<td valign="top" align="center">5</td>
<td valign="top" align="center">0.07</td>
<td valign="top" align="center">0.01; 0.13</td>
<td valign="top" align="left">Communication and study skills</td>
<td valign="top" align="left">Algebra achievement</td>
<td valign="top" align="center">5</td>
<td valign="top" align="center">0.07</td>
<td valign="top" align="center">0.01; 0.13</td>
<td valign="top" align="center">n.a.</td>
</tr>
<tr>
<td/>
<td/>
<td/>
<td/>
<td valign="top" align="left">Technology aided instruction</td>
<td valign="top" align="left">Algebra achievement</td>
<td valign="top" align="center">12</td>
<td valign="top" align="center">0.07</td>
<td valign="top" align="center">&#x02212;0.10; 0.24</td>
<td valign="top" align="left">Technology aided instruction</td>
<td valign="top" align="left">Algebra achievement</td>
<td valign="top" align="center">12</td>
<td valign="top" align="center">0.07</td>
<td valign="top" align="center">&#x02212;0.10; 0.24</td>
<td valign="top" align="center">n.a.</td>
</tr>
<tr>
<td valign="top" align="left">Rakes et al. (<xref ref-type="bibr" rid="B73">2010</xref>)</td>
<td valign="top" align="center">1</td>
<td valign="top" align="center">63%</td>
<td valign="top" align="left">Weighted ES</td>
<td valign="top" align="left">New non-technology curricula</td>
<td valign="top" align="left">Algebra achievement</td>
<td valign="top" align="center">Not reported</td>
<td valign="top" align="center">0.40</td>
<td valign="top" align="center">&#x02212;0.16; 0.64</td>
<td valign="top" align="left">New non-technology curricula</td>
<td valign="top" align="left">Algebra achievement</td>
<td valign="top" align="center">Not reported</td>
<td valign="top" align="center">0.40</td>
<td valign="top" align="center">&#x02212;0.16; 0.64</td>
<td valign="top" align="center">n.a.</td>
</tr>
<tr>
<td/>
<td/>
<td/>
<td/>
<td valign="top" align="left">Instructional strategies</td>
<td valign="top" align="left">Algebra achievement</td>
<td valign="top" align="center">Not reported</td>
<td valign="top" align="center">0.35</td>
<td valign="top" align="center">&#x02212;0.21; 0.49</td>
<td valign="top" align="left">Instructional strategies</td>
<td valign="top" align="left">Algebra achievement</td>
<td valign="top" align="center">Not reported</td>
<td valign="top" align="center">0.35</td>
<td valign="top" align="center">&#x02212;0.21; 0.49</td>
<td valign="top" align="center">n.a.</td>
</tr>
<tr>
<td/>
<td/>
<td/>
<td/>
<td valign="top" align="left">Use of manipulatives</td>
<td valign="top" align="left">Algebra achievement</td>
<td valign="top" align="center">Not reported</td>
<td valign="top" align="center">0.34</td>
<td valign="top" align="center">0.08; 0.60</td>
<td valign="top" align="left">Use of manipulatives</td>
<td valign="top" align="left">Algebra achievement</td>
<td valign="top" align="center">Not reported</td>
<td valign="top" align="center">0.34</td>
<td valign="top" align="center">0.08; 0.60</td>
<td valign="top" align="center">n.a.</td>
</tr>
<tr>
<td/>
<td/>
<td/>
<td/>
<td valign="top" align="left">Technology tools</td>
<td valign="top" align="left">Algebra achievement</td>
<td valign="top" align="center">Not reported</td>
<td valign="top" align="center">0.17</td>
<td valign="top" align="center">&#x02212;0.03; 0.31</td>
<td valign="top" align="left">Technology tools</td>
<td valign="top" align="left">Algebra achievement</td>
<td valign="top" align="center">Not reported</td>
<td valign="top" align="center">0.17</td>
<td valign="top" align="center">&#x02212;0.03; 0.31</td>
<td valign="top" align="center">n.a.</td>
</tr>
<tr>
<td/>
<td/>
<td/>
<td/>
<td valign="top" align="left">Technology-based curricula</td>
<td valign="top" align="left">Algebra achievement</td>
<td valign="top" align="center">Not reported</td>
<td valign="top" align="center">0.15</td>
<td valign="top" align="center">&#x02212;0.46; 0.76</td>
<td valign="top" align="left">Technology&#x02013;based curricula</td>
<td valign="top" align="left">Algebra achievement</td>
<td valign="top" align="center">Not reported</td>
<td valign="top" align="center">0.15</td>
<td valign="top" align="center">&#x02212;0.46; 0.76</td>
<td valign="top" align="center">n.a.</td>
</tr>
<tr>
<td valign="top" align="left" colspan="15"><bold>Comparisons of strategies for fostering critical thinking and scientific reasoning</bold></td>
</tr>
<tr>
<td valign="top" align="left">Abrami et al. (<xref ref-type="bibr" rid="B1">2015</xref>)</td>
<td valign="top" align="center">6</td>
<td valign="top" align="center">66%</td>
<td valign="top" align="left">Hedges&#x00027; g</td>
<td valign="top" align="left">Instructional strategies</td>
<td valign="top" align="left">Critical thinking skills</td>
<td valign="top" align="center">341</td>
<td valign="top" align="center">0.30</td>
<td valign="top" align="center">0.25; 0.34</td>
<td valign="top" align="left">Instructional strategies</td>
<td valign="top" align="left">Critical thinking skills</td>
<td valign="top" align="center">341</td>
<td valign="top" align="center">0.30</td>
<td valign="top" align="center">0.25; 0.34</td>
<td valign="top" align="center">n.a.</td>
</tr>
<tr>
<td valign="top" align="left">Schwichow et al. (<xref ref-type="bibr" rid="B85">2016</xref>)</td>
<td valign="top" align="center">3</td>
<td valign="top" align="center">86%</td>
<td valign="top" align="left">Hedges&#x00027; g</td>
<td valign="top" align="left">Teaching control- of-variables-strategy</td>
<td valign="top" align="left">Control- of-variables-strategy skills</td>
<td valign="top" align="center">226</td>
<td valign="top" align="center">0.61</td>
<td valign="top" align="center">0.53; 0.69</td>
<td valign="top" align="left">Teaching control-of-variables-strategy</td>
<td valign="top" align="left">Control-of-variables-strategy skills</td>
<td valign="top" align="center">226</td>
<td valign="top" align="center">0.61</td>
<td valign="top" align="center">0.53; 0.69</td>
<td valign="top" align="center">n.a.</td>
</tr>
<tr>
<td valign="top" align="left">Engelmann et al. (<xref ref-type="bibr" rid="B29">2016</xref>)</td>
<td valign="top" align="center">3</td>
<td valign="top" align="center">66%</td>
<td valign="top" align="left">Hedges&#x00027; g</td>
<td valign="top" align="left">Interventions on scientific reasoning</td>
<td valign="top" align="left">Scientific reasoning</td>
<td valign="top" align="center">30</td>
<td valign="top" align="center">0.71</td>
<td valign="top" align="center">0.55; 0.87</td>
<td valign="top" align="left">Interventions</td>
<td valign="top" align="left">Scientific reasoning</td>
<td valign="top" align="center">30</td>
<td valign="top" align="center">0.71</td>
<td valign="top" align="center">0.55; 0.87</td>
<td valign="top" align="center">n.a.</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<p><italic>Code, code of meta-analysis for extracting effect sizes (see <xref ref-type="table" rid="T1">Table 1</xref>); k, number of raw effect sizes; ES, effect size; CI&#x000B1;, lower and upper limit of the confidence interval of the effect size estimation; specific effect, effect specific for mathematics and science education in secondary schooling level; n.a., not applicable for cases when overall effect equals specific effect; ES diff, difference from the comparison of overall effects with effect sizes specific for mathematics and science education in secondary schooling level (categories are described in the text)</italic>.</p>
</table-wrap-foot>
</table-wrap>
</sec>
<sec>
<title>Analysis of Scientific Quality</title>
<p>In order to enable reproducibility and alleviate threats to validity, researchers in different fields have developed manuals and standard documents that offer guidelines for meta-analysts (e.g., AMSTAR: Shea et al., <xref ref-type="bibr" rid="B91">2007</xref>; APA&#x00027;s Meta-Analysis Reporting Standards (MARS), PRISMA: Moher et al., <xref ref-type="bibr" rid="B64">2009</xref>). In addition to handbooks (e.g., Borenstein et al., <xref ref-type="bibr" rid="B8">2011</xref>; Cooper, <xref ref-type="bibr" rid="B18">2015</xref>; Higgins et al., <xref ref-type="bibr" rid="B46">2019</xref>) and recent scientific evaluations of meta-analytic practice (e.g., Ahn et al., <xref ref-type="bibr" rid="B2">2012</xref>; Cooper and Koenka, <xref ref-type="bibr" rid="B19">2012</xref>; Polanin et al., <xref ref-type="bibr" rid="B71">2017</xref>; Schalken and Rietbergen, <xref ref-type="bibr" rid="B77">2017</xref>; Siddaway et al., <xref ref-type="bibr" rid="B92">2019</xref>), these provide a strong resource to ensure the scientific quality of meta-analytic work. Moreover, systematic reviews are in danger of accumulating bias and error when the methods utilized at the level of included meta-analyses and primary studies are not evaluated (e.g., Polanin et al., <xref ref-type="bibr" rid="B71">2017</xref>). Since researchers have noted a wide variation in transparent reporting and employing sound methodologies (Ahn et al., <xref ref-type="bibr" rid="B2">2012</xref>), we analyzed all 41 publications in terms of their implementation of strategies to avoid biased findings. Our coding scheme is based on the abovementioned literature review, finally comprising 37 items. It has to be noted that the quality of meta-analyses depends on numerous details and our items do not intend to exhaustively capture all these aspects. However, taken together, these criteria provide a reasonable indication of efforts that have been made to ensure a high quality of scientific information justified by recent literature, even signaling room for improvement. We organized all items in accordance with the guidelines for conducting a meta-analysis of experimental research. These include open science (2 items), search and selection (7 items), coding and data collection (10 items), and meta-analytic methods (18 items). The intercoder agreement for all items ranged from Cohen&#x00027;s kappa = 0.74 to 1.00. Inconsistencies were resolved by discussion. For a detailed description of each item, see <xref ref-type="table" rid="T3">Table 3</xref>.</p>
<table-wrap position="float" id="T3">
<label>Table 3</label>
<caption><p>Scientific quality.</p></caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th valign="top" align="left"><bold>Item</bold></th>
<th valign="top" align="center"><bold>Code</bold></th>
<th valign="top" align="left"><bold>Item description</bold></th>
<th valign="top" align="center"><bold>% of sample</bold></th>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left" colspan="4">&#x000A0;&#x000A0;&#x000A0;<bold>Open science</bold></td>
</tr>
<tr>
<td valign="top" align="left">Open protocol</td>
<td valign="top" align="center">q_pr</td>
<td valign="top" align="left">Is a pre-registered study plan/protocol published? (y/n)</td>
<td valign="top" align="center">0%</td>
</tr>
<tr>
<td valign="top" align="left">Open data</td>
<td valign="top" align="center">q_od</td>
<td valign="top" align="left">Are relevant data for reproducibility of statistical analyses published? (y/n)</td>
<td valign="top" align="center">44%</td>
</tr>
<tr>
<td valign="top" align="left" colspan="4">&#x000A0;&#x000A0;&#x000A0;<bold>Search and selection</bold></td>
</tr>
<tr>
<td valign="top" align="left">Search terms</td>
<td valign="top" align="center">q_st</td>
<td valign="top" align="left">Is a complete description of database search terms/full search string provided? (y/n)</td>
<td valign="top" align="center">93%</td>
</tr>
<tr>
<td valign="top" align="left">Search strategies</td>
<td valign="top" align="center">q_sp</td>
<td valign="top" align="left">Were additional search strategies applied? (e.g., hand-search) (y/n)</td>
<td valign="top" align="center">73%</td>
</tr>
<tr>
<td valign="top" align="left">Exclusion criteria</td>
<td valign="top" align="center">q_ec</td>
<td valign="top" align="left">Are inclusion/exclusion criteria clearly stated? (y/n)</td>
<td valign="top" align="center">100%</td>
</tr>
<tr>
<td valign="top" align="left">Search period</td>
<td valign="top" align="center">q_spr</td>
<td valign="top" align="left">Is information about search period provided? (y/n)</td>
<td valign="top" align="center">95%</td>
</tr>
<tr>
<td valign="top" align="left">NPR publications included</td>
<td valign="top" align="center">q_pi</td>
<td valign="top" align="left">Are effect sizes from non-peer-reviewed (NPR) publications included? (y/n)</td>
<td valign="top" align="center">71%</td>
</tr>
<tr>
<td valign="top" align="left">Selection reliability</td>
<td valign="top" align="center">q_sr</td>
<td valign="top" align="left">Is an indicator for selection reliability provided? (y/n)</td>
<td valign="top" align="center">34%</td>
</tr>
<tr>
<td valign="top" align="left">List of included publications</td>
<td valign="top" align="center">q_lip</td>
<td valign="top" align="left">Is a complete list of included publications provided? (y/n)</td>
<td valign="top" align="center">98%</td>
</tr>
<tr>
<td valign="top" align="left" colspan="4">&#x000A0;&#x000A0;&#x000A0;<bold>Coding and data collection</bold></td>
</tr>
<tr>
<td valign="top" align="left">Sample description</td>
<td valign="top" align="center">q_ip</td>
<td valign="top" align="left">Is the sample population for each included primary study specified? (y/n)</td>
<td valign="top" align="center">54%</td>
</tr>
<tr>
<td valign="top" align="left">Intervention description</td>
<td valign="top" align="center">q_ii</td>
<td valign="top" align="left">Is the intervention for each included primary study specified? (y/n)</td>
<td valign="top" align="center">78%</td>
</tr>
<tr>
<td valign="top" align="left">Control description</td>
<td valign="top" align="center">q_ic</td>
<td valign="top" align="left">Are control conditions for each included primary study specified? (y/n/na)</td>
<td valign="top" align="center">55%</td>
</tr>
<tr>
<td valign="top" align="left">Outcome description</td>
<td valign="top" align="center">q_io</td>
<td valign="top" align="left">Are outcome variables for each included primary study specified? (y/n)</td>
<td valign="top" align="center">51%</td>
</tr>
<tr>
<td valign="top" align="left">Outcome statistics</td>
<td valign="top" align="center">q_rs</td>
<td valign="top" align="left">Are descriptive statistics for outcome variables reported? (y/n)</td>
<td valign="top" align="center">41%</td>
</tr>
<tr>
<td valign="top" align="left">Study design</td>
<td valign="top" align="center">q_id</td>
<td valign="top" align="left">Is the study design for each included primary study specified? (y/n)</td>
<td valign="top" align="center">41%</td>
</tr>
<tr>
<td valign="top" align="left">Coding process</td>
<td valign="top" align="center">q_cp</td>
<td valign="top" align="left">Is the coding/data collection process described? (y/n)</td>
<td valign="top" align="center">78%</td>
</tr>
<tr>
<td valign="top" align="left">Coder qualification</td>
<td valign="top" align="center">q_cq</td>
<td valign="top" align="left">Is the qualification of coders reported? (y/n)</td>
<td valign="top" align="center">49%</td>
</tr>
<tr>
<td valign="top" align="left">Coding categories</td>
<td valign="top" align="center">q_cd</td>
<td valign="top" align="left">Are coding categories for all variables clearly defined? (y/n)</td>
<td valign="top" align="center">85%</td>
</tr>
<tr>
<td valign="top" align="left">Coding reliability</td>
<td valign="top" align="center">q_cr</td>
<td valign="top" align="left">Is an indicator for coding reliability provided? (y/n)</td>
<td valign="top" align="center">80%</td>
</tr>
<tr>
<td valign="top" align="left" colspan="4">&#x000A0;&#x000A0;&#x000A0;<bold>Meta-analytic methods</bold></td>
</tr>
<tr>
<td valign="top" align="left">Missing data handling</td>
<td valign="top" align="center">q_hdm</td>
<td valign="top" align="left">Is a procedure for handling of missing data described? (y/n)</td>
<td valign="top" align="center">71%</td>
</tr>
<tr>
<td valign="top" align="left">Effect size description</td>
<td valign="top" align="center">q_esd</td>
<td valign="top" align="left">Is there a verbal description of how raw effect sizes are determined? (y/n)</td>
<td valign="top" align="center">95%</td>
</tr>
<tr>
<td valign="top" align="left">Effect size calculation</td>
<td valign="top" align="center">q_esc</td>
<td valign="top" align="left">Is an exact formula for the calculation of raw effect sizes reported? (y/n)</td>
<td valign="top" align="center">39%</td>
</tr>
<tr>
<td valign="top" align="left">OAE: Statistical model</td>
<td valign="top" align="center">q_rm</td>
<td valign="top" align="left">Is a statistical model for the overall effect size estimation (OAE) reported? (y/n/na)</td>
<td valign="top" align="center">92%</td>
</tr>
<tr>
<td valign="top" align="left">OAE: Model justification</td>
<td valign="top" align="center">q_jm</td>
<td valign="top" align="left">Is a justification for the statistical model selection of the OAE provided? (y/n/na)</td>
<td valign="top" align="center">89%</td>
</tr>
<tr>
<td valign="top" align="left">OAE: Confidence intervals</td>
<td valign="top" align="center">q_rci</td>
<td valign="top" align="left">Are confidence intervals for OAE reported? (y/n/na)</td>
<td valign="top" align="center">90%</td>
</tr>
<tr>
<td valign="top" align="left">ME: Statistical model</td>
<td valign="top" align="center">q_rma</td>
<td valign="top" align="left">Is a statistical model for moderator effect size estimation (ME) reported? (y/n/na)</td>
<td valign="top" align="center">97%</td>
</tr>
<tr>
<td valign="top" align="left">ME: Model justification</td>
<td valign="top" align="center">q_jmm</td>
<td valign="top" align="left">Is a justification for the statistical model selection for ME provided? (y/n/na)</td>
<td valign="top" align="center">95%</td>
</tr>
<tr>
<td valign="top" align="left">ME: Confidence intervals</td>
<td valign="top" align="center">q_rcm</td>
<td valign="top" align="left">Are confidence intervals for ME reported? (y/n/na)</td>
<td valign="top" align="center">95%</td>
</tr>
<tr>
<td valign="top" align="left">ME: Multiple moderators</td>
<td valign="top" align="center">q_rmm</td>
<td valign="top" align="left">Is the issue of multiple moderator tests discussed? (y/n/na)</td>
<td valign="top" align="center">48%</td>
</tr>
<tr>
<td valign="top" align="left">BSV: indicator</td>
<td valign="top" align="center">q_rabv</td>
<td valign="top" align="left">Is an indicator for the quantity of between-study variance (BSV) reported? (y/n/na)</td>
<td valign="top" align="center">89%</td>
</tr>
<tr>
<td valign="top" align="left">BSV: estimation</td>
<td valign="top" align="center">q_rmbv</td>
<td valign="top" align="left">Is an exact formula for the estimation of between-study variance reported? (y/n/na)</td>
<td valign="top" align="center">50%</td>
</tr>
<tr>
<td valign="top" align="left">Dependent measures</td>
<td valign="top" align="center">q_rdm</td>
<td valign="top" align="left">Is a procedure for handling dependent data points reported? (y/n/na)</td>
<td valign="top" align="center">79%</td>
</tr>
<tr>
<td valign="top" align="left">Application of HLM</td>
<td valign="top" align="center">q_aa</td>
<td valign="top" align="left">Is hierarchical linear modeling applied for dependent data points? (y/n/na)</td>
<td valign="top" align="center">60%</td>
</tr>
<tr>
<td valign="top" align="left">Statistical power</td>
<td valign="top" align="center">q_stpr</td>
<td valign="top" align="left">Is a statistical power analysis reported? (y/n)</td>
<td valign="top" align="center">5%</td>
</tr>
<tr>
<td valign="top" align="left">Publication bias</td>
<td valign="top" align="center">m_pb</td>
<td valign="top" align="left">Is a publication bias test reported? (y/n)</td>
<td valign="top" align="center">83%</td>
</tr>
<tr>
<td valign="top" align="left">Outlier sensitivity analysis</td>
<td valign="top" align="center">m_os</td>
<td valign="top" align="left">Is an outlier sensitivity analysis reported? (y/n)</td>
<td valign="top" align="center">56%</td>
</tr>
<tr>
<td valign="top" align="left">Scientific quality</td>
<td valign="top" align="center">m_sq</td>
<td valign="top" align="left">Is an indicator for scientific quality (e.g., standardized measures; study design; publication status) of primary studies used for moderator analysis? (y/n)</td>
<td valign="top" align="center">61%</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<p><italic>Code (code information for matching with <xref ref-type="supplementary-material" rid="SM1">Supplemental Material</xref>); y, yes; n, no; na, not applicable (this item was not applicable for an individual meta-analysis and thus, the individual meta-analysis was not included in the percentage score)</italic>.</p>
</table-wrap-foot>
</table-wrap>
</sec>
</sec>
</sec>
<sec sec-type="results" id="s3">
<title>Results</title>
<sec>
<title>Availability of Specific Aggregated Evidence</title>
<p>A total of 41 meta-analyses published between January 2004 and May 2019 met all our inclusion criteria. In a stepwise process of selection, 378 publications were excluded because they did not meet one or several inclusion criteria. For example, in the second step, 188 publications were excluded because they did not provide a context-specific effect size. Although these publications might have also been omitted because of not meeting other criteria (e.g., not investigating teaching effectiveness; focusing on a particular group of students), this number is still an indication that a substantial number of meta-analyses might not provide context-specific information. With one exception,<xref ref-type="fn" rid="fn0004"><sup>4</sup></xref> all the selected meta-analyses used aggregated d-family effect sizes based on comparisons between specific teaching strategy interventions and alternatives (mostly certain regular or traditional teaching practices as control condition). All publications provided information on the number of studies that were included. In sum, analyses are based on a total of 2,708 (M = 66.05; SD = 104.59)<xref ref-type="fn" rid="fn0005"><sup>5</sup></xref> primary studies reporting 4,594 (M = 112.05; SD = 1,151.99) effect sizes and involving an estimated number of 1,159,143 (M = 28,271.78; SD = 60,438.86) participants. The sampled meta-analyses were published by 17 different peer-reviewed journals and include an average time span of 21 years (SD = 13.61) of primary research (see <xref ref-type="supplementary-material" rid="SM2">Supplementary Material S2</xref> for details).</p>
<p>Overall, we extracted 78 aggregated effect sizes specific for both science and mathematics education and the secondary student population that are not disaggregated for other (moderating) variables (i.e., variations in sample population, treatment, method, study context, etc.). These effect sizes provide the most inclusive estimate of context-specific effectiveness (see <xref ref-type="table" rid="T2">Table 2</xref>). Of these 78 context-specific aggregated effect sizes, 13 (17%) stem from 4 meta-analyses on mathematics and science interventions within the secondary student population (category 1),<xref ref-type="fn" rid="fn0006"><sup>6</sup></xref> 20 (26%) stem from 14 meta-analyses on mathematics and science interventions with schooling level as a moderator (category 2), 1 (1%) stems from 1 meta-analysis on secondary school interventions with school subject as a moderator (category 3), and 44 (56%) stem from 22 meta-analyses on teaching interventions with subject domain and schooling level as moderators (category 4).</p>
<p>In sum, the majority of meta-analyses providing context-specific aggregated effect size estimates in our sample are meta-analyses on teaching interventions across subjects and schooling levels (56% of extracted context-specific effect sizes) and meta-analyses on mathematics and science interventions across different schooling levels (26% of extracted context-specific effect sizes). With 17% of all extracted context-specific effect sizes, context-specific meta-analyses with a focus on mathematics and science subjects as well as the secondary student population provide a relatively small proportion of context-specific effectiveness information.</p>
</sec>
<sec>
<title>Comparison Between Overall and Specific Effect Sizes</title>
<p>Using 78 domain and schooling level-specific aggregated effect sizes that are not disaggregated for other variables, we compared overall and specific effect sizes in the sampled meta-analyses. In 47 cases, the overall effect reported in the meta-analysis is specific for secondary mathematics and science and, thus, represents the best available context-specific effect size. In 31 cases, the overall effect is not specific for secondary mathematics and science. In these cases, we compared the context-specific effect size based on a subsample of primary studies to the overall effect reported in the meta-analysis. <xref ref-type="table" rid="T2">Table 2</xref> provides a summary of overall effects, specific effects, and comparison results for all dependent and independent variables. In 1 out of 31 comparisons (3%), the overall effect and context-specific effect have the same numerical value (level 0). Further, 17 out of 31 comparisons (55%) yielded a weak level of difference with numerical values of means being different (level 1); 9 out of 31 comparisons (29%) yielded a moderate level of difference with at least one mean not being covered in the confidence interval of the other mean (level 2); and 4 out of 31 comparisons (13%) yielded a high level of difference with no overlap between the confidence intervals of the two means (level 3). In summary, the majority of comparisons (60%) yielded no or small differences between overall and specific effects, 29% of comparisons resulted in moderate differences, and a small number of comparisons (13%) indicated large differences.</p>
</sec>
<sec>
<title>Summary of Effectiveness Information</title>
<p><xref ref-type="table" rid="T2">Table 2</xref> provides a comprehensive summary of effectiveness information. Row-wise, the table lists all 41 meta-analyses<xref ref-type="fn" rid="fn0007"><sup>7</sup></xref> that matched our selection criteria organized in specific categories (see the following paragraph). Column-wise, the table details information both on the overall effect reported in the publication, which is based on all primary studies (mid columns) of the meta-analyses, and on the aggregated effect size(s) specific for the context of secondary mathematics and science education (right columns). As regards the categorization applied, our analysis showed that sampled meta-analyses follow two major organizing principles: First, most meta-analyses (<italic>N</italic> = 33) are teaching strategy-focused, that is, they analyze the effectiveness of a specific teaching strategy (e.g., inquiry learning, flipped classroom) with regard to one or several student outcomes related to mathematics and science learning (e.g., mathematics/science achievement, student motivation in mathematics/science) (see e.g., Furtak et al., <xref ref-type="bibr" rid="B34">2012</xref>). Second, some meta-analyses (<italic>N</italic> = 8) are outcome-focused, that is, they compare several different teaching strategies (e.g., direct instruction vs. problem-based learning vs. cooperative learning etc.) with regard to a specific student outcome related to mathematics and science learning (e.g., critical thinking, algebraic reasoning). In addition, some sampled meta-analyses focused on similar teaching strategies (e.g., three meta-analyses investigated inquiry project-based learning strategies) or similar student outcomes (e.g., critical thinking and scientific reasoning) and were thus further grouped together.</p>
<p>In line with our selection criteria, all sampled meta-analyses provide at least one aggregated effect size estimate specific for the effectiveness of mathematics and science teaching on the secondary level. Without exception, all of these 78 effect sizes are positive. <xref ref-type="fig" rid="F2">Figure 2</xref> presents the distribution of all context-specific mean effect sizes. Effect size estimates range between ES = 0.01 and ES = 1.3 with 12 effect size estimates transcending conventional thresholds of statistical significance (i.e., 0.95% confidence intervals include the value zero). About 80% of context-specific aggregated mean effect sizes are 0.2 or larger and 54% are 0.4 or larger. Overall, the size of our sample signals that research has accumulated a substantial number of meta-analyses on various teaching strategies and student outcomes related to secondary mathematics and science teaching. With all effect sizes being positive, this research indicates higher aggregated effectiveness of experimental conditions compared to control conditions.</p>
<fig id="F2" position="float">
<label>Figure 2</label>
<caption><p>Distribution of effect sizes.</p></caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fpsyg-13-873995-g0002.tif"/>
</fig>
</sec>
<sec>
<title>Scientific Quality of Included Meta-Analyses</title>
<p>In order to provide a concise summary of quality information, we organized scientific quality data in two ways: (a) <xref ref-type="table" rid="T3">Table 3</xref> depicts all the quality items that were coded and summarizes the percentage of the 41 meta-analyses that performed and/or reported what was required by this item. (b) The third row of <xref ref-type="table" rid="T2">Table 2</xref> reports a summary quality score averaged across all 37 quality items for each meta-analysis individually (see <xref ref-type="supplementary-material" rid="SM3">Supplementary Material S3</xref> for details).</p>
<p>On average, sampled meta-analyses fulfilled 68% (<italic>SD</italic> = 13%) of all criteria coded. <xref ref-type="table" rid="T3">Table 3</xref> indicates that on 15 items, over 80% of sampled meta-analyses provided sufficient information. With no meta-analysis being pre-registered and less than half (44%) offering sufficient information to reproduce statistical analyses, issues of open science were not adequately addressed. Criteria relating to search and selection mostly achieved high ratings&#x02014;for example, with all meta-analyses clearly stating inclusion criteria (100%) and 93% providing sufficient information to reproduce the database search. When it comes to transparency of coding and data collection, approximately half of the sampled meta-analyses failed to provide sufficient information on which data they extracted from primary studies [e.g., specification of control condition (55%), outcome variable (51%), and related descriptive statistics (41%)]&#x02014;for example, by publishing a primary study coding table (Polanin et al., <xref ref-type="bibr" rid="B70">2020</xref>). The category meta-analytic methods yielded mixed results. Numerous issues relating to data aggregation and bias reduction were reported by a majority of meta-analyses. Yet, although 95% verbally describe how they determined raw effect sizes from primary studies, less than half (39%) provide precise formulas, which clearly describe how data from different primary study designs (e.g., comparison of post measures vs. comparison of pre-post gains) were converted into effect sizes. Similarly, most meta-analyses (89%) provide at least one indicator for between-study variance, but only half (50%) report the exact estimation method (Hedges et al., <xref ref-type="bibr" rid="B43">2010</xref>; Borenstein et al., <xref ref-type="bibr" rid="B8">2011</xref>).</p>
<p>Further, although almost all meta-analyses conducted multiple moderator tests, only half of these (48%) discussed issues such as Type 1 error inflation and confounding (moderator) variables (see Cafri et al., <xref ref-type="bibr" rid="B10">2010</xref>). A majority, but not all, of the meta-analyses (83%) tested for publication bias; moreover, although numbers and magnitude of raw effect sizes in moderator analyses are often relatively small, only two meta-analyses (5%) (Corcoran et al., <xref ref-type="bibr" rid="B20">2017</xref>; van Alten et al., <xref ref-type="bibr" rid="B107">2019</xref>) reported retrospective statistical power for the significance test used to determine the number of studies necessary for detecting a statistically significant effect (Hempel et al., <xref ref-type="bibr" rid="B45">2013</xref>). Further, 56% of the meta-analyses scanned their data for outliers, which could have biased the results; 61% of the meta-analyses investigated the moderating effects of at least one scientific quality indicator of the primary studies (e.g., utilization of standardized vs. non-standardized outcome measures), yet none of these included a multidimensional assessment based on a quality assessment tool (see, e.g., Valentine and Cooper, <xref ref-type="bibr" rid="B106">2008</xref>). <xref ref-type="table" rid="T2">Table 2</xref> indicates that scientific quality scores of individual meta-analyses ranged from 26% (min) to 92% (max), with half of the meta-analyses having a score lower or higher than 65%. In summary, the majority of sampled meta-analyses adheres to most quality criteria. A high level of scientific quality, however, is not a consistent finding, since some quality criteria are not adequately addressed by many meta-analyses and a few meta-analyses do not meet several important quality criteria.</p>
</sec>
</sec>
<sec sec-type="discussion" id="s4">
<title>Discussion</title>
<p>In order to be successful, educational systems require orientation both in terms of goals as well as in pathways to attain these goals. Numerous countries have been successful in agreeing on common standards and, thus, specifying binding goals for mathematics and science education. As a consequence, pathways to attain these goals must be further specified. Educational research can contribute to attaining these goals by providing information on those pathways that have been revealed to be most effective. If this information is recognized and accounted for by different stakeholders, one of the most important capacities of educational sciences can be used to contribute to an ongoing improvement of educational systems (Kloser, <xref ref-type="bibr" rid="B51">2014</xref>; Slavin, <xref ref-type="bibr" rid="B96">2020</xref>).</p>
<p>This systematic review seeks to provide a systematic analysis and review of aggregated findings within the experimental or quasi-experimental framework for a certain subject domain and a certain educational level. Furthermore, this systematic review investigated to what extent reported effects sizes on an overall level systematically differ from effect sizes particularly determined for the field of secondary mathematics and science teaching. It also outlines to what extent included meta-analyses meet established quality criteria in meta-analysis research. Overall, this contribution complements efforts that seek to identify a set of core or high-leverage practices (Windschitl et al., <xref ref-type="bibr" rid="B110">2012</xref>; Kloser, <xref ref-type="bibr" rid="B51">2014</xref>) in science and mathematics education as well as more general efforts to synthesizing knowledge on effective teaching and learning (e.g., Seidel and Shavelson, <xref ref-type="bibr" rid="B88">2007</xref>; Hattie, <xref ref-type="bibr" rid="B40">2009</xref>; Dunlosky et al., <xref ref-type="bibr" rid="B28">2013</xref>). In the following account, we summarize five major findings and highlight implications to inform future research.</p>
<sec>
<title>Research on Secondary Mathematics and Science Teaching Provides a Substantial Amount of Context-Specific Effectiveness Information</title>
<p>Regarding the field of mathematics and science teaching and current meta-analyses in this specific field, our results demonstrate that research offers a substantial number of specific aggregated effect sizes that encompass various kinds of teaching interventions that are relevant for secondary science and mathematics classrooms. We identified 78 aggregated effect sizes from the last 15 years that provide information that is specific to mathematics and science education. A majority of these effect sizes stem from more general meta-analyses on teaching interventions, which include mathematics and science subjects as well as secondary students as subpopulations. However, specific meta-analyses with a focus on mathematics and science teaching (e.g., Furtak et al., <xref ref-type="bibr" rid="B34">2012</xref>) or even an exclusive focus on secondary mathematics and science populations are forthcoming (e.g., Cheung et al., <xref ref-type="bibr" rid="B13">2017</xref>). Summarizing research from the previous decade (until 2004), Seidel and Shavelson (<xref ref-type="bibr" rid="B88">2007</xref>) concluded (for this time period) that the underlying primary research on teaching effectiveness was largely dominated by correlational design studies. The included meta-analyses in our current sample demonstrate (for the following 15 years) that experimental research on teaching effectiveness is increasingly available. In a majority of the underlying experimental primary studies, innovative teaching strategies were compared to some form of conventional, traditional, business-as-usual practice. In aggregating these effects, the meta-analyses in our sample generally enable conclusions regarding whether or not and under what circumstances the innovation is more effective than traditional practice. Moreover, this review also demonstrates that in current meta-analytic research, these comparisons are organized in three major ways, which allow for additional conclusions.</p>
<p>First, a minority of included meta-analyses (20%) were focused on a dependent variable specific to mathematics and science education (e.g., critical thinking, scientific reasoning, attitudes toward science), synthesizing all teaching-related research with this variable as a target outcome (&#x0201C;outcome-focused meta-analyses&#x0201D;). This entails that a number of teaching strategies (inquiry learning vs. collaborative learning vs. digital learning etc.) potentially fostering this outcome are included. While Schroeder et al. (<xref ref-type="bibr" rid="B82">2007</xref>) focused on achievement as an outcome, several years later, other outcomes (that were less frequently covered in primary research) have been included in meta-analyses. For example, Savelsbergh et al. (<xref ref-type="bibr" rid="B76">2016</xref>) collected research on student attitudes toward mathematics and science, which is still a less frequently studied outcome in primary studies. Their meta-analysis (of <italic>k</italic> = 63 studies) includes various teaching strategies such as inquiry learning, digital learning, and collaborative learning. Second, a majority of included meta-analyses (80%) were focused on a specific teaching strategy in the field of mathematics and science teaching as an independent variable (e.g., inquiry learning, game-based learning etc.), synthesizing all outcome-related research (&#x0201C;teaching strategy-focused meta-analyses&#x0201D;). These meta-analyses enable a nuanced analysis of the effectiveness of that strategy under different conditions and for different learning outcomes (e.g., Wouters et al., <xref ref-type="bibr" rid="B111">2013</xref>). Third, researchers are able to shift the focus of not only their meta-analytic investigation from dependent to independent variables but also with regard to the kind of comparison in the underlying primary research. While numerous meta-analyses include primary research that compares some kind of innovative practice to a traditional practice to determine an effect size, a few more recent meta-analyses include primary studies that compare variations of innovative approaches, such as inquiry learning or game-based learning, with vs. without guidance (Wouters et al., <xref ref-type="bibr" rid="B111">2013</xref>; Lazonder and Harmsen, <xref ref-type="bibr" rid="B54">2016</xref>), or simple versions of automated adaptive guidance vs. advanced versions of automated adaptive guidance (Gerard et al., <xref ref-type="bibr" rid="B35">2015</xref>). Thus, following the establishment of the general effectiveness of a certain teaching strategy, research and research synthesis is now moving forward by carefully studying specific features (and their variations), which can render the application of that teaching strategy more effective. Thus, while previous systematic reviews of STEM research mainly documented quantitative growth, for example, in the increasing number of journal publications (see Li et al., <xref ref-type="bibr" rid="B57">2020</xref>), this review shows the cumulative nature of this research.</p>
</sec>
<sec>
<title>Context (Subject Domain/Educational Level) Is Important in Research on Teaching Effectiveness</title>
<p>Since this systematic review seeks to provide context-specific information, we filtered meta-analyses that include aggregated effect sizes for outcomes of secondary mathematics and science teaching. Using this rationale for selection led to the exclusion of numerous meta-analyses that did not offer information that was sufficiently specific for this context. This is not surprising, as specific areas of teaching effectiveness research may not have accumulated a sufficient number of studies for context-specific analysis. Yet another reason lies in the fact that research syntheses are often not undertaken for the sake of providing context-specific effectiveness information in line with a particular field of practice, but rather for synthesizing findings in a particular research area for theory-building and for reaching broad generalizations (Gurevitch et al., <xref ref-type="bibr" rid="B38">2018</xref>).</p>
<p>However, this review highlights a research synthesis perspective for a particular field of practice. Conceptually, it provides a heuristic for formulating specific inclusion/exclusion criteria that are appropriate for selecting context-specific effectiveness information. In this sense, it showcases requirements that can be considered by researchers and meta-analysts in order to generate more context-specific information for evidence-based practice. Empirically, it demonstrates that context is of significance in terms of teaching effectiveness: when comparing context-specific effect sizes with overall effects in our sampled meta-analyses, we observe varying degrees of difference. Although the majority of comparisons (60%) indicated no or small differences, we also observed many instances (40%) with relevant differences (Schauer and Hedges, <xref ref-type="bibr" rid="B78">2020</xref>), in which case, using the overall effect could lead to different conclusions for evidence-based practice as compared to using the context-specific effect size.</p>
<p>In line with previous research (e.g., de Boer et al., <xref ref-type="bibr" rid="B22">2014</xref>), the findings of this review demonstrate that teaching strategies vary in terms of their effectiveness depending on the contextual conditions designated by a certain field of practice (Taylor et al., <xref ref-type="bibr" rid="B103">2018</xref>). This makes a good case for research and research synthesis that generates and provides context-specific effectiveness information. With this review, we hope to create more awareness for these issues so that researchers can take appropriate action, like conducting context-specific meta-analyses that select and synthesize context-specific primary studies (e.g., Hillmayr et al., <xref ref-type="bibr" rid="B48">2020</xref>).</p>
</sec>
<sec>
<title>Standards-Related Targets Are Addressed by Research on Effective Teaching Strategies</title>
<p>Beyond documenting the availability of specific effectiveness information, this review reveals that a variety of outcomes specified by current standards (such as the Framework for K-12 Science education (National Research Council, <xref ref-type="bibr" rid="B66">2012</xref>), the Next Generation Science Standards, or the Common Core Standards in mathematics) can be attained through instruction using a number of effective pathways (de Kock et al., <xref ref-type="bibr" rid="B23">2004</xref>). In the context of the current standards, process skills such as inquiry and argumentation represent broader educational goals that are addressed in literacy conceptualizations. In this context, student attitudes and motivation, both as prerequisites to learning as well as desirable outcomes, are considered as important goals in their own terms (Kuhn, <xref ref-type="bibr" rid="B53">2007</xref>). The research encompassed by this review includes outcomes such as attitudes and interest in science (e.g., Savelsbergh et al., <xref ref-type="bibr" rid="B76">2016</xref>), motivation (e.g., Wouters et al., <xref ref-type="bibr" rid="B111">2013</xref>), inquiry skills (Lazonder and Harmsen, <xref ref-type="bibr" rid="B54">2016</xref>), critical thinking skills (Abrami et al., <xref ref-type="bibr" rid="B1">2015</xref>), control of variables strategy skills (Schwichow et al., <xref ref-type="bibr" rid="B85">2016</xref>), scientific reasoning and argumentation skills (Engelmann et al., <xref ref-type="bibr" rid="B29">2016</xref>), knowledge transfer skills (e.g., Ginns et al., <xref ref-type="bibr" rid="B36">2013</xref>), and skills of knowledge acquisition and self-regulation (Donker et al., <xref ref-type="bibr" rid="B27">2014</xref>). Thus, in addition to traditional outcome measures such as factual knowledge and achievement, a broader range of educational goals, particularly relevant for current mathematics and science curricula, is encompassed in primary studies and synthesized in meta-analyses. Moreover, a few multicriterial investigations in meta-analyses, assessing effectiveness simultaneously for more than one outcome, were able to demonstrate multicriterial effectiveness of a variety of teaching strategies (e.g., Savelsbergh et al., <xref ref-type="bibr" rid="B76">2016</xref>).</p>
<p>Our results also demonstrate that the sampled meta-analyses address goals of varying scope. Certain teaching strategies support specific targets in terms of standards. For example, the teaching strategy &#x0201C;inquiry learning&#x0201D; can support students effectively in acquiring inquiry skills in addition to domain-specific knowledge (Lazonder and Harmsen, <xref ref-type="bibr" rid="B54">2016</xref>). Other strategies are more universal and do not serve so much as general approaches to teaching but as tools to be incorporated into any lesson or instructional unit to foster mathematics and science learning. Teaching strategies such as using concept maps, self-explaining, or self-grading are not merely easy and cost-efficient to integrate, they are also not restricted to a certain specific content but lend themselves to fostering various learning goals related to standards and curricula.</p>
<p>In order to attain complex goals (such as critical thinking skills etc.) set by standards and curricula in secondary mathematics and science education, classroom learning requires the implementation of more open-ended and complex tasks, which place higher demands on students and thus often require adequate guidance. In reviewing the results of this review, guidance seems to be an important element across different teaching strategies. Students involved in problem-based learning, inquiry learning, or game-based learning were able to profit from teacher or software guidance (Furtak et al., <xref ref-type="bibr" rid="B34">2012</xref>; Wouters and van Oostendorp, <xref ref-type="bibr" rid="B112">2013</xref>; Belland et al., <xref ref-type="bibr" rid="B5">2017</xref>). Importantly, effect sizes in comparisons between guided and non-guided versions of these strategies were as high as effect sizes in the basic comparisons between an innovative strategy (i.e., inquiry and game-based) and a traditional approach (Furtak et al., <xref ref-type="bibr" rid="B34">2012</xref>; Wouters et al., <xref ref-type="bibr" rid="B111">2013</xref>; Lazonder and Harmsen, <xref ref-type="bibr" rid="B54">2016</xref>). Thus, in the context of the learner population of secondary students, the increasing complexity of the demands of the curriculum and with practice moving from a teacher-centered to a learner-centered pedagogy, this seems to suggest that guidance is a crucial element for students succeeding on standard targets.</p>
</sec>
<sec>
<title>The Majority of Aggregated Effect Sizes Are Positive</title>
<p>All of<xref ref-type="fn" rid="fn0008"><sup>8</sup></xref> the investigated teaching strategies indicate beneficial effects on student outcomes in terms of positive aggregated mean effect sizes. Although research on the effectiveness of teaching rests on the basic assumption that research-based teaching strategies can be and generally are effective, it may still be surprising that virtually all aggregated effect sizes selected and presented here were positive. In other words, given the wide range of teaching strategies investigated, one might expect some of these strategies on average to have negative effect sizes and not every tested strategy to work well. This review, however, is not the first systematic review on treatment effectiveness in education yielding mostly positive findings. Other research synthesists have found similar results (e.g., Lipsey and Wilson, <xref ref-type="bibr" rid="B59">1993</xref>; Hattie, <xref ref-type="bibr" rid="B40">2009</xref>; Schneider and Preckel, <xref ref-type="bibr" rid="B79">2017</xref>). In their comprehensive review of 320 independent meta-analyses analyzing the efficacy of psychological, educational, and behavioral interventions, Lipsey and Wilson (<xref ref-type="bibr" rid="B59">1993</xref>) found almost only positive mean effect sizes. Similarly, Hattie (<xref ref-type="bibr" rid="B40">2009</xref>) synthesis of over 800 meta-analyses, which includes 520 individual meta-analyses on the effects of different teaching approaches on student achievement across different educational levels and subject domains, yielded no negative aggregated effect size for any of the included teaching approaches.</p>
<p>More recently, Schneider and Preckel (<xref ref-type="bibr" rid="B79">2017</xref>), in their systematic review on variables associated with achievement in the context of higher education, identified only 2 out of 42 aggregated effect sizes (&#x0003C; 5%) indicating a negative association (with all others being positive) between an instructional approach and student achievement with 1 of the 2 effect sizes representing evidence for the seductive detail effect and thus being expected to be negative. Moreover, after testing for different potential biases and finding no indication for a particular upward bias, Lipsey and Wilson (<xref ref-type="bibr" rid="B59">1993</xref>) concluded that &#x0201C;the treatment approaches represented in meta-analysis and reviewed in this article represent rather mature instances that are sufficiently well developed and credible to attract practitioners and sufficiently promising (or controversial) to attract a critical mass of research. For treatment approaches meeting these criteria, it is perhaps not surprising that a high proportion do prove at least moderately efficacious&#x0201D; (Lipsey and Wilson, <xref ref-type="bibr" rid="B59">1993</xref>, p.1200). Thus, based on previous systematic reviews of meta-analytic research on educational intervention, our result of all available effect sizes in the context of secondary mathematics and science teaching being positive was to be expected and our results confirm this expectation.</p>
<p>Although, there seems to be no controversy around the positive direction of results of educational or instructional interventions, there is an ongoing controversy about the magnitude of standardized effect sizes as a metric for evaluating and interpreting the effectiveness of educational interventions (de Boer et al., <xref ref-type="bibr" rid="B22">2014</xref>; Cheung and Slavin, <xref ref-type="bibr" rid="B14">2016</xref>; Simpson, <xref ref-type="bibr" rid="B93">2018</xref>). A focal point of this discussion constitutes the numerous factors (including potential biases) that have been shown to influence standardized effect sizes. By adopting a selection heuristic that takes into account effect size variation due to subject domain and educational level, we have filtered for two of these factors (educational level and subject domain) in order to provide a reliable estimate of the effectiveness of educational intervention in the context of secondary mathematics and science teaching (e.g., de Boer et al., <xref ref-type="bibr" rid="B22">2014</xref>). Nevertheless, previous research has documented other and equally important factors that influence results and should be considered when interpreting (aggregated) effect sizes of educational interventions (Cheung and Slavin, <xref ref-type="bibr" rid="B14">2016</xref>; Kraft, <xref ref-type="bibr" rid="B52">2020</xref>).</p>
<p>For example, Cheung and Slavin (<xref ref-type="bibr" rid="B14">2016</xref>) examined methodological impacts on effect sizes using a rather homogenous sample of 645 high-quality studies of educational program evaluations across the grades of prekindergarten to 12, involving reading, mathematics, and science. Their results indicate that research design (randomized vs. non-randomized), sample size (small sample size &#x0003C; N = 250 participants &#x0003C; large sample size), outcome measures (researcher-made vs. standardized measures), and type of publication (published vs. non-published) were all independently associated with effect-size magnitude. Consequently, the authors conclude that these factors need to be accounted for by researchers and policy makers before interpreting and comparing effect sizes from program evaluations. Similarly, de Boer et al. (<xref ref-type="bibr" rid="B22">2014</xref>), in their meta-analysis of learning strategy interventions, found that four factors related to how interventions were implemented and how effects were examined together explained 64% of the variance in intervention effect size. Clearly, our sampled meta-analyses demonstrate variations on many parameters that have shown to influence effect sizes (e.g., Slavin and Madden, <xref ref-type="bibr" rid="B94">2011</xref>; de Boer et al., <xref ref-type="bibr" rid="B22">2014</xref>; Cheung and Slavin, <xref ref-type="bibr" rid="B14">2016</xref>). This simultaneous variation on several parameters, particularly in but not limited to variations in research methodology (e.g., sampling, group assignment, comparison condition, outcome measure, effect size calculation etc.) both on the level of primary research and on the synthesis-level, complicates interpreting effect sizes as well comparing and contrasting results across different meta-analyses.</p>
<p>Although this systematic review takes into account some of these aspects by filtering aggregations of experimental research in a particular context, it does not provide an in-depth analysis and discussion of all aspects. One reason is that information necessary for such an in-depth evaluation is oftentimes missing or not sufficiently documented in published meta-analyses. We address this issue in our analysis and discussion of scientific quality (see below). Another reason is that each meta-analysis in our sample, despite communalities, represents a specific configuration with regard to study sampling and analysis of teaching effectiveness research. Thus, a thorough analyses and interpretation of findings that does justice to the complexity of such configurations needs to consider each meta-analysis individually, which is beyond the scope of this publication. We agree with many previous researchers (e.g., Coe, <xref ref-type="bibr" rid="B16">2002</xref>; Ferguson, <xref ref-type="bibr" rid="B33">2009</xref>; Schneider and Preckel, <xref ref-type="bibr" rid="B79">2017</xref>; Simpson, <xref ref-type="bibr" rid="B93">2018</xref>; Kraft, <xref ref-type="bibr" rid="B52">2020</xref>) who have cautioned readers not to reach simple conclusions from complex effect-size estimates. To receive some orientation when interpreting effect sizes, readers can consult recent literature (e.g., Kraft, <xref ref-type="bibr" rid="B52">2020</xref>) or team up with trained researchers to reach informed conclusions.</p>
<p>To increase the potential for coherent interpretation and comparison of findings in future research synthesis, meta-analysts need to reduce heterogeneity when sampling primary research. Cheung and Slavin (<xref ref-type="bibr" rid="B15">2013</xref>, <xref ref-type="bibr" rid="B13">2017</xref>), for instance, put together a set of inclusion criteria&#x02014;particularly suited for the study of educational interventions in classrooms&#x02014;to increase quality and comparability of findings in meta-analytic research in this field (see also Slavin and Lake, <xref ref-type="bibr" rid="B97">2008</xref>). Similarly, Abrami et al. (<xref ref-type="bibr" rid="B1">2015</xref>) increased the homogeneity and the quality of sampled primary research by testing the influence of methodological study features on the effect sizes and consequently excluding pre-experimental designs and non-standardized measures from further analyses. Although these strategies depend on the availability of appropriate numbers and sufficient quality or similarity of primary research, they might also encourage and orient researchers to design primary studies in accordance with such criteria and thus contribute to a more homogenous database. Moreover, homogeneity of sampled primary research is also an important prerequisite for decisions regarding implementation of teaching strategies and educational interventions and related discussions about possible benchmarks for implementation. In this context, researchers have advocated empirical benchmarks &#x0201C;for specific classes of studies and outcome types based on the distribution of effect sizes from relevant literature&#x0201D; (Kraft, <xref ref-type="bibr" rid="B52">2020</xref>, p. 247). Consequently, results from research synthesis in education can only appropriately inform the interpretation of intervention effectiveness and implementation decisions, as far as the interpreter considers the fact that aggregated mean effect sizes represent highly compounded information. Effect sizes generated by meta-analytic aggregation are atop a hierarchy that ultimately rests on the individual research design elements of all single primary studies included in the synthesis. Given the large heterogeneity of research included in this review, which is typical for the field, our results defy broad effectiveness conclusions and instead put the spotlight on each individual meta-analysis and aggregated mean effect size(s) reported therein.</p>
</sec>
<sec>
<title>Sampled Meta-Analytic Research Varies in Terms of Scientific Quality</title>
<p>The validity of information from empirical educational science rests on the appropriate application and reporting of research methodology to determine educational effectiveness. Consequently, systematic syntheses of research beyond summarizing results must also assess the scientific quality of the underlying research (Polanin et al., <xref ref-type="bibr" rid="B71">2017</xref>). Based on 37 selected assessment criteria, our results demonstrate that the sampled meta-analyses overall meet the current quality criteria in meta-analysis research to a large extent. However, single meta-analyses also still vary in their adherence to the complete set of quality criteria. As high quality cannot be taken for granted, quality ratings of individual meta-analyses should be considered when interpreting aggregated effectiveness information. Low ratings imply that the recommended research methodology was not employed or sufficient reporting was not provided (or both). While the former can often lead to biased results (Borenstein et al., <xref ref-type="bibr" rid="B7">2010</xref>), the latter at least impedes reproducibility and jeopardizes research progress (Polanin et al., <xref ref-type="bibr" rid="B70">2020</xref>). However, despite some heterogeneity of the observed scientific quality in our sample, a substantial number of meta-analyses followed most of the guidelines (e.g., Fan et al., <xref ref-type="bibr" rid="B32">2017</xref>; Schneider et al., <xref ref-type="bibr" rid="B80">2018</xref>; van Alten et al., <xref ref-type="bibr" rid="B107">2019</xref>). Along with guidance from standard documents (e.g., MARS) and recent publications (Pigott and Polanin, <xref ref-type="bibr" rid="B69">2020</xref>), these can provide practical examples on how to adequately conduct and report meta-analytic research. Moreover, certain important criteria (e.g., search details, clear statement of inclusion criteria, analysis of publication bias) have been considered by a large majority of authors, thereby demonstrating additional improvement as compared to previous reviews (Ahn et al., <xref ref-type="bibr" rid="B2">2012</xref>). Further, recent analyses of the quality of quantitative research synthesis in education and psychology (Schneider and Preckel, <xref ref-type="bibr" rid="B79">2017</xref>; Polanin et al., <xref ref-type="bibr" rid="B70">2020</xref>; Wedderhoff and Bosnjak, <xref ref-type="bibr" rid="B108">2020</xref>) has revealed that our results are in line with the current practice in high-impact publication outlets. A few recurring issues in the literature as well as in our sample include insufficient reporting and accessibility of raw data&#x02014;that is, coding information and insufficient application of meta-analytic methods to prevent biased results (Schneider and Preckel, <xref ref-type="bibr" rid="B79">2017</xref>). Since none of the sampled publications was pre-registered and less than half (44%) of the publications provide sufficient data for replication, open research practices are still a matter of concern in research synthesis as they are in educational research more generally (Makel et al., <xref ref-type="bibr" rid="B62">2021</xref>). This underlines the importance of efforts to facilitate preregistration of research synthesis for example by providing elaborated templates that specify information necessary for transparent reporting.</p>
<p>Importantly, the scientific quality of meta-analytic findings also rests on the quality of primary research. Even in their most advanced and differentiated form, the meta-analytic technique is limited by the number and quality of the primary studies to which it is applied (Lipsey and Wilson, <xref ref-type="bibr" rid="B59">1993</xref>). This aspect deserves special attention as the so-called &#x0201C;garbage in&#x02013;garbage out&#x0201D; problem has been around as long as meta-analytic research (see Eysenck, <xref ref-type="bibr" rid="B31">1978</xref>); thus, the issue remains unresolved. A recent review (Wedderhoff and Bosnjak, <xref ref-type="bibr" rid="B108">2020</xref>) on the assessment of primary study quality in quantitative reviews revealed that from among 225 meta-analyses published in <italic>Psychological Bulletin</italic> in the last 10 years, 40 (18%) considered quality differences in primary studies. Moreover, assessment strategies varied widely, which is attributed to a lack of a consensual operationalization of study quality. Considering that the underlying primary research of this review also demonstrates variation in terms of several quality indicators and that this variation can be associated with effect size variance (Cheung and Slavin, <xref ref-type="bibr" rid="B14">2016</xref>; Lazonder and Harmsen, <xref ref-type="bibr" rid="B54">2016</xref>), a more systematic investigation&#x02014;that transcends testing single indicators as moderators and employs existing study quality assessment tools (e.g., Study DIAD, Valentine and Cooper, <xref ref-type="bibr" rid="B106">2008</xref>)&#x02014;is paramount for the further development of this evidence base.</p>
<p>Thus, echoing the concerns about the quality of research in education&#x02014;both on a primary research and research synthesis level&#x02014;(see Makel et al., <xref ref-type="bibr" rid="B62">2021</xref>), our quality analysis demonstrates that despite increasing adherence to quality criteria in published meta-analyses, there is still considerable room for improvement. Initial action has been taken by the research community in providing standard documents, assessment tools, protocols, templates and websites (e.g., <ext-link ext-link-type="uri" xlink:href="https://osf.io/">https://osf.io/</ext-link>) to increase transparency and quality. It is now up to researchers to make better use of these aids and guidelines in planning, conducting and reporting their research given their high responsibility for the research community but also for communities of practice who rely on their expertise and integrity.</p>
</sec>
<sec>
<title>Limitations and Implications for Future Research</title>
<p>In this section, we describe salient limitations that warrant attention and further discussion in future research. Our conclusions extend a few of the general concerns and potential biases that are almost inherent in educational effectiveness research, such as general bias to overestimate the effects of new forms of instruction compared with regular forms (e.g., Ma et al., <xref ref-type="bibr" rid="B61">2014</xref>; Schneider and Preckel, <xref ref-type="bibr" rid="B79">2017</xref>).</p>
<p>One of the first limitations concerns the generalizability of our findings for current and international secondary mathematics and science education. Although most of the included meta-analyses were published within the last 5 years, there is still a considerable time-lag between the experiment that generated the primary data and the publication of this review. Within this time interval, the development of effective interventions and technologies has continued and studies that document this effectiveness have been published, which are not part of this review. This is a common concern in the review literature, which is more pronounced in systematic reviews of meta-analytic research (as a second-order synthesis), and in rapidly developing fields of research such as educational instructions (see Polanin et al., <xref ref-type="bibr" rid="B71">2017</xref>). For future research, open and transparent study protocols and open data could facilitate the updating process for rapidly outdated meta-analyses (Pigott and Polanin, <xref ref-type="bibr" rid="B69">2020</xref>; Polanin et al., <xref ref-type="bibr" rid="B70">2020</xref>). Similar to research in higher education (Schneider and Preckel, <xref ref-type="bibr" rid="B79">2017</xref>), a large proportion of included research stem from the United States, followed by a substantially fewer studies from other countries (see also Li et al., <xref ref-type="bibr" rid="B57">2020</xref>). However, the results from several meta-analyses in our sample, demonstrate that the geographical origin of a study can significantly moderate effect sizes (e.g., Schroeder et al., <xref ref-type="bibr" rid="B84">2017</xref>; Chen and Yang, <xref ref-type="bibr" rid="B12">2019</xref>). This raises the question of how experimental research on educational effectiveness can be promoted in countries outside North America in order to enhance the generalizability of findings worldwide.</p>
<p>A second drawback concerns our reliance on results from statistical significance testing, particularly in moderator analyses. The authors in our sampled meta-analyses either used analogs to analysis-of-variance models to examine the moderating effects of single moderatos or meta-regression models to test multiple moderators and their association with effect-size variation in a single model. Both model types rely on statistical significance testing to ascertain whether or not a moderator effect is present. Since this systematic review utilizes information from moderator tests both for study selection and in reporting context-specific effectiveness information, we have implicitly accepted the criterion of statistical significance (at a 0.05 Alpha level) for crucial decisions in what we present as evidence. Although recently criticized, the practice of null-hypothesis statistical significance testing remains a dominant practice in the social sciences, and there is evidence from psychological research that statistical significance tests and Bayes factors as alternatives almost always agreed with regard to which hypothesis is better supported by the data (Wetzels et al., <xref ref-type="bibr" rid="B109">2011</xref>). However, a common problem in meta-analyses&#x02014;particularly in moderator tests&#x02014;is the issue of low statistical power due to the small numbers of available effect sizes from primary research, which brings an increased likelihood of false negatives or type II errors when applying statistical significance tests (Cafri et al., <xref ref-type="bibr" rid="B10">2010</xref>; Hempel et al., <xref ref-type="bibr" rid="B45">2013</xref>). Consequently, our sample might suffer from the inappropriate inclusion of certain aggregated effect sizes, because the significance test failed to detect the presence of a moderator effect (by schooling level and/or subject domain), when the effect is actually present. Conversely, numerous meta-analyses in our sample conducted multiple univariate moderator tests without correcting Alpha levels, which raises concerns about the inflation of Type I error rates and increases the likelihood of falsely identified moderator effects (Polanin and Pigott, <xref ref-type="bibr" rid="B72">2015</xref>). However, this practice is more a problem of the accurate application of statistical significance testing in moderator analysis rather than one of statistical significance testing <italic>per se</italic>.</p>
<p>A third limitation that warrants discussion concerns the usage of the presented findings as evidence in context-specific decision-making. Although meta-analytic findings are often praised for their usefulness to decision-makers&#x02014;since they represent comprehensive summaries based on a robust database (e.g., Pigott and Polanin, <xref ref-type="bibr" rid="B69">2020</xref>)&#x02014;interventions adopted on the basis of this aggregated evidence often fail to be effective in practice. According to Joyce and Cartwright (<xref ref-type="bibr" rid="B50">2020</xref>), this is not surprising, as findings based on one or several experimental studies provide evidence that the intervention worked in the past (causal ascription) but no evidence that the intervention will work in a specific context in the future (local effectiveness prediction). Nevertheless, findings related to aggregated positive effectiveness for a certain context play a role in supporting a prediction, as they indicate that the intervention <italic>can</italic> produce the effect under more or less similar sets of circumstances. A targeted collection of such indications is where we believe the contribution of this review lies. Even though we selected and summarized this information not only for the research community, but also to address practitioners as e.g., teachers and teacher educators, it seems clear that these non-specialist audiences often face challenges in accessing and interpreting current research (see Diery et al., <xref ref-type="bibr" rid="B25">2020</xref>, <xref ref-type="bibr" rid="B24">2021</xref>). One way to offer support is to provide some supportive services which select and translate research for non-specialist audiences. Whereas the selection part is mainly described by this contribution, for the translation part we have established an online service platform that provides plain language summaries for meta-analyses which are selected and included in this review (Seidel et al., <xref ref-type="bibr" rid="B87">2017a</xref>,<xref ref-type="bibr" rid="B86">b</xref>). This service, funded by the German ministry of education and research, can be accessed by any teacher and teacher educator free of charge via <ext-link ext-link-type="uri" xlink:href="http://www.clearinghouse-unterricht.de">http://www.clearinghouse-unterricht.de</ext-link>. The website additionally includes a glossary and other educative material to empower practitioners in order to help them adequately interpret research evidence.</p>
</sec>
</sec>
<sec sec-type="conclusions" id="s5">
<title>Conclusion</title>
<p>Through this systematic review of meta-analyses, we put forward a multiple steps approach to determine an evidence base for a particular field of educational practice. As a first step we chose effective teaching as a prominent field of educational practice. Since targets in teaching are provided on the level of a certain subject <italic>and</italic> educational level, we argued that effectiveness information that cuts across these two categories for specification is best suitable for informing the practice of effective teaching. In this regard, our study is the first to provide and apply a heuristic for filtering the best available effectiveness information based on such a context specification. Our results from the field of secondary mathematics and science teaching demonstrate that context-specific effect sizes information may often differ from more general effect size information on teaching effectiveness. Although our findings indicate that there is substantial amount of relevant and encouraging context-specific information available they also show that we had to exclude many studies because they did not offer information generalizable to this specific context. Thus, although meta-analytic research has strongly developed over the last few years, providing context-specific and high-quality evidence still needs to be a focus in the field of secondary mathematics and science teaching and beyond. This systematic review could offer guidance and encouragement on this continuous path.</p>
</sec>
<sec sec-type="data-availability" id="s6">
<title>Data Availability Statement</title>
<p>Publicly available datasets were analyzed in this study. This data can be found online at: <ext-link ext-link-type="uri" xlink:href="https://osf.io/9n99n/?view_only=bb30c83e9bf34d73a79138ddcf91da5c">https://osf.io/9n99n/?view_only=bb30c83e9bf34d73a79138ddcf91da5c</ext-link> and in the supplements of this article.</p>
</sec>
<sec id="s7">
<title>Author Contributions</title>
<p>MK and AH developed the coding scheme and carried out literature search and coding. MK wrote the first draft of the manuscript. All authors contributed to the conception of the review, the manuscript revision, read, and approved the submitted version.</p>
</sec>
<sec sec-type="funding-information" id="s8">
<title>Funding</title>
<p>The present study was conducted as part of the Project Clearinghouse on Effective Teaching (<ext-link ext-link-type="uri" xlink:href="https://www.clearinghouse-unterricht.de">www.clearinghouse-unterricht.de</ext-link>), funded by the German Federal Ministry of Education and Research (01JA1801).</p>
</sec>
<sec sec-type="COI-statement" id="conf1">
<title>Conflict of Interest</title>
<p>The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.</p>
</sec>
<sec sec-type="disclaimer" id="s9">
<title>Publisher&#x00027;s Note</title>
<p>All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.</p>
</sec>
</body>
<back>
<sec sec-type="supplementary-material" id="s10">
<title>Supplementary Material</title>
<p>The Supplementary Material for this article can be found online at: <ext-link ext-link-type="uri" xlink:href="https://www.frontiersin.org/articles/10.3389/fpsyg.2022.873995/full#supplementary-material">https://www.frontiersin.org/articles/10.3389/fpsyg.2022.873995/full#supplementary-material</ext-link></p>
<supplementary-material xlink:href="Data_Sheet_1.PDF" id="SM1" mimetype="application/pdf" xmlns:xlink="http://www.w3.org/1999/xlink"/>
<supplementary-material xlink:href="Data_Sheet_2.PDF" id="SM2" mimetype="application/pdf" xmlns:xlink="http://www.w3.org/1999/xlink"/>
<supplementary-material xlink:href="Data_Sheet_3.XLSX" id="SM3" mimetype="application/vnd.openxmlformats-officedocument.spreadsheetml.sheet" xmlns:xlink="http://www.w3.org/1999/xlink"/>
<supplementary-material xlink:href="Data_Sheet_4.XLSX" id="SM4" mimetype="application/vnd.openxmlformats-officedocument.spreadsheetml.sheet" xmlns:xlink="http://www.w3.org/1999/xlink"/>
</sec>
<ref-list>
<title>References</title>
<ref id="B1">
<citation citation-type="journal"><xref ref-type="fn" rid="fn0009"><sup>&#x0002A;</sup></xref><person-group person-group-type="author"><name><surname>Abrami</surname> <given-names>P. C.</given-names></name> <name><surname>Bernard</surname> <given-names>R. M.</given-names></name> <name><surname>Borokhovski</surname> <given-names>E.</given-names></name> <name><surname>Waddington</surname> <given-names>D. I.</given-names></name> <name><surname>Wade</surname> <given-names>C. A.</given-names></name> <name><surname>Persson</surname> <given-names>T.</given-names></name></person-group> (<year>2015</year>). <article-title>Strategies for teaching students to think critically: a meta-analysis</article-title>. <source>Rev. Educ. Res.</source> <volume>85</volume>, <fpage>275</fpage>&#x02013;<lpage>314</lpage>. <pub-id pub-id-type="doi">10.3102/0034654314551063</pub-id><pub-id pub-id-type="pmid">9357708</pub-id></citation></ref>
<ref id="B2">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ahn</surname> <given-names>S.</given-names></name> <name><surname>Ames</surname> <given-names>A. J.</given-names></name> <name><surname>Myers</surname> <given-names>N. D.</given-names></name></person-group> (<year>2012</year>). <article-title>A review of meta-analyses in education: methodological strengths and weaknesses</article-title>. <source>Rev. Educ. Res.</source> <volume>82</volume>, <fpage>436</fpage>&#x02013;<lpage>476</lpage>. <pub-id pub-id-type="doi">10.3102/0034654312458162</pub-id><pub-id pub-id-type="pmid">33022192</pub-id></citation></ref>
<ref id="B3">
<citation citation-type="journal"><sup>&#x0002A;</sup><person-group person-group-type="author"><name><surname>Apthorp</surname> <given-names>H. S.</given-names></name> <name><surname>Igel</surname> <given-names>C.</given-names></name> <name><surname>Dean</surname> <given-names>C.</given-names></name></person-group> (<year>2012</year>). <article-title>Using similarities and differences: a meta- analysis of its effects and emergent patterns</article-title>. <source>Sch. Sci. Math.</source>, <volume>112</volume>, <fpage>204</fpage>&#x02013;<lpage>216</lpage>. <pub-id pub-id-type="doi">10.1111/j.1949-8594.2012.00139.x</pub-id></citation>
</ref>
<ref id="B4">
<citation citation-type="journal"><sup>&#x0002A;</sup><person-group person-group-type="author"><name><surname>Balta</surname> <given-names>N.</given-names></name> <name><surname>Michinov</surname> <given-names>N.</given-names></name> <name><surname>Balyimez</surname> <given-names>S.</given-names></name> <name><surname>Ayaz</surname> <given-names>M. F.</given-names></name></person-group> (<year>2017</year>). <article-title>A meta-analysis of the effect of Peer Instruction on learning gain: identification of informational and cultural moderators</article-title>. <source>Int. J. Educ. Res.</source> <volume>86</volume>, <fpage>66</fpage>&#x02013;<lpage>77</lpage>. <pub-id pub-id-type="doi">10.1016/j.ijer.2017.08.009</pub-id></citation>
</ref>
<ref id="B5">
<citation citation-type="journal"><sup>&#x0002A;</sup><person-group person-group-type="author"><name><surname>Belland</surname> <given-names>B. R.</given-names></name> <name><surname>Walker</surname> <given-names>A. E.</given-names></name> <name><surname>Kim</surname> <given-names>N. J.</given-names></name> <name><surname>Lefler</surname> <given-names>M.</given-names></name></person-group> (<year>2017</year>). <article-title>Synthesizing results from empirical research on computer-based scaffolding in STEM education</article-title>. <source>Rev. Educ. Res.</source> <volume>87</volume>, <fpage>309</fpage>&#x02013;<lpage>344</lpage>. <pub-id pub-id-type="doi">10.3102/0034654316670999</pub-id><pub-id pub-id-type="pmid">28344365</pub-id></citation></ref>
<ref id="B6">
<citation citation-type="journal"><sup>&#x0002A;</sup><person-group person-group-type="author"><name><surname>Bisra</surname> <given-names>K.</given-names></name> <name><surname>Liu</surname> <given-names>Q.</given-names></name> <name><surname>Nesbit</surname> <given-names>J. C.</given-names></name> <name><surname>Salimi</surname> <given-names>F.</given-names></name> <name><surname>Winne</surname> <given-names>P. H.</given-names></name></person-group> (<year>2018</year>). <article-title>Inducing self-explanation: a meta-analysis</article-title>. <source>Educ. Psychol. Rev.</source> <volume>30</volume>, <fpage>703</fpage>&#x02013;<lpage>725</lpage>. <pub-id pub-id-type="doi">10.1007/s10648-018-9434-x</pub-id></citation>
</ref>
<ref id="B7">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Borenstein</surname> <given-names>M.</given-names></name> <name><surname>Hedges</surname> <given-names>L. V.</given-names></name> <name><surname>Higgins</surname> <given-names>J. P.</given-names></name> <name><surname>Rothstein</surname> <given-names>H. R.</given-names></name></person-group> (<year>2010</year>). <article-title>A basic introduction to fixed-effect and random-effects models for meta-analysis</article-title>. <source>Res. Synth. Method.</source> <volume>1</volume>, <fpage>97</fpage>&#x02013;<lpage>111</lpage>. <pub-id pub-id-type="doi">10.1002/jrsm.12</pub-id><pub-id pub-id-type="pmid">26061376</pub-id></citation></ref>
<ref id="B8">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Borenstein</surname> <given-names>M.</given-names></name> <name><surname>Hedges</surname> <given-names>L. V.</given-names></name> <name><surname>Higgins</surname> <given-names>J. P.</given-names></name> <name><surname>Rothstein</surname> <given-names>H. R.</given-names></name></person-group> (<year>2011</year>). <source>Introduction to Meta-Analysis</source>. <publisher-loc>New York</publisher-loc>: <publisher-name>John Wiley and Sons</publisher-name>.</citation>
</ref>
<ref id="B9">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Brown</surname> <given-names>J.</given-names></name></person-group> (<year>2012</year>). <article-title>The current status of STEM education research</article-title>. <source>J. STEM Educ. Innov. Res.</source> <volume>13</volume>, <fpage>7</fpage>&#x02013;<lpage>11</lpage>.</citation>
</ref>
<ref id="B10">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Cafri</surname> <given-names>G.</given-names></name> <name><surname>Kromrey</surname> <given-names>J. D.</given-names></name> <name><surname>Brannick</surname> <given-names>M. T.</given-names></name></person-group> (<year>2010</year>). <article-title>A meta-meta-analysis: Empirical review of statistical power, type I error rates, effect sizes, and model selection of meta-analyses published in psychology</article-title>. <source>Multivar. Behav. Res.</source> <volume>45</volume>, <fpage>239</fpage>&#x02013;<lpage>270</lpage>. <pub-id pub-id-type="doi">10.1080/00273171003680187</pub-id><pub-id pub-id-type="pmid">26760285</pub-id></citation></ref>
<ref id="B11">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Cain</surname> <given-names>T.</given-names></name> <name><surname>Brindley</surname> <given-names>S.</given-names></name> <name><surname>Brown</surname> <given-names>C.</given-names></name> <name><surname>Jones</surname> <given-names>G.</given-names></name> <name><surname>Riga</surname> <given-names>F.</given-names></name></person-group> (<year>2019</year>). <article-title>Bounded decision-making, teachers&#x00027; reflection and organisational learning: How research can inform teachers and teaching</article-title>. <source>Br. Educ. Res. J.</source> <volume>45</volume>, <fpage>1072</fpage>&#x02013;<lpage>1087</lpage>. <pub-id pub-id-type="doi">10.1002/berj.3551</pub-id></citation>
</ref>
<ref id="B12">
<citation citation-type="journal"><sup>&#x0002A;</sup><person-group person-group-type="author"><name><surname>Chen</surname> <given-names>C. H.</given-names></name> <name><surname>Yang</surname> <given-names>Y. C.</given-names></name></person-group> (<year>2019</year>). <article-title>Revisiting the effects of project-based learning on students&#x00027; academic achievement: a meta-analysis investigating moderators</article-title>. <source>Educ. Res. Rev.</source> <volume>26</volume>, <fpage>71</fpage>&#x02013;<lpage>81</lpage>. <pub-id pub-id-type="doi">10.1016/j.edurev.2018.11.001</pub-id></citation>
</ref>
<ref id="B13">
<citation citation-type="journal"><sup>&#x0002A;</sup><person-group person-group-type="author"><name><surname>Cheung</surname> <given-names>A.</given-names></name> <name><surname>Slavin</surname> <given-names>R. E.</given-names></name> <name><surname>Kim</surname> <given-names>E.</given-names></name> <name><surname>Lake</surname> <given-names>C.</given-names></name></person-group> (<year>2017</year>). <article-title>Effective secondary science programs: a best-evidence synthesis</article-title>. <source>J. Res. Sci. Teach.</source> <volume>54</volume>, <fpage>58</fpage>&#x02013;<lpage>81</lpage>. <pub-id pub-id-type="doi">10.1002/tea.21338</pub-id></citation>
</ref>
<ref id="B14">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Cheung</surname> <given-names>A. C.</given-names></name> <name><surname>Slavin</surname> <given-names>R. E.</given-names></name></person-group> (<year>2016</year>). <article-title>How methodological features affect effect sizes in education</article-title>. <source>Educ. Res.</source> <volume>45</volume>, <fpage>283</fpage>&#x02013;<lpage>292</lpage>. <pub-id pub-id-type="doi">10.3102/0013189X16656615</pub-id><pub-id pub-id-type="pmid">32339107</pub-id></citation></ref>
<ref id="B15">
<citation citation-type="journal"><sup>&#x0002A;</sup><person-group person-group-type="author"><name><surname>Cheung</surname> <given-names>A. C. K.</given-names></name> <name><surname>Slavin</surname> <given-names>R. E.</given-names></name></person-group> (<year>2013</year>). <article-title>The effectiveness of educational technology applications for enhancing mathematics achievement in K-12 classrooms: a meta-analysis</article-title>. <source>Educ. Res. Rev.</source> <volume>9</volume>, <fpage>88</fpage>&#x02013;<lpage>113</lpage>. <pub-id pub-id-type="doi">10.1016/j.edurev.2013.01.001</pub-id></citation>
</ref>
<ref id="B16">
<citation citation-type="web"><person-group person-group-type="author"><name><surname>Coe</surname> <given-names>R.</given-names></name></person-group> (<year>2002</year>). <source>It&#x00027;s the Effect Size, Stupid: What Effect Size is and Why it is Important</source>. Available online at: <ext-link ext-link-type="uri" xlink:href="https://f.hubspotusercontent30.net/hubfs/5191137/attachments/ebe/ESguide.pdf">https://f.hubspotusercontent30.net/hubfs/5191137/attachments/ebe/ESguide.pdf</ext-link></citation>
</ref>
<ref id="B17">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Cohen</surname> <given-names>D. K.</given-names></name> <name><surname>Spillane</surname> <given-names>J. P.</given-names></name> <name><surname>Peurach</surname> <given-names>D. J.</given-names></name></person-group> (<year>2018</year>). <article-title>The dilemmas of educational reform</article-title>. <source>Educ. Res.</source> <volume>47</volume>, <fpage>204</fpage>&#x02013;<lpage>212</lpage>. <pub-id pub-id-type="doi">10.3102/0013189X17743488</pub-id></citation>
</ref>
<ref id="B18">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Cooper</surname> <given-names>H.</given-names></name></person-group> (<year>2015</year>). <source>Research Synthesis and Meta-Analysis: A Step-by-Step Approach</source> (<publisher-loc>Vol. 2</publisher-loc>). London: Sage publications.</citation>
</ref>
<ref id="B19">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Cooper</surname> <given-names>H.</given-names></name> <name><surname>Koenka</surname> <given-names>A. C.</given-names></name></person-group> (<year>2012</year>). <article-title>The overview of reviews: Unique challenges and opportunities when research syntheses are the principal elements of new integrative scholarship</article-title>. <source>Am. Psychol.</source> <volume>67</volume>, <fpage>446</fpage>. <pub-id pub-id-type="doi">10.1037/a0027119</pub-id><pub-id pub-id-type="pmid">22352742</pub-id></citation></ref>
<ref id="B20">
<citation citation-type="journal"><sup>&#x0002A;</sup><person-group person-group-type="author"><name><surname>Corcoran</surname> <given-names>R. P.</given-names></name> <name><surname>Cheung</surname> <given-names>A.</given-names></name> <name><surname>Kim</surname> <given-names>E.</given-names></name> <name><surname>Xie</surname> <given-names>C.</given-names></name></person-group> (<year>2017</year>). <article-title>Effective Universal school-based social and emotional learning programs for improving academic achievement: a systematic review and meta-analysis of 50 years of research</article-title>. <source>Educ. Res. Rev.</source> <volume>25</volume>, <fpage>56</fpage>&#x02013;<lpage>72</lpage>. <pub-id pub-id-type="doi">10.1016/j.edurev.2017.12.001</pub-id></citation>
</ref>
<ref id="B21">
<citation citation-type="journal"><sup>&#x0002A;</sup><person-group person-group-type="author"><name><surname>Darabi</surname> <given-names>A.</given-names></name> <name><surname>Arrington</surname> <given-names>T. L.</given-names></name> <name><surname>Sayilir</surname> <given-names>E.</given-names></name></person-group> (<year>2018</year>). <article-title>Learning from failure: a meta-analysis of the empirical studies</article-title>. <source>Educ. Technol. Res. Dev.</source> <volume>66</volume>:<fpage>1101</fpage>&#x02013;<lpage>1118</lpage>. <pub-id pub-id-type="doi">10.1007/s11423-018-9579-9</pub-id></citation>
</ref>
<ref id="B22">
<citation citation-type="journal"><sup>&#x0002A;</sup><person-group person-group-type="author"><name><surname>de Boer</surname> <given-names>H.</given-names></name> <name><surname>Donker</surname> <given-names>A. S.</given-names></name> <name><surname>van der Werf</surname> <given-names>M. P. C.</given-names></name></person-group> (<year>2014</year>). <article-title>Effects of the attributes of educational interventions on students&#x00027; academic performance: a meta-analysis</article-title>. <source>Rev. Educ. Res.</source>, <volume>84</volume>, <fpage>509</fpage>&#x02013;<lpage>545</lpage>. <pub-id pub-id-type="doi">10.3102/0034654314540006</pub-id></citation>
</ref>
<ref id="B23">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>de Kock</surname> <given-names>A.</given-names></name> <name><surname>Sleegers</surname> <given-names>P.</given-names></name> <name><surname>Voeten</surname> <given-names>M. J.</given-names></name></person-group> (<year>2004</year>). <article-title>New learning and the classification of learning environments in secondary education</article-title>. <source>Rev. Educ. Res.</source>, <volume>74</volume>, <fpage>141</fpage>&#x02013;<lpage>170</lpage>. <pub-id pub-id-type="doi">10.3102/00346543074002141</pub-id></citation>
</ref>
<ref id="B24">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Diery</surname> <given-names>A.</given-names></name> <name><surname>Knogler</surname> <given-names>M.</given-names></name> <name><surname>Seidel</surname> <given-names>T.</given-names></name></person-group> (<year>2021</year>). <article-title>Supporting evidence-based practice through teacher education: A study on teacher educators as central agents</article-title>. <source>Int. J. Educ. Res. Open</source>. 2, 100056. <pub-id pub-id-type="doi">10.1016/j.ijedro.2021.100056</pub-id></citation>
</ref>
<ref id="B25">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Diery</surname> <given-names>A.</given-names></name> <name><surname>Vogel</surname> <given-names>F.</given-names></name> <name><surname>Knogler</surname> <given-names>M.</given-names></name> <name><surname>Seidel</surname> <given-names>T.</given-names></name></person-group> (<year>2020</year>). <article-title>Evidence-based practice in higher education: teacher educators&#x00027; attitudes, challenges, and uses</article-title>. <source>Front. Educ.</source> <volume>5</volume>:<fpage>62</fpage>. <pub-id pub-id-type="doi">10.3389/feduc.2020.00062/full</pub-id></citation>
</ref>
<ref id="B26">
<citation citation-type="journal"><sup>&#x0002A;</sup><person-group person-group-type="author"><name><surname>Dignath</surname> <given-names>C.</given-names></name> <name><surname>Buttner</surname> <given-names>G.</given-names></name></person-group> (<year>2008</year>). <article-title>Components of fostering self-regulated learning among students. A meta-analysis on intervention studies at primary and secondary school level</article-title>. <source>Metacogn. Learn.</source> <volume>3</volume>, <fpage>231</fpage>&#x02013;<lpage>264</lpage>. <pub-id pub-id-type="doi">10.1007/s11409-008-9029-x</pub-id></citation>
</ref>
<ref id="B27">
<citation citation-type="journal"><sup>&#x0002A;</sup><person-group person-group-type="author"><name><surname>Donker</surname> <given-names>A. S.</given-names></name> <name><surname>de Boer</surname> <given-names>H.</given-names></name> <name><surname>Kostons</surname> <given-names>D.</given-names></name> <name><surname>Van Ewijk</surname> <given-names>C. D.</given-names></name> <name><surname>van der Werf</surname> <given-names>M. P.</given-names></name></person-group> (<year>2014</year>). <article-title>Effectiveness of learning strategy instruction on academic performance: a meta-analysis</article-title>. <source>Educ. Res. Rev.</source> <volume>11</volume>, <fpage>1</fpage>&#x02013;<lpage>26</lpage>. <pub-id pub-id-type="doi">10.1016/j.edurev.2013.11.002</pub-id></citation>
</ref>
<ref id="B28">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Dunlosky</surname> <given-names>J.</given-names></name> <name><surname>Rawson</surname> <given-names>K. A.</given-names></name> <name><surname>Marsh</surname> <given-names>E. J.</given-names></name> <name><surname>Nathan</surname> <given-names>M. J.</given-names></name> <name><surname>Willingham</surname> <given-names>D. T.</given-names></name></person-group> (<year>2013</year>). <article-title>Improving students&#x00027; learning with effective learning techniques: promising directions from cognitive and educational psychology</article-title>. <source>Psychol. Sci. Public Interest</source> <volume>14</volume>, <fpage>4</fpage>&#x02013;<lpage>58</lpage>. <pub-id pub-id-type="doi">10.1177/1529100612453266</pub-id><pub-id pub-id-type="pmid">26173288</pub-id></citation></ref>
<ref id="B29">
<citation citation-type="journal"><sup>&#x0002A;</sup><person-group person-group-type="author"><name><surname>Engelmann</surname> <given-names>K.</given-names></name> <name><surname>Neuhaus</surname> <given-names>B. J.</given-names></name> <name><surname>Fischer</surname> <given-names>F.</given-names></name></person-group> (<year>2016</year>). <article-title>Fostering scientific reasoning in education&#x02013;meta-analytic evidence from intervention studies</article-title>. <source>Educ. Res. Eval.</source> <volume>22</volume>, <fpage>333</fpage>&#x02013;<lpage>349</lpage>. <pub-id pub-id-type="doi">10.1080/13803611.2016.1240089</pub-id></citation>
</ref>
<ref id="B30">
<citation citation-type="web"><person-group person-group-type="author"><collab>Every Student Succeeds Act</collab></person-group> (<year>2015</year>). Available online at: <ext-link ext-link-type="uri" xlink:href="https://www.ed.gov/essa?src=rn">https://www.ed.gov/essa?src=rn</ext-link></citation>
</ref>
<ref id="B31">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Eysenck</surname> <given-names>H. J.</given-names></name></person-group> (<year>1978</year>). <article-title>An exercise in mega-silliness</article-title>. <source>Am. Psychol.</source> <volume>33</volume>, <fpage>517</fpage>. <pub-id pub-id-type="doi">10.1037/0003-066X.33.5.517.a</pub-id></citation>
</ref>
<ref id="B32">
<citation citation-type="journal"><sup>&#x0002A;</sup><person-group person-group-type="author"><name><surname>Fan</surname> <given-names>H.</given-names></name> <name><surname>Xu</surname> <given-names>J.</given-names></name> <name><surname>Cai</surname> <given-names>Z.</given-names></name> <name><surname>He</surname> <given-names>J.</given-names></name> <name><surname>Fan</surname> <given-names>X.</given-names></name></person-group> (<year>2017</year>). <article-title>Homework and students&#x00027; achievement in math and science: a 30-year meta-analysis, 1986&#x02013;2015</article-title>. <source>Educ. Res. Rev.</source> <volume>20</volume>, <fpage>35</fpage>&#x02013;<lpage>54</lpage>. <pub-id pub-id-type="doi">10.1016/j.edurev.2016.11.003</pub-id></citation>
</ref>
<ref id="B33">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ferguson</surname> <given-names>C. J.</given-names></name></person-group> (<year>2009</year>). <article-title>Is psychological research really as good as medical research? Effect size comparisons between psychology and medicine</article-title>. <source>Rev. General Psychol.</source> <volume>13</volume>, <fpage>130</fpage>&#x02013;<lpage>136</lpage>. <pub-id pub-id-type="doi">10.1037/a0015103</pub-id><pub-id pub-id-type="pmid">33539561</pub-id></citation></ref>
<ref id="B34">
<citation citation-type="journal"><sup>&#x0002A;</sup><person-group person-group-type="author"><name><surname>Furtak</surname> <given-names>E. M.</given-names></name> <name><surname>Seidel</surname> <given-names>T.</given-names></name> <name><surname>Iverson</surname> <given-names>H.</given-names></name> <name><surname>Briggs</surname> <given-names>D. C.</given-names></name></person-group> (<year>2012</year>). <article-title>Experimental and quasi-experimental studies of inquiry-based science teaching: a meta-analysis</article-title>. <source>Rev. Educ. Res.</source> <volume>82</volume>, <fpage>300</fpage>&#x02013;<lpage>329</lpage>. <pub-id pub-id-type="doi">10.3102/0034654312457206</pub-id></citation>
</ref>
<ref id="B35">
<citation citation-type="journal"><sup>&#x0002A;</sup><person-group person-group-type="author"><name><surname>Gerard</surname> <given-names>L.</given-names></name> <name><surname>Matuk</surname> <given-names>C.</given-names></name> <name><surname>McElhaney</surname> <given-names>K.</given-names></name> <name><surname>Linn</surname> <given-names>M. C.</given-names></name></person-group> (<year>2015</year>). <article-title>Automated, adaptive guidance for K-12 education</article-title>. <source>Educ. Res. Rev.</source> <volume>15</volume>, <fpage>41</fpage>&#x02013;<lpage>58</lpage>. <pub-id pub-id-type="doi">10.1016/j.edurev.2015.04.001</pub-id></citation>
</ref>
<ref id="B36">
<citation citation-type="journal"><sup>&#x0002A;</sup><person-group person-group-type="author"><name><surname>Ginns</surname> <given-names>P.</given-names></name> <name><surname>Martin</surname> <given-names>A. J.</given-names></name> <name><surname>Marsh</surname> <given-names>H. W.</given-names></name></person-group> (<year>2013</year>). <article-title>Designing instructional text in a conversational style: a meta-analysis</article-title>. <source>Educ. Psychol. Rev.</source> <volume>25</volume>, <fpage>445</fpage>&#x02013;<lpage>472</lpage>. <pub-id pub-id-type="doi">10.1007/s10648-013-9228-0</pub-id></citation>
</ref>
<ref id="B37">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Grossman</surname> <given-names>P.</given-names></name> <name><surname>Compton</surname> <given-names>C.</given-names></name> <name><surname>Igra</surname> <given-names>D.</given-names></name> <name><surname>Ronfeldt</surname> <given-names>M.</given-names></name> <name><surname>Shahan</surname> <given-names>E.</given-names></name> <name><surname>Williamson</surname> <given-names>P.</given-names></name></person-group> (<year>2009</year>). <article-title>Teaching practice: a cross-professional perspective</article-title>. <source>Teach. Coll. Rec.</source> <volume>111</volume>, <fpage>2055</fpage>&#x02013;<lpage>2100</lpage>. <pub-id pub-id-type="doi">10.1177/016146810911100905</pub-id></citation>
</ref>
<ref id="B38">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Gurevitch</surname> <given-names>J.</given-names></name> <name><surname>Koricheva</surname> <given-names>J.</given-names></name> <name><surname>Nakagawa</surname> <given-names>S.</given-names></name> <name><surname>Stewart</surname> <given-names>G.</given-names></name></person-group> (<year>2018</year>). <article-title>Meta-analysis and the science of research synthesis</article-title>. <source>Nature</source> <volume>555</volume>, <fpage>175</fpage>&#x02013;<lpage>182</lpage>. <pub-id pub-id-type="doi">10.1038/nature25753</pub-id><pub-id pub-id-type="pmid">29517004</pub-id></citation></ref>
<ref id="B39">
<citation citation-type="journal"><sup>&#x0002A;</sup><person-group person-group-type="author"><name><surname>Haas</surname> <given-names>M.</given-names></name></person-group> (<year>2005</year>). <article-title>Teaching methods for secondary algebra: a meta-analysis of findings</article-title>. <source>NASSP Bull.</source> <volume>89</volume>, <fpage>24</fpage>&#x02013;<lpage>46</lpage>. <pub-id pub-id-type="doi">10.1177/019263650508964204</pub-id></citation>
</ref>
<ref id="B40">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Hattie</surname> <given-names>J. A. C.</given-names></name></person-group> (<year>2009</year>). <source>Visible Learning: A Synthesis of Over 800 Meta-Analyses Relating to Achievement</source>. <publisher-loc>New York</publisher-loc>: <publisher-name>Routledge</publisher-name>.</citation>
</ref>
<ref id="B41">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hedges</surname> <given-names>L. V.</given-names></name></person-group> (<year>2013</year>). <article-title>Recommendations for practice: justifying claims of generalizability</article-title>. <source>Educ. Psychol. Rev.</source> <volume>25</volume>, <fpage>331</fpage>&#x02013;<lpage>337</lpage>. <pub-id pub-id-type="doi">10.1007/s10648-013-9239-x</pub-id></citation>
</ref>
<ref id="B42">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hedges</surname> <given-names>L. V.</given-names></name></person-group> (<year>2018</year>). <article-title>Challenges in building usable knowledge in education</article-title>. <source>J. Res. Educ. Eff.</source> <volume>11</volume>, <fpage>1</fpage>&#x02013;<lpage>21</lpage>. <pub-id pub-id-type="doi">10.1080/19345747.2017.1375583</pub-id><pub-id pub-id-type="pmid">26463730</pub-id></citation></ref>
<ref id="B43">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hedges</surname> <given-names>L. V.</given-names></name> <name><surname>Tipton</surname> <given-names>E.</given-names></name> <name><surname>Johnson</surname> <given-names>M. C.</given-names></name></person-group> (<year>2010</year>). <article-title>Robust variance estimation in meta-regression with dependent effect size estimates</article-title>. <source>Res. Synth. Methods</source> <volume>1</volume>, <fpage>39</fpage>&#x02013;<lpage>65</lpage>. <pub-id pub-id-type="doi">10.1002/jrsm.5</pub-id><pub-id pub-id-type="pmid">26061381</pub-id></citation></ref>
<ref id="B44">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hedges</surname> <given-names>L. V.</given-names></name> <name><surname>Vevea</surname> <given-names>J. L.</given-names></name></person-group> (<year>1998</year>). <article-title>Fixed-and random-effects models in meta-analysis</article-title>. <source>Psychol Method.</source> <volume>3</volume>, <fpage>486</fpage>.</citation>
</ref>
<ref id="B45">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hempel</surname> <given-names>S.</given-names></name> <name><surname>Miles</surname> <given-names>J. N.</given-names></name> <name><surname>Booth</surname> <given-names>M. J.</given-names></name> <name><surname>Wang</surname> <given-names>Z.</given-names></name> <name><surname>Morton</surname> <given-names>S. C.</given-names></name> <name><surname>Shekelle</surname> <given-names>P. G.</given-names></name></person-group> (<year>2013</year>). <article-title>Risk of bias: a simulation study of power to detect study-level moderator effects in meta-analysis</article-title>. <source>System. Rev.</source> <volume>2</volume>, <fpage>107</fpage>. <pub-id pub-id-type="doi">10.1186/2046-4053-2-107</pub-id><pub-id pub-id-type="pmid">24286208</pub-id></citation></ref>
<ref id="B46">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Higgins</surname> <given-names>J. P.</given-names></name> <name><surname>Thomas</surname> <given-names>J.</given-names></name> <name><surname>Chandler</surname> <given-names>J.</given-names></name> <name><surname>Cumpston</surname> <given-names>M.</given-names></name> <name><surname>Li</surname> <given-names>T.</given-names></name> <name><surname>Page</surname> <given-names>M. J.</given-names></name> <etal/></person-group>. (<year>2019</year>). <source>Cochrane Handbook for Systematic Reviews of Interventions.</source> <publisher-loc>New York</publisher-loc>: <publisher-name>Wiley</publisher-name>. <pub-id pub-id-type="doi">10.1002/9781119536604</pub-id><pub-id pub-id-type="pmid">35352103</pub-id></citation></ref>
<ref id="B47">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hill</surname> <given-names>C. J.</given-names></name> <name><surname>Bloom</surname> <given-names>H. S.</given-names></name> <name><surname>Black</surname> <given-names>A. R.</given-names></name> <name><surname>Lipsey</surname> <given-names>M. W.</given-names></name></person-group> (<year>2008</year>). <article-title>Empirical benchmarks for interpreting effect sizes in research</article-title>. <source>Child Dev. Perspect.</source> <volume>2</volume>, <fpage>172</fpage>&#x02013;<lpage>177</lpage>. <pub-id pub-id-type="doi">10.1111/j.1750-8606.2008.00061.x</pub-id><pub-id pub-id-type="pmid">27694127</pub-id></citation></ref>
<ref id="B48">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hillmayr</surname> <given-names>D.</given-names></name> <name><surname>Ziernwald</surname> <given-names>L.</given-names></name> <name><surname>Reinhold</surname> <given-names>F.</given-names></name> <name><surname>Hofer</surname> <given-names>S. I.</given-names></name> <name><surname>Reiss</surname> <given-names>K. M.</given-names></name></person-group> (<year>2020</year>). <article-title>The potential of digital tools to enhance mathematics and science learning in secondary schools: a context-specific meta-analysis</article-title>. <source>Comput. Educ.</source> <volume>153</volume>, <fpage>103897</fpage>. <pub-id pub-id-type="doi">10.1016/j.compedu.2020.103897</pub-id></citation>
</ref>
<ref id="B49">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Howe</surname> <given-names>K.</given-names></name></person-group> (<year>2009</year>). <article-title>Epistemology, methodology, and education sciences</article-title>. <source>Educ. Res.</source> <volume>38</volume>, <fpage>428</fpage>. <pub-id pub-id-type="doi">10.3102/0013189X09342003</pub-id></citation>
</ref>
<ref id="B50">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Joyce</surname> <given-names>K. E.</given-names></name> <name><surname>Cartwright</surname> <given-names>N.</given-names></name></person-group> (<year>2020</year>). <article-title>Bridging the gap between research and practice: Predicting what will work locally</article-title>. <source>Am. Educ. Res. J.</source>, <volume>57</volume>, <fpage>1045</fpage>&#x02013;<lpage>1082</lpage>. <pub-id pub-id-type="doi">10.3102/0002831219866687</pub-id></citation>
</ref>
<ref id="B51">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kloser</surname> <given-names>M.</given-names></name></person-group> (<year>2014</year>). <article-title>Identifying a core set of science teaching practices: A delphi expert panel approach</article-title>. <source>J. Res. Sci. Teach.</source> <volume>51</volume>, <fpage>1185</fpage>&#x02013;<lpage>1217</lpage>. <pub-id pub-id-type="doi">10.1002/tea.21171</pub-id></citation>
</ref>
<ref id="B52">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kraft</surname> <given-names>M. A.</given-names></name></person-group> (<year>2020</year>). <article-title>Interpreting effect sizes of education interventions</article-title>. <source>Educ. Res.</source> <volume>49</volume>, <fpage>241</fpage>&#x02013;<lpage>253</lpage>. <pub-id pub-id-type="doi">10.3102/0013189X20912798</pub-id></citation>
</ref>
<ref id="B53">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kuhn</surname> <given-names>D.</given-names></name></person-group> (<year>2007</year>). <article-title>Is direct instruction an answer to the right question?</article-title> <source>Educ. Psychol.</source>, <volume>42</volume>, <fpage>109</fpage>&#x02013;<lpage>113</lpage>. <pub-id pub-id-type="doi">10.1080/00461520701263376</pub-id></citation>
</ref>
<ref id="B54">
<citation citation-type="journal"><sup>&#x0002A;</sup><person-group person-group-type="author"><name><surname>Lazonder</surname> <given-names>A. W.</given-names></name> <name><surname>Harmsen</surname> <given-names>R.</given-names></name></person-group> (<year>2016</year>). <article-title>Meta-analysis of inquiry-based learning effects of guidance</article-title>. <source>Rev. Educ. Res.</source> <volume>86</volume>, <fpage>681</fpage>&#x02013;<lpage>718</lpage>. <pub-id pub-id-type="doi">10.3102/0034654315627366</pub-id></citation>
</ref>
<ref id="B55">
<citation citation-type="journal"><sup>&#x0002A;</sup><person-group person-group-type="author"><name><surname>Lee</surname> <given-names>Y.</given-names></name> <name><surname>Capraro</surname> <given-names>M. M.</given-names></name> <name><surname>Capraro</surname> <given-names>R. M.</given-names></name> <name><surname>Bicer</surname> <given-names>A.</given-names></name></person-group> (<year>2018</year>). <article-title>A meta-analysis: Improvement of students&#x00027; algebraic reasoning through metacognitive training</article-title>. <source>Int. Educ. Stud.</source> <volume>11</volume>, <fpage>42</fpage>&#x02013;<lpage>49</lpage>. <pub-id pub-id-type="doi">10.5539/ies.v11n10p42</pub-id></citation>
</ref>
<ref id="B56">
<citation citation-type="journal"><sup>&#x0002A;</sup><person-group person-group-type="author"><name><surname>Li</surname> <given-names>Q.</given-names></name> <name><surname>Ma</surname> <given-names>X.</given-names></name></person-group> (<year>2010</year>). <article-title>A meta-analysis of the effects of computer technology on school students&#x00027; mathematics learning</article-title>. <source>Educ. Psychol. Rev.</source> <volume>22</volume>, <fpage>215</fpage>&#x02013;<lpage>243</lpage>. <pub-id pub-id-type="doi">10.1007/s10648-010-9125-8</pub-id></citation>
</ref>
<ref id="B57">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Li</surname> <given-names>Y.</given-names></name> <name><surname>Wang</surname> <given-names>K.</given-names></name> <name><surname>Xiao</surname> <given-names>Y.</given-names></name> <name><surname>Froyd</surname> <given-names>J. E.</given-names></name></person-group> (<year>2020</year>). <article-title>Research and trends in STEM education: a systematic review of journal publications</article-title>. <source>Int. J. STEM Educ.</source> <volume>7</volume>, <fpage>1</fpage>&#x02013;<lpage>16</lpage>. <pub-id pub-id-type="doi">10.1186/2196-7822-1-1</pub-id></citation>
</ref>
<ref id="B58">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lin</surname> <given-names>T. J.</given-names></name> <name><surname>Lin</surname> <given-names>T. C.</given-names></name> <name><surname>Potvin</surname> <given-names>P.</given-names></name> <name><surname>Tsai</surname> <given-names>C. C.</given-names></name></person-group> (<year>2019</year>). <article-title>Research trends in science education from 2013 to 2017: a systematic content analysis of publications in selected journals</article-title>. <source>Int. J. Sci. Educ.</source> <volume>41</volume>, <fpage>367</fpage>&#x02013;<lpage>387</lpage>. <pub-id pub-id-type="doi">10.1080/09500693.2018.1550274</pub-id></citation>
</ref>
<ref id="B59">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Lipsey</surname> <given-names>M. W.</given-names></name> <name><surname>Wilson</surname> <given-names>D. B.</given-names></name></person-group> (<year>1993</year>). <article-title>The efficacy of psychological, educational, and behavioral treatment: confirmation from meta-analysis</article-title>. <source>Am. Psychol.</source>, 48, 1181. <pub-id pub-id-type="doi">10.1037/0003-066X.48.12.1181</pub-id><pub-id pub-id-type="pmid">8297057</pub-id></citation></ref>
<ref id="B60">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lynch</surname> <given-names>K.</given-names></name> <name><surname>Hill</surname> <given-names>H. C.</given-names></name> <name><surname>Gonzalez</surname> <given-names>K. E.</given-names></name> <name><surname>Pollard</surname> <given-names>C.</given-names></name></person-group> (<year>2019</year>). <article-title>Strengthening the research base that informs STEM instructional improvement efforts: a meta-analysis</article-title>. <source>Educ. Eval. Policy Anal.</source> <volume>41</volume>, <fpage>260</fpage>&#x02013;<lpage>293</lpage>. <pub-id pub-id-type="doi">10.3102/0162373719849044</pub-id></citation>
</ref>
<ref id="B61">
<citation citation-type="journal"><sup>&#x0002A;</sup><person-group person-group-type="author"><name><surname>Ma</surname> <given-names>W.</given-names></name> <name><surname>Adesope</surname> <given-names>O. O.</given-names></name> <name><surname>Nesbit</surname> <given-names>J. C.</given-names></name> <name><surname>Liu</surname> <given-names>Q.</given-names></name></person-group> (<year>2014</year>). <article-title>Intelligent tutoring systems and learning outcomes: a meta-analysis</article-title>. <source>J. Educ. Psychol.</source> <volume>106</volume>, <fpage>901</fpage>&#x02013;<lpage>918</lpage>. <pub-id pub-id-type="doi">10.1037/a0037123</pub-id></citation>
</ref>
<ref id="B62">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Makel</surname> <given-names>M. C.</given-names></name> <name><surname>Hodges</surname> <given-names>J.</given-names></name> <name><surname>Cook</surname> <given-names>B. G.</given-names></name> <name><surname>Plucker</surname> <given-names>J. A.</given-names></name></person-group> (<year>2021</year>). <article-title>Both questionable and open research practices are prevalent in education research</article-title>. <source>Educ. Res.</source> <volume>50</volume>, <fpage>493</fpage>&#x02013;<lpage>504</lpage>. <pub-id pub-id-type="doi">10.3102/0013189X211001356</pub-id></citation>
</ref>
<ref id="B63">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Mayer</surname> <given-names>R. E.</given-names></name></person-group> (<year>2004</year>). <article-title>Should there be a three-strikes rule against pure discovery learning?</article-title>. <source>Am. Psychol.</source> <volume>59</volume>, <fpage>14</fpage>. <pub-id pub-id-type="doi">10.1037/0003-066X.59.1.14</pub-id><pub-id pub-id-type="pmid">14736316</pub-id></citation></ref>
<ref id="B64">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Moher</surname> <given-names>D.</given-names></name> <name><surname>Liberati</surname> <given-names>A.</given-names></name> <name><surname>Tetzlaff</surname> <given-names>J.</given-names></name> <name><surname>Altman</surname> <given-names>D. G.</given-names></name></person-group> (<year>2009</year>). <article-title>Preferred reporting items for systematic reviews and meta-analyses: the PRISMA statement</article-title>. <source>PLoS Med.</source> <volume>6</volume>, <fpage>e1000097</fpage>&#x02013;<lpage>e1000097</lpage>. <pub-id pub-id-type="doi">10.1371/journal.pmed.1000097</pub-id><pub-id pub-id-type="pmid">20171303</pub-id></citation></ref>
<ref id="B65">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Morris</surname> <given-names>S.</given-names></name></person-group> (<year>2008</year>). <article-title>Estimating effect sizes from pretest-posttest-control group designs</article-title>. <source>Organ. Res. Methods.</source> <volume>11</volume>, <fpage>364</fpage>&#x02013;<lpage>386</lpage>. <pub-id pub-id-type="doi">10.1177/1094428106291059</pub-id><pub-id pub-id-type="pmid">19271847</pub-id></citation></ref>
<ref id="B66">
<citation citation-type="book"><person-group person-group-type="author"><collab>National Research Council</collab></person-group> (<year>2012</year>). A Framework for K-12 Science Education: Practices, Crosscutting Concepts, and Core Ideas. National Academies Press.</citation>
</ref>
<ref id="B67">
<citation citation-type="web"><person-group person-group-type="author"><collab>No Child Left Behind Act</collab></person-group> (<year>2002</year>). Available online at: <ext-link ext-link-type="uri" xlink:href="https://www2.ed.gov/nclb/landing.jhtml">https://www2.ed.gov/nclb/landing.jhtml</ext-link></citation>
</ref>
<ref id="B68">
<citation citation-type="book"><person-group person-group-type="author"><collab>OECD</collab></person-group> (<year>2019</year>). PISA 2018 Assessment and Analytical Framework. PISA. Paris: <person-group person-group-type="author"><collab>OECD</collab></person-group>Publishing. <pub-id pub-id-type="doi">10.1787/b25efab8-en</pub-id></citation>
</ref>
<ref id="B69">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Pigott</surname> <given-names>T. D.</given-names></name> <name><surname>Polanin</surname> <given-names>J. R.</given-names></name></person-group> (<year>2020</year>). <article-title>Methodological guidance paper: high-quality meta-analysis in a systematic review</article-title>. <source>Rev. Educ. Res.</source> <volume>90</volume>, <fpage>24</fpage>&#x02013;<lpage>46</lpage>. <pub-id pub-id-type="doi">10.3102/0034654319877153</pub-id></citation>
</ref>
<ref id="B70">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Polanin</surname> <given-names>J. R.</given-names></name> <name><surname>Hennessy</surname> <given-names>E. A.</given-names></name> <name><surname>Tsuji</surname> <given-names>S.</given-names></name></person-group> (<year>2020</year>). <article-title>Transparency and reproducibility of meta-analyses in psychology: a meta-review</article-title>. <source>Perspect. Psychol. Sci.</source> <volume>2020</volume>:<fpage>1745691620906416</fpage>. <pub-id pub-id-type="doi">10.1177/1745691620906416</pub-id><pub-id pub-id-type="pmid">32516081</pub-id></citation></ref>
<ref id="B71">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Polanin</surname> <given-names>J. R.</given-names></name> <name><surname>Maynard</surname> <given-names>B. R.</given-names></name> <name><surname>Dell</surname> <given-names>N. A.</given-names></name></person-group> (<year>2017</year>). <article-title>Overviews in education research: a systematic review and analysis</article-title>. <source>Rev. Educ. Res.</source> <volume>87</volume>, <fpage>172</fpage>&#x02013;<lpage>203</lpage>. <pub-id pub-id-type="doi">10.3102/0034654316631117</pub-id></citation>
</ref>
<ref id="B72">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Polanin</surname> <given-names>J. R.</given-names></name> <name><surname>Pigott</surname> <given-names>T. D.</given-names></name></person-group> (<year>2015</year>). <article-title>The use of meta-analytic statistical significance testing</article-title>. <source>Res. Synth. Methods</source> <volume>6</volume>, <fpage>63</fpage>&#x02013;<lpage>73</lpage>. <pub-id pub-id-type="doi">10.1002/jrsm.1124</pub-id><pub-id pub-id-type="pmid">26035470</pub-id></citation></ref>
<ref id="B73">
<citation citation-type="journal"><sup>&#x0002A;</sup><person-group person-group-type="author"><name><surname>Rakes</surname> <given-names>C. R.</given-names></name> <name><surname>Valentine</surname> <given-names>J. C.</given-names></name> <name><surname>McGatha</surname> <given-names>M. B.</given-names></name> <name><surname>Ronau</surname> <given-names>R. N.</given-names></name></person-group> (<year>2010</year>). <article-title>Methods of instructional improvement in algebra: a systematic review and meta-analysis</article-title>. <source>Rev. Educ. Res.</source> <volume>80</volume>, <fpage>372</fpage>&#x02013;<lpage>400</lpage>. <pub-id pub-id-type="doi">10.3102/0034654310374880</pub-id></citation>
</ref>
<ref id="B74">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Robinson</surname> <given-names>D. H.</given-names></name> <name><surname>Levin</surname> <given-names>J. R.</given-names></name> <name><surname>Schraw</surname> <given-names>G.</given-names></name> <name><surname>Patall</surname> <given-names>E. A.</given-names></name> <name><surname>Hunt</surname> <given-names>E. B.</given-names></name></person-group> (<year>2013</year>). <article-title>On Going (Way) Beyond one&#x00027;s data: a proposal to restrict recommendations for practice in primary educational research journals</article-title>. <source>Educ. Psychol. Rev.</source> <volume>2</volume>, <fpage>291</fpage>&#x02013;<lpage>302</lpage>. <pub-id pub-id-type="doi">10.1007/s10648-013-9223-5</pub-id></citation>
</ref>
<ref id="B75">
<citation citation-type="journal"><sup>&#x0002A;</sup><person-group person-group-type="author"><name><surname>Sanchez</surname> <given-names>C. E.</given-names></name> <name><surname>Atkinson</surname> <given-names>K. M.</given-names></name> <name><surname>Koenka</surname> <given-names>A. C.</given-names></name> <name><surname>Moshontz</surname> <given-names>H.</given-names></name> <name><surname>Cooper</surname> <given-names>H.</given-names></name></person-group> (<year>2017</year>). <article-title>Self-grading and peer-grading for formative and summative assessments in 3rd through12th grade classrooms: a meta-analysis</article-title>. <source>J. Educ. Psychol.</source>, <volume>109</volume>, <fpage>1049</fpage>&#x02013;<lpage>1066</lpage>. <pub-id pub-id-type="doi">10.1037/edu0000190</pub-id></citation>
</ref>
<ref id="B76">
<citation citation-type="journal"><sup>&#x0002A;</sup><person-group person-group-type="author"><name><surname>Savelsbergh</surname> <given-names>E. R.</given-names></name> <name><surname>Prins</surname> <given-names>G. T.</given-names></name> <name><surname>Rietbergen</surname> <given-names>C.</given-names></name> <name><surname>Fechner</surname> <given-names>S.</given-names></name> <name><surname>Vaessen</surname> <given-names>B. E.</given-names></name> <name><surname>Draijer</surname> <given-names>J. M.</given-names></name> <etal/></person-group>. (<year>2016</year>). <article-title>Effects of innovative science and mathematics teaching on student attitudes and achievement: a meta-analytic study</article-title>. <source>Educ. Res. Rev.</source> <volume>19</volume>, <fpage>158</fpage>&#x02013;<lpage>172</lpage>. <pub-id pub-id-type="doi">10.1016/j.edurev.2016.07.003</pub-id></citation>
</ref>
<ref id="B77">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Schalken</surname> <given-names>N.</given-names></name> <name><surname>Rietbergen</surname> <given-names>C.</given-names></name></person-group> (<year>2017</year>). <article-title>The reporting quality of systematic reviews and meta-analyses in industrial and organizational psychology: a systematic review</article-title>. <source>Front. Psychol.</source> <volume>8</volume>, <fpage>1395</fpage>&#x02013;<lpage>1395</lpage>. <pub-id pub-id-type="doi">10.3389/fpsyg.2017.01395</pub-id><pub-id pub-id-type="pmid">28878704</pub-id></citation></ref>
<ref id="B78">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Schauer</surname> <given-names>J. M.</given-names></name> <name><surname>Hedges</surname> <given-names>L. V.</given-names></name></person-group> (<year>2020</year>). <article-title>Assessing heterogeneity and power in replications of psychological experiments</article-title>. <source>Psychol. Bull.</source> <volume>146</volume>, <fpage>701</fpage>&#x02013;<lpage>719</lpage>. <pub-id pub-id-type="doi">10.1037/bul0000232</pub-id><pub-id pub-id-type="pmid">32271029</pub-id></citation></ref>
<ref id="B79">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Schneider</surname> <given-names>M.</given-names></name> <name><surname>Preckel</surname> <given-names>F.</given-names></name></person-group> (<year>2017</year>). <article-title>Variables associated with achievement in higher education: a systematic review of meta-analyses</article-title>. <source>Psychol. Bull.</source> <volume>143</volume>, <fpage>565</fpage>. <pub-id pub-id-type="doi">10.1037/bul0000098</pub-id><pub-id pub-id-type="pmid">28333495</pub-id></citation></ref>
<ref id="B80">
<citation citation-type="journal"><sup>&#x0002A;</sup><person-group person-group-type="author"><name><surname>Schneider</surname> <given-names>S.</given-names></name> <name><surname>Beege</surname> <given-names>M.</given-names></name> <name><surname>Nebel</surname> <given-names>S.</given-names></name> <name><surname>Rey</surname> <given-names>G. D.</given-names></name></person-group> (<year>2018</year>). <article-title>A meta-analysis of how signaling affects learning with media</article-title>. <source>Educ. Res. Rev.</source> <volume>23</volume>, <fpage>1</fpage>&#x02013;<lpage>24</lpage>. <pub-id pub-id-type="doi">10.1016/j.edurev.2017.11.001</pub-id></citation>
</ref>
<ref id="B81">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Schraw</surname> <given-names>G.</given-names></name> <name><surname>Patall</surname> <given-names>E. A.</given-names></name></person-group> (<year>2013</year>). <article-title>Using principles of evidence-based practice to improve prescriptive recommendations</article-title>. <source>Educ. Psychol. Rev.</source> <volume>25</volume>, <fpage>345</fpage>&#x02013;<lpage>351</lpage>. <pub-id pub-id-type="doi">10.1007/s10648-013-9237-z</pub-id></citation>
</ref>
<ref id="B82">
<citation citation-type="journal"><sup>&#x0002A;</sup><person-group person-group-type="author"><name><surname>Schroeder</surname> <given-names>C. M.</given-names></name> <name><surname>Scott</surname> <given-names>T. P.</given-names></name> <name><surname>Tolson</surname> <given-names>H.</given-names></name> <name><surname>Huang</surname> <given-names>T. Y.</given-names></name> <name><surname>Lee</surname> <given-names>Y. H.</given-names></name></person-group> (<year>2007</year>). <article-title>A meta-analysis of national research: effects of teaching strategies on student achievement in science in the United States</article-title>. <source>J. Res. Sci. Teach.</source> <volume>44</volume>, <fpage>1436</fpage>&#x02013;<lpage>1460</lpage>. <pub-id pub-id-type="doi">10.1002/tea.20212</pub-id></citation>
</ref>
<ref id="B83">
<citation citation-type="journal"><sup>&#x0002A;</sup><person-group person-group-type="author"><name><surname>Schroeder</surname> <given-names>N. L.</given-names></name> <name><surname>Cenkci</surname> <given-names>A. T.</given-names></name></person-group> (<year>2018</year>). <article-title>Spatial contiguity and spatial split-attention effects in multimedia learning environments: a meta-analysis</article-title>. <source>Educ. Psychol. Rev.</source> <volume>30</volume>, <fpage>679</fpage>&#x02013;<lpage>701</lpage>. <pub-id pub-id-type="doi">10.1007/s10648-018-9435-9</pub-id></citation>
</ref>
<ref id="B84">
<citation citation-type="journal"><sup>&#x0002A;</sup><person-group person-group-type="author"><name><surname>Schroeder</surname> <given-names>N. L.</given-names></name> <name><surname>Nesbit</surname> <given-names>J. C.</given-names></name> <name><surname>Anguiano</surname> <given-names>C. J.</given-names></name> <name><surname>Adesope</surname> <given-names>O. O.</given-names></name></person-group> (<year>2017</year>). <article-title>Studying and constructing concept maps: a meta-analysis</article-title>. <source>Educ. Psychol. Rev.</source> <volume>30</volume>, <fpage>431</fpage>&#x02013;<lpage>455</lpage>. <pub-id pub-id-type="doi">10.1007/s10648-017-9403-9</pub-id></citation>
</ref>
<ref id="B85">
<citation citation-type="journal"><sup>&#x0002A;</sup><person-group person-group-type="author"><name><surname>Schwichow</surname> <given-names>M.</given-names></name> <name><surname>Croker</surname> <given-names>S.</given-names></name> <name><surname>Zimmerman</surname> <given-names>C.</given-names></name> <name><surname>H&#x000F6;ffler</surname> <given-names>T.</given-names></name> <name><surname>H&#x000E4;rtig</surname> <given-names>H.</given-names></name></person-group> (<year>2016</year>). <article-title>Teaching the control-of-variables strategy: a meta-analysis</article-title>. <source>Dev. Rev.</source> <volume>39</volume>, <fpage>37</fpage>&#x02013;<lpage>63</lpage>. <pub-id pub-id-type="doi">10.1016/j.dr.2015.12.001</pub-id></citation>
</ref>
<ref id="B86">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Seidel</surname> <given-names>T.</given-names></name> <name><surname>Knogler</surname> <given-names>M.</given-names></name> <name><surname>Mok</surname> <given-names>S. Y.</given-names></name> <name><surname>Hetmanek</surname> <given-names>A.</given-names></name> <name><surname>Bauer</surname> <given-names>J.</given-names></name> <name><surname>Vogel</surname> <given-names>F.</given-names></name> <etal/></person-group>. (<year>2017b</year>). Forschung f&#x000F6;rdert (Lehrer) Bildung. Das Clearing House Unterricht. [Research supports (teacher) education. The Clearing House Unterricht] <source>J. Lehrerinnen- und Lehrerbildung</source>, <volume>3</volume>, <fpage>23</fpage>&#x02013;<lpage>28</lpage>.</citation>
</ref>
<ref id="B87">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Seidel</surname> <given-names>T.</given-names></name> <name><surname>Mok</surname> <given-names>S. Y.</given-names></name> <name><surname>Hetmanek</surname> <given-names>A.</given-names></name> <name><surname>Knogler</surname> <given-names>M.</given-names></name></person-group> (<year>2017a</year>). <article-title>Meta-Analysen zur Unterrichtsforschung und ihr Beitrag f&#x000FC;r die Realisierung eines Clearing House Unterricht f&#x000FC;r die Lehrerbildung</article-title>. <source>Zeitschrift f&#x000FC;r Bildungsforschung</source> <volume>7</volume>, <fpage>311</fpage>&#x02013;<lpage>325</lpage>. <pub-id pub-id-type="doi">10.1007/s35834-017-0191-6</pub-id></citation>
</ref>
<ref id="B88">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Seidel</surname> <given-names>T.</given-names></name> <name><surname>Shavelson</surname> <given-names>R. J.</given-names></name></person-group> (<year>2007</year>). <article-title>Teaching effectiveness research in the past decade: The role of theory and research design in disentangling meta-analysis results</article-title>. <source>Rev. Educ. Res.</source> <volume>77</volume>, <fpage>454</fpage>&#x02013;<lpage>499</lpage>. <pub-id pub-id-type="doi">10.3102/0034654307310317</pub-id></citation>
</ref>
<ref id="B89">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Shadish</surname> <given-names>W. R.</given-names></name> <name><surname>Cook</surname> <given-names>T. D.</given-names></name> <name><surname>Campbell</surname> <given-names>D. T.</given-names></name></person-group> (<year>2002</year>). <source>Experimental and Quasi-Experimental Designs for Generalized Causal Inference</source>. <publisher-loc>Boston</publisher-loc>: <publisher-name>Houghton Mifflin</publisher-name>.</citation>
</ref>
<ref id="B90">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Shavelson</surname> <given-names>R. J.</given-names></name> <name><surname>Towne</surname> <given-names>L.</given-names></name></person-group> (<year>2002</year>). <source>Scientific Research in Education.</source> <publisher-loc>Washinton, DC</publisher-loc>: <publisher-name>National Academies Press</publisher-name>.</citation>
</ref>
<ref id="B91">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Shea</surname> <given-names>B. J.</given-names></name> <name><surname>Grimshaw</surname> <given-names>J. M.</given-names></name> <name><surname>Wells</surname> <given-names>G. A.</given-names></name> <name><surname>Boers</surname> <given-names>M.</given-names></name> <name><surname>Andersson</surname> <given-names>N.</given-names></name> <name><surname>Hamel</surname> <given-names>C.</given-names></name> <etal/></person-group>. (<year>2007</year>). <article-title>Development of AMSTAR: a measurement tool to assess the methodological quality of systematic reviews</article-title>. <source>BMC Med. Res. Methodol.</source> <volume>7</volume>, <fpage>10</fpage>. <pub-id pub-id-type="doi">10.1186/1471-2288-7-10</pub-id><pub-id pub-id-type="pmid">17302989</pub-id></citation></ref>
<ref id="B92">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Siddaway</surname> <given-names>A. P.</given-names></name> <name><surname>Wood</surname> <given-names>A. M.</given-names></name> <name><surname>Hedges</surname> <given-names>L. V.</given-names></name></person-group> (<year>2019</year>). <article-title>How to do a systematic review: a best practice guide for conducting and reporting narrative reviews, meta-analyses, and meta-syntheses</article-title>. <source>Annu. Rev. Psychol.</source> <volume>70</volume>, <fpage>747</fpage>&#x02013;<lpage>770</lpage>. <pub-id pub-id-type="doi">10.1146/annurev-psych-010418-102803</pub-id><pub-id pub-id-type="pmid">30089228</pub-id></citation></ref>
<ref id="B93">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Simpson</surname> <given-names>A.</given-names></name></person-group> (<year>2018</year>). <article-title>Princesses are bigger than elephants: effect size as a category error in evidence-based education</article-title>. <source>Br. Educ. Res. J.</source> <volume>44</volume>, <fpage>897</fpage>&#x02013;<lpage>913</lpage>. <pub-id pub-id-type="doi">10.1002/berj.3474</pub-id></citation>
</ref>
<ref id="B94">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Slavin</surname> <given-names>R.</given-names></name> <name><surname>Madden</surname> <given-names>N. A.</given-names></name></person-group> (<year>2011</year>). <article-title>Measures inherent to treatments in program effectiveness reviews</article-title>. <source>J. Res. Educ. Eff.</source> <volume>4</volume>, <fpage>370</fpage>&#x02013;<lpage>380</lpage>. <pub-id pub-id-type="doi">10.1080/19345747.2011.558986</pub-id></citation>
</ref>
<ref id="B95">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Slavin</surname> <given-names>R. E.</given-names></name></person-group> (<year>2008</year>). <article-title>Perspectives on evidence-based research in education&#x02014;What works? Issues in synthesizing educational program evaluations</article-title>. <source>Educ. Res.</source> <volume>37</volume>, <fpage>5</fpage>&#x02013;<lpage>14</lpage>. <pub-id pub-id-type="doi">10.3102/0013189X08314117</pub-id></citation>
</ref>
<ref id="B96">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Slavin</surname> <given-names>R. E.</given-names></name></person-group> (<year>2020</year>). <article-title>How evidence-based reform will transform research and practice in education</article-title>. <source>Educ. Psychol.</source> <volume>55</volume>, <fpage>21</fpage>&#x02013;<lpage>31</lpage>. <pub-id pub-id-type="doi">10.1080/00461520.2019.1611432</pub-id></citation>
</ref>
<ref id="B97">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Slavin</surname> <given-names>R. E.</given-names></name> <name><surname>Lake</surname> <given-names>C.</given-names></name></person-group> (<year>2008</year>). <article-title>Effective programs in elementary mathematics: A best-evidence synthesis</article-title>. <source>Rev. Educ. Res.</source> <volume>78</volume>, <fpage>427</fpage>&#x02013;<lpage>515</lpage>. <pub-id pub-id-type="doi">10.3102/0034654308317473</pub-id></citation>
</ref>
<ref id="B98">
<citation citation-type="journal"><sup>&#x0002A;</sup><person-group person-group-type="author"><name><surname>Sokolowski</surname> <given-names>A.</given-names></name></person-group> (<year>2015</year>). <article-title>The effects of mathematical modelling on students&#x00027; achievement-meta-analysis of research</article-title>. <source>IAFOR J. Educ.</source> <volume>3</volume>, <fpage>93</fpage>&#x02013;<lpage>114</lpage>. <pub-id pub-id-type="doi">10.22492/ije.3.1.06</pub-id></citation>
</ref>
<ref id="B99">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Staines</surname> <given-names>G. L.</given-names></name></person-group> (<year>2008</year>). <article-title>The causal generalization paradox: the case of treatment outcome research</article-title>. <source>Rev. General Psychol.</source> <volume>12</volume>, <fpage>236</fpage>&#x02013;<lpage>252</lpage>. <pub-id pub-id-type="doi">10.1037/1089-2680.12.3.236</pub-id></citation>
</ref>
<ref id="B100">
<citation citation-type="journal"><sup>&#x0002A;</sup><person-group person-group-type="author"><name><surname>Steenbergen-Hu</surname> <given-names>S.</given-names></name> <name><surname>Cooper</surname> <given-names>H.</given-names></name></person-group> (<year>2013</year>). <article-title>A meta-analysis of the effectiveness of intelligent tutoring systems on K-12 students&#x00027; mathematical learning</article-title>. <source>J. Educ. Psychol.</source> <volume>105</volume>, <fpage>970</fpage>&#x02013;<lpage>987</lpage>. <pub-id pub-id-type="doi">10.1037/a0032447</pub-id></citation>
</ref>
<ref id="B101">
<citation citation-type="journal"><sup>&#x0002A;</sup><person-group person-group-type="author"><name><surname>Sung</surname> <given-names>Y.-T.</given-names></name> <name><surname>Chang</surname> <given-names>K.-E.</given-names></name> <name><surname>Liu</surname> <given-names>T.-C.</given-names></name></person-group> (<year>2016</year>). <article-title>The effects of integrating mobile devices with teaching and learning on students&#x00027; learning performance: a meta-analysis and research synthesis</article-title>. <source>Comput. Educ.</source> <volume>94</volume>, <fpage>252</fpage>&#x02013;<lpage>275</lpage>. <pub-id pub-id-type="doi">10.1016/j.compedu.2015.11.008</pub-id></citation>
</ref>
<ref id="B102">
<citation citation-type="journal"><sup>&#x0002A;</sup><person-group person-group-type="author"><name><surname>Sung</surname> <given-names>Y. T.</given-names></name> <name><surname>Yang</surname> <given-names>J. M.</given-names></name> <name><surname>Lee</surname> <given-names>H. Y.</given-names></name></person-group> (<year>2017</year>). <article-title>The effects of mobile-computer-supported collaborative learning: meta-analysis and critical synthesis</article-title>. <source>Rev. Educ. Res.</source> <volume>87</volume>, <fpage>768</fpage>&#x02013;<lpage>805</lpage>. <pub-id pub-id-type="doi">10.3102/0034654317704307</pub-id><pub-id pub-id-type="pmid">28989193</pub-id></citation></ref>
<ref id="B103">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Taylor</surname> <given-names>J. A.</given-names></name> <name><surname>Kowalski</surname> <given-names>S. M.</given-names></name> <name><surname>Polanin</surname> <given-names>J. R.</given-names></name> <name><surname>Askinas</surname> <given-names>K.</given-names></name> <name><surname>Stuhlsatz</surname> <given-names>M. A.</given-names></name> <name><surname>Wilson</surname> <given-names>C. D.</given-names></name> <etal/></person-group>. (<year>2018</year>). <article-title>Investigating science education effect sizes: implications for power analyses and programmatic decisions</article-title>. <source>AERA Open</source> 4, 2332858418791991. <pub-id pub-id-type="doi">10.1177/2332858418791991</pub-id></citation>
</ref>
<ref id="B104">
<citation citation-type="journal"><sup>&#x0002A;</sup><person-group person-group-type="author"><name><surname>Tingir</surname> <given-names>S.</given-names></name> <name><surname>Cavlazoglu</surname> <given-names>B.</given-names></name> <name><surname>Caliskan</surname> <given-names>O.</given-names></name> <name><surname>Koklu</surname> <given-names>O.</given-names></name> <name><surname>Intepe-Tingir</surname> <given-names>S.</given-names></name></person-group> (<year>2017</year>). <article-title>Effects of mobile devices on k-12 students&#x00027; achievement: a meta-analysis</article-title>. <source>J. Comput. Assist. Learn.</source> <volume>33</volume>, <fpage>355</fpage>&#x02013;<lpage>369</lpage>. <pub-id pub-id-type="doi">10.1111/jcal.12184</pub-id></citation>
</ref>
<ref id="B105">
<citation citation-type="journal"><sup>&#x0002A;</sup><person-group person-group-type="author"><name><surname>Tokac</surname> <given-names>U.</given-names></name> <name><surname>Novak</surname> <given-names>E.</given-names></name> <name><surname>Thompson</surname> <given-names>C. G.</given-names></name></person-group> (<year>2019</year>). <article-title>Effects of game-based learning on students&#x00027; mathematics achievement: a meta-analysis</article-title>. <source>J. Comput. Assist. Learn.</source> <volume>35</volume>, <fpage>407</fpage>&#x02013;<lpage>420</lpage>. <pub-id pub-id-type="doi">10.1111/jcal.12347</pub-id></citation>
</ref>
<ref id="B106">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Valentine</surname> <given-names>J. C.</given-names></name> <name><surname>Cooper</surname> <given-names>H.</given-names></name></person-group> (<year>2008</year>). <article-title>A systematic and transparent approach for assessing the methodological quality of intervention effectiveness research: The Study Design and Implementation Assessment Device (Study DIAD)</article-title>. <source>Psychol. Methods</source> 13, 130. <pub-id pub-id-type="doi">10.1037/1082-989X.13.2.130</pub-id><pub-id pub-id-type="pmid">18557682</pub-id></citation></ref>
<ref id="B107">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>van Alten</surname> <given-names>D. C.</given-names></name> <name><surname>Phielix</surname> <given-names>C.</given-names></name> <name><surname>Janssen</surname> <given-names>J.</given-names></name> <name><surname>Kester</surname> <given-names>L.</given-names></name></person-group> (<year>2019</year>). <article-title>Effects of flipping the classroom on learning outcomes and satisfaction: a meta-analysis</article-title>. <source>Educ. Res. Rev.</source> <volume>28</volume>, <fpage>100281</fpage>. <pub-id pub-id-type="doi">10.1016/j.edurev.2019.05.003</pub-id></citation>
</ref>
<ref id="B108">
<citation citation-type="journal"><sup>&#x0002A;</sup><person-group person-group-type="author"><name><surname>Wedderhoff</surname> <given-names>N.</given-names></name> <name><surname>Bosnjak</surname> <given-names>M.</given-names></name></person-group> (<year>2020</year>). <article-title>Erfassung der Prim&#x000E4;rstudienqualit&#x000E4;t in psychologischen Meta-Analysen. [Assessmet of primary study quality in meta-analyses in psychology]</article-title>. <source>Psychol. Rund.</source> <volume>71</volume>, <fpage>119</fpage>&#x02013;<lpage>126</lpage>. <pub-id pub-id-type="doi">10.1026/0033-3042/a000484</pub-id></citation>
</ref>
<ref id="B109">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wetzels</surname> <given-names>R.</given-names></name> <name><surname>Matzke</surname> <given-names>D.</given-names></name> <name><surname>Lee</surname> <given-names>M. D.</given-names></name> <name><surname>Rouder</surname> <given-names>J. N.</given-names></name> <name><surname>Iverson</surname> <given-names>G. J.</given-names></name> <name><surname>Wagenmakers</surname> <given-names>E. J.</given-names></name></person-group> (<year>2011</year>). <article-title>Statistical evidence in experimental psychology: an empirical comparison using 855 t tests</article-title>. <source>Perspect. Psychol. Sci.</source> <volume>6</volume>, <fpage>291</fpage>&#x02013;<lpage>298</lpage>. <pub-id pub-id-type="doi">10.1177/1745691611406923</pub-id><pub-id pub-id-type="pmid">26168519</pub-id></citation></ref>
<ref id="B110">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Windschitl</surname> <given-names>M.</given-names></name> <name><surname>Thompson</surname> <given-names>J.</given-names></name> <name><surname>Braaten</surname> <given-names>M.</given-names></name> <name><surname>Stroupe</surname> <given-names>D.</given-names></name></person-group> (<year>2012</year>). <article-title>Proposing a core set of instructional practices and tools for teachers of science</article-title>. <source>Sci. Educ.</source>, <volume>96</volume>, <fpage>878</fpage>&#x02013;<lpage>903</lpage>. <pub-id pub-id-type="doi">10.1002/sce.21027</pub-id></citation>
</ref>
<ref id="B111">
<citation citation-type="journal"><sup>&#x0002A;</sup><person-group person-group-type="author"><name><surname>Wouters</surname> <given-names>P.</given-names></name> <name><surname>van Nimwegen</surname> <given-names>C.</given-names></name> <name><surname>van Oostendorp</surname> <given-names>H.</given-names></name> <name><surname>van der Spek</surname> <given-names>E. D.</given-names></name></person-group> (<year>2013</year>). <article-title>A meta-analysis of the cognitive and motivational effects of serious games</article-title>. <source>J. Educ. Psychol.</source> <volume>105</volume>, <fpage>249</fpage>&#x02013;<lpage>265</lpage>. <pub-id pub-id-type="doi">10.1037/a0031311</pub-id><pub-id pub-id-type="pmid">32773374</pub-id></citation></ref>
<ref id="B112">
<citation citation-type="journal"><sup>&#x0002A;</sup><person-group person-group-type="author"><name><surname>Wouters</surname> <given-names>P.</given-names></name> <name><surname>van Oostendorp</surname> <given-names>H.</given-names></name></person-group> (<year>2013</year>). <article-title>A meta-analytic review of the role of instructional support in game-based learning</article-title>. <source>Comput. Educ.</source> <volume>60</volume>, <fpage>412</fpage>&#x02013;<lpage>425</lpage>. <pub-id pub-id-type="doi">10.1016/j.compedu.2012.07.018</pub-id></citation>
</ref>
<ref id="B113">
<citation citation-type="journal"><sup>&#x0002A;</sup><person-group person-group-type="author"><name><surname>Zheng</surname> <given-names>L. Q.</given-names></name></person-group> (<year>2016</year>). <article-title>The effectiveness of self-regulated learning scaffolds on academic performance in computer-based learning environments: a meta-analysis</article-title>. <source>Asia Pacific Educ. Rev.</source> <volume>17</volume>, <fpage>187</fpage>&#x02013;<lpage>202</lpage>. <pub-id pub-id-type="doi">10.1007/s12564-016-9426-9</pub-id></citation>
</ref>
</ref-list>
<fn-group>
<fn id="fn0001"><p><sup>1</sup>We employ the term &#x0201C;strategy&#x0201D; to delineate all kinds of instructional interventions, ranging from multicomponent programmes to specific instructional approaches and practices that can be adopted by teachers to support student learning.</p></fn>
<fn id="fn0002"><p><sup>2</sup>This specification allows for multiple types of study designs related to the experimental research paradigm.</p></fn>
<fn id="fn0003"><p><sup>3</sup>For details on the operationalization of specificity, please consult the following section on &#x0201C;extraction of effect sizes&#x0201D;.</p></fn>
<fn id="fn0004"><p><sup>4</sup>In their meta-analysis on homework and student achievement, Fan et al. (<xref ref-type="bibr" rid="B32">2017</xref>) used an aggregated correlation coefficient r to provide estimates on how different homework practices relate to student achievement outcomes.</p></fn>
<fn id="fn0005"><p><sup>5</sup>Statistical average for individual meta-analyses.</p></fn>
<fn id="fn0006"><p><sup>6</sup>See <xref ref-type="table" rid="T1">Table 1</xref> for category description.</p></fn>
<fn id="fn0007"><p><sup>7</sup>Although several meta-analyses contain some of the same studies, we decided to retain all selected meta-analyses and effect sizes in our summary for several reasons. First, the overlap of primary studies is limited to only a few meta-analyses. Second, the overlap is usually small and concerns only a few primary studies. Third, although some meta-analyses include some of the same studies, they adopt a different focus of analysis. Fourth, our aim is to provide an overview of the existing meta-analyses and effect sizes for secondary mathematics and science teaching and not to conduct a second-order meta-analysis.</p></fn>
<fn id="fn0008"><p><sup>8</sup>Our sampled meta-analyses did not report any negative context-specific effect size (see <xref ref-type="table" rid="T2">Table 2</xref>). For 13 out of 78 (17%) context-specific effect sizes, statistical significance levels exceeded conventional thresholds (<italic>p</italic> &#x0003E; 0.05).</p></fn>
<fn id="fn0009"><p><sup>&#x0002A;</sup>References marked with an asterisk are included in the review.</p></fn>
</fn-group>
</back>
</article>