<?xml version="1.0" encoding="utf-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v2.3 20070202//EN" "journalpublishing.dtd">
<article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" article-type="research-article" dtd-version="2.3" xml:lang="EN">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">Front. Educ.</journal-id>
<journal-title>Frontiers in Education</journal-title>
<abbrev-journal-title abbrev-type="pubmed">Front. Educ.</abbrev-journal-title>
<issn pub-type="epub">2504-284X</issn>
<publisher>
<publisher-name>Frontiers Media S.A.</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="doi">10.3389/feduc.2023.1073829</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Education</subject>
<subj-group>
<subject>Original Research</subject>
</subj-group>
</subj-group>
</article-categories>
<title-group>
<article-title>An application of Bayesian inference to examine student retention and attrition in the STEM classroom</article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author" corresp="yes">
<name>
<surname>Bertolini</surname>
<given-names>Roberto</given-names>
</name>
<xref rid="aff1" ref-type="aff"><sup>1</sup></xref>
<xref rid="c001" ref-type="corresp"><sup>&#x002A;</sup></xref>
<uri xlink:href="https://loop.frontiersin.org/people/2059545/overview"/>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Finch</surname>
<given-names>Stephen J.</given-names>
</name>
<xref rid="aff1" ref-type="aff"><sup>1</sup></xref>
<uri xlink:href="https://loop.frontiersin.org/people/2189055/overview"/>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Nehm</surname>
<given-names>Ross H.</given-names>
</name>
<xref rid="aff2" ref-type="aff"><sup>2</sup></xref>
<uri xlink:href="https://loop.frontiersin.org/people/1856477/overview"/>
</contrib>
</contrib-group>
<aff id="aff1"><sup>1</sup><institution>Department of Applied Mathematics and Statistics, Stony Brook University</institution>, <addr-line>Stony Brook, NY</addr-line>, <country>United States</country></aff>
<aff id="aff2"><sup>2</sup><institution>Department of Ecology and Evolution, Program in Science Education, Stony Brook University</institution>, <addr-line>Stony Brook, NY</addr-line>, <country>United States</country></aff>
<author-notes>
<fn id="fn0001" fn-type="edited-by"><p>Edited by: Xiaoming Zhai, University of Georgia, United States</p></fn>
<fn id="fn0002" fn-type="edited-by"><p>Reviewed by: Marcus Kubsch, University of Kiel, Germany; Mariel Fernanda Musso, CONICET Centro Interdisciplinario de Investigaciones en Psicolog&#x00ED;a Matem&#x00E1;tica y Experimental, Argentina; Joshua Rosenberg, The University of Tennessee, Knoxville, United States</p></fn>
<corresp id="c001">&#x002A;Correspondence: Roberto Bertolini, &#x02709; <email>roberto.bertolini@alumni.stonybrook.edu</email>; &#x02709; <email>rbertolini.math@gmail.com</email></corresp>
<fn id="fn0003" fn-type="other"><p>This article was submitted to STEM Education, a section of the journal Frontiers in Education</p></fn>
</author-notes>
<pub-date pub-type="epub">
<day>14</day>
<month>02</month>
<year>2023</year>
</pub-date>
<pub-date pub-type="collection">
<year>2023</year>
</pub-date>
<volume>8</volume>
<elocation-id>1073829</elocation-id>
<history>
<date date-type="received">
<day>19</day>
<month>10</month>
<year>2022</year>
</date>
<date date-type="accepted">
<day>16</day>
<month>01</month>
<year>2023</year>
</date>
</history>
<permissions>
<copyright-statement>Copyright &#x00A9; 2023 Bertolini, Finch and Nehm.</copyright-statement>
<copyright-year>2023</copyright-year>
<copyright-holder>Bertolini, Finch and Nehm</copyright-holder>
<license xlink:href="http://creativecommons.org/licenses/by/4.0/">
<p>This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.</p>
</license>
</permissions>
<abstract>
<sec>
<title>Introduction</title>
<p>As artificial intelligence (AI) technology becomes more widespread in the classroom environment, educators have relied on data-driven machine learning (ML) techniques and statistical frameworks to derive insights into student performance patterns. Bayesian methodologies have emerged as a more intuitive approach to frequentist methods of inference since they link prior assumptions and data together to provide a quantitative distribution of final model parameter estimates. Despite their alignment with four recent ML assessment criteria developed in the educational literature, Bayesian methodologies have received considerably less attention by academic stakeholders prompting the need to empirically discern how these techniques can be used to provide actionable insights into student performance.</p>
</sec>
<sec>
<title>Methods</title>
<p>To identify the factors most indicative of student retention and attrition, we apply a Bayesian framework to comparatively examine the differential impact that the amalgamation of traditional and AI-driven predictors has on student performance in an undergraduate in-person science, technology, engineering, and mathematics (STEM) course.</p>
</sec>
<sec>
<title>Results</title>
<p>Interaction with the course learning management system (LMS) and performance on diagnostic concept inventory (CI) assessments provided the greatest insights into final course performance. Establishing informative prior values using historical classroom data did not always appreciably enhance model fit.</p>
</sec>
<sec>
<title>Discussion</title>
<p>We discuss how Bayesian methodologies are a more pragmatic and interpretable way of assessing student performance and are a promising tool for use in science education research and assessment.</p>
</sec>
</abstract>
<kwd-group>
<kwd>Bayesian methods</kwd>
<kwd>retention and attrition</kwd>
<kwd>learning management system</kwd>
<kwd>concept inventory</kwd>
<kwd>machine learning</kwd>
<kwd>STEM education</kwd>
<kwd>undergraduate biology</kwd>
</kwd-group>
<contract-num rid="cn1">79545</contract-num>
<contract-sponsor id="cn1">The Howard Hughes Medical Institute Science Education Program</contract-sponsor>
<counts>
<fig-count count="3"/>
<table-count count="4"/>
<equation-count count="3"/>
<ref-count count="187"/>
<page-count count="16"/>
<word-count count="15035"/>
</counts>
</article-meta>
</front>
<body>
<sec id="sec1" sec-type="intro">
<label>1.</label>
<title>Introduction</title>
<p>Over the last three decades, the development and emergence of artificial intelligence (AI) technology has revolutionized the classroom environment (<xref ref-type="bibr" rid="ref112">McArthur et al., 2005</xref>; <xref ref-type="bibr" rid="ref142">Roll and Wylie, 2016</xref>; <xref ref-type="bibr" rid="ref36">Chen L. et al., 2020</xref>). <xref ref-type="bibr" rid="ref13">Baker et al. (2019)</xref> define AI as &#x201C;computers which perform cognitive tasks, usually associated with human minds, particularly learning, and problem-solving.&#x201D; Adaptive pedagogical frameworks, early warning systems, and learning management systems (LMS) have been developed incorporating AI-driven capabilities to provide students, teachers, and educational administrators with a plethora of tools and data that can be leveraged to assess, track, and monitor student performance patterns (<xref ref-type="bibr" rid="ref180">Wen and Lin, 2008</xref>; <xref ref-type="bibr" rid="ref173">Vandenewaetere et al., 2011</xref>; <xref ref-type="bibr" rid="ref57">Fern&#x00E1;ndez-Caram&#x00E9;s and Fraga-Lamas, 2019</xref>; <xref ref-type="bibr" rid="ref79">Kabudi et al., 2021</xref>). In recent years, summative and formative assessments that provide instantaneous feedback to students using automated and explainable AI grading and response systems have personalized the classroom environment, giving instructors the ability to tailor curricula to the individual aptitude levels of students using these interfaces (<xref ref-type="bibr" rid="ref78">Jokhan et al., 2019</xref>; <xref ref-type="bibr" rid="ref14">Ba&#x00F1;eres et al., 2020</xref>; <xref ref-type="bibr" rid="ref1">Afzaal et al., 2021</xref>; <xref ref-type="bibr" rid="ref182">Xu et al., 2021</xref>; <xref ref-type="bibr" rid="ref121">Nawaz et al., 2022</xref>). Bolstered by the emergence of newer technological innovations such as virtual reality, augmented reality, and gamification in the classroom, digital tools continue to supplement traditional pedagogical strategies and have spurred the development of diverse and novel data sources (<xref ref-type="bibr" rid="ref73">Huang et al., 2019</xref>; <xref ref-type="bibr" rid="ref146">Sailer and Homner, 2020</xref>; <xref ref-type="bibr" rid="ref185">Yang et al., 2021</xref>; <xref ref-type="bibr" rid="ref3">Alam, 2022</xref>). Despite these advances, a major challenge that has emerged with the growth of classroom technology is how to meaningfully derive cognitive insights and inferences pertaining to student learning and performance from the plethora of data and knowledge created and contained within these systems (<xref ref-type="bibr" rid="ref166">Van Camp et al., 2017</xref>; <xref ref-type="bibr" rid="ref37">Chen X. et al., 2020</xref>; <xref ref-type="bibr" rid="ref118">Musso et al., 2020</xref>; <xref ref-type="bibr" rid="ref185">Yang et al., 2021</xref>; <xref ref-type="bibr" rid="ref88">Kubsch et al., 2022</xref>).</p>
<p>Underpinning the analyses of these technological tools are a series of mathematical frameworks and statistical methodologies that have been applied to quantitatively assess the impact of various complex constructs, assessments, and remediation/intervention strategies on student cognition and learning. Machine learning (ML) serves as a critical tool in this endeavor due to its ability to leverage knowledge from large quantities of structured, unstructured, and semi-structured corpora to generate performance implications with a high degree of accuracy (<xref ref-type="bibr" rid="ref188">Zhai et al., 2020a</xref>,<xref ref-type="bibr" rid="ref190">b</xref>; <xref ref-type="bibr" rid="ref187">Zhai, 2021</xref>; <xref ref-type="bibr" rid="ref189">Zhai et al., 2021</xref>). The study and use of ML in education has spurred the growth of various subfields, including educational data mining (EDM) and predictive learning analytics (LA), to study, develop, and apply these techniques to different pedagogical settings (<xref ref-type="bibr" rid="ref12">Baker, 2010</xref>; <xref ref-type="bibr" rid="ref143">Romero and Ventura, 2020</xref>). To advance the fields of EDM and LA, researchers seek to refine existing statistical methodologies and ML techniques for analyzing large and diverse educational corpora (<xref ref-type="bibr" rid="ref27">Brooks and Thompson, 2017</xref>; <xref ref-type="bibr" rid="ref18">Bertolini, 2021</xref>).</p>
<p>Student retention and attrition in introductory science, technology, engineering, and mathematics (STEM) classes is one critical issue that continues to remain a paramount concern for academic stakeholders (<xref ref-type="bibr" rid="ref35">Chen, 2013</xref>; <xref ref-type="bibr" rid="ref136">Penprase, 2020</xref>). Identifying the factors associated with student performance and implementing pedagogical strategies to foster student success in STEM settings is an international priority in education research. (<xref ref-type="bibr" rid="ref33">Chang et al., 2014</xref>; <xref ref-type="bibr" rid="ref94">Lee et al., 2015</xref>; <xref ref-type="bibr" rid="ref75">Ikuma et al., 2019</xref>; <xref ref-type="bibr" rid="ref84">Kricorian et al., 2020</xref>; <xref ref-type="bibr" rid="ref103">L&#x00F3;pez Zambrano et al., 2021</xref>). Many studies have applied existing ML frameworks, or proposed their own novel methodologies, to make predictions of student success. Depending on the pedagogical environment, course context, and grade level (e.g., hybrid, remote, asynchronous, and in-person classroom settings), different types of academic and non-academic factors have been shown to impact student performance (<xref ref-type="bibr" rid="ref127">Nouri et al., 2019</xref>; <xref ref-type="bibr" rid="ref182">Xu et al., 2021</xref>; <xref ref-type="bibr" rid="ref19">Bertolini et al., 2021a</xref>; <xref ref-type="bibr" rid="ref4">Albreiki, 2022</xref>).</p>
<p>With ML becoming more mainstream and commonplace within the body of educational research, a major criticism of its usage is that model development and its subsequent output are often complex, esoteric, and at times uninterpretable (<xref ref-type="bibr" rid="ref43">Conati et al., 2018</xref>; <xref ref-type="bibr" rid="ref102">Liu and Tan, 2020</xref>). While the usage of these &#x201C;black box&#x201D; methodologies have led to the development of more accurate data-driven models for forecasting student performance (see <xref ref-type="bibr" rid="ref119">Musso et al., 2013</xref>; <xref ref-type="bibr" rid="ref29">Cascallar et al., 2014</xref>; <xref ref-type="bibr" rid="ref164">Tsiakmaki et al., 2020</xref> for examples), statistical and mathematical intricacies governing these tools and their outputs often hinder communication of these results to faculty and other educational stakeholders (<xref ref-type="bibr" rid="ref144">Rudin, 2019</xref>). While various statistical frameworks have been developed and produced to make these &#x201C;black box&#x201D; algorithms more interpretable, it is difficult to precisely quantify the informative candidate features that were used in a ML algorithm to arrive at a certain outcome, making it difficult to communicate and formulate educational actions and interventions among stakeholders (<xref ref-type="bibr" rid="ref9">Arrieta et al., 2020</xref>; <xref ref-type="bibr" rid="ref19">Bertolini et al., 2021a</xref>).</p>
<p>In education, uncertainty in estimates for ML model parameters and mechanisms to assess the differential efficacy of competing prediction algorithms have predominately used frequentist statistical techniques, most notably null hypothesis significance testing. Bayesian inference and modeling, which account for the relationship between data and prespecified information about the distribution of model parameters, are methods of statistical inference that emerged due to the widespread availability of technological software, minimizing the need for researchers to rely on the usage of large-scale computing architectures (<xref ref-type="bibr" rid="ref25">Brooks, 1998</xref>; <xref ref-type="bibr" rid="ref105">Lunn et al., 2000</xref>; <xref ref-type="bibr" rid="ref138">Plummer, 2003</xref>; <xref ref-type="bibr" rid="ref91">Lambert et al., 2005</xref>; <xref ref-type="bibr" rid="ref85">Kruschke, 2011a</xref>; <xref ref-type="bibr" rid="ref64">Gelman et al., 2015</xref>; <xref ref-type="bibr" rid="ref170">Van den Bergh et al., 2021</xref>). Bayesian approaches to modeling are commonly employed in many scientific disciplines including medicine (<xref ref-type="bibr" rid="ref156">Spiegelhalter et al., 1999</xref>), ecology (<xref ref-type="bibr" rid="ref113">McCarthy, 2007</xref>), and cosmology (<xref ref-type="bibr" rid="ref71">Hobson et al., 2010</xref>), but have been sparsely incorporated into EDM and LA research to systematically compare performance variability in models of student classroom success based on the characteristics of input predictors. <xref ref-type="bibr" rid="ref001">Homer (2016)</xref> remarks that the use of Bayesian methods and their application to forecast student performance and STEM attrition has the potential to revolutionize EDM and LA in the next decade.</p>
<p>In this study, a Bayesian framework is applied to model student success in an introductory baccalaureate biology course. We are interested in establishing the effectiveness of traditional data types (i.e., demographics, standardized aptitude tests, prior academic performance) and data from nascent AI-driven technological software and formative assessments (e.g., LMS, diagnostic concept inventory (CI) assessments) to identify factors that impact student performance. After introducing our research questions (<xref ref-type="sec" rid="sec2">Section 2</xref>), we provide a brief overview of the strengths of Bayesian analytics compared to traditional frequentist and ML frameworks (<xref ref-type="sec" rid="sec4">Section 3.1</xref>). This is followed by a brief literature review on their usage in STEM education research, and how Bayesian modeling aligns with four components of ML assessment proposed in the literature (<xref ref-type="sec" rid="sec5">Section 3.2</xref>). We then outline the methodologies used in this study (<xref ref-type="sec" rid="sec6">Section 4</xref>), our results (<xref ref-type="sec" rid="sec11">Section 5</xref>), and conclude with a discussion (<xref ref-type="sec" rid="sec14">Section 6</xref>) and future research directions (<xref ref-type="sec" rid="sec15">Section 7</xref>).</p>
</sec>
<sec id="sec2">
<label>2.</label>
<title>Research questions</title>
<p>Our study addressed the following research questions:</p>
<disp-quote>
<p><italic>(RQ 1)</italic> How do various student- and course-specific data types impact the odds of student retention in a STEM classroom context?</p>
<p><italic>(RQ 2)</italic> Given the ability to integrate prior knowledge into Bayesian models via prespecified probability distributions, does incorporating aggregated historical records of student performance data enhance model fit, compared to when uninformative priors are used?</p>
</disp-quote>
</sec>
<sec id="sec3">
<label>3.</label>
<title>Literature review</title>
<sec id="sec4">
<label>3.1.</label>
<title>Overview of Bayesian methods</title>
<p>Bayesian inference uses probability to quantify uncertainty in the estimates of model parameters. Unlike frequentist statistical techniques, parameters are treated as random variables which take on an associated probability distribution, instead of fixed quantities (<xref ref-type="bibr" rid="ref55">Ellison, 1996</xref>; <xref ref-type="bibr" rid="ref70">Hobbs and Hooten, 2015</xref>; <xref ref-type="bibr" rid="ref120">Muth et al., 2018</xref>; <xref ref-type="bibr" rid="ref72">Hooten and Hefley, 2019</xref>). <xref rid="tab1" ref-type="table">Table 1</xref> depicts the major differences between Bayesian and frequentist methods commonly cited and summarized in the literature (<xref ref-type="bibr" rid="ref17">Berger and Berry, 1988</xref>; <xref ref-type="bibr" rid="ref55">Ellison, 1996</xref>; <xref ref-type="bibr" rid="ref158">Stephens et al., 2007</xref>). Unlike frequentist methods, Bayesian methods are capable of &#x201C;yield[ing] answers which are much easier to understand than standard statistical answers, and hence much less likely to be misinterpreted&#x201D; (<xref ref-type="bibr" rid="ref17">Berger and Berry, 1988</xref>).</p>
<table-wrap position="float" id="tab1">
<label>Table 1</label>
<caption><p>Comparison of frequentist and Bayesian methods.</p></caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th align="left" valign="top">Frequentist</th>
<th align="left" valign="top">Bayesian</th>
</tr>
</thead>
<tbody>
<tr>
<td align="left" valign="top">Examines the probability of observing data <bold>given a hypothesis</bold></td>
<td align="left" valign="top">Examines the probability a hypothesis is true <bold>given data</bold></td>
</tr>
<tr>
<td align="left" valign="top"><bold>Does not incorporate</bold> prior probabilities</td>
<td align="left" valign="top"><bold>Incorporates</bold> prior probabilities</td>
</tr>
<tr>
<td align="left" valign="top">Use of <bold><italic>p</italic>-values</bold> (i.e., point estimates) which is an expectation of a <bold>long-run frequency</bold></td>
<td align="left" valign="top">Use of <bold>posterior probability distributions</bold> (i.e., variability along with point estimates) which is an expression of a <bold>degree of belief</bold></td>
</tr>
<tr>
<td align="left" valign="top">Model parameters are <bold>fixed quantities</bold></td>
<td align="left" valign="top">Model parameters are <bold>random variables</bold></td>
</tr>
<tr>
<td align="left" valign="top">Conclusions depend on the subjectivity of the <bold>investigator</bold></td>
<td align="left" valign="top">Conclusions depend on the subjectivity of the <bold>user</bold></td>
</tr>
</tbody>
</table>
</table-wrap>
<p>The strength of Bayesian techniques lies in the prespecification of probability distributions for analytical parameters. These prior distributions are explicit mathematical statements that either incorporate previous information from published studies (known as informative priors), or a plausible range of values that specific model parameters can take on (known as noninformative priors; <xref ref-type="bibr" rid="ref115">McCarthy and Masters, 2005</xref>; <xref ref-type="bibr" rid="ref95">Lemoine, 2019</xref>; <xref ref-type="bibr" rid="ref15">Banner et al., 2020</xref>). As output, Bayesian techniques produce posterior outputs providing researchers with a quantitative distribution and range of final parameter estimates that explicitly account for uncertainty and variability in predictive efficacy (<xref ref-type="bibr" rid="ref122">Neal, 2004</xref>). Bayesian inference is not a strictly separate type of ML model but is a probabilistic method of inference that can be incorporated into these existing algorithmic frameworks. ML algorithms generally use the raw data to generate inferences, while Bayesian methods use the raw data along with explicitly assigned probability distributions (i.e., priors) to estimate model parameters. In testing for statistical significance, one advantage of Bayesian methodologies is that the posterior distribution can be used to tabulate the probability that different hypotheses are true (e.g., both the null and alternative hypotheses), which is more intuitive compared to frequentist methods. Traditional null hypothesis significance testing only calculates a <italic>p</italic>-value, a long-run probability of obtaining a data set at least as extreme as the one observed (<xref ref-type="bibr" rid="ref59">Fornacon-Wood et al., 2022</xref>).</p>
<p>When a plethora of candidate features are included in a model, Bayesian methods can minimize the impact of highly correlated variables by using regularization priors to shrink posterior estimates toward their parameter values to induce sparsity and perform variable selection (<xref ref-type="bibr" rid="ref82">Komaki, 2006</xref>). Unlike frequentist methods, Bayesian shrinkage methods define a criterion for selecting values on the credible or high density intervals of posterior distributions rather than constraining the magnitude of coefficient estimates (<xref ref-type="bibr" rid="ref97">Li and Pati, 2017</xref>). These regularization priors are generally mixture models that combine multiple statistical distributions together resulting in a high concentration point mass and a diffusive prior with a heavy tail (<xref ref-type="bibr" rid="ref168">Van de Schoot et al., 2021</xref>). While Bayesian regularization priors do not produce unstable variance estimates for model parameters, a common criticism of frequentist methods, Bayesian regularization priors are more mathematically sophisticated compared to traditional uninformative and informative univariate prior distributions (<xref ref-type="bibr" rid="ref30">Casella et al., 2010</xref>; <xref ref-type="bibr" rid="ref171">Van Erp et al., 2019</xref>).</p>
<p>Bayesian methods can also be used to study different cohorts of a population nested within and between different factors. Such frameworks can yield more conservative parameter estimates, do not rely on asymptotics like frequentist methods, and are capable of handling heterogeneous and imbalanced corpora, the latter of which is commonly encountered in education (<xref ref-type="bibr" rid="ref58">Fordyce et al., 2011</xref>; <xref ref-type="bibr" rid="ref62">Gelman et al., 2012</xref>; <xref ref-type="bibr" rid="ref169">van de Schoot et al., 2014</xref>). To summarize, Bayesian frameworks are a plausible alternative to frequentist techniques with some documented theoretical and pragmatic benefits (see <xref ref-type="bibr" rid="ref85">Kruschke, 2011a</xref>,<xref ref-type="bibr" rid="ref86">b</xref>; <xref ref-type="bibr" rid="ref169">van de Schoot et al., 2014</xref>). In the next section, we highlight prior studies that have incorporated Bayesian methods to examine diverse student data types and how these techniques align with four ML assessment educational criteria.</p>
</sec>
<sec id="sec5">
<label>3.2.</label>
<title>Application to STEM educational settings and ML assessment</title>
<p>In previous STEM classroom studies examining student performance, emphasis has been placed on using conventional sources of university data for this endeavor, which traditionally encompass past student academic performance and achievement predictors such as high school grade point average and student demographics (<xref ref-type="bibr" rid="ref129">Orr and Foster, 2013</xref>; <xref ref-type="bibr" rid="ref16">Berens et al., 2019</xref>). There is increasing interest in examining how combining these traditional formative data types with course-specific data-driven tools and assessment data (e.g., LMS usage patterns, diagnostic tests) extracted from intelligent systems may differentially inform models suitable for course-level instructor actions in the STEM classroom. These novel assessment types, in conjunction with academic characteristics and personalized data records, have been shown to improve the overall performance of ML algorithms (<xref ref-type="bibr" rid="ref94">Lee et al., 2015</xref>; <xref ref-type="bibr" rid="ref186">Zabriskie et al., 2019</xref>; <xref ref-type="bibr" rid="ref184">Yang et al., 2020</xref>; <xref ref-type="bibr" rid="ref188">Zhai et al., 2020a</xref>,<xref ref-type="bibr" rid="ref190">b</xref>; <xref ref-type="bibr" rid="ref19">Bertolini et al., 2021a</xref>,<xref ref-type="bibr" rid="ref20">b</xref>, <xref ref-type="bibr" rid="ref21">2022</xref>). However, frequentist and non-Bayesian methods have been the primary techniques utilized in these analyses to assess competing performance variability between different algorithms and to identify the significant features that drive overall ML model performance.</p>
<p>In many prior EDM and LA studies, researchers have employed a type of ML algorithm, known as Na&#x00EF;ve Bayes, to forecast student performance in various STEM settings (see <xref ref-type="bibr" rid="ref150">Shahiri and Husain, 2015</xref>; <xref ref-type="bibr" rid="ref2">Ahmed et al., 2021</xref>; <xref ref-type="bibr" rid="ref137">Perez and Perez, 2021</xref> for examples). In recent systematic literature reviews, <xref ref-type="bibr" rid="ref149">Shafiq et al. (2022)</xref>, <xref ref-type="bibr" rid="ref135">Pe&#x00F1;a-Ayala (2014)</xref>, and <xref ref-type="bibr" rid="ref11">Baashar et al. (2021)</xref>, found that Na&#x00EF;ve Bayes was used in 35%, 20%, and 14% of education studies surveyed, respectively. While this supervised ML algorithm has the word &#x201C;bayes&#x201D; in its name, it has not been traditionally classified as a Bayesian methodology because it assumes that all features included in the model are independent of one another (<xref ref-type="bibr" rid="ref66">Hand and Yu, 2001</xref>; <xref ref-type="bibr" rid="ref145">Russell, 2010</xref>). While having a firm theoretical basis, independence between student-specific factors do not typically hold in practice, as there are correlations and associations between them which impact performance outcomes. For example, if an educator or institutional researcher wanted to develop a model to predict student performance in a class using socioeconomic data factors and SAT scores, Na&#x00EF;ve Bayes would treat these features as being independent of one another when rendering the final predictions. However, there are documented studies that have identified an association between socioeconomic status and student performance on the SAT (<xref ref-type="bibr" rid="ref191">Zwick and Himelfarb, 2011</xref>; <xref ref-type="bibr" rid="ref69">Higdem et al., 2016</xref>). In a survey of 100 EDM and LA studies over the last 5&#x2009;years, <xref ref-type="bibr" rid="ref149">Shafiq et al. (2022)</xref> found that only 5% of studies used a formal type of Bayesian methodology (i.e., did not assume independence between features).</p>
<p>The three most common applications of Bayesian inference in education have been their usage in unsupervised text mining, natural language processing, and in Bayesian knowledge tracing. Unsupervised methods (such as Latent Dirichlet Allocation) and natural language processing provide educators with the capability of synthesizing words, phrases, categories, and topics from student text corpora to extract data and inferences pertaining to student cognition, learning and concept retention, factors that impact student performance (<xref ref-type="bibr" rid="ref6">Almond et al., 2015</xref>; <xref ref-type="bibr" rid="ref48">Culbertson, 2016</xref>; <xref ref-type="bibr" rid="ref181">Xiao et al., 2022</xref>). Moreover, many AI-driven educational tools have been developed using these techniques to automatically score open-ended and constructed response assessments using these methodologies, achieving a high degree of accuracy that was comparable with manual human scoring (<xref ref-type="bibr" rid="ref117">Moharreri et al., 2014</xref>; <xref ref-type="bibr" rid="ref101">Liu et al., 2016</xref>). However, these techniques have limited applications and use if text corpora are not being incorporated into ML models. In Bayesian knowledge tracing, hidden Markov models use probability to determine the likelihood of an outcome based on a sequence of prior events (<xref ref-type="bibr" rid="ref167">Van de Sande, 2013</xref>). These techniques are used to scrutinize student learning dynamics to study concept retention and mastery by tracking the student learning process over time. Observed data from educational assessments and interventions (e.g., tutoring sessions, personalized learning technology) acquired at distinct longitudinal time points during the students&#x2019; academic tenure are used as input to these models (<xref ref-type="bibr" rid="ref44">Corbett and Anderson, 1994</xref>; <xref ref-type="bibr" rid="ref107">Mao et al., 2018</xref>; <xref ref-type="bibr" rid="ref47">Cui et al., 2019</xref>).</p>
<p>Despite their limited use in AI education-based research, Bayesian inference techniques align with the four components of ML assessment proposed and outlined by <xref ref-type="bibr" rid="ref187">Zhai (2021)</xref>. The first criterion &#x201C;allows assessment practices to target complex, diverse, and structural constructs, and thus better approach science learning goals.&#x201D; This has been the primary focus and application of Bayesian methods in education thus far. Indeed, most studies employing Bayesian methods have used them to perform psychometric and factor analyses of novel assessment types (e.g., multi-skill itemized activities and question types) and surveys to study student comprehension, cognition, and attitudes toward learning (<xref ref-type="bibr" rid="ref51">Desmarais and Gagnon, 2006</xref>; <xref ref-type="bibr" rid="ref133">Pardos et al., 2008</xref>; <xref ref-type="bibr" rid="ref23">Brassil and Couch, 2019</xref>; <xref ref-type="bibr" rid="ref110">Martinez, 2021</xref>; <xref ref-type="bibr" rid="ref134">Parkin and Wang, 2021</xref>; <xref ref-type="bibr" rid="ref174">Vaziri et al., 2021</xref>; <xref ref-type="bibr" rid="ref178">Wang et al., 2021</xref>). The insights obtained from these studies have led to the design, development, and deployment of more adaptive learning and student-focused knowledge assessment content, based on their aptitude levels, allowing educators to learn more about student comprehension and how individualized content can be tailored to students (<xref ref-type="bibr" rid="ref53">Drigas et al., 2009</xref>).</p>
<p>The second and third criteria &#x201C;extends the approaches used to elicit performance and evidence collection&#x201D; and &#x201C;provide a means to better interpret observations and use evidence&#x201D; are the crux of Bayesian modeling, as described in <xref ref-type="sec" rid="sec4">Section 3.1</xref>. Within this statistical framework, the data models are defined explicitly using intuitive notions and knowledge about the relationships between different features and their distributions (<xref ref-type="bibr" rid="ref52">Dienes, 2011</xref>; <xref ref-type="bibr" rid="ref86">Kruschke, 2011b</xref>) via expert elicitation, knowledge, and experimental findings to inform priors for Bayesian statistical models (<xref ref-type="bibr" rid="ref40">Choy et al., 2009</xref>).</p>
<p>The fourth criterion &#x201C;supports immediate and complex decision-making and action-taking.&#x201D; The Bayesian paradigm allows users to update knowledge via prior distributions without testing multiple hypotheses repeatedly, allowing researchers to reflect on the similarities and differences between model outputs, thereby placing decision making on the subjectivity of the recipients and consumers of the model results (<xref ref-type="bibr" rid="ref17">Berger and Berry, 1988</xref>; <xref ref-type="bibr" rid="ref158">Stephens et al., 2007</xref>). ML algorithms primarily rely on using aggregated training data where hyperparameters are tuned to enhance model efficacy and performance. In contrast, Bayesian inference methodologies incorporate probabilistic prior knowledge, beliefs, and findings from past studies into these models. Standard statistical assumptions that encompass many frequentist techniques, such as regression, do not need to be satisfied in Bayesian frameworks, allowing models to be developed with greater complexity that utilize asymmetric probability distributions, a current limitation of some frequentist approaches such as maximum likelihood estimation which does not explicitly assign probabilities and only provides a point estimate for model parameters (<xref ref-type="bibr" rid="ref169">van de Schoot et al., 2014</xref>, <xref ref-type="bibr" rid="ref168">2021</xref>). Moreover, Bayesian methods have been shown to be computationally faster compared to default numerical integration techniques traditionally employed in frequentist mixed effects models (<xref ref-type="bibr" rid="ref111">McArdle et al., 2009</xref>; <xref ref-type="bibr" rid="ref169">van de Schoot et al., 2014</xref>).</p>
<p>Despite their alignment with these four ML assessment criteria, compared to the use of traditional statistical methodologies, Bayesian ML methods are an underrepresented and underutilized statistical methodology employed in education research (<xref ref-type="bibr" rid="ref159">Subbiah et al., 2011</xref>; <xref ref-type="bibr" rid="ref83">K&#x00F6;nig and van de Schoot, 2018</xref>). A limited amount of work in the literature has used Bayesian techniques to understand the factors impacting student performance such as the grade point average (GPA) of college students (<xref ref-type="bibr" rid="ref68">Hien and Haddawy, 2007</xref>), graduation rates (<xref ref-type="bibr" rid="ref46">Crisp et al., 2018</xref>; <xref ref-type="bibr" rid="ref60">Gebretekle and Goshu, 2019</xref>), and final examination performance (<xref ref-type="bibr" rid="ref10">Ayers and Junker, 2006</xref>). Even less work has focused on quantitatively assessing the impact of different data types on student performance outcomes. In this study, we explore the use of a Bayesian framework to comparatively examine the differential impact that the amalgamation of traditional and AI-driven predictors has on overall model fit and performance.</p>
</sec>
</sec>
<sec id="sec6" sec-type="materials|methods">
<label>4.</label>
<title>Materials and methods</title>
<sec id="sec7">
<label>4.1.</label>
<title>Course context</title>
<p>Our study focused on examining student performance in a baccalaureate, lecture-based, in-person biology course at a public higher educational research institution in the United States. A core topic in this course is evolution. In total, 3,225 students enrolled in the class over six academic semesters (fall 2014, spring 2015, fall 2015, spring 2016, fall 2016, and spring 2017) were examined in this observational study (<xref rid="fig1" ref-type="fig">Figure 1</xref>).</p>
<fig position="float" id="fig1">
<label>Figure 1</label>
<caption><p>Course grade information by semester examined.</p></caption>
<graphic xlink:href="feduc-08-1073829-g001.tif"/>
</fig>
<p>This analysis focused on the pass/fail status for each student, the dependent variable <inline-formula><mml:math id="M1"><mml:mrow><mml:msub><mml:mi>Y</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mo>,</mml:mo><mml:mi>j</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula>, which was modeled as Bernoulli-distributed (<xref ref-type="disp-formula" rid="EQ1">Equation 1</xref>):</p>
<disp-formula id="EQ1"><label>(1)</label><mml:math id="M2"><mml:mrow><mml:mstyle mathvariant="bold"><mml:msub><mml:mi mathvariant="bold-italic">Y</mml:mi><mml:mrow><mml:mi mathvariant="bold-italic">i</mml:mi><mml:mo>,</mml:mo><mml:mi mathvariant="bold-italic">j</mml:mi></mml:mrow></mml:msub><mml:mo>~</mml:mo><mml:mtext mathvariant="bold-italic">Bernoulli</mml:mtext><mml:mrow><mml:mo>(</mml:mo><mml:mrow><mml:msub><mml:mi mathvariant="bold-italic">&#x03B8;</mml:mi><mml:mrow><mml:mi mathvariant="bold-italic">i</mml:mi><mml:mo>,</mml:mo><mml:mi mathvariant="bold-italic">j</mml:mi></mml:mrow></mml:msub></mml:mrow><mml:mo>)</mml:mo></mml:mrow></mml:mstyle></mml:mrow></mml:math></disp-formula>
<p><inline-formula><mml:math id="M3"><mml:mrow><mml:msub><mml:mi>Y</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mo>,</mml:mo><mml:mi>j</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula> takes on a value of &#x2018;1&#x2019; with probability <inline-formula><mml:math id="M4"><mml:mrow><mml:msub><mml:mi>&#x03B8;</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mo>,</mml:mo><mml:mi>j</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula> and a value of &#x2018;0&#x2019; with probability 1&#x2009;&#x2212;&#x2009;<inline-formula><mml:math id="M5"><mml:mrow><mml:msub><mml:mi>&#x03B8;</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mo>,</mml:mo><mml:mi>j</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula>, where <inline-formula><mml:math id="M6"><mml:mrow><mml:msub><mml:mi>&#x03B8;</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mo>,</mml:mo><mml:mi>j</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula> is the probability that student <italic>i</italic> passed the course when enrolled in term <italic>j</italic>. The tilde relation in <xref ref-type="disp-formula" rid="EQ1">Equation 1</xref> &#x201C;~&#x201D; means &#x201C;is distributed as&#x201D; (<xref ref-type="bibr" rid="ref5">Allenby and Rossi, 2006</xref>). A passing grade (<inline-formula><mml:math id="M7"><mml:mrow><mml:msub><mml:mi>Y</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mo>,</mml:mo><mml:mi>j</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula> = 1) included the marks A, A&#x2212;, B+, B, B&#x2212;, C+, C, and C&#x2212;, while a failing course grade (<italic>Y</italic><sub><italic>i,&#x2009;j</italic></sub>&#x2009;=&#x2009;0) included the marks D+, D, F, I (incomplete), I/F (incomplete course mark which turned into an F), NC (no credit), and W (withdrawal). The biology class selected for this analysis was chosen because it is a gateway STEM course categorized by a relatively large disparity between retention and attrition rates at our institution. Across all six semesters, the overall failing rate was 11.7% (<italic>n</italic>&#x2009;=&#x2009;378). Fall semester passing rates ranged between 77.3% and 85.5%, which was lower than spring passing rates ranging between 92.5% and 95.8%.</p>
</sec>
<sec id="sec8">
<label>4.2.</label>
<title>Data sources</title>
<p>A diverse set of student academic and non-academic features were extracted from the institution&#x2019;s data warehouse (<xref rid="tab2" ref-type="table">Table 2</xref>). Traditional student-specific data features pertained to (1) demographics, (2) pre-collegiate characteristics, (3) collegiate characteristics, and (4) financial aid data. For technological systems and novel assessment types, student engagement with the LMS Blackboard, and performance on two concept inventory (CI) diagnostic assessments: the Assessing COntextual Reasoning about Natural Selection (ACORNS); <xref ref-type="bibr" rid="ref124">Nehm et al. (2012)</xref> and the Conceptual Inventory of Natural Selection (CINS); <xref ref-type="bibr" rid="ref8">Anderson et al. (2002)</xref> were incorporated into the Bayesian framework. CI assessments are widely used in the collegiate biology classroom to provide novel insights into student perceptions and attitudes toward biological concepts and theory and may employ automatic grading capabilities using ML and AI (<xref ref-type="bibr" rid="ref123">Nehm, 2019</xref>). Detailed summary statistics for these variables can be found in <xref rid="sec22" ref-type="sec">Supplementary material</xref>. All predictors corresponded to variables acquired by the institution and instructor prior to the third week in the course, based on the findings of <xref ref-type="bibr" rid="ref94">Lee et al. (2015)</xref>, <xref ref-type="bibr" rid="ref183">Xue (2018)</xref>, and <xref ref-type="bibr" rid="ref18">Bertolini (2021)</xref>.</p>
<table-wrap position="float" id="tab2">
<label>Table 2</label>
<caption><p>Description of predictor variables by data category.</p></caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th align="left" valign="top">Data category</th>
<th align="left" valign="top">Predictor</th>
<th align="left" valign="top">Description [Factor Levels;base comparison (if applicable)]</th>
</tr>
</thead>
<tbody>
<tr>
<td align="left" valign="top" rowspan="4">Demographics</td>
<td align="left" valign="top">Gender</td>
<td align="left" valign="top">Student&#x2019;s sex (female, <bold>male</bold>)</td>
</tr>
<tr>
<td align="left" valign="top">Ethnicity</td>
<td align="left" valign="top">Student&#x2019;s ethnicity (White, Asian, Hispanic, Black, <bold>Multiracial</bold>)</td>
</tr>
<tr>
<td align="left" valign="top">Citizenship Status</td>
<td align="left" valign="top">Indicator of the student&#x2019;s citizenship status (<bold>native</bold>, naturalized, foreign)</td>
</tr>
<tr>
<td align="left" valign="top">Age</td>
<td align="left" valign="top">Student&#x2019;s age</td>
</tr>
<tr>
<td align="left" valign="top" rowspan="2">Pre-collegiate academic variables</td>
<td align="left" valign="top">High School GPA</td>
<td align="left" valign="top">Student&#x2019;s high school GPA</td>
</tr>
<tr>
<td align="left" valign="top">SAT Score</td>
<td align="left" valign="top">Student&#x2019;s highest SAT score (out of 1,600) submitted to the university</td>
</tr>
<tr>
<td align="left" valign="top" rowspan="5">Collegiate characteristics</td>
<td align="left" valign="top">Math Placement Score</td>
<td align="left" valign="top">Student&#x2019;s mathematics placement examination score</td>
</tr>
<tr>
<td align="left" valign="top">Enrollment Status</td>
<td align="left" valign="top">Student&#x2019;s enrollment status (continuing student, <bold>new freshmen</bold>, graduate student, transfer student)</td>
</tr>
<tr>
<td align="left" valign="top">Pre-Total Course Credits</td>
<td align="left" valign="top">Number of credits taken the semester prior to taking the biology course (if applicable)</td>
</tr>
<tr>
<td align="left" valign="top">Pre-Cumulative GPA</td>
<td align="left" valign="top">Cumulative GPA of the student up until the semester they took the biology course</td>
</tr>
<tr>
<td align="left" valign="top">Units Taking</td>
<td align="left" valign="top">Number of credits taken the same term as the biology course</td>
</tr>
<tr>
<td align="left" valign="top" rowspan="3">Financial aid</td>
<td align="left" valign="top">Aid Amount</td>
<td align="left" valign="top">Disbursed amount of financial aid the student received</td>
</tr>
<tr>
<td align="left" valign="top">PELL</td>
<td align="left" valign="top">Indicator of whether the student was a PELL grant recipient (recipient, <bold>non-recipient</bold>)</td>
</tr>
<tr>
<td align="left" valign="top">TAP</td>
<td align="left" valign="top">Indicator of whether the student was a TAP grant recipient (recipient, <bold>non-recipient</bold>)</td>
</tr>
<tr>
<td align="left" valign="top" rowspan="2">Learning management system (LMS)</td>
<td align="left" valign="top">LMS Logins</td>
<td align="left" valign="top">Logins aggregated up until the third week of the class</td>
</tr>
<tr>
<td align="left" valign="top">Total Courses</td>
<td align="left" valign="top">Total number of courses taken the same semester as the biology course</td>
</tr>
<tr>
<td align="left" valign="top" rowspan="2">Concept inventory (CI) assessments</td>
<td align="left" valign="top">CINS</td>
<td align="left" valign="top">Student&#x2019;s CINS assessment score</td>
</tr>
<tr>
<td align="left" valign="top">ACORNS KC</td>
<td align="left" valign="top">The number of key concepts (KC) the student used in their responses to the ACORNS instrument</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<p>Bolded covariates denote reference variables used as a baseline level to compare feature levels for categorical predictors.</p>
</table-wrap-foot>
</table-wrap>
<p>During data preprocessing, categorical predictors were converted into indicator variables. Following the recommendation by <xref ref-type="bibr" rid="ref108">Marshall et al. (2010)</xref>, missing data were imputed using the predictive mean matching imputation technique in the &#x2018;mice&#x2019; package for the R programming environment (<xref ref-type="bibr" rid="ref165">Van Buuren and Groothuis-Oudshoorn, 2011</xref>). Prior to model fitting, covariates were standardized to have a zero mean and a standard deviation of one.</p>
</sec>
<sec id="sec9">
<label>4.3.</label>
<title>Bayesian statistical analysis</title>
<p>To answer RQ 1, we ran a multiple logistic regression model incorporating the effects of both traditional and course-specific predictors using a Bayesian framework:</p>
<disp-formula id="EQ2"><label>(2)</label><mml:math id="M9"><mml:mrow><mml:mstyle mathvariant="bold"><mml:mtext mathvariant="bold-italic">logit</mml:mtext><mml:mfenced open="(" close=")"><mml:mrow><mml:msub><mml:mi mathvariant="bold-italic">&#x03B8;</mml:mi><mml:mrow><mml:mi mathvariant="bold-italic">i</mml:mi><mml:mo>,</mml:mo><mml:mi mathvariant="bold-italic">j</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:mfenced><mml:mo>=</mml:mo><mml:msub><mml:mi mathvariant="bold-italic">&#x03B2;</mml:mi><mml:mn>0</mml:mn></mml:msub><mml:mo>+</mml:mo><mml:msub><mml:mi mathvariant="bold-italic">&#x03B1;</mml:mi><mml:mi mathvariant="bold-italic">j</mml:mi></mml:msub><mml:mo>+</mml:mo><mml:mstyle displaystyle="true"><mml:munderover><mml:mo>&#x2211;</mml:mo><mml:mrow><mml:mi mathvariant="bold-italic">p</mml:mi><mml:mo>=</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:mn>24</mml:mn></mml:mrow></mml:munderover></mml:mstyle><mml:msub><mml:mi mathvariant="bold-italic">&#x03B2;</mml:mi><mml:mi mathvariant="bold-italic">p</mml:mi></mml:msub><mml:msub><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mi mathvariant="bold-italic">p</mml:mi></mml:msub></mml:mstyle></mml:mrow></mml:math></disp-formula>
<p>Since the coefficients in logistic regression models are either positive, negative or zero, broad uninformative normal distribution priors were used for these parameters in <xref ref-type="disp-formula" rid="EQ2">Equation 2</xref>. The normal distribution is a common statistical distribution that many institutional researchers and educators are familiar with and utilize (see <xref ref-type="bibr" rid="ref45">Coughlin and Pagano, 1997</xref>; <xref ref-type="bibr" rid="ref172">Van Zyl, 2015</xref>). These prior distributions can be written mathematically as <inline-formula><mml:math id="M10"><mml:mrow><mml:mi>N</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:mrow><mml:mi>&#x03BC;</mml:mi><mml:mo>,</mml:mo><mml:mi>&#x03C4;</mml:mi></mml:mrow><mml:mo>)</mml:mo></mml:mrow></mml:mrow></mml:math></inline-formula> where <inline-formula><mml:math id="M11"><mml:mi>N</mml:mi></mml:math></inline-formula> is a normal distribution centered at mean <italic>&#x03BC;</italic> with precision <inline-formula><mml:math id="M13"><mml:mrow><mml:mi>&#x03C4;</mml:mi><mml:mo>=</mml:mo><mml:mfrac bevelled="true"><mml:mn>1</mml:mn><mml:mrow><mml:msup><mml:mi>&#x03C3;</mml:mi><mml:mn>2</mml:mn></mml:msup></mml:mrow></mml:mfrac></mml:mrow></mml:math></inline-formula> (the inverse of the variance <inline-formula><mml:math id="M14"><mml:mrow><mml:msup><mml:mi>&#x03C3;</mml:mi><mml:mn>2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula>). In <xref ref-type="disp-formula" rid="EQ2">Equation 2</xref>, the covariate features were assigned uninformative priors with a mean of zero and small precision of 0.000001: <inline-formula><mml:math id="M15"><mml:mrow><mml:msub><mml:mi>&#x03B2;</mml:mi><mml:mi>p</mml:mi></mml:msub><mml:mo>~</mml:mo><mml:mi>N</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:mrow><mml:mn>0</mml:mn><mml:mo>,</mml:mo><mml:mn>0.000001</mml:mn></mml:mrow><mml:mo>)</mml:mo></mml:mrow></mml:mrow></mml:math></inline-formula> where <inline-formula><mml:math id="M16"><mml:mi>p</mml:mi></mml:math></inline-formula>&#x2009;=&#x2009;1,&#x2026;,24. <xref rid="tab3" ref-type="table">Table 3</xref> maps the data features described in <xref rid="tab2" ref-type="table">Table 2</xref> with the parameters found in <xref ref-type="disp-formula" rid="EQ2">Equation 2</xref>. We also performed a prior predictive simulation to validate the suitability of these prior distribution choices by using synthetic data to confirm that the Bayesian logistic regression model could recover numerical values prescribed on the analytical parameters. Due to word count limitations, this analysis is detailed in <xref rid="sec22" ref-type="sec">Supplementary material</xref>.</p>
<table-wrap position="float" id="tab3">
<label>Table 3</label>
<caption><p>Logistic regression parameter estimates, credible intervals, 89% high density interval, and ROPE overlap percentage estimates.</p></caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th align="left" valign="top">Data category</th>
<th align="left" valign="top">Predictor [model parameter from <xref ref-type="disp-formula" rid="EQ2">Equation 2</xref>]</th>
<th align="center" valign="top">Mean (standard deviation) of parameter estimate</th>
<th align="center" valign="top">Median parameter estimate</th>
<th align="center" valign="top">90% credible interval</th>
<th align="center" valign="top">95% credible interval</th>
<th align="center" valign="top">99% credible interval</th>
<th align="center" valign="top">89% high density interval</th>
<th align="center" valign="top">% of high-density interval (HDI) inside ROPE</th>
</tr>
</thead>
<tbody>
<tr>
<td align="left" valign="top" rowspan="9">Demographics</td>
<td align="left" valign="top">Intercept [&#x03B2;<sub>0</sub>]</td>
<td align="char" valign="top" char=".">3.007 (0.394)</td>
<td align="char" valign="top" char=".">3.005</td>
<td align="char" valign="top" char=".">(2.556, 3.465)</td>
<td align="char" valign="top" char=".">(2.228, 3.800)</td>
<td align="char" valign="top" char=".">(1.989, 4.047)</td>
<td align="char" valign="top" char=".">(2.41, 3.59)</td>
<td align="char" valign="top" char=".">0.01%</td>
</tr>
<tr>
<td align="left" valign="top">Ethnicity Black [&#x03B2;<sub>1</sub>]</td>
<td align="char" valign="top" char=".">0.054 (0.092)</td>
<td align="char" valign="top" char=".">0.054</td>
<td align="char" valign="top" char=".">(&#x2212;0.064, 0.172)</td>
<td align="char" valign="top" char=".">(&#x2212;0.122, 0.232)</td>
<td align="char" valign="top" char=".">(&#x2212;0.160, 0.268)</td>
<td align="char" valign="top" char=".">(&#x2212;0.09, 0.20)</td>
<td align="char" valign="top" char=".">91.20%</td>
</tr>
<tr>
<td align="left" valign="top">Ethnicity Hispanic [&#x03B2;<sub>2</sub>]</td>
<td align="char" valign="top" char=".">0.003 (0.098)</td>
<td align="char" valign="top" char=".">0.003</td>
<td align="char" valign="top" char=".">(&#x2212;0.124, 0.127)</td>
<td align="char" valign="top" char=".">(&#x2212;0.188, 0.192)</td>
<td align="char" valign="top" char=".">(&#x2212;0.231, 0.225)</td>
<td align="char" valign="top" char=".">(&#x2212;0.16, 0.16)</td>
<td align="char" valign="top" char=".">93.57%</td>
</tr>
<tr>
<td align="left" valign="top">Gender [&#x03B2;<sub>3</sub>]</td>
<td align="char" valign="top" char=".">&#x2212;0.039 (0.067)</td>
<td align="char" valign="top" char=".">&#x2212;0.039</td>
<td align="char" valign="top" char=".">(&#x2212;0.125, 0.047)</td>
<td align="char" valign="top" char=".">(&#x2212;0.169, 0.092)</td>
<td align="char" valign="top" char=".">(&#x2212;0.197, 0.117)</td>
<td align="char" valign="top" char=".">(&#x2212;0.15, 0.07)</td>
<td align="char" valign="top" char=".">98.21%</td>
</tr>
<tr>
<td align="left" valign="top">Age [&#x03B2;<sub>4</sub>]</td>
<td align="char" valign="top" char=".">&#x2212;0.061 (0.072)</td>
<td align="char" valign="top" char=".">&#x2212;0.062</td>
<td align="char" valign="top" char=".">(&#x2212;0.153, 0.032)</td>
<td align="char" valign="top" char=".">(&#x2212;0.201, 0.082)</td>
<td align="char" valign="top" char=".">(&#x2212;0.226, 0.110)</td>
<td align="char" valign="top" char=".">(&#x2212;0.18, 0.05)</td>
<td align="char" valign="top" char=".">95.48%</td>
</tr>
<tr>
<td align="left" valign="top">Citizenship Status Naturalized Student [&#x03B2;<sub>5</sub>]</td>
<td align="char" valign="top" char=".">&#x2212;0.073 (0.066)</td>
<td align="char" valign="top" char=".">&#x2212;0.074</td>
<td align="char" valign="top" char=".">(&#x2212;0.156, 0.011)</td>
<td align="char" valign="top" char=".">(&#x2212;0.199, 0.059)</td>
<td align="char" valign="top" char=".">(&#x2212;0.222, 0.086)</td>
<td align="char" valign="top" char=".">(&#x2212;0.18, 0.03)</td>
<td align="char" valign="top" char=".">95.40%</td>
</tr>
<tr>
<td align="left" valign="top">Ethnicity Asian [&#x03B2;<sub>6</sub>]</td>
<td align="char" valign="top" char=".">&#x2212;0.128 (0.134)</td>
<td align="char" valign="top" char=".">&#x2212;0.127</td>
<td align="char" valign="top" char=".">(&#x2212;0.302, 0.413)</td>
<td align="char" valign="top" char=".">(&#x2212;0.393, 0.127)</td>
<td align="char" valign="top" char=".">(&#x2212;0.453, 0.171)</td>
<td align="char" valign="top" char=".">(&#x2212;0.34, 0.09)</td>
<td align="char" valign="top" char=".">64.75%</td>
</tr>
<tr>
<td align="left" valign="top">Ethnicity White [&#x03B2;<sub>7</sub>]</td>
<td align="char" valign="top" char=".">&#x2212;0.152 (0.132)</td>
<td align="char" valign="top" char=".">&#x2212;0.151</td>
<td align="char" valign="top" char=".">(&#x2212;0.324, 0.016)</td>
<td align="char" valign="top" char=".">(&#x2212;0.413, 0.100)</td>
<td align="char" valign="top" char=".">(&#x2212;0.471, 0.146)</td>
<td align="char" valign="top" char=".">(&#x2212;0.36, 0.06)</td>
<td align="char" valign="top" char=".">58.63%</td>
</tr>
<tr>
<td align="left" valign="top">Citizenship Status Foreign Student [&#x03B2;<sub>8</sub>]</td>
<td align="char" valign="top" char=".">&#x2212;0.249 (0.067)</td>
<td align="char" valign="top" char=".">&#x2212;0.249</td>
<td align="char" valign="top" char=".">(&#x2212;0.335, &#x2212;0.162)</td>
<td align="char" valign="top" char=".">(&#x2212;0.381, &#x2212;0.117)</td>
<td align="char" valign="top" char=".">(&#x2212;0.405, &#x2212;0.091)</td>
<td align="char" valign="top" char=".">(&#x2212;0.36, &#x2212;0.14)</td>
<td align="char" valign="top" char=".">15.84%</td>
</tr>
<tr>
<td align="left" valign="top" rowspan="2">Pre-collegiate academic variables</td>
<td align="left" valign="top">High School GPA [&#x03B2;<sub>9</sub>]</td>
<td align="char" valign="top" char=".">0.266 (0.074)</td>
<td align="char" valign="top" char=".">0.226</td>
<td align="char" valign="top" char=".">(0.172, 0.360)</td>
<td align="char" valign="top" char=".">(0.122, 0.410)</td>
<td align="char" valign="top" char=".">(0.095, 0.438)</td>
<td align="char" valign="top" char=".">(0.15, 0.38)</td>
<td align="char" valign="top" char=".">12.40%</td>
</tr>
<tr>
<td align="left" valign="top">SAT Score [&#x03B2;<sub>10</sub>]</td>
<td align="char" valign="top" char=".">0.244 (0.080)</td>
<td align="char" valign="top" char=".">0.243</td>
<td align="char" valign="top" char=".">(0.142, 0.347)</td>
<td align="char" valign="top" char=".">(0.088, 0.401)</td>
<td align="char" valign="top" char=".">(0.060, 0.431)</td>
<td align="char" valign="top" char=".">(0.12, 0.37)</td>
<td align="char" valign="top" char=".">21.78%</td>
</tr>
<tr>
<td align="left" valign="top" rowspan="7">Collegiate characteristics</td>
<td align="left" valign="top">Pre-Cumulative GPA [&#x03B2;<sub>11</sub>]</td>
<td align="char" valign="top" char=".">0.468 (0.066)</td>
<td align="char" valign="top" char=".">0.468</td>
<td align="char" valign="top" char=".">(0.384, 0.554)</td>
<td align="char" valign="top" char=".">(0.339, 0.598)</td>
<td align="char" valign="top" char=".">(0.316, 0.624)</td>
<td align="char" valign="top" char=".">(0.36, 0.57)</td>
<td align="char" valign="top" char=".">0.00%</td>
</tr>
<tr>
<td align="left" valign="top">Enrollment Status Continuing Student [&#x03B2;<sub>12</sub>]</td>
<td align="char" valign="top" char=".">0.272 (0.096)</td>
<td align="char" valign="top" char=".">0.273</td>
<td align="char" valign="top" char=".">(0.148, 0.394)</td>
<td align="char" valign="top" char=".">(0.083, 0.459)</td>
<td align="char" valign="top" char=".">(0.046, 0.490)</td>
<td align="char" valign="top" char=".">(0.12, 0.42)</td>
<td align="char" valign="top" char=".">17.43%</td>
</tr>
<tr>
<td align="left" valign="top">Enrollment Status New Graduate Student [&#x03B2;<sub>13</sub>]</td>
<td align="char" valign="top" char=".">0.194 (0.069)</td>
<td align="char" valign="top" char=".">0.193</td>
<td align="char" valign="top" char=".">(0.107, 0.282)</td>
<td align="char" valign="top" char=".">(0.060, 0.330)</td>
<td align="char" valign="top" char=".">(0.038, 0.357)</td>
<td align="char" valign="top" char=".">(0.09, 0.31)</td>
<td align="char" valign="top" char=".">43.24%</td>
</tr>
<tr>
<td align="left" valign="top">Pre-Total Course Credits [&#x03B2;<sub>14</sub>]</td>
<td align="char" valign="top" char=".">0.139 (0.082)</td>
<td align="char" valign="top" char=".">0.137</td>
<td align="char" valign="top" char=".">(0.034, 0.243)</td>
<td align="char" valign="top" char=".">(&#x2212;0.020, 0.299)</td>
<td align="char" valign="top" char=".">(&#x2212;0.051, 0.329)</td>
<td align="char" valign="top" char=".">(0.01, 0.27)</td>
<td align="char" valign="top" char=".">70.27%</td>
</tr>
<tr>
<td align="left" valign="top">Enrollment Status Transfer Student [&#x03B2;<sub>15</sub>]</td>
<td align="char" valign="top" char=".">0.124 (0.099)</td>
<td align="char" valign="top" char=".">0.124</td>
<td align="char" valign="top" char=".">(&#x2212;0.003, 0.250)</td>
<td align="char" valign="top" char=".">(&#x2212;0.067, 0.317)</td>
<td align="char" valign="top" char=".">(&#x2212;0.104, 0.356)</td>
<td align="char" valign="top" char=".">(&#x2212;0.03, 0.28)</td>
<td align="char" valign="top" char=".">71.95%</td>
</tr>
<tr>
<td align="left" valign="top">Math Placement Score [&#x03B2;<sub>16</sub>]</td>
<td align="char" valign="top" char=".">&#x2212;0.039 (0.078)</td>
<td align="char" valign="top" char=".">&#x2212;0.040</td>
<td align="char" valign="top" char=".">(&#x2212;0.139, 0.062)</td>
<td align="char" valign="top" char=".">(&#x2212;0.193, 0.116)</td>
<td align="char" valign="top" char=".">(&#x2212;0.220, 0.144)</td>
<td align="char" valign="top" char=".">(&#x2212;0.16, 0.09)</td>
<td align="char" valign="top" char=".">96.34%</td>
</tr>
<tr>
<td align="left" valign="top">Units Taking [&#x03B2;<sub>17</sub>]</td>
<td align="char" valign="top" char=".">&#x2212;0.116 (0.102)</td>
<td align="char" valign="top" char=".">&#x2212;0.116</td>
<td align="char" valign="top" char=".">(&#x2212;0.246, 0.016)</td>
<td align="char" valign="top" char=".">(&#x2212;0.316, 0.084)</td>
<td align="char" valign="top" char=".">(&#x2212;0.353, 0.121)</td>
<td align="char" valign="top" char=".">(&#x2212;0.28, 0.05)</td>
<td align="char" valign="top" char=".">73.87%</td>
</tr>
<tr>
<td align="left" valign="top" rowspan="3">Financial aid</td>
<td align="left" valign="top">TAP [&#x03B2;<sub>18</sub>]</td>
<td align="char" valign="top" char=".">0.016 (0.075)</td>
<td align="char" valign="top" char=".">0.015</td>
<td align="char" valign="top" char=".">(&#x2212;0.080, 0.111)</td>
<td align="char" valign="top" char=".">(&#x2212;0.131, 0.163)</td>
<td align="char" valign="top" char=".">(&#x2212;0.159, 0.190)</td>
<td align="char" valign="top" char=".">(&#x2212;0.11, 0.13)</td>
<td align="char" valign="top" char=".">98.16%</td>
</tr>
<tr>
<td align="left" valign="top">Aid Amount [&#x03B2;<sub>19</sub>]</td>
<td align="char" valign="top" char=".">&#x2212;0.015 (0.076)</td>
<td align="char" valign="top" char=".">&#x2212;0.016</td>
<td align="char" valign="top" char=".">(&#x2212;0.112, 0.082)</td>
<td align="char" valign="top" char=".">(&#x2212;0.163, 0.104)</td>
<td align="char" valign="top" char=".">(&#x2212;0.191, 0.161)</td>
<td align="char" valign="top" char=".">(&#x2212;0.13, 0.11)</td>
<td align="char" valign="top" char=".">98.13%</td>
</tr>
<tr>
<td align="left" valign="top">PELL [&#x03B2;<sub>20</sub>]</td>
<td align="char" valign="top" char=".">&#x2212;0.226 (0.086)</td>
<td align="char" valign="top" char=".">&#x2212;0.225</td>
<td align="char" valign="top" char=".">(&#x2212;0.337, &#x2212;0.115)</td>
<td align="char" valign="top" char=".">(&#x2212;0.397, 0.059)</td>
<td align="char" valign="top" char=".">(&#x2212;0.427, &#x2212;0.027)</td>
<td align="char" valign="top" char=".">(&#x2212;0.36, &#x2212;0.09)</td>
<td align="char" valign="top" char=".">30.64%</td>
</tr>
<tr>
<td align="left" valign="top" rowspan="2">Learning management system (LMS)</td>
<td align="left" valign="top">LMS Logins [&#x03B2;<sub>21</sub>]</td>
<td align="char" valign="top" char=".">0.586 (0.082)</td>
<td align="char" valign="top" char=".">0.585</td>
<td align="char" valign="top" char=".">(0.481, 0.691)</td>
<td align="char" valign="top" char=".">(0.428, 0.749)</td>
<td align="char" valign="top" char=".">(0.397, 0.779)</td>
<td align="char" valign="top" char=".">(0.45, 0.71)</td>
<td align="char" valign="top" char=".">0.00%</td>
</tr>
<tr>
<td align="left" valign="top">Total Courses [&#x03B2;<sub>22</sub>]</td>
<td align="char" valign="top" char=".">0.193 (0.101)</td>
<td align="char" valign="top" char=".">0.193</td>
<td align="char" valign="top" char=".">(0.065, 0.322)</td>
<td align="char" valign="top" char=".">(&#x2212;0.004, 0.394)</td>
<td align="char" valign="top" char=".">(&#x2212;0.378, 0.428)</td>
<td align="char" valign="top" char=".">(0.03, 0.35)</td>
<td align="char" valign="top" char=".">45.59%</td>
</tr>
<tr>
<td align="left" valign="top" rowspan="2">Concept inventory (CI) assessments</td>
<td align="left" valign="top">ACORNS KC [&#x03B2;<sub>23</sub>]</td>
<td align="char" valign="top" char=".">0.574 (0.116)</td>
<td align="char" valign="top" char=".">0.570</td>
<td align="char" valign="top" char=".">(0.425, 0.722)</td>
<td align="char" valign="top" char=".">(0.352, 0.804)</td>
<td align="char" valign="top" char=".">(0.311, 0.851)</td>
<td align="char" valign="top" char=".">(0.39, 0.76)</td>
<td align="char" valign="top" char=".">0.03%</td>
</tr>
<tr>
<td align="left" valign="top">CINS [&#x03B2;<sub>24</sub>]</td>
<td align="char" valign="top" char=".">0.500 (0.093)</td>
<td align="char" valign="top" char=".">0.499</td>
<td align="char" valign="top" char=".">(0.382, 0.619)</td>
<td align="char" valign="top" char=".">(0.320, 0.683)</td>
<td align="char" valign="top" char=".">(0.287, 0.719)</td>
<td align="char" valign="top" char=".">(0.35, 0.65)</td>
<td align="char" valign="top" char=".">0.02%</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<p>WAIC for model: &#x2212;3,384.40.</p>
</table-wrap-foot>
</table-wrap>
<p>All students had the same estimated intercept <inline-formula><mml:math id="M42"><mml:mrow><mml:msub><mml:mi>&#x03B2;</mml:mi><mml:mn>0</mml:mn></mml:msub><mml:mo>~</mml:mo><mml:mi>N</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:mrow><mml:mn>0</mml:mn><mml:mo>,</mml:mo><mml:mn>0.000001</mml:mn></mml:mrow><mml:mo>)</mml:mo></mml:mrow></mml:mrow></mml:math></inline-formula> and coefficient estimates (i.e., fixed effects). A semester-specific random effects term (<inline-formula><mml:math id="M43"><mml:mrow><mml:msub><mml:mi>&#x03B1;</mml:mi><mml:mi>j</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> where <inline-formula><mml:math id="M44"><mml:mrow><mml:mi>j</mml:mi><mml:mo>=</mml:mo><mml:mn>1</mml:mn><mml:mo>,</mml:mo><mml:mo>&#x2026;</mml:mo><mml:mo>,</mml:mo><mml:mn>6</mml:mn></mml:mrow></mml:math></inline-formula>) was added to quantify variability in student performance across the different semesters. Since <inline-formula><mml:math id="M45"><mml:mrow><mml:msub><mml:mi>&#x03B1;</mml:mi><mml:mi>j</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> is a random-effects term, a nested prior for this model parameter was used: <inline-formula><mml:math id="M46"><mml:mrow><mml:msub><mml:mi>&#x03B1;</mml:mi><mml:mi>j</mml:mi></mml:msub><mml:mo>~</mml:mo><mml:mi>N</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:mrow><mml:mn>0</mml:mn><mml:mo>,</mml:mo><mml:msub><mml:mi>&#x03C4;</mml:mi><mml:mi>&#x03B1;</mml:mi></mml:msub></mml:mrow><mml:mo>)</mml:mo></mml:mrow></mml:mrow></mml:math></inline-formula>. In this context, <inline-formula><mml:math id="M47"><mml:mrow><mml:msub><mml:mi>&#x03B1;</mml:mi><mml:mi>j</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> is a normal distribution with a zero mean and precision denoted as <inline-formula><mml:math id="M48"><mml:mrow><mml:msub><mml:mi>&#x03C4;</mml:mi><mml:mi>&#x03B1;</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>, which follows another statistical distribution <inline-formula><mml:math id="M49"><mml:mrow><mml:msub><mml:mi>&#x03C4;</mml:mi><mml:mi>&#x03B1;</mml:mi></mml:msub><mml:mo>~</mml:mo><mml:mtext mathvariant="italic">Gamma</mml:mtext><mml:mrow><mml:mo>(</mml:mo><mml:mrow><mml:mn>0.001</mml:mn><mml:mo>,</mml:mo><mml:mn>0.001</mml:mn></mml:mrow><mml:mo>)</mml:mo></mml:mrow></mml:mrow></mml:math></inline-formula>; a gamma distribution with a shape and scale parameter value of 0.001. The parameter <inline-formula><mml:math id="M50"><mml:mrow><mml:msub><mml:mi>&#x03C4;</mml:mi><mml:mi>&#x03B1;</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> is called a hyperparameter and the distribution <inline-formula><mml:math id="M51"><mml:mrow><mml:mtext mathvariant="italic">Gamma</mml:mtext><mml:mrow><mml:mo>(</mml:mo><mml:mrow><mml:mn>0.001</mml:mn><mml:mo>,</mml:mo><mml:mn>0.001</mml:mn></mml:mrow><mml:mo>)</mml:mo></mml:mrow></mml:mrow></mml:math></inline-formula> is known as a hyperprior distribution (<xref ref-type="bibr" rid="ref70">Hobbs and Hooten, 2015</xref>). The gamma distribution is a continuous probability distribution that is traditionally used as a prior distribution for the variance when nested priors are used (<xref ref-type="bibr" rid="ref61">Gelman, 2006</xref>). The nesting of priors resembles a hierarchical form of a Bayesian model, which considers data from multiple levels to compare similarities and differences between independent groups (<xref ref-type="bibr" rid="ref115">McCarthy and Masters, 2005</xref>). In this research context, we are interested in discerning whether the term the student took the course (either fall or spring) impacted student performance (retention or attrition), due to differences in the composition of the student body between these semesters.</p>
<p>To identify the features that significantly impacted student retention and attrition in our STEM classroom context, the region of practical equivalence (ROPE) was calculated for each of the model parameters. ROPE corresponds to a statistical &#x201C;null&#x201D; hypothesis for the model parameter. The overlap percentage between each credible interval and ROPE region are used to ascertain statistical significance (<xref ref-type="bibr" rid="ref86">Kruschke, 2011b</xref>). An overlap percentage closer to zero indicates that the feature is significant in the model, while a value closer to 100% indicates that the model parameter is not statistically significant. This differs from the frequentist way of identifying statistically significant features by determining whether their model parameter values differ significantly from zero. Based on the recommendations by <xref ref-type="bibr" rid="ref86">Kruschke (2011b)</xref> and <xref ref-type="bibr" rid="ref116">McElreath (2018)</xref>, a specific type of credible interval based on probability density, known as the 89% high density interval, was used. Since a Bayesian logistic regression model was used in this study, per <xref ref-type="bibr" rid="ref87">Kruschke and Liddell (2018)</xref>, the ROPE range was prespecified between &#x2212;0.18 and 0.18.</p>
<p>The Bayesian model was implemented in JAGS (<xref ref-type="bibr" rid="ref138">Plummer, 2003</xref>) using the R2jags package (<xref ref-type="bibr" rid="ref139">Plummer, 2013</xref>) found in the R programming environment. JAGS uses Markov chain Monte Carlo (MCMC) methods to obtain the posterior distribution for each regression parameter by sampling values from it, following an initial burn-in period, before the posterior distribution stabilizes (<xref ref-type="bibr" rid="ref115">McCarthy and Masters, 2005</xref>).</p>
<p>Posterior distributions for the logistic regression coefficients were computed using two chains. The number of iterations run in the MCMC sampling was 50,000 with a burn-in number of 5,000. Thinning was not applied to the chains and all chains converged unambiguously. Convergence was assessed using the Gelman-Rubric statistic (<inline-formula><mml:math id="M52"><mml:mrow><mml:mover accent="true"><mml:mi>R</mml:mi><mml:mo>^</mml:mo></mml:mover><mml:mo>&#x003C;</mml:mo><mml:mn>1.1</mml:mn></mml:mrow></mml:math></inline-formula>) for all regression parameters (<xref ref-type="bibr" rid="ref26">Brooks and Gelman, 1998</xref>). This model was then used to ascertain the factors that were predictors of student performance in our collegiate biology course setting.</p>
<p>In RQ 2, an empirical Bayesian approach was taken to examine whether incorporating informative priors using knowledge from aggregated historical corpora (i.e., prior information of student performance from past semesters) enhanced model fit, compared to the use of traditional uninformative normal distribution priors. Data from two, three, four, and five past semesters of course data were used to assign values for the prior distributions of the regression coefficients. For this research question, the semester-specific random effect term was omitted. The Bayesian logistic regression model was run on a single subsequent semester of course data (<xref rid="fig2" ref-type="fig">Figure 2</xref>). Since passing and failing rates differed between fall and spring semesters, these terms were also examined separately (<xref rid="fig2" ref-type="fig">Figures 2E</xref>,<xref rid="fig2" ref-type="fig">F</xref>).</p>
<disp-formula id="EQ3"><label>(3)</label><mml:math id="M53"><mml:mrow><mml:mstyle mathvariant="bold"><mml:mtext mathvariant="bold-italic">logit</mml:mtext><mml:mrow><mml:mo>(</mml:mo><mml:mrow><mml:msub><mml:mi mathvariant="bold-italic">&#x03B8;</mml:mi><mml:mi mathvariant="bold-italic">i</mml:mi></mml:msub></mml:mrow><mml:mo>)</mml:mo></mml:mrow><mml:mo>=</mml:mo><mml:msub><mml:mi mathvariant="bold-italic">&#x03B2;</mml:mi><mml:mn>0</mml:mn></mml:msub><mml:mo>+</mml:mo><mml:mstyle displaystyle="true"><mml:munderover><mml:mo>&#x2211;</mml:mo><mml:mrow><mml:mi mathvariant="bold-italic">p</mml:mi><mml:mo>=</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:mn>24</mml:mn></mml:mrow></mml:munderover></mml:mstyle><mml:msub><mml:mi mathvariant="bold-italic">&#x03B2;</mml:mi><mml:mi mathvariant="bold-italic">p</mml:mi></mml:msub><mml:msub><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mi mathvariant="bold-italic">p</mml:mi></mml:msub></mml:mstyle></mml:mrow></mml:math></disp-formula>
<fig position="float" id="fig2">
<label>Figure 2</label>
<caption><p>Empirical Bayesian methodology using aggregated semesters of prior data. The &#x201C;Prior Semesters&#x201D; are used to specify the mean and precision for the distribution of the model covariates. The &#x201C;Data&#x201D; terms are the single semesters of course data that the logistic regression models were run on.</p></caption>
<graphic xlink:href="feduc-08-1073829-g002.tif"/>
</fig>
<p>In this modified setup for RQ 2, we used informative normal prior distributions estimated from aggregated past corpora records (<xref ref-type="disp-formula" rid="EQ3">Equation 3</xref>) where <inline-formula><mml:math id="M54"><mml:mrow><mml:msub><mml:mi>&#x03B2;</mml:mi><mml:mi>p</mml:mi></mml:msub><mml:mo>~</mml:mo><mml:mi>N</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:mrow><mml:msub><mml:mi>b</mml:mi><mml:mi>p</mml:mi></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mi>&#x03C4;</mml:mi><mml:mi>p</mml:mi></mml:msub></mml:mrow><mml:mo>)</mml:mo></mml:mrow></mml:mrow></mml:math></inline-formula>. <inline-formula><mml:math id="M55"><mml:mrow><mml:msub><mml:mi>b</mml:mi><mml:mi>p</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M56"><mml:mrow><mml:msub><mml:mi>&#x03C4;</mml:mi><mml:mi>p</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> were estimates for the mean and precision of the covariate, which differed depending on whether the predictor was continuous or categorical. For continuous predictors, <inline-formula><mml:math id="M57"><mml:mrow><mml:msub><mml:mi>b</mml:mi><mml:mi>p</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M58"><mml:mrow><mml:msub><mml:mi>&#x03C4;</mml:mi><mml:mi>p</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> corresponded to the mean and precision for the <italic>p</italic><sup>th</sup> covariate, tabulated from prior course records. For categorical predictors, <inline-formula><mml:math id="M59"><mml:mrow><mml:msub><mml:mi>b</mml:mi><mml:mi>p</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> was the proportion of entries from aggregated semesters, while <inline-formula><mml:math id="M60"><mml:mrow><mml:msub><mml:mi>&#x03C4;</mml:mi><mml:mi>p</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>= 0.000001. For example, in <xref rid="fig2" ref-type="fig">Figure 2A</xref>, for the continuous covariate age using fall 2015 data, the value <inline-formula><mml:math id="M61"><mml:mrow><mml:msub><mml:mi>b</mml:mi><mml:mi>p</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> was the average age and <inline-formula><mml:math id="M62"><mml:mrow><mml:msub><mml:mi>&#x03C4;</mml:mi><mml:mi>p</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> was the precision of age for students who took the biology course in fall 2014 and spring 2015. For the categorical covariate pertaining to Asian ethnicity, <inline-formula><mml:math id="M63"><mml:mrow><mml:msub><mml:mi>b</mml:mi><mml:mi>p</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> was the proportion of Asian students enrolled in the biology course in fall 2014 and spring 2015, and <inline-formula><mml:math id="M64"><mml:mrow><mml:msub><mml:mi>&#x03C4;</mml:mi><mml:mi>p</mml:mi></mml:msub><mml:mspace width="thickmathspace"/></mml:mrow></mml:math></inline-formula>=&#x2009;0.000001. All mean and precision values were calculated prior to data imputation. The values of <inline-formula><mml:math id="M65"><mml:mrow><mml:msub><mml:mi>b</mml:mi><mml:mi>p</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M66"><mml:mrow><mml:msub><mml:mi>&#x03C4;</mml:mi><mml:mi>p</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> for all covariates can be found in <xref rid="sec22" ref-type="sec">Supplementary material</xref>. A broad uninformative prior was used for the intercept: <inline-formula><mml:math id="M67"><mml:mrow><mml:msub><mml:mi>&#x03B2;</mml:mi><mml:mn>0</mml:mn></mml:msub><mml:mo>~</mml:mo><mml:mi>N</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:mrow><mml:mn>0</mml:mn><mml:mo>,</mml:mo><mml:mn>0.000001</mml:mn></mml:mrow><mml:mo>)</mml:mo></mml:mrow></mml:mrow></mml:math></inline-formula>.</p>
<p>Unlike RQ 1, for RQ 2 we focused on comparing model fit, instead of studying differences in the model estimates for the Bayesian parameters between individual models. In this auxiliary analysis, posterior distributions were computed using two chains. The number of iterations run in the MCMC sampling was 200,000 with a burn-in number of 50,000. Thinning was not applied to these chains. Model performance using informative prior distributions was compared to when broad uninformative normal distributions priors replaced the informative prior distributions in <xref ref-type="disp-formula" rid="EQ3">Equation 3</xref>: <inline-formula><mml:math id="M68"><mml:mrow><mml:msub><mml:mi>&#x03B2;</mml:mi><mml:mi>p</mml:mi></mml:msub><mml:mo>~</mml:mo><mml:mi>N</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:mrow><mml:mn>0</mml:mn><mml:mo>,</mml:mo><mml:mn>0.000001</mml:mn></mml:mrow><mml:mo>)</mml:mo></mml:mrow><mml:mo>,</mml:mo><mml:mi>p</mml:mi><mml:mo>=</mml:mo><mml:mn>0</mml:mn><mml:mo>,</mml:mo><mml:mo>&#x2026;</mml:mo><mml:mo>,</mml:mo><mml:mn>24</mml:mn></mml:mrow></mml:math></inline-formula>.</p>
</sec>
<sec id="sec10">
<label>4.4.</label>
<title>Widely applicable information criterion (WAIC) evaluation metric</title>
<p>For all models, performance was compared using the widely applicable information criterion (WAIC), also known as the Watanabe-Akaike information criterion. This is a generalized version of the Akaike information criterion (<xref ref-type="bibr" rid="ref002">Akaike, 1973</xref>) which is a commonly employed evaluation metric in EDM and LA (<xref ref-type="bibr" rid="ref157">Stamper et al., 2013</xref>). This metric is used to estimate out-of-sample performance for a model by computing a logarithmic pointwise posterior predictive density and correcting this estimate based on the number of parameters included in the model to prevent overfitting (<xref ref-type="bibr" rid="ref63">Gelman et al., 2014</xref>). Smaller WAIC values are indicative of a better fitting model.</p>
</sec>
</sec>
<sec id="sec11" sec-type="results">
<label>5.</label>
<title>Results</title>
<sec id="sec12">
<label>5.1.</label>
<title>(RQ 1): How do various student- and course-specific data types impact the odds of student retention in a STEM classroom context?</title>
<p>Standardized parameter estimates are shown in <xref rid="tab3" ref-type="table">Table 3</xref>. Many traditional university-specific predictors were found to be associated with classroom success. A one standard deviation increase in the student&#x2019;s cumulative collegiate GPA, high school GPA, and SAT score increased their odds of passing the course by 1.600 (60.0%), 1.305 (30.5%) and 1.277 (27.7%), respectively, controlling for all other factors. Compared to native students, international/foreign students were forecasted to perform worst (odds ratio&#x2009;=&#x2009;<inline-formula><mml:math id="M69"><mml:mrow><mml:msup><mml:mi>e</mml:mi><mml:mrow><mml:mo>&#x2212;</mml:mo><mml:mn>0.249</mml:mn></mml:mrow></mml:msup><mml:mspace width="thickmathspace"/></mml:mrow></mml:math></inline-formula>&#x2009;=&#x2009;0.780), along with students who received a PELL grant (odds ratio&#x2009;=&#x2009;<inline-formula><mml:math id="M70"><mml:mrow><mml:msup><mml:mi>e</mml:mi><mml:mrow><mml:mo>&#x2212;</mml:mo><mml:mn>0.226</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula>&#x2009;=&#x2009;0.798). Relative to new freshmen, transfer students performed slightly, but not significantly better (odds ratio&#x2009;=&#x2009;<inline-formula><mml:math id="M71"><mml:mrow><mml:mspace width="thickmathspace"/><mml:msup><mml:mi>e</mml:mi><mml:mrow><mml:mn>0.124</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula>&#x2009;=&#x2009;1.132). Continuing students (i.e., students who are not taking the biology course during their first term at the institution) were most likely to pass the course (odds ratio&#x2009;=&#x2009;<inline-formula><mml:math id="M72"><mml:mrow><mml:msup><mml:mi>e</mml:mi><mml:mrow><mml:mn>0.272</mml:mn></mml:mrow></mml:msup><mml:mspace width="thickmathspace"/></mml:mrow></mml:math></inline-formula>&#x2009;=&#x2009;1.313).</p>
<p>The magnitude for the course-specific predictors was positive and the largest among all other variables incorporated into the model. LMS logins had the greatest association with student performance; a one standard deviation increase in the number of student logins increased the odds of passing the course by 1.800 (80.0%). While the effects of both CI assessments on student performance were comparable (ACORNS KC: <inline-formula><mml:math id="M73"><mml:mrow><mml:msub><mml:mi>&#x03B2;</mml:mi><mml:mrow><mml:mn>23</mml:mn></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula>&#x2009;=&#x2009;0.574; CINS: <inline-formula><mml:math id="M74"><mml:mrow><mml:msub><mml:mi>&#x03B2;</mml:mi><mml:mrow><mml:mn>24</mml:mn></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula>&#x2009;=&#x2009;0.500), higher scores on these assessments yielded a greater likelihood of passing (57% and 50% for the ACORNS and CINS assessments, respectively).</p>
<p>Weak semester-specific effects were also observed (<inline-formula><mml:math id="M75"><mml:mrow><mml:msubsup><mml:mi>&#x03C3;</mml:mi><mml:mi>&#x03B1;</mml:mi><mml:mn>2</mml:mn></mml:msubsup><mml:mo>=</mml:mo><mml:mfrac><mml:mn>1</mml:mn><mml:mrow><mml:msub><mml:mi>&#x03C4;</mml:mi><mml:mi>&#x03B1;</mml:mi></mml:msub></mml:mrow></mml:mfrac><mml:mo>=</mml:mo><mml:mn>0.5</mml:mn></mml:mrow></mml:math></inline-formula>). The average modes of the Bayesian posterior densities for the deviations of individual semester effects were non-negative for spring semesters, compared to fall semesters (<xref rid="fig3" ref-type="fig">Figure 3</xref>).</p>
<fig position="float" id="fig3">
<label>Figure 3</label>
<caption><p>Bayesian posterior modes for semester-specific random effects. Thick white lines indicate 50% credible intervals, while thin white lines indicate 95% credible intervals.</p></caption>
<graphic xlink:href="feduc-08-1073829-g003.tif"/>
</fig>
</sec>
<sec id="sec13">
<label>5.2.</label>
<title>(RQ 2): Given the ability to integrate prior knowledge into Bayesian models via prespecified probability distributions, does incorporating aggregated historical records of student performance data enhance model fit, compared to when uninformative priors are used?</title>
<p><xref rid="tab4" ref-type="table">Table 4</xref> provides a comparative assessment of the differences between the WAIC values, <inline-formula><mml:math id="M76"><mml:mrow><mml:msub><mml:mi mathvariant="normal">&#x0394;</mml:mi><mml:mrow><mml:mi>W</mml:mi><mml:mi>A</mml:mi><mml:mi>I</mml:mi><mml:mi>C</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula>, between the logistic regression models incorporating uninformative and informative normal distribution priors. Negative values for <inline-formula><mml:math id="M77"><mml:mrow><mml:msub><mml:mi mathvariant="normal">&#x0394;</mml:mi><mml:mrow><mml:mi>W</mml:mi><mml:mi>A</mml:mi><mml:mi>I</mml:mi><mml:mi>C</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula> indicate that the model performed better when uninformative priors were used. Positive values for <inline-formula><mml:math id="M78"><mml:mrow><mml:msub><mml:mi mathvariant="normal">&#x0394;</mml:mi><mml:mrow><mml:mi>W</mml:mi><mml:mi>A</mml:mi><mml:mi>I</mml:mi><mml:mi>C</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula> indicate that the model using informative priors performed better. Mixed results were observed pertaining to the superiority of the logistic regression model when informative prior values were used &#x2013; for some semesters such as spring 2017, informative prior values enhanced model fit except when two semesters of historical data were used to prescribe the normal distribution priors (<inline-formula><mml:math id="M79"><mml:mrow><mml:msub><mml:mi mathvariant="normal">&#x0394;</mml:mi><mml:mrow><mml:mi>W</mml:mi><mml:mi>A</mml:mi><mml:mi>I</mml:mi><mml:mi>C</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula> = &#x2212;34.90). Except for the spring 2017 corpus, the magnitude of <inline-formula><mml:math id="M80"><mml:mrow><mml:msub><mml:mi mathvariant="normal">&#x0394;</mml:mi><mml:mrow><mml:mi>W</mml:mi><mml:mi>A</mml:mi><mml:mi>I</mml:mi><mml:mi>C</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula> increased as more historical data were considered. The best model performance was achieved when prior distribution parameters values were prescribed using data from two prior semesters of the same term (i.e., two fall and two spring semesters). For the fall 2016 and spring 2017 corpus, models incorporating uninformative prior values performed slightly better compared to the use of informative priors (<inline-formula><mml:math id="M81"><mml:mrow><mml:msub><mml:mi mathvariant="normal">&#x0394;</mml:mi><mml:mrow><mml:mi>W</mml:mi><mml:mi>A</mml:mi><mml:mi>I</mml:mi><mml:mi>C</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula>&#x2009;=&#x2009;&#x2212;1.30 for fall 2016 and <inline-formula><mml:math id="M82"><mml:mrow><mml:msub><mml:mi mathvariant="normal">&#x0394;</mml:mi><mml:mrow><mml:mi>W</mml:mi><mml:mi>A</mml:mi><mml:mi>I</mml:mi><mml:mi>C</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula>&#x2009;=&#x2009;&#x2212;34.00 for spring 2017).</p>
<table-wrap position="float" id="tab4">
<label>Table 4</label>
<caption><p>WAIC results comparing Bayesian models using uninformative and informative model priors per the study design in <xref rid="fig2" ref-type="fig">Figure 2</xref>.</p></caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th align="left" valign="top">Semester</th>
<th align="left" valign="top">Number of prior semesters [Prior semesters of course data: Reference from <xref rid="fig2" ref-type="fig">Figure 2</xref>]</th>
<th align="center" valign="top">WAIC using uninformative priors</th>
<th align="center" valign="top">WAIC using informative priors</th>
<th align="center" valign="top"><inline-formula><mml:math id="M83"><mml:mrow><mml:msub><mml:mi mathvariant="normal">&#x0394;</mml:mi><mml:mrow><mml:mi>W</mml:mi><mml:mi>A</mml:mi><mml:mi>I</mml:mi><mml:mi>C</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula>(Uninformative WAIC - informative WAIC)</th>
</tr>
</thead>
<tbody>
<tr>
<td align="left" valign="top">Fall 2015</td>
<td align="left" valign="top">Two [Fall 2014, Spring 2015: <xref rid="fig2" ref-type="fig">Figure 2A</xref>]</td>
<td align="char" valign="top" char=".">&#x2212;1,391.80</td>
<td align="char" valign="top" char=".">&#x2212;1,459.20</td>
<td align="char" valign="top" char=".">67.40</td>
</tr>
<tr>
<td align="left" valign="top" rowspan="2">Spring 2016</td>
<td align="left" valign="top">Two [Spring 2015, Fall 2015: <xref rid="fig2" ref-type="fig">Figure 2A</xref>]</td>
<td align="char" valign="top" char=".">&#x2212;1,513.00</td>
<td align="char" valign="top" char=".">&#x2212;1,512.60</td>
<td align="char" valign="top" char=".">&#x2212;0.40</td>
</tr>
<tr>
<td align="left" valign="top">Three [Fall 2014, Spring 2015, Fall 2015: <xref rid="fig2" ref-type="fig">Figure 2B</xref>]</td>
<td align="char" valign="top" char=".">&#x2212;2,374.40</td>
<td align="char" valign="top" char=".">&#x2212;2,401.20</td>
<td align="char" valign="top" char=".">26.80</td>
</tr>
<tr>
<td align="left" valign="top" rowspan="4">Fall 2016</td>
<td align="left" valign="top">Two [Fall 2015, Spring 2016: <xref rid="fig2" ref-type="fig">Figure 2A</xref>]</td>
<td align="char" valign="top" char=".">&#x2212;1,444.60</td>
<td align="char" valign="top" char=".">&#x2212;1,411.30</td>
<td align="char" valign="top" char=".">&#x2212;33.30</td>
</tr>
<tr>
<td align="left" valign="top">Three [Spring 2015, Fall 2015, Spring 2016: <xref rid="fig2" ref-type="fig">Figure 2B</xref>]</td>
<td align="char" valign="top" char=".">&#x2212;1,517.80</td>
<td align="char" valign="top" char=".">&#x2212;1,460.70</td>
<td align="char" valign="top" char=".">&#x2212;57.10</td>
</tr>
<tr>
<td align="left" valign="top">Four [Fall 2014, Spring 2015, Fall 2015, Spring 2016: <xref rid="fig2" ref-type="fig">Figure 2C</xref>]</td>
<td align="char" valign="top" char=".">&#x2212;3,015.20</td>
<td align="char" valign="top" char=".">&#x2212;2,869.00</td>
<td align="char" valign="top" char=".">&#x2212;146.20</td>
</tr>
<tr>
<td align="left" valign="top">Two Fall [Fall 2014, Fall 2015: <xref rid="fig2" ref-type="fig">Figure 2E</xref>]</td>
<td align="char" valign="top" char=".">&#x2212;808.20</td>
<td align="char" valign="top" char=".">&#x2212;806.90</td>
<td align="char" valign="top" char=".">&#x2212;1.30</td>
</tr>
<tr>
<td align="left" valign="top" rowspan="5">Spring 2017</td>
<td align="left" valign="top">Two [Spring 2016, Fall 2016: <xref rid="fig2" ref-type="fig">Figure 2A</xref>]</td>
<td align="char" valign="top" char=".">&#x2212;1,355.70</td>
<td align="char" valign="top" char=".">&#x2212;1,320.80</td>
<td align="char" valign="top" char=".">&#x2212;34.90</td>
</tr>
<tr>
<td align="left" valign="top">Three [Fall 2015, Spring 2016, Fall 2016: <xref rid="fig2" ref-type="fig">Figure 2B</xref>]</td>
<td align="char" valign="top" char=".">&#x2212;1,400.40</td>
<td align="char" valign="top" char=".">&#x2212;1,445.20</td>
<td align="char" valign="top" char=".">44.80</td>
</tr>
<tr>
<td align="left" valign="top">Four [Spring 2015, Fall 2015, Spring 2016, Fall 2016: <xref rid="fig2" ref-type="fig">Figure 2C</xref>]</td>
<td align="char" valign="top" char=".">&#x2212;2,856.00</td>
<td align="char" valign="top" char=".">&#x2212;2,886.70</td>
<td align="char" valign="top" char=".">30.70</td>
</tr>
<tr>
<td align="left" valign="top">Five[Fall 2014, Spring 2015, Fall 2015, Spring 2016, Fall 2016: <xref rid="fig2" ref-type="fig">Figure 2D</xref>]</td>
<td align="char" valign="top" char=".">&#x2212;3,740.10</td>
<td align="char" valign="top" char=".">&#x2212;3,766.70</td>
<td align="char" valign="top" char=".">26.60</td>
</tr>
<tr>
<td align="left" valign="top">Two Spring [Spring 2015, Spring 2016: <xref rid="fig2" ref-type="fig">Figure 2F</xref>]</td>
<td align="char" valign="top" char=".">&#x2212;400.90</td>
<td align="char" valign="top" char=".">&#x2212;366.90</td>
<td align="char" valign="top" char=".">&#x2212;34.00</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<p>Smaller WAIC values indicate a better fitting model.</p>
</table-wrap-foot>
</table-wrap>
</sec>
</sec>
<sec id="sec14" sec-type="discussions">
<label>6.</label>
<title>Discussion</title>
<p>Modeling student performance is not a new development in EDM and LA (see <xref ref-type="bibr" rid="ref34">Chatti et al., 2012</xref>; <xref ref-type="bibr" rid="ref41">Clow, 2013</xref>; <xref ref-type="bibr" rid="ref154">Sin and Muthu, 2015</xref>; <xref ref-type="bibr" rid="ref92">Lang et al., 2017</xref>). Although a plethora of studies have investigated different mathematical frameworks for modeling student outcomes in STEM settings using ML and frequentist methods, much less AI educational research has used Bayesian methods to explore the impact of different data types and sources on student performance.</p>
<p>The answer to RQ 1 is that course-specific data types provided the greatest insight into student performance patterns. A one standard deviation increase in LMS logins and CI scores significantly increased the odds of course retention. These findings are consistent with similar observations in other classroom contexts and STEM disciplines that utilized non-Bayesian methods, demonstrating the utility of these novel assessment types as being highly informative of student retention and attrition (<xref ref-type="bibr" rid="ref147">Salehi et al., 2019</xref>; <xref ref-type="bibr" rid="ref153">Simmons and Heckler, 2020</xref>; <xref ref-type="bibr" rid="ref18">Bertolini, 2021</xref>; <xref ref-type="bibr" rid="ref38">Chen and Zhang, 2021</xref>). While prior academic experiences were identified as factors that were significant predictors of course performance in our biology course setting, they were not as strong predictors as those derived from AI-driven technology; this finding supports calls for educators to embrace and incorporate these tools into the classroom environment since they can be used to provide valuable insights into student performance.</p>
<p>The inclusion of LMS data in EDM and LA models have been predominantly utilized in online, blended, or flipped classroom environments where they were deemed necessary tools for guiding administrative and pedagogical interventions (see <xref ref-type="bibr" rid="ref7">Al-Shabandar et al., 2017</xref>; <xref ref-type="bibr" rid="ref176">Wang, 2017</xref>; <xref ref-type="bibr" rid="ref100">Lisitsyna and Oreshin, 2019</xref>; <xref ref-type="bibr" rid="ref152">Shayan and van Zaanen, 2019</xref>; <xref ref-type="bibr" rid="ref104">Louhab et al., 2020</xref>; <xref ref-type="bibr" rid="ref126">Nieuwoudt, 2020</xref>). Our findings demonstrated that using technological resources with in-class instruction provided greater insights into student achievement. While not considered, the utility of other information extracted from (LMSs) (e.g., student access to course deliverables; see <xref ref-type="bibr" rid="ref32">Chandler and Skallos, 2012</xref>) aside from student login data should be examined to further explore student comprehension, learning, and course interaction (<xref ref-type="bibr" rid="ref20">Bertolini et al., 2021b</xref>).</p>
<p>Since instructors may be more confident in their ability to address student misconceptions of various course topics instead of developing models to forecast classroom success, CIs were incorporated since they are capable of diagnosing student learning barriers (<xref ref-type="bibr" rid="ref67">Haudek et al., 2011</xref>; <xref ref-type="bibr" rid="ref123">Nehm, 2019</xref>). It is important to note that there are some documented cases where incorporating multiple CI assessments on the same subject matter into the classroom environment may cloud intervention planning (<xref ref-type="bibr" rid="ref42">Coletta et al., 2007</xref>; <xref ref-type="bibr" rid="ref93">Lasry et al., 2011</xref>). While performance on the AI-scored ACORNS and traditionally scored CINS was positively correlated (<italic>&#x03C1;</italic>&#x2009;=&#x2009;0.321) across all six semesters, we do not believe that studying both diminishes the impact of these CIs due to the nature of the two assessments. The ACORNS is a constructed-response assessment that requires a student to generate expository responses to explain evolutionary concepts (i.e., develop scientific explanations), while the CINS is a multiple-choice assessment that prompts students to recognize accurate information (i.e., select a statement). Our findings suggest that utilizing CI assessments with a diverse array of question types may provide differential and greater insight into student learning. While pre-and post-hoc analyses have examined student performance on these assessments before and after course completion, it is still an open question in biology education research whether the administration of these CI assessments at different time points in the course would be more effective in quantifying and forecasting student success (<xref ref-type="bibr" rid="ref177">Wang, 2018</xref>; <xref ref-type="bibr" rid="ref125">Nehm et al., 2022</xref>).</p>
<p>Demographic characteristics were not significant factors that impacted classroom performance, compared to student academic attributes in this classroom context. This finding is consistent with many non-Bayesian EDM and LA studies (<xref ref-type="bibr" rid="ref96">Leppel, 2002</xref>; <xref ref-type="bibr" rid="ref161">Thomas and Galambos, 2004</xref>; <xref ref-type="bibr" rid="ref74">Hussain et al., 2018</xref>; <xref ref-type="bibr" rid="ref132">Paquette et al., 2020</xref>; <xref ref-type="bibr" rid="ref19">Bertolini et al., 2021a</xref>). Except for PELL recipients, financial aid data were not highly informative in quantifying the odds of passing this biology course. These data types were included since financial needs have a negative effect on student persistence in STEM (<xref ref-type="bibr" rid="ref76">Johnson, 2012</xref>; <xref ref-type="bibr" rid="ref31">Castleman et al., 2018</xref>). It is important to note that these data types should not be considered as proxies for individual or parental socioeconomic status since they group middle-income and low-income students together, as well as undercount the latter group (see <xref ref-type="bibr" rid="ref160">Tebbs and Turner, 2005</xref>; <xref ref-type="bibr" rid="ref50">Delisle, 2017</xref>). Further scrutiny of these features is needed given these limitations.</p>
<p>New freshmen students were less likely to pass the course compared to transfer students, even though there is substantial documentation that transfer students struggle academically after transitioning to a 4-year institution (<xref ref-type="bibr" rid="ref90">Laanan, 2001</xref>; <xref ref-type="bibr" rid="ref54">Duggan and Pickering, 2008</xref>; <xref ref-type="bibr" rid="ref151">Shaw et al., 2019</xref>). There are several factors that may have contributed to this finding. While a significant portion of student attrition occurs in the student&#x2019;s first term at an institution (<xref ref-type="bibr" rid="ref49">Delen, 2011</xref>; <xref ref-type="bibr" rid="ref109">Martin, 2017</xref>; <xref ref-type="bibr" rid="ref130">Ortiz-Lozano et al., 2018</xref>), for new freshmen, academic performance is strongly associated with each student&#x2019;s social interaction with the campus environment (<xref ref-type="bibr" rid="ref163">Tinto, 1987</xref>; <xref ref-type="bibr" rid="ref175">Virdyanawaty and Mansur, 2016</xref>; <xref ref-type="bibr" rid="ref162">Thomas et al., 2018</xref>). Large introductory STEM courses have often been associated with student alienation (<xref ref-type="bibr" rid="ref28">Brown and Fitzke, 2019</xref>). Furthermore, insufficient mastery of prerequisite material coupled with a decrease in morale may also be attributed to poorer freshmen performance in a course (<xref ref-type="bibr" rid="ref114">McCarthy and Kuh, 2006</xref>). Further research should explore these factors in this and other collegiate STEM courses by educational stakeholders and institutional researchers at our university.</p>
<p>Minimal variability was observed between semester-specific effects, consistent with the findings of <xref ref-type="bibr" rid="ref19">Bertolini et al. (2021a</xref>,<xref ref-type="bibr" rid="ref20">b)</xref> who compared ML performance using frequentist statistical techniques. Differences between student enrollment characteristics were likely the reason for the disproportionate number of passing and failing students between the fall and spring course offerings. In addition to having a lower passing rate, the fall semesters enrolled students with lower high school GPAs (mean: 91.8 vs. 93.0) and more transfer students (8.7% vs. 4.7%), compared to spring semesters.</p>
<p>Although the current study focused on developing a Bayesian framework to examine retention and attrition, factors that impact student persistence, it is valuable to consider the ways in which the results could be applied to our classroom setting, given that these methodologies have received limited attention in the literature (<xref ref-type="bibr" rid="ref18">Bertolini, 2021</xref>). By identifying student characteristics and features that impact student performance, instructors and academic stakeholders can work to develop educational interventions and psychosocial support structures to foster student success (see <xref ref-type="bibr" rid="ref20">Bertolini et al., 2021b</xref> for a list of examples). Overall, while diverse data types have the potential to enhance the generality of student success predictions and guide instructor engagement and action, these findings suggest that educational interventions and psychosocial groups should be structured based on both the academic achievements and characteristics of students. For example, if the instructor chooses to place students into collaborative learning groups, these support structures should avoid homogeneous groups composed of students likely to fail the course (e.g., new freshmen and international students). At the institution level, educational stakeholders can work to provide greater support services for these students through tutoring, outreach, and mentoring services. While students on track to succeed can benefit from an intervention, timely identification of struggling students is critical to reduce attrition and high dropout STEM rates (<xref ref-type="bibr" rid="ref130">Ortiz-Lozano et al., 2018</xref>; <xref ref-type="bibr" rid="ref20">Bertolini et al., 2021b</xref>).</p>
<p>In RQ 2, using informative priors from aggregated past semesters of course corpora (i.e., more historical semesters) did not always enhance model fit. Some prior work in education found that utilizing information from larger data sets improves model performance (<xref ref-type="bibr" rid="ref56">Epling et al., 2003</xref>; <xref ref-type="bibr" rid="ref22">Boyd and Crawford, 2011</xref>; <xref ref-type="bibr" rid="ref98">Liao et al., 2019</xref>). The purpose of presenting this empirical analysis was to mirror prior frequentist EDM, LA, and ML studies where researchers increased the amount of historical data used in their training corpora to see if this enhanced model efficacy (<xref ref-type="bibr" rid="ref18">Bertolini, 2021</xref>). Since the use of Bayesian inference is nascent in education, incorporating subjective and elucidated priors are a documented concern for educators since it is difficult for them to precisely decide what the distributions for model parameters should be, and they fear that this specification of prior knowledge may allow researchers to deliberately bias posterior results (<xref ref-type="bibr" rid="ref81">Kassler et al., 2019</xref>). It is imperative to note that the underlying mathematical frameworks of frequentist techniques also utilize implicit priors; however, they are rather nonsensical since underlying parameters are fixed and remain constant even during data resampling. Many education researchers are likely unaware of these priors governing traditional frequentist models, even though they have been adhered to and incorporated into a plethora of educational research contexts. Greater knowledge and instruction on the mathematical underpinnings of frequentist and Bayesian techniques are warranted and may provide educators with a new perspective and greater appreciation toward using informative prior distributions in Bayesian analytics, embracing them as a pragmatic alternative to frequentist statistical methodologies.</p>
<p>For these educational corpora examined in this research context, this empirical Bayesian design may not always be suitable for establishing informative normally distributed priors for covariates using historical data, as indicated by the large amount of variability in model performance and fit shown in <xref rid="tab4" ref-type="table">Table 4</xref>. This differs from other educational studies which found that incorporating informative priors leads to more meaningful insights into student comprehension and learning (<xref ref-type="bibr" rid="ref77">Johnson and Jenkins, 2004</xref>; <xref ref-type="bibr" rid="ref89">Kubsch et al., 2021</xref>). Several plausible reasons that may account for our contrasting findings include (1) running models on a single semester of course data (either fall or spring), (2) variability of student engagement and heterogeneity in the students&#x2019; aptitude over different semesters, (3) more selective admissions criteria over different academic terms, and (4) choice of the normal prior distribution. The role of domain-specific knowledge and further scrutiny of these prior distributions and model parameters need to be the focus of future Bayesian educational studies going forward.</p>
<p>ML and its integration with AI technology has tremendous potential to enhance student learning activities, assessments, and scientific inquiries, while providing academic stakeholders with greater insight into student learning, cognition, and performance to address a plethora of STEM challenges (<xref ref-type="bibr" rid="ref190">Zhai et al., 2020b</xref>; <xref ref-type="bibr" rid="ref187">Zhai, 2021</xref>). Our study demonstrated that Bayesian methods are another tool that educators can utilize to quantify student retention and attrition, factors that impact student performance, in the science classroom. These techniques are a more intuitive approach to the rejection/acceptance criteria of frequentist methods, linking prior assumptions, and data together to provide a quantitative distribution of final model parameter estimates. Additional studies in the EDM and LA literature are needed to continue studying the effectiveness of these methods in alternative educational contexts, STEM settings, and AI/ML educational tools for informing data-driven pedagogical decisions.</p>
</sec>
<sec id="sec15">
<label>7.</label>
<title>Limitations and future directions</title>
<p>There are several limitations to this observational study. The results obtained are corpora dependent and may not generalize to other introductory STEM classes based on (1) institution type (e.g., public, private, for-profit), (2) class size, (3) course duration, and (4) course content coverage (<xref ref-type="bibr" rid="ref20">Bertolini et al., 2021b</xref>). Given the centrality of evolution to the undergraduate biology curriculum (<xref ref-type="bibr" rid="ref24">Brewer and Smith, 2011</xref>), we used scores from the ACORNS and CINS assessments. There are many additional published, validated, and commonly employed CI assessments that should be studied as alternative possible sources for modeling (<xref ref-type="bibr" rid="ref123">Nehm, 2019</xref>). Furthermore, Bayesian methods should be applied to examine whether the findings in this manuscript generalize to other STEM subjects (e.g., physics, chemistry) and classroom contexts (e.g., smaller classes, summer, or winter sessions).</p>
<p>In this study, we focused on comparing model performance and fit using the WAIC metric. Other Bayesian evaluation metrics, such as Bayes factor, were not utilized in the study since this metric does not explicitly include a term quantifying model complexity; furthermore, the Bayes factor tends to be unstable and sensitive to the choice of the prior distribution (<xref ref-type="bibr" rid="ref80">Kadane and Lazar, 2004</xref>; <xref ref-type="bibr" rid="ref179">Ward, 2008</xref>). Moreover, we also did not employ the deviance information criterion since this metric is not a completely Bayesian evaluation metric (<xref ref-type="bibr" rid="ref141">Richards, 2005</xref>; <xref ref-type="bibr" rid="ref113">McCarthy, 2007</xref>; <xref ref-type="bibr" rid="ref155">Spiegelhalter et al., 2014</xref>).</p>
<p>One premise of this study was to identify the features associated with biology classroom success. An analogous analysis can use these Bayesian logistic regression models to predict student success in subsequent semesters of the course offering. Moreover, alternative prior distributions, aside from a normal distribution, for the regression parameters should be considered in future studies, including regularization priors and variable selection methodologies.</p>
<p>Biology course performance was categorized as a dichotomous outcome. In future studies, the student&#x2019;s raw course grade can be modeled using linear regression techniques. Individualized logistic regression models were not run in this study since they have been thoroughly explored in other EDM and LA studies (see <xref ref-type="bibr" rid="ref65">Goldstein et al., 2007</xref>; <xref ref-type="bibr" rid="ref39">Chowdry et al., 2013</xref>; <xref ref-type="bibr" rid="ref94">Lee et al., 2015</xref>; <xref ref-type="bibr" rid="ref177">Wang, 2018</xref>). Furthermore, we did not consider synergistic effects between different covariates in this analysis. A comprehensive study of these interactions would be a pragmatic next step.</p>
<p>In the future, this work can be extended to model student performance in online and hybrid classroom settings. Due to the recent and dramatic rise of remote instruction, leveraging diverse forms of information from other AI-enhanced learning tools, as well as phenotypic variables from video conferencing software, may provide greater insight into student learning and comprehension. Moreover, the inclusion of these data types has the potential to yield more accurate predictions of retention and attrition, factors that impact student performance, when aggregated with traditional university-specific corpora (<xref ref-type="bibr" rid="ref18">Bertolini, 2021</xref>).</p>
</sec>
<sec id="sec16" sec-type="conclusions">
<label>8.</label>
<title>Conclusion</title>
<p>The special issue <italic>AI for Tackling STEM Education Challenges</italic> focuses on the technological, educational, and methodological advances devised by academic researchers in AI to address a multitude of STEM educational challenges. While ML algorithms have been widely used in the literature to discern insights into student performance patterns, our study has sought to advance this work by demonstrating that Bayesian inference techniques are a useful and pragmatic alternative for ascertaining the differential association between traditional and novel assessment data types on STEM retention and attrition. Features extracted from the LMS and CI assessments were found to be the most significant factors associated with student performance in a baccalaureate biology course setting, compared to traditional features such as demographics and prior course performance. These findings are a small, yet important step for leveraging the power of Bayesian modeling to examine educational outcomes and aid stakeholders in designing personalized content, interventions, and psychosocial structures to support student STEM success.</p>
</sec>
<sec id="sec17" sec-type="data-availability">
<title>Data availability statement</title>
<p>The data analyzed in this study is subject to the following licenses/restrictions: The research grant supporting this study is still ongoing. Therefore, all data analyzed in this study will be available from the corresponding author on reasonable request after the grant end date of August 2023. Requests to access these datasets should be directed to <email>roberto.bertolini@alumni.stonybrook.edu</email>.</p>
</sec>
<sec id="sec18">
<title>Ethics statement</title>
<p>The studies involving human participants were reviewed and approved by Stony Brook University. The patients/participants provided their written informed consent to participate in this study.</p>
</sec>
<sec id="sec19">
<title>Author contributions</title>
<p>RB, SF, and RN conceptualized the study, reviewed and approved the final manuscript. RB performed all data analyses, prepared all tables and figures, and wrote the first draft of the manuscript. All authors contributed to the article and approved the submitted version.</p>
</sec>
<sec id="sec20" sec-type="funding-information">
<title>Funding</title>
<p>The Howard Hughes Medical Institute Science Education Program provided funding (grant number 79545). The views in this contribution do not necessarily reflect those of the Howard Hughes Medical Institute.</p>
</sec>
<sec id="conf1" sec-type="COI-statement">
<title>Conflict of interest</title>
<p>The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.</p>
</sec>
<sec id="sec100" sec-type="disclaimer">
<title>Publisher&#x2019;s note</title>
<p>All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.</p>
</sec>
</body>
<back>
<ack>
<p>The authors thank Heather J. Lynch for helpful suggestions and for providing feedback on an early draft of the methods section in this manuscript. We also acknowledge Yaqi Xue and Nora Galambos for assembling the data files analyzed in this study. The authors thank the Howard Hughes Medical Institute Science Education Program for providing funding. We also thank the guest editor and reviewers for their helpful comments and feedback on the manuscript.</p>
</ack>
<sec id="sec22" sec-type="supplementary-material">
<title>Supplementary material</title>
<p>The Supplementary material for this article can be found online at: <ext-link xlink:href="https://www.frontiersin.org/articles/10.3389/feduc.2023.1073829/full#supplementary-material" ext-link-type="uri">https://www.frontiersin.org/articles/10.3389/feduc.2023.1073829/full#supplementary-material</ext-link></p>
<supplementary-material xlink:href="Data_Sheet_1.docx" id="SM1" mimetype="application/vnd.openxmlformats-officedocument.wordprocessingml.document" xmlns:xlink="http://www.w3.org/1999/xlink"/>
</sec>
<ref-list>
<title>References</title>
<ref id="ref1"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Afzaal</surname> <given-names>M.</given-names></name> <name><surname>Nouri</surname> <given-names>J.</given-names></name> <name><surname>Zia</surname> <given-names>A.</given-names></name> <name><surname>Papapetrou</surname> <given-names>P.</given-names></name> <name><surname>Fors</surname> <given-names>U.</given-names></name> <name><surname>Wu</surname> <given-names>Y.</given-names></name> <etal/></person-group>. (<year>2021</year>). <article-title>Explainable AI for data-driven feedback and intelligent action recommendations to support students self-regulation</article-title>. <source>Front. Artif. Intell.</source> <volume>4</volume>:<fpage>723447</fpage>. doi: <pub-id pub-id-type="doi">10.3389/frai.2021.723447</pub-id>, PMID: <pub-id pub-id-type="pmid">34870183</pub-id></citation></ref>
<ref id="ref2"><citation citation-type="confproc"><person-group person-group-type="author"><name><surname>Ahmed</surname> <given-names>D. M.</given-names></name> <name><surname>Abdulazeez</surname> <given-names>A. M.</given-names></name> <name><surname>Zeebaree</surname> <given-names>D. Q.</given-names></name> <name><surname>Ahmed</surname> <given-names>F. Y.</given-names></name></person-group> (<year>2021</year>). &#x201C;<article-title>Predicting university&#x2019;s students performance based on machine learning techniques</article-title>,&#x201D; in <conf-name>2021 IEEE International Conference on Automatic Control &#x0026; Intelligent Systems (I2CACIS)</conf-name>. (<conf-loc>Shah Alam, Malaysia</conf-loc>: <publisher-name>IEEE</publisher-name>) <fpage>276</fpage>&#x2013;<lpage>281</lpage>.</citation></ref>
<ref id="ref002"><citation citation-type="book"><person-group person-group-type="author"><name><surname>Akaike</surname> <given-names>H.</given-names></name></person-group> (<year>1973</year>). &#x201C;<article-title>Information theory and an extension of the maximum likelihood principle</article-title>&#x201D; in <source>2nd International Symposium on Information Theory</source>. eds. <person-group person-group-type="editor"><name><surname>Petrov</surname> <given-names>B. N.</given-names></name> <name><surname>Cs&#x00E1;ki</surname> <given-names>F.</given-names></name></person-group> (<publisher-loc>Tsahkadsor, Armenia, USSR. Budapest</publisher-loc>: <publisher-name>Akad&#x00E9;miai Kiad&#x00F3;</publisher-name>), <fpage>267</fpage>&#x2013;<lpage>281</lpage>.</citation></ref>
<ref id="ref3"><citation citation-type="book"><person-group person-group-type="author"><name><surname>Alam</surname> <given-names>A.</given-names></name></person-group> (<year>2022</year>). &#x201C;<article-title>Employing adaptive learning and intelligent tutoring robots for virtual classrooms and smart campuses: reforming education in the age of artificial intelligence</article-title>&#x201D; in <source>Advanced computing and intelligent technologies</source> (<publisher-loc>Singapore</publisher-loc>: <publisher-name>Springer</publisher-name>), <fpage>395</fpage>&#x2013;<lpage>406</lpage>.</citation></ref>
<ref id="ref4"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Albreiki</surname> <given-names>B.</given-names></name></person-group> (<year>2022</year>). <article-title>Framework for automatically suggesting remedial actions to help students at risk based on explainable ML and rule-based models</article-title>. <source>Int. J. Educ. Technol. High. Educ.</source> <volume>19</volume>, <fpage>1</fpage>&#x2013;<lpage>26</lpage>. doi: <pub-id pub-id-type="doi">10.1186/s41239-022-00354-6</pub-id></citation></ref>
<ref id="ref5"><citation citation-type="book"><person-group person-group-type="author"><name><surname>Allenby</surname> <given-names>G. M.</given-names></name> <name><surname>Rossi</surname> <given-names>P. E.</given-names></name></person-group> (<year>2006</year>). &#x201C;<article-title>Hierarchical bayes models</article-title>&#x201D; in <source>The Handbook of Marketing Research: Uses, Misuses, and Future Advances</source>. <publisher-loc>Thousand Oaks, California, United States</publisher-loc>: <publisher-name>SAGE Publications, Inc.</publisher-name>, <fpage>418</fpage>&#x2013;<lpage>440</lpage>.</citation></ref>
<ref id="ref6"><citation citation-type="book"><person-group person-group-type="author"><name><surname>Almond</surname> <given-names>R. G.</given-names></name> <name><surname>Mislevy</surname> <given-names>R. J.</given-names></name> <name><surname>Steinberg</surname> <given-names>L. S.</given-names></name> <name><surname>Yan</surname> <given-names>D.</given-names></name> <name><surname>Williamson</surname> <given-names>D. M.</given-names></name></person-group> (<year>2015</year>). <source>Bayesian Networks in Educational Assessment</source>. <publisher-loc>New York, United States</publisher-loc>: <publisher-name>Springer</publisher-name>.</citation></ref>
<ref id="ref7"><citation citation-type="confproc"><person-group person-group-type="author"><name><surname>Al-Shabandar</surname> <given-names>R.</given-names></name> <name><surname>Hussain</surname> <given-names>A.</given-names></name> <name><surname>Laws</surname> <given-names>A.</given-names></name> <name><surname>Keight</surname> <given-names>R.</given-names></name> <name><surname>Lunn</surname> <given-names>J.</given-names></name> <name><surname>Radi</surname> <given-names>N.</given-names></name></person-group> (<year>2017</year>). &#x201C;<article-title>Machine learning approaches to predict learning outcomes in Massive open online courses</article-title>&#x201D; in <conf-name>2017 International Joint Conference on Neural Networks (IJCNN) (IEEE)</conf-name>. <fpage>713</fpage>&#x2013;<lpage>71720</lpage>.</citation></ref>
<ref id="ref8"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Anderson</surname> <given-names>D. L.</given-names></name> <name><surname>Fisher</surname> <given-names>K. M.</given-names></name> <name><surname>Norman</surname> <given-names>G. J.</given-names></name></person-group> (<year>2002</year>). <article-title>Development and evaluation of the conceptual inventory of natural selection</article-title>. <source>J. Res. Sci. Teach.</source> <volume>39</volume>, <fpage>952</fpage>&#x2013;<lpage>978</lpage>. doi: <pub-id pub-id-type="doi">10.1002/tea.10053</pub-id>, PMID: <pub-id pub-id-type="pmid">12115116</pub-id></citation></ref>
<ref id="ref9"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Arrieta</surname> <given-names>A. B.</given-names></name> <name><surname>D&#x00ED;az-Rodr&#x00ED;guez</surname> <given-names>N.</given-names></name> <name><surname>Del Ser</surname> <given-names>J.</given-names></name> <name><surname>Bennetot</surname> <given-names>A.</given-names></name> <name><surname>Tabik</surname> <given-names>S.</given-names></name> <name><surname>Barbado</surname> <given-names>A.</given-names></name> <etal/></person-group>. (<year>2020</year>). <article-title>Explainable Artificial Intelligence (XAI): concepts, taxonomies, opportunities and challenges toward responsible AI</article-title>. <source>Inf. Fusion.</source> <volume>58</volume>, <fpage>82</fpage>&#x2013;<lpage>115</lpage>. doi: <pub-id pub-id-type="doi">10.1016/j.inffus.2019.12.012</pub-id></citation></ref>
<ref id="ref10"><citation citation-type="book"><person-group person-group-type="author"><name><surname>Ayers</surname> <given-names>E.</given-names></name> <name><surname>Junker</surname> <given-names>B. W.</given-names></name></person-group> (<year>2006</year>). &#x201C;<article-title>Do skills combine additively to predict task difficulty in eighth grade mathematics</article-title>&#x201D; in <source>Educational data mining: Papers from the AAAI Workshop</source> (<publisher-loc>Washington D.C., United States</publisher-loc>: <publisher-name>AAAI Press</publisher-name>).</citation></ref>
<ref id="ref11"><citation citation-type="confproc"><person-group person-group-type="author"><name><surname>Baashar</surname> <given-names>Y.</given-names></name> <name><surname>Alkawsi</surname> <given-names>G.</given-names></name> <name><surname>Ali</surname> <given-names>N. A.</given-names></name> <name><surname>Alhussian</surname> <given-names>H.</given-names></name> <name><surname>Bahbouh</surname> <given-names>H. T.</given-names></name></person-group> (<year>2021</year>). &#x201C;<article-title>Predicting student&#x2019;s performance using machine learning methods: a systematic literature review</article-title>&#x201D; in <source>2021 International Conference on Computer &#x0026; Information Sciences (ICCOINS) IEEE</source>, <fpage>357</fpage>&#x2013;<lpage>362</lpage>.</citation></ref>
<ref id="ref12"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Baker</surname> <given-names>R. S.</given-names></name></person-group> (<year>2010</year>). <article-title>Data mining for education</article-title>. <source>Int. Encycl. Educ.</source> <volume>7</volume>, <fpage>112</fpage>&#x2013;<lpage>118</lpage>. doi: <pub-id pub-id-type="doi">10.1016/B978-0-08-044894-7.01318-X</pub-id></citation></ref>
<ref id="ref13"><citation citation-type="book"><person-group person-group-type="author"><name><surname>Baker</surname> <given-names>T.</given-names></name> <name><surname>Smith</surname> <given-names>L.</given-names></name> <name><surname>Anissa</surname> <given-names>N.</given-names></name></person-group> (<year>2019</year>). <source>Educ-AI-Tion Rebooted? Exploring the Future of Artificial Intelligence in Schools and Colleges</source> (<publisher-loc>London</publisher-loc>: <publisher-name>Nesta</publisher-name>). Available at: <ext-link xlink:href="https://www.nesta.org.uk/report/education-rebooted" ext-link-type="uri">https://www.nesta.org.uk/report/education-rebooted</ext-link> (Accessed January 28, 2023).</citation></ref>
<ref id="ref14"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ba&#x00F1;eres</surname> <given-names>D.</given-names></name> <name><surname>Rodr&#x00ED;guez</surname> <given-names>M. E.</given-names></name> <name><surname>Guerrero-Rold&#x00E1;n</surname> <given-names>A. E.</given-names></name> <name><surname>Karadeniz</surname> <given-names>A.</given-names></name></person-group> (<year>2020</year>). <article-title>An early warning system to detect at-risk students in online higher education</article-title>. <source>Appl. Sci.</source> <volume>10</volume>:<fpage>4427</fpage>. doi: <pub-id pub-id-type="doi">10.3390/app10134427</pub-id></citation></ref>
<ref id="ref15"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Banner</surname> <given-names>K. M.</given-names></name> <name><surname>Irvine</surname> <given-names>K. M.</given-names></name> <name><surname>Rodhouse</surname> <given-names>T. J.</given-names></name></person-group> (<year>2020</year>). <article-title>The use of Bayesian priors in ecology: the good, the bad and the not great</article-title>. <source>Methods Ecol. Evol.</source> <volume>11</volume>, <fpage>882</fpage>&#x2013;<lpage>889</lpage>. doi: <pub-id pub-id-type="doi">10.1111/2041-210X.13407</pub-id></citation></ref>
<ref id="ref16"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Berens</surname> <given-names>J.</given-names></name> <name><surname>Schneider</surname> <given-names>K.</given-names></name> <name><surname>G&#x00F6;rtz</surname> <given-names>S.</given-names></name> <name><surname>Oster</surname> <given-names>S.</given-names></name> <name><surname>Burghoff</surname> <given-names>J.</given-names></name></person-group> (<year>2019</year>). <article-title>Early detection of students at risk &#x2013; predicting student dropouts using administrative student data and machine learning methods</article-title>. <source>J. Educ. Data Mining.</source> <volume>11</volume>, <fpage>1</fpage>&#x2013;<lpage>41</lpage>. doi: <pub-id pub-id-type="doi">10.5281/zenodo.3594771</pub-id></citation></ref>
<ref id="ref17"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Berger</surname> <given-names>J. O.</given-names></name> <name><surname>Berry</surname> <given-names>D. A.</given-names></name></person-group> (<year>1988</year>). <article-title>Statistical analysis and the illusion of objectivity</article-title>. <source>Am. Sci.</source> <volume>76</volume>, <fpage>159</fpage>&#x2013;<lpage>165</lpage>.</citation></ref>
<ref id="ref18"><citation citation-type="book"><person-group person-group-type="author"><name><surname>Bertolini</surname> <given-names>R.</given-names></name></person-group> (<year>2021</year>). <source>Evaluating performance variability of data pipelines for binary classification with applications to predictive learning analytics. [Dissertation]</source>. <publisher-loc>Stony Brook (NY)</publisher-loc>: <publisher-name>Stony Brook University</publisher-name>.</citation></ref>
<ref id="ref19"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bertolini</surname> <given-names>R.</given-names></name> <name><surname>Finch</surname> <given-names>S. J.</given-names></name> <name><surname>Nehm</surname> <given-names>R. H.</given-names></name></person-group> (<year>2021a</year>). <article-title>Enhancing data pipelines for forecasting student performance: integrating feature selection with cross-validation</article-title>. <source>Int. J. Educ. Technol. High. Educ.</source> <volume>18</volume>, <fpage>1</fpage>&#x2013;<lpage>23</lpage>. doi: <pub-id pub-id-type="doi">10.1186/s41239-021-00279-6</pub-id></citation></ref>
<ref id="ref20"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bertolini</surname> <given-names>R.</given-names></name> <name><surname>Finch</surname> <given-names>S. J.</given-names></name> <name><surname>Nehm</surname> <given-names>R. H.</given-names></name></person-group> (<year>2021b</year>). <article-title>Testing the impact of novel assessment sources and machine learning methods on predictive outcome modeling in undergraduate biology</article-title>. <source>J. Sci. Educ. Technol.</source> <volume>30</volume>, <fpage>193</fpage>&#x2013;<lpage>209</lpage>. doi: <pub-id pub-id-type="doi">10.1007/s10956-020-09888-8</pub-id></citation></ref>
<ref id="ref21"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bertolini</surname> <given-names>R.</given-names></name> <name><surname>Finch</surname> <given-names>S. J.</given-names></name> <name><surname>Nehm</surname> <given-names>R. H.</given-names></name></person-group> (<year>2022</year>). <article-title>Quantifying variability in predictions of student performance: examining the impact of bootstrap resampling in data pipelines</article-title>. <source>Comput. Educ. Artif. Intell.</source> <volume>3</volume>:<fpage>100067</fpage>. doi: <pub-id pub-id-type="doi">10.1016/j.caeai.2022.100067</pub-id></citation></ref>
<ref id="ref22"><citation citation-type="book"><person-group person-group-type="author"><name><surname>Boyd</surname> <given-names>D.</given-names></name> <name><surname>Crawford</surname> <given-names>K.</given-names></name></person-group> (<year>2011</year>). &#x201C;<article-title>Six provocations for big data</article-title>&#x201D; in <source>A decade in internet time: Symposium on the dynamics of the internet and society</source>. <publisher-loc>Oxford, UK</publisher-loc>: <publisher-name>Oxford Institute</publisher-name>.</citation></ref>
<ref id="ref23"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Brassil</surname> <given-names>C. E.</given-names></name> <name><surname>Couch</surname> <given-names>B. A.</given-names></name></person-group> (<year>2019</year>). <article-title>Multiple-true-false questions reveal more thoroughly the complexity of student thinking than multiple-choice questions: a Bayesian item response model comparison</article-title>. <source>Int. J. STEM Educ.</source> <volume>6</volume>, <fpage>1</fpage>&#x2013;<lpage>17</lpage>. doi: <pub-id pub-id-type="doi">10.1186/s40594-019-0169-0</pub-id></citation></ref>
<ref id="ref24"><citation citation-type="book"><person-group person-group-type="author"><name><surname>Brewer</surname> <given-names>C. A.</given-names></name> <name><surname>Smith</surname> <given-names>D.</given-names></name></person-group> (<year>2011</year>). <source>Vision and change in undergraduate biology education: a call to action</source>. <publisher-name>American Association for the Advancement of Science</publisher-name>, <publisher-loc>Washington, DC</publisher-loc>.</citation></ref>
<ref id="ref25"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Brooks</surname> <given-names>S.</given-names></name></person-group> (<year>1998</year>). <article-title>Markov chain Monte Carlo method and its application</article-title>. <source>J. R. Stat. Soc. Ser. D (The Statistician).</source> <volume>47</volume>, <fpage>69</fpage>&#x2013;<lpage>100</lpage>.</citation></ref>
<ref id="ref26"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Brooks</surname> <given-names>S. P.</given-names></name> <name><surname>Gelman</surname> <given-names>A.</given-names></name></person-group> (<year>1998</year>). <article-title>General methods for monitoring convergence of iterative simulations</article-title>. <source>J. Comput. Graph. Stat.</source> <volume>7</volume>, <fpage>434</fpage>&#x2013;<lpage>455</lpage>.</citation></ref>
<ref id="ref27"><citation citation-type="other"><person-group person-group-type="author"><name><surname>Brooks</surname> <given-names>C.</given-names></name> <name><surname>Thompson</surname> <given-names>C.</given-names></name></person-group> (<year>2017</year>). &#x201C;<article-title>Predictive modelling in teaching and learning</article-title>&#x201D; in <source>Handbook of learning analytics</source>, <fpage>61</fpage>&#x2013;<lpage>68</lpage>.</citation></ref>
<ref id="ref28"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Brown</surname> <given-names>M. A.</given-names></name> <name><surname>Fitzke</surname> <given-names>R. E.</given-names></name></person-group> (<year>2019</year>). <article-title>The importance of student engagement and experiential learning in undergraduate education</article-title>. <source>J. Undergrad. Res.</source> <volume>10</volume>:<fpage>2</fpage>. Available at <ext-link xlink:href="https://par.nsf.gov/servlets/purl/10204919" ext-link-type="uri">https://par.nsf.gov/servlets/purl/10204919</ext-link> (Accessed January 28, 2023).</citation></ref>
<ref id="ref29"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Cascallar</surname> <given-names>E.</given-names></name> <name><surname>Musso</surname> <given-names>M.</given-names></name> <name><surname>Kyndt</surname> <given-names>E.</given-names></name> <name><surname>Dochy</surname> <given-names>F.</given-names></name></person-group> (<year>2014</year>). <article-title>Modelling for understanding AND for prediction/classification--the power of neural networks in research</article-title>. <source>Frontline Learn. Res.</source> <volume>2</volume>, <fpage>67</fpage>&#x2013;<lpage>81</lpage>. doi: <pub-id pub-id-type="doi">10.14786/flr.v2i5.135</pub-id></citation></ref>
<ref id="ref30"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Casella</surname> <given-names>G.</given-names></name> <name><surname>Ghosh</surname> <given-names>M.</given-names></name> <name><surname>Gill</surname> <given-names>J.</given-names></name> <name><surname>Kyung</surname> <given-names>M.</given-names></name></person-group> (<year>2010</year>). <article-title>Penalized regression, standard errors, and Bayesian lassos</article-title>. <source>Bayesian Anal.</source> <volume>5</volume>, <fpage>369</fpage>&#x2013;<lpage>411</lpage>. doi: <pub-id pub-id-type="doi">10.1214/10-BA607</pub-id></citation></ref>
<ref id="ref31"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Castleman</surname> <given-names>B. L.</given-names></name> <name><surname>Long</surname> <given-names>B. T.</given-names></name> <name><surname>Mabel</surname> <given-names>Z.</given-names></name></person-group> (<year>2018</year>). <article-title>Can financial aid help to address the growing need for STEM education? The effects of need-based grants on the completion of science, technology, engineering, and math courses and degrees</article-title>. <source>J. Policy Anal. Manage.</source> <volume>37</volume>, <fpage>136</fpage>&#x2013;<lpage>166</lpage>. doi: <pub-id pub-id-type="doi">10.1002/pam.22039</pub-id></citation></ref>
<ref id="ref32"><citation citation-type="confproc"><person-group person-group-type="author"><name><surname>Chandler</surname> <given-names>S. D.</given-names></name> <name><surname>Skallos</surname> <given-names>M.</given-names></name></person-group> (<year>2012</year>). &#x201C;<article-title>Do Learning Management System Tools Help Students Learn?</article-title>&#x201D; in <conf-name>23rd International Conference on College Teaching and Learning</conf-name>.</citation></ref>
<ref id="ref33"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Chang</surname> <given-names>M. J.</given-names></name> <name><surname>Sharkness</surname> <given-names>J.</given-names></name> <name><surname>Hurtado</surname> <given-names>S.</given-names></name> <name><surname>Newman</surname> <given-names>C. B.</given-names></name></person-group> (<year>2014</year>). <article-title>What matters in college for retaining aspiring scientists and engineers from underrepresented racial groups</article-title>. <source>J. Res. Sci. Teach.</source> <volume>51</volume>, <fpage>555</fpage>&#x2013;<lpage>580</lpage>. doi: <pub-id pub-id-type="doi">10.1002/tea.21146</pub-id></citation></ref>
<ref id="ref34"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Chatti</surname> <given-names>M. A.</given-names></name> <name><surname>Dyckhoff</surname> <given-names>A. L.</given-names></name> <name><surname>Schroeder</surname> <given-names>U.</given-names></name> <name><surname>Th&#x00FC;s</surname> <given-names>H.</given-names></name></person-group> (<year>2012</year>). <article-title>A reference model for learning analytics</article-title>. <source>Int. J. Technol. Enhanced Learn.</source> <volume>4</volume>, <fpage>318</fpage>&#x2013;<lpage>331</lpage>. doi: <pub-id pub-id-type="doi">10.1504/IJTEL.2012.051815</pub-id>, PMID: <pub-id pub-id-type="pmid">36627410</pub-id></citation></ref>
<ref id="ref35"><citation citation-type="other"><person-group person-group-type="author"><name><surname>Chen</surname> <given-names>X.</given-names></name></person-group> (<year>2013</year>). <italic>STEM Attrition: College Students&#x2019; Paths into and out of STEM Fields</italic>. Statistical Analysis Report. NCES 2014-001. National Center for Education Statistics.</citation></ref>
<ref id="ref36"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Chen</surname> <given-names>L.</given-names></name> <name><surname>Chen</surname> <given-names>P.</given-names></name> <name><surname>Lin</surname> <given-names>Z.</given-names></name></person-group> (<year>2020</year>). <article-title>Artificial intelligence in education: a review</article-title>. <source>IEEE Access.</source> <volume>8</volume>, <fpage>75264</fpage>&#x2013;<lpage>75278</lpage>. doi: <pub-id pub-id-type="doi">10.1109/ACCESS.2020.2988510</pub-id>, PMID: <pub-id pub-id-type="pmid">36654703</pub-id></citation></ref>
<ref id="ref37"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Chen</surname> <given-names>X.</given-names></name> <name><surname>Xie</surname> <given-names>H.</given-names></name> <name><surname>Zou</surname> <given-names>D.</given-names></name> <name><surname>Hwang</surname> <given-names>G. J.</given-names></name></person-group> (<year>2020</year>). <article-title>Application and theory gaps during the rise of artificial intelligence in education</article-title>. <source>Comput. Educ.: Artif. Intell.</source> <volume>1</volume>:<fpage>100002</fpage>. doi: <pub-id pub-id-type="doi">10.1016/j.caeai.2020.100002</pub-id></citation></ref>
<ref id="ref38"><citation citation-type="web"><person-group person-group-type="author"><name><surname>Chen</surname> <given-names>Z.</given-names></name> <name><surname>Zhang</surname> <given-names>T.</given-names></name></person-group> (<year>2021</year>). Analyzing the Heterogeneous Impact of Remote Learning on Students&#x2019; Ability to Stay on Track During the Pandemic. <italic>arXiv</italic> [2108.00601]. Available at: <ext-link xlink:href="https://arxiv.org/abs/2108.00601" ext-link-type="uri">https://arxiv.org/abs/2108.00601</ext-link> (Accessed October 7, 2022).</citation></ref>
<ref id="ref39"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Chowdry</surname> <given-names>H.</given-names></name> <name><surname>Crawford</surname> <given-names>C.</given-names></name> <name><surname>Dearden</surname> <given-names>L.</given-names></name> <name><surname>Goodman</surname> <given-names>A.</given-names></name> <name><surname>Vignoles</surname> <given-names>A.</given-names></name></person-group> (<year>2013</year>). <article-title>Widening participation in higher education: analysis using linked administrative data</article-title>. <source>J. R. Stat. Soc. A. Stat. Soc.</source> <volume>176</volume>, <fpage>431</fpage>&#x2013;<lpage>457</lpage>. doi: <pub-id pub-id-type="doi">10.1111/j.1467-985X.2012.01043.x</pub-id></citation></ref>
<ref id="ref40"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Choy</surname> <given-names>S. L.</given-names></name> <name><surname>O&#x2019;Leary</surname> <given-names>R.</given-names></name> <name><surname>Mengersen</surname> <given-names>K.</given-names></name></person-group> (<year>2009</year>). <article-title>Elicitation by design in ecology: using expert opinion to inform priors for Bayesian statistical models</article-title>. <source>Ecology</source> <volume>90</volume>, <fpage>265</fpage>&#x2013;<lpage>277</lpage>. doi: <pub-id pub-id-type="doi">10.1890/07-1886.1</pub-id>, PMID: <pub-id pub-id-type="pmid">19294931</pub-id></citation></ref>
<ref id="ref41"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Clow</surname> <given-names>D.</given-names></name></person-group> (<year>2013</year>). <article-title>An overview of learning analytics</article-title>. <source>Teach. High. Educ.</source> <volume>18</volume>, <fpage>683</fpage>&#x2013;<lpage>695</lpage>. doi: <pub-id pub-id-type="doi">10.1080/13562517.2013.827653</pub-id>, PMID: <pub-id pub-id-type="pmid">36635924</pub-id></citation></ref>
<ref id="ref42"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Coletta</surname> <given-names>V. P.</given-names></name> <name><surname>Phillips</surname> <given-names>J. A.</given-names></name> <name><surname>Steinert</surname> <given-names>J. J.</given-names></name></person-group> (<year>2007</year>). <article-title>Interpreting force concept inventory scores: normalized gain and SAT scores</article-title>. <source>Phys. Rev. Spec. Top. &#x2013; Phys. Educ. Res.</source> <volume>3</volume>:<fpage>010106</fpage>. doi: <pub-id pub-id-type="doi">10.1103/PhysRevSTPER.3.010106</pub-id></citation></ref>
<ref id="ref43"><citation citation-type="web"><person-group person-group-type="author"><name><surname>Conati</surname> <given-names>C.</given-names></name> <name><surname>Porayska-Pomsta</surname> <given-names>K.</given-names></name> <name><surname>Mavrikis</surname> <given-names>M.</given-names></name></person-group> (<year>2018</year>). AI in Education needs interpretable machine learning: Lessons from Open Learner Modelling. <italic>arXiv</italic> [1807.00154]. Available at: <ext-link xlink:href="https://arxiv.org/abs/1807.00154" ext-link-type="uri">https://arxiv.org/abs/1807.00154</ext-link> (Accessed October 7, 2022).</citation></ref>
<ref id="ref44"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Corbett</surname> <given-names>A. T.</given-names></name> <name><surname>Anderson</surname> <given-names>J. R.</given-names></name></person-group> (<year>1994</year>). <article-title>Knowledge tracing: modeling the acquisition of procedural knowledge</article-title>. <source>User Model. User-Adap. Inter.</source> <volume>4</volume>, <fpage>253</fpage>&#x2013;<lpage>278</lpage>.</citation></ref>
<ref id="ref45"><citation citation-type="book"><person-group person-group-type="author"><name><surname>Coughlin</surname> <given-names>M. A.</given-names></name> <name><surname>Pagano</surname> <given-names>M.</given-names></name></person-group> (<year>1997</year>). <source>Case study applications of statistics in institutional research: resources in institutional research, number ten</source>. <publisher-name>Association for Institutional Research, Florida State University</publisher-name>, <publisher-loc>Tallahassee, FL</publisher-loc>.</citation></ref>
<ref id="ref46"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Crisp</surname> <given-names>G.</given-names></name> <name><surname>Doran</surname> <given-names>E.</given-names></name> <name><surname>Salis Reyes</surname> <given-names>N. A.</given-names></name></person-group> (<year>2018</year>). <article-title>Predicting graduation rates at 4-year broad access institutions using a Bayesian modeling approach</article-title>. <source>Res. High. Educ.</source> <volume>59</volume>, <fpage>133</fpage>&#x2013;<lpage>155</lpage>. doi: <pub-id pub-id-type="doi">10.1007/s11162-017-9459-x</pub-id></citation></ref>
<ref id="ref47"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Cui</surname> <given-names>Y.</given-names></name> <name><surname>Chu</surname> <given-names>M. W.</given-names></name> <name><surname>Chen</surname> <given-names>F.</given-names></name></person-group> (<year>2019</year>). <article-title>Analyzing student process data in game-based assessment with Bayesian knowledge tracing and dynamic Bayesian networks</article-title>. <source>J. Educ. Data Mining.</source> <volume>11</volume>, <fpage>80</fpage>&#x2013;<lpage>100</lpage>. doi: <pub-id pub-id-type="doi">10.5281/zenodo.3554751</pub-id></citation></ref>
<ref id="ref48"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Culbertson</surname> <given-names>M. J.</given-names></name></person-group> (<year>2016</year>). <article-title>Bayesian networks in educational assessment: the state of the field</article-title>. <source>Appl. Psychol. Meas.</source> <volume>40</volume>, <fpage>3</fpage>&#x2013;<lpage>21</lpage>. doi: <pub-id pub-id-type="doi">10.1177/0146621615590401</pub-id>, PMID: <pub-id pub-id-type="pmid">29881033</pub-id></citation></ref>
<ref id="ref49"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Delen</surname> <given-names>D.</given-names></name></person-group> (<year>2011</year>). <article-title>Predicting student attrition with data mining methods</article-title>. <source>J. College Stud. Retention: Res. Theory Pract.</source> <volume>13</volume>, <fpage>17</fpage>&#x2013;<lpage>35</lpage>. doi: <pub-id pub-id-type="doi">10.2190/CS.13.1.b</pub-id></citation></ref>
<ref id="ref50"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Delisle</surname> <given-names>J.</given-names></name></person-group> (<year>2017</year>). <article-title>The Pell Grant proxy: a ubiquitous but flawed measure of low-income student enrollment</article-title>. <source>Evidence Speaks Rep.</source> <volume>2</volume>, <fpage>1</fpage>&#x2013;<lpage>12</lpage>. Available at <ext-link xlink:href="https://www.brookings.edu/wp-content/uploads/2017/10/pell-grants-report.pdf" ext-link-type="uri">https://www.brookings.edu/wp-content/uploads/2017/10/pell-grants-report.pdf</ext-link> (Accessed January 28, 2023).</citation></ref>
<ref id="ref51"><citation citation-type="confproc"><person-group person-group-type="author"><name><surname>Desmarais</surname> <given-names>M. C.</given-names></name> <name><surname>Gagnon</surname> <given-names>M.</given-names></name></person-group> (<year>2006</year>). &#x201C;<article-title>Bayesian student models based on item to item knowledge structures</article-title>&#x201D; in <source>European Conference on Technology Enhanced Learning</source>, <publisher-name>Springer</publisher-name> <fpage>111</fpage>&#x2013;<lpage>124</lpage>.</citation></ref>
<ref id="ref52"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Dienes</surname> <given-names>Z.</given-names></name></person-group> (<year>2011</year>). <article-title>Bayesian versus orthodox statistics: which side are you on?</article-title> <source>Perspect. Psychol. Sci.</source> <volume>6</volume>, <fpage>274</fpage>&#x2013;<lpage>290</lpage>. doi: <pub-id pub-id-type="doi">10.1177/1745691611406920</pub-id>, PMID: <pub-id pub-id-type="pmid">26168518</pub-id></citation></ref>
<ref id="ref53"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Drigas</surname> <given-names>A. S.</given-names></name> <name><surname>Argyri</surname> <given-names>K.</given-names></name> <name><surname>Vrettaros</surname> <given-names>J.</given-names></name></person-group> (<year>2009</year>). <article-title>Decade review (1999-2009): progress of application of artificial intelligence tools in student diagnosis</article-title>. <source>Int. J. Social Humanistic Comput.</source> <volume>1</volume>, <fpage>175</fpage>&#x2013;<lpage>191</lpage>. doi: <pub-id pub-id-type="doi">10.1504/IJSHC.2009.031006</pub-id></citation></ref>
<ref id="ref54"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Duggan</surname> <given-names>M. H.</given-names></name> <name><surname>Pickering</surname> <given-names>J. W.</given-names></name></person-group> (<year>2008</year>). <article-title>Barriers to transfer student academic success and retention</article-title>. <source>J. College Stud. Retention: Res. Theory Pract.</source> <volume>9</volume>, <fpage>437</fpage>&#x2013;<lpage>459</lpage>. doi: <pub-id pub-id-type="doi">10.2190/CS.9.4.c</pub-id>, PMID: <pub-id pub-id-type="pmid">33897099</pub-id></citation></ref>
<ref id="ref55"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ellison</surname> <given-names>A. M.</given-names></name></person-group> (<year>1996</year>). <article-title>An introduction to Bayesian inference for ecological research and environmental decision-making</article-title>. <source>Ecol. Appl.</source> <volume>6</volume>, <fpage>1036</fpage>&#x2013;<lpage>1046</lpage>. doi: <pub-id pub-id-type="doi">10.2307/2269588</pub-id></citation></ref>
<ref id="ref56"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Epling</surname> <given-names>M.</given-names></name> <name><surname>Timmons</surname> <given-names>S.</given-names></name> <name><surname>Wharrad</surname> <given-names>H.</given-names></name></person-group> (<year>2003</year>). <article-title>An educational panopticon? New technology, nurse education and surveillance</article-title>. <source>Nurse Educ. Today</source> <volume>23</volume>, <fpage>412</fpage>&#x2013;<lpage>418</lpage>. doi: <pub-id pub-id-type="doi">10.1016/S0260-6917(03)00002-9</pub-id>, PMID: <pub-id pub-id-type="pmid">12900189</pub-id></citation></ref>
<ref id="ref57"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Fern&#x00E1;ndez-Caram&#x00E9;s</surname> <given-names>T. M.</given-names></name> <name><surname>Fraga-Lamas</surname> <given-names>P.</given-names></name></person-group> (<year>2019</year>). <article-title>Towards next generation teaching, learning, and context-aware applications for higher education: a review on blockchain, IoT, fog and edge computing enabled smart campuses and universities</article-title>. <source>Appl. Sci.</source> <volume>9</volume>:<fpage>4479</fpage>. doi: <pub-id pub-id-type="doi">10.3390/app9214479</pub-id></citation></ref>
<ref id="ref58"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Fordyce</surname> <given-names>J. A.</given-names></name> <name><surname>Gompert</surname> <given-names>Z.</given-names></name> <name><surname>Forister</surname> <given-names>M. L.</given-names></name> <name><surname>Nice</surname> <given-names>C. C.</given-names></name></person-group> (<year>2011</year>). <article-title>A hierarchical Bayesian approach to ecological count data: a flexible tool for ecologists</article-title>. <source>PLoS One</source> <volume>6</volume>:<fpage>e26785</fpage>. doi: <pub-id pub-id-type="doi">10.1371/journal.pone.0026785</pub-id>, PMID: <pub-id pub-id-type="pmid">22132077</pub-id></citation></ref>
<ref id="ref59"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Fornacon-Wood</surname> <given-names>I.</given-names></name> <name><surname>Mistry</surname> <given-names>H.</given-names></name> <name><surname>Johnson-Hart</surname> <given-names>C.</given-names></name> <name><surname>Faivre-Finn</surname> <given-names>C.</given-names></name> <name><surname>O&#x2019;Connor</surname> <given-names>J. P.</given-names></name> <name><surname>Price</surname> <given-names>G. J.</given-names></name></person-group> (<year>2022</year>). <article-title>Understanding the differences between Bayesian and frequentist statistics</article-title>. <source>Int. J. Radiat. Oncol. Biol. Phys.</source> <volume>112</volume>, <fpage>1076</fpage>&#x2013;<lpage>1082</lpage>. doi: <pub-id pub-id-type="doi">10.1016/j.ijrobp.2021.12.011</pub-id>, PMID: <pub-id pub-id-type="pmid">35286881</pub-id></citation></ref>
<ref id="ref60"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Gebretekle</surname> <given-names>T. K.</given-names></name> <name><surname>Goshu</surname> <given-names>A. T.</given-names></name></person-group> (<year>2019</year>). <article-title>Bayesian analysis of retention and graduation of female students of higher education institution: the case of Hawassa University (HU), Ethiopia</article-title>. <source>Am. J. Theor. Appl. Stat.</source> <volume>8</volume>, <fpage>47</fpage>&#x2013;<lpage>66</lpage>. doi: <pub-id pub-id-type="doi">10.11648/j.ajtas.20190802.12</pub-id></citation></ref>
<ref id="ref61"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Gelman</surname> <given-names>A.</given-names></name></person-group> (<year>2006</year>). <article-title>Prior distributions for variance parameters in hierarchical models</article-title>. <source>Bayesian Anal.</source> <volume>1</volume>, <fpage>515</fpage>&#x2013;<lpage>534</lpage>. doi: <pub-id pub-id-type="doi">10.1214/06-BA117A</pub-id></citation></ref>
<ref id="ref62"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Gelman</surname> <given-names>A.</given-names></name> <name><surname>Hill</surname> <given-names>J.</given-names></name> <name><surname>Yajima</surname> <given-names>M.</given-names></name></person-group> (<year>2012</year>). <article-title>Why we (usually) don&#x2019;t have to worry about multiple comparisons</article-title>. <source>J. Res. Educ. Effect.</source> <volume>5</volume>, <fpage>189</fpage>&#x2013;<lpage>211</lpage>. doi: <pub-id pub-id-type="doi">10.1080/19345747.2011.618213</pub-id></citation></ref>
<ref id="ref63"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Gelman</surname> <given-names>A.</given-names></name> <name><surname>Hwang</surname> <given-names>J.</given-names></name> <name><surname>Vehtari</surname> <given-names>A.</given-names></name></person-group> (<year>2014</year>). <article-title>Understanding predictive information criteria for Bayesian models</article-title>. <source>Stat. Comput.</source> <volume>24</volume>, <fpage>997</fpage>&#x2013;<lpage>1016</lpage>. doi: <pub-id pub-id-type="doi">10.1007/s11222-013-9416-2</pub-id>, PMID: <pub-id pub-id-type="pmid">36517117</pub-id></citation></ref>
<ref id="ref64"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Gelman</surname> <given-names>A.</given-names></name> <name><surname>Lee</surname> <given-names>D.</given-names></name> <name><surname>Guo</surname> <given-names>J.</given-names></name></person-group> (<year>2015</year>). <article-title>Stan: a probabilistic programming language for Bayesian inference and optimization</article-title>. <source>J. Educ. Behav. Stat.</source> <volume>40</volume>, <fpage>530</fpage>&#x2013;<lpage>543</lpage>. doi: <pub-id pub-id-type="doi">10.3102/1076998615606113</pub-id></citation></ref>
<ref id="ref65"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Goldstein</surname> <given-names>H.</given-names></name> <name><surname>Burgess</surname> <given-names>S.</given-names></name> <name><surname>McConnell</surname> <given-names>B.</given-names></name></person-group> (<year>2007</year>). <article-title>Modelling the effect of pupil mobility on school differences in educational achievement</article-title>. <source>J. R. Stat. Soc. A. Stat. Soc.</source> <volume>170</volume>, <fpage>941</fpage>&#x2013;<lpage>954</lpage>. doi: <pub-id pub-id-type="doi">10.1111/j.1467-985X.2007.00491.x</pub-id></citation></ref>
<ref id="ref66"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hand</surname> <given-names>D. J.</given-names></name> <name><surname>Yu</surname> <given-names>K.</given-names></name></person-group> (<year>2001</year>). <article-title>Idiot&#x2019;s Bayes &#x2013; not so stupid after all?</article-title> <source>Int. Stat. Rev.</source> <volume>69</volume>, <fpage>385</fpage>&#x2013;<lpage>398</lpage>. doi: <pub-id pub-id-type="doi">10.1111/j.1751-5823.2001.tb00465.x</pub-id></citation></ref>
<ref id="ref67"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Haudek</surname> <given-names>K. C.</given-names></name> <name><surname>Kaplan</surname> <given-names>J. J.</given-names></name> <name><surname>Knight</surname> <given-names>J.</given-names></name> <name><surname>Long</surname> <given-names>T.</given-names></name> <name><surname>Merrill</surname> <given-names>J.</given-names></name> <name><surname>Munn</surname> <given-names>A.</given-names></name> <etal/></person-group>. (<year>2011</year>). <article-title>Harnessing technology to improve formative assessment of student conceptions in STEM: forging a national network</article-title>. <source>CBE&#x2013;Life Sci. Educ.</source> <volume>10</volume>, <fpage>149</fpage>&#x2013;<lpage>155</lpage>. doi: <pub-id pub-id-type="doi">10.1187/cbe.11-03-0019</pub-id>, PMID: <pub-id pub-id-type="pmid">21633063</pub-id></citation></ref>
<ref id="ref68"><citation citation-type="confproc"><person-group person-group-type="author"><name><surname>Hien</surname> <given-names>N. T. N.</given-names></name> <name><surname>Haddawy</surname> <given-names>P.</given-names></name></person-group> (<year>2007</year>). &#x201C;<article-title>A decision support system for evaluating international student applications</article-title>&#x201D; in <source>2007 37th annual frontiers in education conference &#x2013; global engineering: knowledge without borders, opportunities without passports (IEEE), F2A-1</source>.</citation></ref>
<ref id="ref69"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Higdem</surname> <given-names>J. L.</given-names></name> <name><surname>Kostal</surname> <given-names>J. W.</given-names></name> <name><surname>Kuncel</surname> <given-names>N. R.</given-names></name> <name><surname>Sackett</surname> <given-names>P. R.</given-names></name> <name><surname>Shen</surname> <given-names>W.</given-names></name> <name><surname>Beatty</surname> <given-names>A. S.</given-names></name> <etal/></person-group>. (<year>2016</year>). <article-title>The role of socioeconomic status in SAT&#x2013;freshman grade relationships across gender and racial subgroups</article-title>. <source>Educ. Meas. Issues Pract.</source> <volume>35</volume>, <fpage>21</fpage>&#x2013;<lpage>28</lpage>. doi: <pub-id pub-id-type="doi">10.1111/emip.12103</pub-id></citation></ref>
<ref id="ref70"><citation citation-type="book"><person-group person-group-type="author"><name><surname>Hobbs</surname> <given-names>N. T.</given-names></name> <name><surname>Hooten</surname> <given-names>M. B.</given-names></name></person-group> (<year>2015</year>). <source>Bayesian models</source>. <publisher-loc>Princeton, New Jersey, United States</publisher-loc>: <publisher-name>Princeton University Press</publisher-name>.</citation></ref>
<ref id="ref71"><citation citation-type="book"><person-group person-group-type="author"><name><surname>Hobson</surname> <given-names>M. P.</given-names></name> <name><surname>Jaffe</surname> <given-names>A. H.</given-names></name> <name><surname>Liddle</surname> <given-names>A. R.</given-names></name> <name><surname>Mukherjee</surname> <given-names>P.</given-names></name> <name><surname>Parkinson</surname> <given-names>D.</given-names></name></person-group> (<year>2010</year>). <source>Bayesian methods in cosmology</source>. <publisher-loc>Cambridge, England</publisher-loc>: <publisher-name>Cambridge University Press</publisher-name>.</citation></ref>
<ref id="ref001"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Homer</surname> <given-names>M.</given-names></name></person-group> (<year>2016</year>). <source>The future of quantitative educational research methods: Bigger, better and, perhaps, bayesian</source>? Available at: <ext-link xlink:href="http://hpp.education.leeds.ac.uk/wp-content/uploads/sites/131/2016/02/HPP2016-3-Homer.pdf" ext-link-type="uri">http://hpp.education.leeds.ac.uk/wp-content/uploads/sites/131/2016/02/HPP2016-3-Homer.pdf</ext-link> (Accessed January 28, 2023).</citation></ref>
<ref id="ref72"><citation citation-type="book"><person-group person-group-type="author"><name><surname>Hooten</surname> <given-names>M. B.</given-names></name> <name><surname>Hefley</surname> <given-names>T. J.</given-names></name></person-group> (<year>2019</year>). <source>Bringing Bayesian models to life</source>. <publisher-loc>Boca Raton, Florida</publisher-loc>: <publisher-name>CRC Press</publisher-name>.</citation></ref>
<ref id="ref73"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Huang</surname> <given-names>K. T.</given-names></name> <name><surname>Ball</surname> <given-names>C.</given-names></name> <name><surname>Francis</surname> <given-names>J.</given-names></name> <name><surname>Ratan</surname> <given-names>R.</given-names></name> <name><surname>Boumis</surname> <given-names>J.</given-names></name> <name><surname>Fordham</surname> <given-names>J.</given-names></name></person-group> (<year>2019</year>). <article-title>Augmented versus virtual reality in education: an exploratory study examining science knowledge retention when using augmented reality/virtual reality mobile applications</article-title>. <source>Cyberpsychol. Behav. Soc. Netw.</source> <volume>22</volume>, <fpage>105</fpage>&#x2013;<lpage>110</lpage>. doi: <pub-id pub-id-type="doi">10.1089/cyber.2018.0150</pub-id>, PMID: <pub-id pub-id-type="pmid">30657334</pub-id></citation></ref>
<ref id="ref74"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hussain</surname> <given-names>S.</given-names></name> <name><surname>Dahan</surname> <given-names>N. A.</given-names></name> <name><surname>Ba-Alwib</surname> <given-names>F. M.</given-names></name> <name><surname>Ribata</surname> <given-names>N.</given-names></name></person-group> (<year>2018</year>). <article-title>Educational data mining and analysis of students&#x2019; academic performance using WEKA</article-title>. <source>Indones. J. Electr. Eng. Comput. Sci.</source> <volume>9</volume>, <fpage>447</fpage>&#x2013;<lpage>459</lpage>. doi: <pub-id pub-id-type="doi">10.11591/ijeecs.v9.i2.pp447-459</pub-id></citation></ref>
<ref id="ref75"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ikuma</surname> <given-names>L. H.</given-names></name> <name><surname>Steele</surname> <given-names>A.</given-names></name> <name><surname>Dann</surname> <given-names>S.</given-names></name> <name><surname>Adio</surname> <given-names>O.</given-names></name> <name><surname>Waggenspack</surname> <given-names>W. N.</given-names> <suffix>Jr.</suffix></name></person-group> (<year>2019</year>). <article-title>Large-scale student programs increase persistence in STEM fields in a public university setting</article-title>. <source>J. Eng. Educ.</source> <volume>108</volume>, <fpage>57</fpage>&#x2013;<lpage>81</lpage>. doi: <pub-id pub-id-type="doi">10.1002/jee.20244</pub-id></citation></ref>
<ref id="ref76"><citation citation-type="book"><person-group person-group-type="author"><name><surname>Johnson</surname> <given-names>M. H.</given-names></name></person-group> (<year>2012</year>). <source>An analysis of retention factors in undergraduate degree programs in science, technology, engineering, and mathematics. [Dissertation]</source>. <publisher-loc>(Missoula (MT)</publisher-loc>: <publisher-name>University of Montana</publisher-name>.</citation></ref>
<ref id="ref77"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Johnson</surname> <given-names>M. S.</given-names></name> <name><surname>Jenkins</surname> <given-names>F.</given-names></name></person-group> (<year>2004</year>). <article-title>A Bayesian hierarchical model for large-scale educational surveys: an application to the National Assessment of Educational Progress</article-title>. <source>ETS Res. Rep. Ser.</source> <volume>2004</volume>, <fpage>i</fpage>&#x2013;<lpage>28</lpage>. doi: <pub-id pub-id-type="doi">10.1002/j.2333-8504.2004.tb01965.x</pub-id></citation></ref>
<ref id="ref78"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Jokhan</surname> <given-names>A.</given-names></name> <name><surname>Sharma</surname> <given-names>B.</given-names></name> <name><surname>Singh</surname> <given-names>S.</given-names></name></person-group> (<year>2019</year>). <article-title>Early warning system as a predictor for student performance in higher education blended courses</article-title>. <source>Stud. High. Educ.</source> <volume>44</volume>, <fpage>1900</fpage>&#x2013;<lpage>1911</lpage>. doi: <pub-id pub-id-type="doi">10.1080/03075079.2018.1466872</pub-id></citation></ref>
<ref id="ref79"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kabudi</surname> <given-names>T.</given-names></name> <name><surname>Pappas</surname> <given-names>I.</given-names></name> <name><surname>Olsen</surname> <given-names>D. H.</given-names></name></person-group> (<year>2021</year>). <article-title>AI-enabled adaptive learning systems: a systematic mapping of the literature</article-title>. <source>Comput. Educ.: Artif. Intell.</source> <volume>2</volume>:<fpage>100017</fpage>. doi: <pub-id pub-id-type="doi">10.1016/j.caeai.2021.100017</pub-id></citation></ref>
<ref id="ref80"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kadane</surname> <given-names>J. B.</given-names></name> <name><surname>Lazar</surname> <given-names>N. A.</given-names></name></person-group> (<year>2004</year>). <article-title>Methods and criteria for model selection</article-title>. <source>J. Am. Stat. Assoc.</source> <volume>99</volume>, <fpage>279</fpage>&#x2013;<lpage>290</lpage>. doi: <pub-id pub-id-type="doi">10.1198/016214504000000269</pub-id>, PMID: <pub-id pub-id-type="pmid">36658559</pub-id></citation></ref>
<ref id="ref81"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kassler</surname> <given-names>D.</given-names></name> <name><surname>Nichols-Barrer</surname> <given-names>I.</given-names></name> <name><surname>Finucane</surname> <given-names>M.</given-names></name></person-group> (<year>2019</year>). <article-title>Beyond &#x201C;treatment versus control&#x201D;: how Bayesian analysis makes factorial experiments feasible in educational research</article-title>. <source>Eval. Rev.</source> <volume>4</volume>, <fpage>238</fpage>&#x2013;<lpage>261</lpage>. doi: <pub-id pub-id-type="doi">10.1177/0193841X1881890</pub-id></citation></ref>
<ref id="ref82"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Komaki</surname> <given-names>F.</given-names></name></person-group> (<year>2006</year>). <article-title>Shrinkage priors for Bayesian prediction</article-title>. <source>Ann. Stat.</source> <volume>34</volume>, <fpage>808</fpage>&#x2013;<lpage>819</lpage>. doi: <pub-id pub-id-type="doi">10.1214/009053606000000010</pub-id></citation></ref>
<ref id="ref83"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>K&#x00F6;nig</surname> <given-names>C.</given-names></name> <name><surname>van de Schoot</surname> <given-names>R.</given-names></name></person-group> (<year>2018</year>). <article-title>Bayesian statistics in educational research: a look at the current state of affairs</article-title>. <source>Educ. Rev.</source> <volume>70</volume>, <fpage>486</fpage>&#x2013;<lpage>509</lpage>. doi: <pub-id pub-id-type="doi">10.1080/00131911.2017.1350636</pub-id></citation></ref>
<ref id="ref84"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kricorian</surname> <given-names>K.</given-names></name> <name><surname>Seu</surname> <given-names>M.</given-names></name> <name><surname>Lopez</surname> <given-names>D.</given-names></name> <name><surname>Ureta</surname> <given-names>E.</given-names></name> <name><surname>Equils</surname> <given-names>O.</given-names></name></person-group> (<year>2020</year>). <article-title>Factors influencing participation of underrepresented students in STEM fields: matched mentors and mindsets</article-title>. <source>Int. J. STEM Educ.</source> <volume>7</volume>, <fpage>1</fpage>&#x2013;<lpage>9</lpage>. doi: <pub-id pub-id-type="doi">10.1186/s40594-020-00219-2</pub-id></citation></ref>
<ref id="ref85"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kruschke</surname> <given-names>J. K.</given-names></name></person-group> (<year>2011a</year>). <article-title>Bayesian assessment of null values via parameter estimation and model comparison</article-title>. <source>Perspect. Psychol. Sci.</source> <volume>6</volume>, <fpage>299</fpage>&#x2013;<lpage>312</lpage>. doi: <pub-id pub-id-type="doi">10.1177/1745691611406925</pub-id>, PMID: <pub-id pub-id-type="pmid">26168520</pub-id></citation></ref>
<ref id="ref86"><citation citation-type="book"><person-group person-group-type="author"><name><surname>Kruschke</surname> <given-names>J. K.</given-names></name></person-group> (<year>2011b</year>). <source>Doing Bayesian data analysis: a tutorial with R and BUGS</source>. <publisher-loc>London, United Kingdom</publisher-loc>: <publisher-name>Academic Press</publisher-name>.</citation></ref>
<ref id="ref87"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kruschke</surname> <given-names>J. K.</given-names></name> <name><surname>Liddell</surname> <given-names>T. M.</given-names></name></person-group> (<year>2018</year>). <article-title>The Bayesian New Statistics: hypothesis testing, estimation, meta-analysis, and power analysis from a Bayesian perspective</article-title>. <source>Psychon. Bull. Rev.</source> <volume>25</volume>, <fpage>178</fpage>&#x2013;<lpage>206</lpage>. doi: <pub-id pub-id-type="doi">10.3758/s13423-016-1221-4</pub-id>, PMID: <pub-id pub-id-type="pmid">28176294</pub-id></citation></ref>
<ref id="ref88"><citation citation-type="book"><person-group person-group-type="author"><name><surname>Kubsch</surname> <given-names>M.</given-names></name> <name><surname>Czinczel</surname> <given-names>B.</given-names></name> <name><surname>Lossjew</surname> <given-names>J.</given-names></name> <name><surname>Wyrwich</surname> <given-names>T.</given-names></name> <name><surname>Bednorz</surname> <given-names>D.</given-names></name> <name><surname>Bernholt</surname> <given-names>S.</given-names></name> <etal/></person-group>. (<year>2022</year>). &#x201C;<article-title>Toward learning progression analytics&#x2014;developing learning environments for the automated analysis of learning using evidence centered design</article-title>&#x201D; in <source>Frontiers in education</source>, vol. <volume>605</volume> (<publisher-loc>Lausanne, Switzerland</publisher-loc>: <publisher-name>Frontiers</publisher-name>)</citation></ref>
<ref id="ref89"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kubsch</surname> <given-names>M.</given-names></name> <name><surname>Stamer</surname> <given-names>I.</given-names></name> <name><surname>Steiner</surname> <given-names>M.</given-names></name> <name><surname>Neumann</surname> <given-names>K.</given-names></name> <name><surname>Parchmann</surname> <given-names>I.</given-names></name></person-group> (<year>2021</year>). <article-title>Beyond p-values: Using bayesian data analysis in science education research</article-title>. <source>Pract. Assess. Res. Eval.</source> <volume>26</volume>, <fpage>1</fpage>&#x2013;<lpage>18</lpage>. doi: <pub-id pub-id-type="doi">10.7275/vzpw-ng13</pub-id></citation></ref>
<ref id="ref90"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Laanan</surname> <given-names>F. S.</given-names></name></person-group> (<year>2001</year>). <article-title>Transfer student adjustment</article-title>. <source>New Directions Community Colleges</source> <volume>2001</volume>, <fpage>5</fpage>&#x2013;<lpage>13</lpage>. doi: <pub-id pub-id-type="doi">10.1002/cc.16</pub-id>, PMID: <pub-id pub-id-type="pmid">36564994</pub-id></citation></ref>
<ref id="ref91"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lambert</surname> <given-names>P. C.</given-names></name> <name><surname>Sutton</surname> <given-names>A. J.</given-names></name> <name><surname>Burton</surname> <given-names>P. R.</given-names></name> <name><surname>Abrams</surname> <given-names>K. R.</given-names></name> <name><surname>Jones</surname> <given-names>D. R.</given-names></name></person-group> (<year>2005</year>). <article-title>How vague is vague? A simulation study of the impact of the use of vague prior distributions in MCMC using WinBUGS</article-title>. <source>Stat. Med.</source> <volume>24</volume>, <fpage>2401</fpage>&#x2013;<lpage>2428</lpage>. doi: <pub-id pub-id-type="doi">10.1002/sim.2112</pub-id>, PMID: <pub-id pub-id-type="pmid">16015676</pub-id></citation></ref>
<ref id="ref92"><citation citation-type="book"><person-group person-group-type="author"><name><surname>Lang</surname> <given-names>C.</given-names></name> <name><surname>Siemens</surname> <given-names>G.</given-names></name> <name><surname>Wise</surname> <given-names>A.</given-names></name> <name><surname>Ga&#x0161;evi&#x0107;</surname> <given-names>D.</given-names></name></person-group> (<year>2017</year>). <source>The handbook of learning analytics</source>. <publisher-loc>Beaumont, Alberta, Canada</publisher-loc>: <publisher-name>SOLAR, Society for Learning Analytics and Research</publisher-name>.</citation></ref>
<ref id="ref93"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lasry</surname> <given-names>N.</given-names></name> <name><surname>Rosenfield</surname> <given-names>S.</given-names></name> <name><surname>Dedic</surname> <given-names>H.</given-names></name> <name><surname>Dahan</surname> <given-names>A.</given-names></name> <name><surname>Reshef</surname> <given-names>O.</given-names></name></person-group> (<year>2011</year>). <article-title>The puzzling reliability of the force concept inventory</article-title>. <source>Am. J. Phys.</source> <volume>79</volume>, <fpage>909</fpage>&#x2013;<lpage>912</lpage>. doi: <pub-id pub-id-type="doi">10.1119/1.3602073</pub-id></citation></ref>
<ref id="ref94"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lee</surname> <given-names>U. J.</given-names></name> <name><surname>Sbeglia</surname> <given-names>G. C.</given-names></name> <name><surname>Ha</surname> <given-names>M.</given-names></name> <name><surname>Finch</surname> <given-names>S. J.</given-names></name> <name><surname>Nehm</surname> <given-names>R. H.</given-names></name></person-group> (<year>2015</year>). <article-title>Clicker score trajectories and concept inventory scores as predictors for early warning systems for large STEM classes</article-title>. <source>J. Sci. Educ. Technol.</source> <volume>24</volume>, <fpage>848</fpage>&#x2013;<lpage>860</lpage>. doi: <pub-id pub-id-type="doi">10.1007/s10956-015-9568-2</pub-id></citation></ref>
<ref id="ref95"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lemoine</surname> <given-names>N. P.</given-names></name></person-group> (<year>2019</year>). <article-title>Moving beyond noninformative priors: why and how to choose weakly informative priors in Bayesian analysis</article-title>. <source>Oikos</source> <volume>128</volume>, <fpage>912</fpage>&#x2013;<lpage>928</lpage>. doi: <pub-id pub-id-type="doi">10.1111/oik.05985</pub-id></citation></ref>
<ref id="ref96"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Leppel</surname> <given-names>K.</given-names></name></person-group> (<year>2002</year>). <article-title>Similarities and differences in the college persistence of men and women</article-title>. <source>Rev. High. Educ.</source> <volume>25</volume>, <fpage>433</fpage>&#x2013;<lpage>450</lpage>. doi: <pub-id pub-id-type="doi">10.1353/rhe.2002.0021</pub-id>, PMID: <pub-id pub-id-type="pmid">36319751</pub-id></citation></ref>
<ref id="ref97"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Li</surname> <given-names>H.</given-names></name> <name><surname>Pati</surname> <given-names>D.</given-names></name></person-group> (<year>2017</year>). <article-title>Variable selection using shrinkage priors</article-title>. <source>Comput. Stat. Data Anal.</source> <volume>107</volume>, <fpage>107</fpage>&#x2013;<lpage>119</lpage>. doi: <pub-id pub-id-type="doi">10.1016/j.csda.2016.10.008</pub-id>, PMID: <pub-id pub-id-type="pmid">36589140</pub-id></citation></ref>
<ref id="ref98"><citation citation-type="confproc"><person-group person-group-type="author"><name><surname>Liao</surname> <given-names>S. N.</given-names></name> <name><surname>Zingaro</surname> <given-names>D.</given-names></name> <name><surname>Alvarado</surname> <given-names>C.</given-names></name> <name><surname>Griswold</surname> <given-names>W. G.</given-names></name> <name><surname>Porter</surname> <given-names>L.</given-names></name></person-group> (<year>2019</year>). &#x201C;<article-title>Exploring the value of different data sources for predicting student performance in multiple cs courses</article-title>&#x201D; in <source>Proceedings of the 50th ACM technical symposium on computer science education</source>.</citation></ref>
<ref id="ref100"><citation citation-type="book"><person-group person-group-type="author"><name><surname>Lisitsyna</surname> <given-names>L.</given-names></name> <name><surname>Oreshin</surname> <given-names>S. A.</given-names></name></person-group> (<year>2019</year>). &#x201C;<article-title>Machine learning approach of predicting learning outcomes of MOOCs to increase its performance</article-title>&#x201D; in <source>Smart Education and e-Learning 2019</source> (<publisher-loc>New York, United States</publisher-loc>: <publisher-name>Springer</publisher-name>), <fpage>107</fpage>&#x2013;<lpage>115</lpage>.</citation></ref>
<ref id="ref101"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Liu</surname> <given-names>O. L.</given-names></name> <name><surname>Rios</surname> <given-names>J. A.</given-names></name> <name><surname>Heilman</surname> <given-names>M.</given-names></name> <name><surname>Gerard</surname> <given-names>L.</given-names></name> <name><surname>Linn</surname> <given-names>M. C.</given-names></name></person-group> (<year>2016</year>). <article-title>Validation of automated scoring of science assessments</article-title>. <source>J. Res. Sci. Teach.</source> <volume>53</volume>, <fpage>215</fpage>&#x2013;<lpage>233</lpage>. doi: <pub-id pub-id-type="doi">10.1002/tea.21299</pub-id>, PMID: <pub-id pub-id-type="pmid">36656126</pub-id></citation></ref>
<ref id="ref102"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Liu</surname> <given-names>R.</given-names></name> <name><surname>Tan</surname> <given-names>A.</given-names></name></person-group> (<year>2020</year>). <article-title>Towards interpretable automated machine learning for STEM career prediction</article-title>. <source>J. Educ. Data Mining.</source> <volume>12</volume>, <fpage>19</fpage>&#x2013;<lpage>32</lpage>. doi: <pub-id pub-id-type="doi">10.1002/tea.21299</pub-id></citation></ref>
<ref id="ref103"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>L&#x00F3;pez Zambrano</surname> <given-names>J.</given-names></name> <name><surname>Lara Torralbo</surname> <given-names>J. A.</given-names></name> <name><surname>Romero Morales</surname> <given-names>C.</given-names></name></person-group> (<year>2021</year>). <article-title>Early prediction of student learning performance through data mining: a systematic review</article-title>. <source>Psicothema Oviedo.</source> <volume>33</volume>, <fpage>456</fpage>&#x2013;<lpage>465</lpage>. doi: <pub-id pub-id-type="doi">10.7334/psicothema2021.62</pub-id></citation></ref>
<ref id="ref104"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Louhab</surname> <given-names>F. E.</given-names></name> <name><surname>Bahnasse</surname> <given-names>A.</given-names></name> <name><surname>Bensalah</surname> <given-names>F.</given-names></name> <name><surname>Khiat</surname> <given-names>A.</given-names></name> <name><surname>Khiat</surname> <given-names>Y.</given-names></name> <name><surname>Talea</surname> <given-names>M.</given-names></name></person-group> (<year>2020</year>). <article-title>Novel approach for adaptive flipped classroom based on learning management system</article-title>. <source>Educ. Inf. Technol.</source> <volume>25</volume>, <fpage>755</fpage>&#x2013;<lpage>773</lpage>. doi: <pub-id pub-id-type="doi">10.1007/s10639-019-09994-0</pub-id></citation></ref>
<ref id="ref105"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lunn</surname> <given-names>D. J.</given-names></name> <name><surname>Thomas</surname> <given-names>A.</given-names></name> <name><surname>Best</surname> <given-names>N.</given-names></name> <name><surname>Spiegelhalter</surname> <given-names>D.</given-names></name></person-group> (<year>2000</year>). <article-title>WinBUGS &#x2013; a Bayesian modelling framework: concepts, structure, and extensibility</article-title>. <source>Stat. Comput.</source> <volume>10</volume>, <fpage>325</fpage>&#x2013;<lpage>337</lpage>. doi: <pub-id pub-id-type="doi">10.1023/A:1008929526011</pub-id></citation></ref>
<ref id="ref107"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Mao</surname> <given-names>Y.</given-names></name> <name><surname>Lin</surname> <given-names>C.</given-names></name> <name><surname>Chi</surname> <given-names>M.</given-names></name></person-group> (<year>2018</year>). <article-title>Deep Learning vs. Bayesian Knowledge Tracing: Student Models for Interventions</article-title>. <source>J. Educ. Data Mining</source> <volume>10</volume>, <fpage>28</fpage>&#x2013;<lpage>54</lpage>. doi: <pub-id pub-id-type="doi">10.5281/zenodo.3554691</pub-id></citation></ref>
<ref id="ref108"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Marshall</surname> <given-names>A.</given-names></name> <name><surname>Altman</surname> <given-names>D. G.</given-names></name> <name><surname>Holder</surname> <given-names>R. L.</given-names></name></person-group> (<year>2010</year>). <article-title>Comparison of imputation methods for handling missing covariate data when fitting a cox proportional hazards model: a resampling study</article-title>. <source>BMC Med. Res. Methodol.</source> <volume>10</volume>, <fpage>1</fpage>&#x2013;<lpage>10</lpage>. doi: <pub-id pub-id-type="doi">10.1186/1471-2288-10-112</pub-id></citation></ref>
<ref id="ref109"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Martin</surname> <given-names>J. M.</given-names></name></person-group> (<year>2017</year>). <article-title>It just didn&#x2019;t work out: Examining nonreturning students&#x2019; stories about their freshman experience</article-title>. <source>J. College Stud. Retention: Res. Theory Pract.</source> <volume>19</volume>, <fpage>176</fpage>&#x2013;<lpage>198</lpage>. doi: <pub-id pub-id-type="doi">10.1177/1521025115611670</pub-id></citation></ref>
<ref id="ref110"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Martinez</surname> <given-names>A. J.</given-names></name></person-group> (<year>2021</year>). <article-title>Factor structure and measurement invariance of the academic time management and procrastination measure</article-title>. <source>J. Psychoeduc. Assess.</source> <volume>39</volume>, <fpage>891</fpage>&#x2013;<lpage>901</lpage>. doi: <pub-id pub-id-type="doi">10.1177/07342829211034252</pub-id></citation></ref>
<ref id="ref111"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>McArdle</surname> <given-names>J. J.</given-names></name> <name><surname>Grimm</surname> <given-names>K. J.</given-names></name> <name><surname>Hamagami</surname> <given-names>F.</given-names></name> <name><surname>Bowles</surname> <given-names>R. P.</given-names></name> <name><surname>Meredith</surname> <given-names>W.</given-names></name></person-group> (<year>2009</year>). <article-title>Modeling life-span growth curves of cognition using longitudinal data with multiple samples and changing scales of measurement</article-title>. <source>Psychol. Methods</source> <volume>14</volume>, <fpage>126</fpage>&#x2013;<lpage>149</lpage>. doi: <pub-id pub-id-type="doi">10.1037/a0015857</pub-id>, PMID: <pub-id pub-id-type="pmid">19485625</pub-id></citation></ref>
<ref id="ref112"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>McArthur</surname> <given-names>D.</given-names></name> <name><surname>Lewis</surname> <given-names>M.</given-names></name> <name><surname>Bishary</surname> <given-names>M.</given-names></name></person-group> (<year>2005</year>). <article-title>The roles of artificial intelligence in education: current progress and future prospects</article-title>. <source>J. Educ. Technol.</source> <volume>1</volume>, <fpage>42</fpage>&#x2013;<lpage>80</lpage>. doi: <pub-id pub-id-type="doi">10.26634/jet.1.4.972</pub-id></citation></ref>
<ref id="ref113"><citation citation-type="book"><person-group person-group-type="author"><name><surname>McCarthy</surname> <given-names>M. A.</given-names></name></person-group> (<year>2007</year>). <source>Bayesian methods for ecology</source>. <publisher-loc>Cambridge, England</publisher-loc>: <publisher-name>Cambridge University Press</publisher-name>.</citation></ref>
<ref id="ref114"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>McCarthy</surname> <given-names>M.</given-names></name> <name><surname>Kuh</surname> <given-names>G. D.</given-names></name></person-group> (<year>2006</year>). <article-title>Are students ready for college? What student engagement data say</article-title>. <source>Phi Delta Kappan.</source> <volume>87</volume>, <fpage>664</fpage>&#x2013;<lpage>669</lpage>. doi: <pub-id pub-id-type="doi">10.1177/003172170608700909</pub-id></citation></ref>
<ref id="ref115"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>McCarthy</surname> <given-names>M. A.</given-names></name> <name><surname>Masters</surname> <given-names>P. I.</given-names></name></person-group> (<year>2005</year>). <article-title>Profiting from prior information in Bayesian analyses of ecological data</article-title>. <source>J. Appl. Ecol.</source> <volume>42</volume>, <fpage>1012</fpage>&#x2013;<lpage>1019</lpage>. doi: <pub-id pub-id-type="doi">10.1111/j.1365-2664.2005.01101.x</pub-id></citation></ref>
<ref id="ref116"><citation citation-type="book"><person-group person-group-type="author"><name><surname>McElreath</surname> <given-names>R.</given-names></name></person-group> (<year>2018</year>). <source>Statistical rethinking: a bayesian course with examples in R and stan</source>. <publisher-loc>Chapman</publisher-loc>; <publisher-name>Hall/CRC</publisher-name>.</citation></ref>
<ref id="ref117"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Moharreri</surname> <given-names>K.</given-names></name> <name><surname>Ha</surname> <given-names>M.</given-names></name> <name><surname>Nehm</surname> <given-names>R. H.</given-names></name></person-group> (<year>2014</year>). <article-title>EvoGrader: an online formative assessment tool for automatically evaluating written evolutionary explanations</article-title>. <source>Evol.: Educ. Outreach.</source> <volume>7</volume>, <fpage>1</fpage>&#x2013;<lpage>14</lpage>. doi: <pub-id pub-id-type="doi">10.1186/s12052-014-0015-2</pub-id></citation></ref>
<ref id="ref118"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Musso</surname> <given-names>M. F.</given-names></name> <name><surname>Hern&#x00E1;ndez</surname> <given-names>C. F. R.</given-names></name> <name><surname>Cascallar</surname> <given-names>E. C.</given-names></name></person-group> (<year>2020</year>). <article-title>Predicting key educational outcomes in academic trajectories: a machine-learning approach</article-title>. <source>High. Educ.</source> <volume>80</volume>, <fpage>875</fpage>&#x2013;<lpage>894</lpage>. doi: <pub-id pub-id-type="doi">10.1007/s10734-020-00520-7</pub-id></citation></ref>
<ref id="ref119"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Musso</surname> <given-names>M. F.</given-names></name> <name><surname>Kyndt</surname> <given-names>E.</given-names></name> <name><surname>Cascallar</surname> <given-names>E. C.</given-names></name> <name><surname>Dochy</surname> <given-names>F.</given-names></name></person-group> (<year>2013</year>). <article-title>Predicting general academic performance and identifying the differential contribution of participating variables using artificial neural networks</article-title>. <source>Frontline Learn. Res.</source> <volume>1</volume>, <fpage>42</fpage>&#x2013;<lpage>71</lpage>. doi: <pub-id pub-id-type="doi">10.14786/flr.v1i1.13</pub-id></citation></ref>
<ref id="ref120"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Muth</surname> <given-names>C.</given-names></name> <name><surname>Oravecz</surname> <given-names>Z.</given-names></name> <name><surname>Gabry</surname> <given-names>J.</given-names></name></person-group> (<year>2018</year>). <article-title>User-friendly Bayesian regression modeling: a tutorial with rstanarm and shinystan</article-title>. <source>Quant. Methods Psychol.</source> <volume>14</volume>, <fpage>99</fpage>&#x2013;<lpage>119</lpage>. doi: <pub-id pub-id-type="doi">10.20982/tqmp.14.2.p099</pub-id></citation></ref>
<ref id="ref121"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Nawaz</surname> <given-names>R.</given-names></name> <name><surname>Sun</surname> <given-names>Q.</given-names></name> <name><surname>Shardlow</surname> <given-names>M.</given-names></name> <name><surname>Kontonatsios</surname> <given-names>G.</given-names></name> <name><surname>Aljohani</surname> <given-names>N. R.</given-names></name> <name><surname>Visvizi</surname> <given-names>A.</given-names></name> <etal/></person-group>. (<year>2022</year>). <article-title>Leveraging AI and machine learning for national student survey: actionable insights from textual feedback to enhance quality of teaching and learning in UK&#x2019;s higher education</article-title>. <source>Appl. Sci.</source> <volume>12</volume>:<fpage>514</fpage>. doi: <pub-id pub-id-type="doi">10.3390/app12010514</pub-id></citation></ref>
<ref id="ref122"><citation citation-type="book"><person-group person-group-type="author"><name><surname>Neal</surname> <given-names>R. M.</given-names></name></person-group> (<year>2004</year>). <source>Bayesian methods for machine learning</source> <publisher-name>NIPS Tutorial</publisher-name>. Available at: <ext-link xlink:href="https://www.cs.toronto.edu/radford/ftp/bayes-tut.pdf" ext-link-type="uri">https://www.cs.toronto.edu/radford/ftp/bayes-tut.pdf</ext-link> (Accessed January 28, 2023).</citation></ref>
<ref id="ref123"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Nehm</surname> <given-names>R. H.</given-names></name></person-group> (<year>2019</year>). <article-title>Biology education research: building integrative frameworks for teaching and learning about living systems</article-title>. <source>Discip. Interdiscip. Sci. Educ. Res.</source> <volume>1</volume>, <fpage>1</fpage>&#x2013;<lpage>18</lpage>. doi: <pub-id pub-id-type="doi">10.1186/s43031-019-0017-6</pub-id></citation></ref>
<ref id="ref124"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Nehm</surname> <given-names>R. H.</given-names></name> <name><surname>Beggrow</surname> <given-names>E. P.</given-names></name> <name><surname>Opfer</surname> <given-names>J. E.</given-names></name> <name><surname>Ha</surname> <given-names>M.</given-names></name></person-group> (<year>2012</year>). <article-title>Reasoning about natural selection: diagnosing contextual competency using the ACORNS instrument</article-title>. <source>Am. Biol. Teach.</source> <volume>74</volume>, <fpage>92</fpage>&#x2013;<lpage>98</lpage>. doi: <pub-id pub-id-type="doi">10.1525/abt.2012.74.2.6</pub-id></citation></ref>
<ref id="ref125"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Nehm</surname> <given-names>R. H.</given-names></name> <name><surname>Finch</surname> <given-names>S. J.</given-names></name> <name><surname>Sbeglia</surname> <given-names>G. C.</given-names></name></person-group> (<year>2022</year>). <article-title>Is active learning enough? The contributions of misconception-focused instruction and active-learning dosage on student learning of evolution</article-title>. <source>Bioscience</source> <volume>72</volume>, <fpage>1105</fpage>&#x2013;<lpage>1117</lpage>. doi: <pub-id pub-id-type="doi">10.1093/biosci/biac073</pub-id></citation></ref>
<ref id="ref126"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Nieuwoudt</surname> <given-names>J. E.</given-names></name></person-group> (<year>2020</year>). <article-title>Investigating synchronous and asynchronous class attendance as predictors of academic success in online education</article-title>. <source>Australas. J. Educ. Technol.</source> <volume>36</volume>, <fpage>15</fpage>&#x2013;<lpage>25</lpage>. doi: <pub-id pub-id-type="doi">10.14742/ajet.5137</pub-id></citation></ref>
<ref id="ref127"><citation citation-type="confproc"><person-group person-group-type="author"><name><surname>Nouri</surname> <given-names>J.</given-names></name> <name><surname>Saqr</surname> <given-names>M.</given-names></name> <name><surname>Fors</surname> <given-names>U.</given-names></name></person-group> (<year>2019</year>). &#x201C;<article-title>Predicting performance of students in a flipped classroom using machine learning: towards automated data-driven formative feedback</article-title>&#x201D; in <source>10th International conference on education, training and informatics (ICETI 2019)</source> <volume>17</volume>, <fpage>17</fpage>&#x2013;<lpage>21</lpage>.</citation></ref>
<ref id="ref129"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Orr</surname> <given-names>R.</given-names></name> <name><surname>Foster</surname> <given-names>S.</given-names></name></person-group> (<year>2013</year>). <article-title>Increasing student success using online quizzing in introductory (majors) biology</article-title>. <source>CBE&#x2013;Life Sci. Educ.</source> <volume>12</volume>, <fpage>509</fpage>&#x2013;<lpage>514</lpage>. doi: <pub-id pub-id-type="doi">10.1187/cbe.12-10-0183</pub-id>, PMID: <pub-id pub-id-type="pmid">24006398</pub-id></citation></ref>
<ref id="ref130"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ortiz-Lozano</surname> <given-names>J. M.</given-names></name> <name><surname>Rua-Vieites</surname> <given-names>A.</given-names></name> <name><surname>Bilbao-Calabuig</surname> <given-names>P.</given-names></name> <name><surname>Casades&#x00FA;s-Fa</surname> <given-names>M.</given-names></name></person-group> (<year>2018</year>). <article-title>University student retention: Best time and data to identify undergraduate students at risk of dropout</article-title>. <source>Innov. Educ. Teach. Int.</source> <volume>57</volume>, <fpage>1</fpage>&#x2013;<lpage>12</lpage>. doi: <pub-id pub-id-type="doi">10.1080/14703297.2018.1502090</pub-id></citation></ref>
<ref id="ref132"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Paquette</surname> <given-names>L.</given-names></name> <name><surname>Ocumpaugh</surname> <given-names>J.</given-names></name> <name><surname>Li</surname> <given-names>Z.</given-names></name> <name><surname>Andres</surname> <given-names>A.</given-names></name> <name><surname>Baker</surname> <given-names>R.</given-names></name></person-group> (<year>2020</year>). <article-title>Who&#x2019;s learning? Using demographics in EDM research</article-title>. <source>J. Educ. Data Mining.</source> <volume>12</volume>, <fpage>1</fpage>&#x2013;<lpage>30</lpage>. doi: <pub-id pub-id-type="doi">10.5281/zenodo.4143612</pub-id></citation></ref>
<ref id="ref133"><citation citation-type="confproc"><person-group person-group-type="author"><name><surname>Pardos</surname> <given-names>Z.</given-names></name> <name><surname>Heffernan</surname> <given-names>N.</given-names></name> <name><surname>Ruiz</surname> <given-names>C.</given-names></name> <name><surname>Beck</surname> <given-names>J.</given-names></name></person-group> (<year>2008</year>). &#x201C;<article-title>The composite effect: Conjuntive or compensatory? An analysis of multi-skill math questions in ITS</article-title>&#x201D; in <conf-name>Proceedings of the 1st International Conference on Educational Data Mining</conf-name>. <conf-loc>Montreal, Canada</conf-loc>, <fpage>147</fpage>&#x2013;<lpage>156</lpage>.</citation></ref>
<ref id="ref134"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Parkin</surname> <given-names>J. R.</given-names></name> <name><surname>Wang</surname> <given-names>Z.</given-names></name></person-group> (<year>2021</year>). <article-title>Confirmatory factor analysis of the WIAT-III in a referral sample</article-title>. <source>Psychol. Sch.</source> <volume>58</volume>, <fpage>837</fpage>&#x2013;<lpage>852</lpage>. doi: <pub-id pub-id-type="doi">10.1002/pits.22474</pub-id></citation></ref>
<ref id="ref135"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Pe&#x00F1;a-Ayala</surname> <given-names>A.</given-names></name></person-group> (<year>2014</year>). <article-title>Educational data mining: a survey and a data mining-based analysis of recent works</article-title>. <source>Expert Syst. Appl.</source> <volume>41</volume>, <fpage>1432</fpage>&#x2013;<lpage>1462</lpage>. doi: <pub-id pub-id-type="doi">10.1016/j.eswa.2013.08.042</pub-id></citation></ref>
<ref id="ref136"><citation citation-type="book"><person-group person-group-type="author"><name><surname>Penprase</surname> <given-names>B. E.</given-names></name></person-group> (<year>2020</year>). &#x201C;<article-title>History of STEM in the USA</article-title>&#x201D; in <source>STEM education for the 21st century</source>. (<publisher-loc>New York, United States</publisher-loc>: <publisher-name>Springer</publisher-name>), <fpage>1</fpage>&#x2013;<lpage>16</lpage>.</citation></ref>
<ref id="ref137"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Perez</surname> <given-names>J. G.</given-names></name> <name><surname>Perez</surname> <given-names>E. S.</given-names></name></person-group> (<year>2021</year>). <article-title>Predicting student program completion using Na&#x00EF;ve Bayes classification algorithm</article-title>. <source>Int. J. Modern Educ. Comput. Sci.</source> <volume>13</volume>, <fpage>57</fpage>&#x2013;<lpage>67</lpage>. doi: <pub-id pub-id-type="doi">10.5815/ijmecs.2021.03.05</pub-id></citation></ref>
<ref id="ref138"><citation citation-type="confproc"><person-group person-group-type="author"><name><surname>Plummer</surname> <given-names>M.</given-names></name></person-group> (<year>2003</year>). &#x201C;<article-title>JAGS: A program for analysis of Bayesian graphical models using Gibbs sampling</article-title>&#x201D; in <source>Proceedings of the 3rd International Workshop on Distributed Statistical Computing</source>. <volume>124</volume>, <fpage>1</fpage>&#x2013;<lpage>10</lpage>.</citation></ref>
<ref id="ref139"><citation citation-type="web"><person-group person-group-type="author"><name><surname>Plummer</surname> <given-names>M.</given-names></name></person-group> (<year>2013</year>). <italic>rjags: Bayesian graphical models using MCMC</italic>. Available at: <ext-link xlink:href="https://CRAN.R-project.org/package=rjags" ext-link-type="uri">https://CRAN.R-project.org/package=rjags</ext-link> (Accessed October 7, 2022).</citation></ref>
<ref id="ref141"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Richards</surname> <given-names>S. A.</given-names></name></person-group> (<year>2005</year>). <article-title>Testing ecological theory using the information-theoretic approach: examples and cautionary results</article-title>. <source>Ecology</source> <volume>86</volume>, <fpage>2805</fpage>&#x2013;<lpage>2814</lpage>. doi: <pub-id pub-id-type="doi">10.1890/05-0074</pub-id></citation></ref>
<ref id="ref142"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Roll</surname> <given-names>I.</given-names></name> <name><surname>Wylie</surname> <given-names>R.</given-names></name></person-group> (<year>2016</year>). <article-title>Evolution and revolution in artificial intelligence in education</article-title>. <source>Int. J. Artif. Intell. Educ.</source> <volume>26</volume>, <fpage>582</fpage>&#x2013;<lpage>599</lpage>. doi: <pub-id pub-id-type="doi">10.1007/s40593-016-0110-3</pub-id>, PMID: <pub-id pub-id-type="pmid">36300397</pub-id></citation></ref>
<ref id="ref143"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Romero</surname> <given-names>C.</given-names></name> <name><surname>Ventura</surname> <given-names>S.</given-names></name></person-group> (<year>2020</year>). <article-title>Educational data mining and learning analytics: an updated survey</article-title>. <source>Wiley Interdiscip. Rev.: Data Min. Knowl. Discovery.</source> <volume>10</volume>:<fpage>e1355</fpage>. doi: <pub-id pub-id-type="doi">10.1002/9781118956588.ch16</pub-id></citation></ref>
<ref id="ref144"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Rudin</surname> <given-names>C.</given-names></name></person-group> (<year>2019</year>). <article-title>Stop explaining black box machine learning models for high stakes decisions and use interpretable models instead</article-title>. <source>Nat. Mach. Intell.</source> <volume>1</volume>, <fpage>206</fpage>&#x2013;<lpage>215</lpage>. doi: <pub-id pub-id-type="doi">10.1038/s42256-019-0048-x</pub-id>, PMID: <pub-id pub-id-type="pmid">35603010</pub-id></citation></ref>
<ref id="ref145"><citation citation-type="book"><person-group person-group-type="author"><name><surname>Russell</surname> <given-names>S. J.</given-names></name></person-group> (<year>2010</year>). <source>Artificial intelligence: a modern approach</source>. <publisher-loc>Essex, England</publisher-loc>: <publisher-name>Pearson Education, Inc</publisher-name>.</citation></ref>
<ref id="ref146"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Sailer</surname> <given-names>M.</given-names></name> <name><surname>Homner</surname> <given-names>L.</given-names></name></person-group> (<year>2020</year>). <article-title>The gamification of learning: a meta-analysis</article-title>. <source>Educ. Psychol. Rev.</source> <volume>32</volume>, <fpage>77</fpage>&#x2013;<lpage>112</lpage>. doi: <pub-id pub-id-type="doi">10.1007/s10648-019-09498-w</pub-id>, PMID: <pub-id pub-id-type="pmid">35917707</pub-id></citation></ref>
<ref id="ref147"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Salehi</surname> <given-names>S.</given-names></name> <name><surname>Burkholder</surname> <given-names>E.</given-names></name> <name><surname>Lepage</surname> <given-names>G. P.</given-names></name> <name><surname>Pollock</surname> <given-names>S.</given-names></name> <name><surname>Wieman</surname> <given-names>C.</given-names></name></person-group> (<year>2019</year>). <article-title>Demographic gaps or preparation gaps?: The large impact of incoming preparation on performance of students in introductory physics</article-title>. <source>Phys. Rev. Phys. Educ. Res.</source> <volume>15</volume>:<fpage>020114</fpage>. doi: <pub-id pub-id-type="doi">10.1103/PhysRevPhysEducRes.15.020114</pub-id>, PMID: <pub-id pub-id-type="pmid">36655389</pub-id></citation></ref>
<ref id="ref149"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Shafiq</surname> <given-names>D. A.</given-names></name> <name><surname>Marjani</surname> <given-names>M.</given-names></name> <name><surname>Habeeb</surname> <given-names>R. A. A.</given-names></name> <name><surname>Asirvatham</surname> <given-names>D.</given-names></name></person-group> (<year>2022</year>). <article-title>Student retention using educational data mining and predictive analytics: a systematic literature review</article-title>. <source>IEEE Access.</source> <volume>10</volume>, <fpage>72480</fpage>&#x2013;<lpage>72503</lpage>. doi: <pub-id pub-id-type="doi">10.1109/ACCESS.2022.3188767</pub-id></citation></ref>
<ref id="ref150"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Shahiri</surname> <given-names>A. M.</given-names></name> <name><surname>Husain</surname> <given-names>W.</given-names></name></person-group> (<year>2015</year>). <article-title>A review on predicting student&#x2019;s performance using data mining techniques</article-title>. <source>Procedia Comput. Sci.</source> <volume>72</volume>, <fpage>414</fpage>&#x2013;<lpage>422</lpage>. doi: <pub-id pub-id-type="doi">10.1016/j.procs.2015.12.157</pub-id>, PMID: <pub-id pub-id-type="pmid">36571084</pub-id></citation></ref>
<ref id="ref151"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Shaw</surname> <given-names>S. T.</given-names></name> <name><surname>Spink</surname> <given-names>K.</given-names></name> <name><surname>Chin-Newman</surname> <given-names>C.</given-names></name></person-group> (<year>2019</year>). <article-title>&#x201C;Do I really belong here?&#x201D;: The stigma of being a community college transfer student at a four-year university</article-title>. <source>Community Coll. J. Res. Pract.</source> <volume>43</volume>, <fpage>657</fpage>&#x2013;<lpage>660</lpage>. doi: <pub-id pub-id-type="doi">10.1080/10668926.2018.1528907</pub-id></citation></ref>
<ref id="ref152"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Shayan</surname> <given-names>P.</given-names></name> <name><surname>van Zaanen</surname> <given-names>M.</given-names></name></person-group> (<year>2019</year>). <article-title>Predicting student performance from their behavior in learning management systems</article-title>. <source>Int. J. Inf. Educ. Technol.</source> <volume>9</volume>, <fpage>337</fpage>&#x2013;<lpage>341</lpage>. doi: <pub-id pub-id-type="doi">10.18178/ijiet.2019.9.5.1223</pub-id>, PMID: <pub-id pub-id-type="pmid">36566626</pub-id></citation></ref>
<ref id="ref153"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Simmons</surname> <given-names>A. B.</given-names></name> <name><surname>Heckler</surname> <given-names>A. F.</given-names></name></person-group> (<year>2020</year>). <article-title>Grades, grade component weighting, and demographic disparities in introductory physics</article-title>. <source>Phys. Rev. Phys. Educ. Res.</source> <volume>16</volume>:<fpage>020125</fpage>. doi: <pub-id pub-id-type="doi">10.1103/PhysRevPhysEducRes.16.020125</pub-id></citation></ref>
<ref id="ref154"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Sin</surname> <given-names>K.</given-names></name> <name><surname>Muthu</surname> <given-names>L.</given-names></name></person-group> (<year>2015</year>). <article-title>Application of big data in educational data mining and learning analytics &#x2013; a literature review</article-title>. <source>ICTACT J. Soft Comput.</source> <volume>5</volume>, <fpage>1035</fpage>&#x2013;<lpage>1049</lpage>. doi: <pub-id pub-id-type="doi">10.21917/ijsc.2015.0145</pub-id>, PMID: <pub-id pub-id-type="pmid">34398394</pub-id></citation></ref>
<ref id="ref155"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Spiegelhalter</surname> <given-names>D. J.</given-names></name> <name><surname>Best</surname> <given-names>N. G.</given-names></name> <name><surname>Carlin</surname> <given-names>B. P.</given-names></name> <name><surname>Van Der Linde</surname> <given-names>A.</given-names></name></person-group> (<year>2014</year>). <article-title>The deviance information criterion: 12 years on</article-title>. <source>J. R. Stat. Soc.: Ser. B (Statistical Methodology).</source> <volume>76</volume>, <fpage>485</fpage>&#x2013;<lpage>493</lpage>. doi: <pub-id pub-id-type="doi">10.1111/rssb.12062</pub-id>, PMID: <pub-id pub-id-type="pmid">33002963</pub-id></citation></ref>
<ref id="ref156"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Spiegelhalter</surname> <given-names>D. J.</given-names></name> <name><surname>Myles</surname> <given-names>J. P.</given-names></name> <name><surname>Jones</surname> <given-names>D. R.</given-names></name> <name><surname>Abrams</surname> <given-names>K. R.</given-names></name></person-group> (<year>1999</year>). <article-title>An introduction to Bayesian methods in health technology assessment</article-title>. <source>Br. Med. J.</source> <volume>319</volume>, <fpage>508</fpage>&#x2013;<lpage>512</lpage>. doi: <pub-id pub-id-type="doi">10.1136/bmj.319.7208.508</pub-id>, PMID: <pub-id pub-id-type="pmid">10454409</pub-id></citation></ref>
<ref id="ref157"><citation citation-type="confproc"><person-group person-group-type="author"><name><surname>Stamper</surname> <given-names>J.</given-names></name> <name><surname>Koedinger</surname> <given-names>K.</given-names></name> <name><surname>McLaughlin</surname> <given-names>E.</given-names></name></person-group> (<year>2013</year>). &#x201C;<article-title>A comparison of model selection metrics in Datashop</article-title>&#x201D; in <source>Proceedings of the 6th International Conference on Educational Data Mining</source>.</citation></ref>
<ref id="ref158"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Stephens</surname> <given-names>P. A.</given-names></name> <name><surname>Buskirk</surname> <given-names>S. W.</given-names></name> <name><surname>del Rio</surname> <given-names>C. M.</given-names></name></person-group> (<year>2007</year>). <article-title>Inference in ecology and evolution</article-title>. <source>Trends Ecol. Evol.</source> <volume>22</volume>, <fpage>192</fpage>&#x2013;<lpage>197</lpage>. doi: <pub-id pub-id-type="doi">10.1016/j.tree.2006.12.003</pub-id>, PMID: <pub-id pub-id-type="pmid">36646949</pub-id></citation></ref>
<ref id="ref159"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Subbiah</surname> <given-names>M.</given-names></name> <name><surname>Srinivasan</surname> <given-names>M. R.</given-names></name> <name><surname>Shanthi</surname> <given-names>S.</given-names></name></person-group> (<year>2011</year>). <article-title>Revisiting higher education data analysis: a Bayesian perspective</article-title>. <source>Int. J. Sci. Technol. Educ. Res.</source> <volume>2</volume>, <fpage>32</fpage>&#x2013;<lpage>38</lpage>. doi: <pub-id pub-id-type="doi">10.5897/IJSTER.9000027</pub-id></citation></ref>
<ref id="ref160"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Tebbs</surname> <given-names>J.</given-names></name> <name><surname>Turner</surname> <given-names>S.</given-names></name></person-group> (<year>2005</year>). <article-title>Low-income students: a caution about using data on Pell grant recipients</article-title>. <source>Change Mag. Higher Learn.</source> <volume>37</volume>, <fpage>34</fpage>&#x2013;<lpage>43</lpage>. doi: <pub-id pub-id-type="doi">10.3200/CHNG.37.4.34-43</pub-id></citation></ref>
<ref id="ref161"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Thomas</surname> <given-names>E. H.</given-names></name> <name><surname>Galambos</surname> <given-names>N.</given-names></name></person-group> (<year>2004</year>). <article-title>What satisfies students? Mining student-opinion data with regression and decision tree analysis</article-title>. <source>Res. High. Educ.</source> <volume>45</volume>, <fpage>251</fpage>&#x2013;<lpage>269</lpage>. doi: <pub-id pub-id-type="doi">10.1023/B:RIHE.0000019589.79439.6e</pub-id></citation></ref>
<ref id="ref162"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Thomas</surname> <given-names>D. T.</given-names></name> <name><surname>Walsh</surname> <given-names>E. T.</given-names></name> <name><surname>Torr</surname> <given-names>B. M.</given-names></name> <name><surname>Alvarez</surname> <given-names>A. S.</given-names></name> <name><surname>Malagon</surname> <given-names>M. C.</given-names></name></person-group> (<year>2018</year>). <article-title>Incorporating high-impact practices for retention: a learning community model for transfer students</article-title>. <source>J. College Stud. Retention: Res. Theory Pract.</source> <volume>23</volume>, <fpage>243</fpage>&#x2013;<lpage>263</lpage>. doi: <pub-id pub-id-type="doi">10.1177/1521025118813618</pub-id></citation></ref>
<ref id="ref163"><citation citation-type="book"><person-group person-group-type="author"><name><surname>Tinto</surname> <given-names>V.</given-names></name></person-group> (<year>1987</year>). <source>Leaving college: rethinking the causes and cures of student attrition</source>. <publisher-loc>Chicago, Illinois, United States</publisher-loc>: <publisher-name>The University of Chicago Press</publisher-name>.</citation></ref>
<ref id="ref164"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Tsiakmaki</surname> <given-names>M.</given-names></name> <name><surname>Kostopoulos</surname> <given-names>G.</given-names></name> <name><surname>Kotsiantis</surname> <given-names>S.</given-names></name> <name><surname>Ragos</surname> <given-names>O.</given-names></name></person-group> (<year>2020</year>). <article-title>Transfer learning from deep neural networks for predicting student performance</article-title>. <source>Appl. Sci.</source> <volume>10</volume>:<fpage>2145</fpage>. doi: <pub-id pub-id-type="doi">10.3390/app10062145</pub-id>, PMID: <pub-id pub-id-type="pmid">36588380</pub-id></citation></ref>
<ref id="ref165"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Van Buuren</surname> <given-names>S.</given-names></name> <name><surname>Groothuis-Oudshoorn</surname> <given-names>K.</given-names></name></person-group> (<year>2011</year>). <article-title>mice: multivariate imputation by chained equations in R</article-title>. <source>J. Stat. Softw.</source> <volume>45</volume>, <fpage>1</fpage>&#x2013;<lpage>67</lpage>. doi: <pub-id pub-id-type="doi">10.18637/jss.v045.i03</pub-id></citation></ref>
<ref id="ref166"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Van Camp</surname> <given-names>L. S. C.</given-names></name> <name><surname>Sabbe</surname> <given-names>B. G. C.</given-names></name> <name><surname>Oldenburg</surname> <given-names>J. F. E.</given-names></name></person-group> (<year>2017</year>). <article-title>Cognitive insight; a systematic review</article-title>. <source>Clin. Psychol. Rev.</source> <volume>55</volume>, <fpage>12</fpage>&#x2013;<lpage>24</lpage>. doi: <pub-id pub-id-type="doi">10.1016/j.cpr.2017.04.011</pub-id>, PMID: <pub-id pub-id-type="pmid">28478270</pub-id></citation></ref>
<ref id="ref167"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Van de Sande</surname> <given-names>B.</given-names></name></person-group> (<year>2013</year>). <article-title>Properties of the Bayesian knowledge tracing model</article-title>. <source>J. Educ. Data Min.</source> <volume>5</volume>, <fpage>1</fpage>&#x2013;<lpage>10</lpage>. doi: <pub-id pub-id-type="doi">10.5281/zenodo.3554629</pub-id></citation></ref>
<ref id="ref168"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Van de Schoot</surname> <given-names>R.</given-names></name> <name><surname>Depaoli</surname> <given-names>S.</given-names></name> <name><surname>King</surname> <given-names>R.</given-names></name> <name><surname>Kramer</surname> <given-names>B.</given-names></name> <name><surname>M&#x00E4;rtens</surname> <given-names>K.</given-names></name> <name><surname>Tadesse</surname> <given-names>M. G.</given-names></name> <etal/></person-group>. (<year>2021</year>). <article-title>Bayesian statistics and modelling</article-title>. <source>Nat. Rev. Methods Primers</source> <volume>1</volume>, <fpage>1</fpage>&#x2013;<lpage>26</lpage>. doi: <pub-id pub-id-type="doi">10.1038/s43586-020-00001-2</pub-id></citation></ref>
<ref id="ref169"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Van de Schoot</surname> <given-names>R.</given-names></name> <name><surname>Kaplan</surname> <given-names>D.</given-names></name> <name><surname>Denissen</surname> <given-names>J.</given-names></name> <name><surname>Asendorpf</surname> <given-names>J. B.</given-names></name> <name><surname>Neyer</surname> <given-names>F. J.</given-names></name> <name><surname>Van Aken</surname> <given-names>M. A.</given-names></name></person-group> (<year>2014</year>). <article-title>A gentle introduction to Bayesian analysis: applications to development research</article-title>. <source>Child Dev.</source> <volume>85</volume>, <fpage>842</fpage>&#x2013;<lpage>860</lpage>. doi: <pub-id pub-id-type="doi">10.1111/cdev.12169</pub-id>, PMID: <pub-id pub-id-type="pmid">24116396</pub-id></citation></ref>
<ref id="ref170"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Van den Bergh</surname> <given-names>D.</given-names></name> <name><surname>Clyde</surname> <given-names>M. A.</given-names></name> <name><surname>Gupta</surname> <given-names>A. R. K. N.</given-names></name> <name><surname>de Jong</surname> <given-names>T.</given-names></name> <name><surname>Gronau</surname> <given-names>Q. F.</given-names></name> <name><surname>Marsman</surname> <given-names>M.</given-names></name> <etal/></person-group>. (<year>2021</year>). <article-title>A tutorial on Bayesian multi-model linear regression with BAS and JASP</article-title>. <source>Behav. Res. Methods</source> <volume>53</volume>, <fpage>1</fpage>&#x2013;<lpage>21</lpage>. doi: <pub-id pub-id-type="doi">10.3758/s13428-021-01552-2</pub-id></citation></ref>
<ref id="ref171"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Van Erp</surname> <given-names>S.</given-names></name> <name><surname>Oberski</surname> <given-names>D. L.</given-names></name> <name><surname>Mulder</surname> <given-names>J.</given-names></name></person-group> (<year>2019</year>). <article-title>Shrinkage priors for Bayesian penalized regression</article-title>. <source>J. Math. Psychol.</source> <volume>89</volume>, <fpage>31</fpage>&#x2013;<lpage>50</lpage>. doi: <pub-id pub-id-type="doi">10.1016/j.jmp.2018.12.004</pub-id>, PMID: <pub-id pub-id-type="pmid">35412893</pub-id></citation></ref>
<ref id="ref172"><citation citation-type="web"><person-group person-group-type="author"><name><surname>Van Zyl</surname> <given-names>D.</given-names></name></person-group> (<year>2015</year>). <italic>Introduction to statistics for institutional research. Southern African Association for Institutional Research</italic>. Available at: <ext-link xlink:href="https://www.saair-web.co.za/wp-content/uploads/2015/08/5-SAAIR-IR-Foundations-Intro-to-stats.pdf" ext-link-type="uri">https://www.saair-web.co.za/wp-content/uploads/2015/08/5-SAAIR-IR-Foundations-Intro-to-stats.pdf</ext-link> (Accessed October 7, 2022).</citation></ref>
<ref id="ref173"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Vandenewaetere</surname> <given-names>M.</given-names></name> <name><surname>Desmet</surname> <given-names>P.</given-names></name> <name><surname>Clarebout</surname> <given-names>G.</given-names></name></person-group> (<year>2011</year>). <article-title>The contribution of learner characteristics in the development of computer-based adaptive learning environments</article-title>. <source>Comput. Hum. Behav.</source> <volume>27</volume>, <fpage>118</fpage>&#x2013;<lpage>130</lpage>. doi: <pub-id pub-id-type="doi">10.1016/j.chb.2010.07.038</pub-id></citation></ref>
<ref id="ref174"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Vaziri</surname> <given-names>S.</given-names></name> <name><surname>Vaziri</surname> <given-names>B.</given-names></name> <name><surname>Novoa</surname> <given-names>L. J.</given-names></name> <name><surname>Torabi</surname> <given-names>E.</given-names></name></person-group> (<year>2021</year>). <article-title>Academic motivation in introductory business analytics courses: a Bayesian approach</article-title>. <source>INFORMS Trans. Educ.</source> <volume>22</volume>, <fpage>121</fpage>&#x2013;<lpage>129</lpage>. doi: <pub-id pub-id-type="doi">10.1287/ited.2021.0247</pub-id></citation></ref>
<ref id="ref175"><citation citation-type="confproc"><person-group person-group-type="author"><name><surname>Virdyanawaty</surname> <given-names>R. I.</given-names></name> <name><surname>Mansur</surname> <given-names>A.</given-names></name></person-group> (<year>2016</year>). &#x201C;<article-title>Drop out estimation students based on the study period: comparison between naive bayes and support vector machines algorithm methods</article-title>&#x201D; in <source>IOP conference series: materials science and engineering</source>. <publisher-loc>Bristol, England</publisher-loc>: <publisher-name>IOP Publishing</publisher-name>. <volume>15</volume>, <fpage>012039</fpage>.</citation></ref>
<ref id="ref176"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wang</surname> <given-names>F. H.</given-names></name></person-group> (<year>2017</year>). <article-title>An exploration of online behaviour engagement and achievement in flipped classroom supported by learning management system</article-title>. <source>Comput. Educ.</source> <volume>114</volume>, <fpage>79</fpage>&#x2013;<lpage>91</lpage>. doi: <pub-id pub-id-type="doi">10.1016/j.compedu.2017.06.012</pub-id></citation></ref>
<ref id="ref177"><citation citation-type="book"><person-group person-group-type="author"><name><surname>Wang</surname> <given-names>X.</given-names></name></person-group> (<year>2018</year>). <source><italic>Longitudinal learning dynamics and the conceptual restructuring of evolutionary understanding</italic> [Dissertation]</source>. <publisher-loc>Stony Brook (NY)</publisher-loc>: <publisher-name>Stony Brook, New York</publisher-name>.</citation></ref>
<ref id="ref178"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wang</surname> <given-names>Y.</given-names></name> <name><surname>Wang</surname> <given-names>Y.</given-names></name> <name><surname>Stein</surname> <given-names>D.</given-names></name> <name><surname>Liu</surname> <given-names>Q.</given-names></name> <name><surname>Chen</surname> <given-names>W.</given-names></name></person-group> (<year>2021</year>). <article-title>The structure of Chinese beginning online instructors&#x2019; competencies: evidence from Bayesian factor analysis</article-title>. <source>J. Comput. Educ.</source> <volume>8</volume>, <fpage>411</fpage>&#x2013;<lpage>440</lpage>. doi: <pub-id pub-id-type="doi">10.1007/s40692-021-00186-9</pub-id></citation></ref>
<ref id="ref179"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ward</surname> <given-names>E. J.</given-names></name></person-group> (<year>2008</year>). <article-title>A review and comparison of four commonly used Bayesian and maximum likelihood model selection tools</article-title>. <source>Ecol. Model.</source> <volume>211</volume>, <fpage>1</fpage>&#x2013;<lpage>10</lpage>. doi: <pub-id pub-id-type="doi">10.1016/j.ecolmodel.2007.10.030</pub-id></citation></ref>
<ref id="ref180"><citation citation-type="confproc"><person-group person-group-type="author"><name><surname>Wen</surname> <given-names>D.</given-names></name> <name><surname>Lin</surname> <given-names>F.</given-names></name></person-group> (<year>2008</year>). &#x201C;<article-title>Ways and means of employing AI technology in e-learning systems</article-title>&#x201D; in <source>2008 Eighth IEEE International Conference on Advanced Learning Technologies. (IEEE)</source>, <fpage>1005</fpage>&#x2013;<lpage>1006</lpage>.</citation></ref>
<ref id="ref181"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Xiao</surname> <given-names>W.</given-names></name> <name><surname>Ji</surname> <given-names>P.</given-names></name> <name><surname>Hu</surname> <given-names>J.</given-names></name></person-group> (<year>2022</year>). <article-title>A survey on educational data mining methods used for predicting students&#x2019; performance</article-title>. <source>Eng. Rep.</source> <volume>4</volume>:<fpage>e12482</fpage>. doi: <pub-id pub-id-type="doi">10.1002/eng2.12482</pub-id>, PMID: <pub-id pub-id-type="pmid">36106178</pub-id></citation></ref>
<ref id="ref182"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Xu</surname> <given-names>W.</given-names></name> <name><surname>Meng</surname> <given-names>J.</given-names></name> <name><surname>Kanaga Suba Raja</surname> <given-names>S.</given-names></name> <name><surname>Padma Priya</surname> <given-names>M.</given-names></name> <name><surname>Kiruthiga Devi</surname> <given-names>M.</given-names></name></person-group> (<year>2021</year>). <article-title>Artificial intelligence in constructing personalized and accurate feedback systems for students</article-title>. <source>Int. J. Model. Simul. Sci. Comput.</source>:<fpage>2341001</fpage>. doi: <pub-id pub-id-type="doi">10.1142/S1793962323410015</pub-id></citation></ref>
<ref id="ref183"><citation citation-type="book"><person-group person-group-type="author"><name><surname>Xue</surname> <given-names>Y.</given-names></name></person-group> (<year>2018</year>). <source>Testing the differential efficacy of data mining techniques to predicting student outcomes in higher education [Dissertation]</source>. <publisher-name>Stony Brook, New York</publisher-name>, <publisher-loc>Stony Brook (NY)</publisher-loc>.</citation></ref>
<ref id="ref184"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Yang</surname> <given-names>J.</given-names></name> <name><surname>DeVore</surname> <given-names>S.</given-names></name> <name><surname>Hewagallage</surname> <given-names>D.</given-names></name> <name><surname>Miller</surname> <given-names>P.</given-names></name> <name><surname>Ryan</surname> <given-names>Q. X.</given-names></name> <name><surname>Stewart</surname> <given-names>J.</given-names></name></person-group> (<year>2020</year>). <article-title>Using machine learning to identify the most at-risk students in physics classes</article-title>. <source>Phys. Rev. Phys. Educ. Res.</source> <volume>16</volume>:<fpage>020130</fpage>. doi: <pub-id pub-id-type="doi">10.1103/PhysRevPhysEducRes.16.020130</pub-id></citation></ref>
<ref id="ref185"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Yang</surname> <given-names>S. J.</given-names></name> <name><surname>Ogata</surname> <given-names>H.</given-names></name> <name><surname>Matsui</surname> <given-names>T.</given-names></name> <name><surname>Chen</surname> <given-names>N. S.</given-names></name></person-group> (<year>2021</year>). <article-title>Human-centered artificial intelligence in education: seeing the invisible through the visible</article-title>. <source>Comput. Educ.: Artif. Intell.</source> <volume>2</volume>:<fpage>100008</fpage>. doi: <pub-id pub-id-type="doi">10.1016/j.caeai.2021.100008</pub-id></citation></ref>
<ref id="ref186"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zabriskie</surname> <given-names>C.</given-names></name> <name><surname>Yang</surname> <given-names>J.</given-names></name> <name><surname>DeVore</surname> <given-names>S.</given-names></name> <name><surname>Stewart</surname> <given-names>J.</given-names></name></person-group> (<year>2019</year>). <article-title>Using machine learning to predict physics course outcomes</article-title>. <source>Phys. Rev. Phys. Educ. Res.</source> <volume>15</volume>:<fpage>020120</fpage>. doi: <pub-id pub-id-type="doi">10.1103/PhysRevPhysEducRes.15.020120</pub-id>, PMID: <pub-id pub-id-type="pmid">35313293</pub-id></citation></ref>
<ref id="ref187"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zhai</surname> <given-names>X.</given-names></name></person-group> (<year>2021</year>). <article-title>Practices and theories: how can machine learning assist in innovative assessment practices in science education</article-title>. <source>J. Sci. Educ. Technol.</source> <volume>30</volume>, <fpage>139</fpage>&#x2013;<lpage>149</lpage>. doi: <pub-id pub-id-type="doi">10.1007/s10956-021-09901-8</pub-id></citation></ref>
<ref id="ref188"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zhai</surname> <given-names>X.</given-names></name> <name><surname>C Haudek</surname> <given-names>K.</given-names></name> <name><surname>Shi</surname> <given-names>L.</given-names></name> <name><surname>H Nehm</surname> <given-names>R.</given-names></name> <name><surname>Urban-Lurain</surname> <given-names>M.</given-names></name></person-group> (<year>2020a</year>). <article-title>From substitution to redefinition: a framework of machine learning-based science assessment</article-title>. <source>J. Res. Sci. Teach.</source> <volume>57</volume>, <fpage>1430</fpage>&#x2013;<lpage>1459</lpage>. doi: <pub-id pub-id-type="doi">10.1002/tea.21658</pub-id></citation></ref>
<ref id="ref189"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zhai</surname> <given-names>X.</given-names></name> <name><surname>Shi</surname> <given-names>L.</given-names></name> <name><surname>Nehm</surname> <given-names>R. H.</given-names></name></person-group> (<year>2021</year>). <article-title>A meta-analysis of machine learning-based science assessments: factors impacting machine-human score agreements</article-title>. <source>J. Sci. Educ. Technol.</source> <volume>30</volume>, <fpage>361</fpage>&#x2013;<lpage>379</lpage>. doi: <pub-id pub-id-type="doi">10.1007/s10956-020-09875-z</pub-id></citation></ref>
<ref id="ref190"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zhai</surname> <given-names>X.</given-names></name> <name><surname>Yin</surname> <given-names>Y.</given-names></name> <name><surname>Pellegrino</surname> <given-names>J. W.</given-names></name> <name><surname>Haudek</surname> <given-names>K. C.</given-names></name> <name><surname>Shi</surname> <given-names>L.</given-names></name></person-group> (<year>2020b</year>). <article-title>Applying machine learning in science assessments: a systematic review</article-title>. <source>Stud. Sci. Educ.</source> <volume>56</volume>, <fpage>111</fpage>&#x2013;<lpage>151</lpage>. doi: <pub-id pub-id-type="doi">10.1080/03057267.2020.1735757</pub-id></citation></ref>
<ref id="ref191"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zwick</surname> <given-names>R.</given-names></name> <name><surname>Himelfarb</surname> <given-names>I.</given-names></name></person-group> (<year>2011</year>). <article-title>The effect of high school socioeconomic status on the predictive validity of SAT scores and high school grade-point average</article-title>. <source>J. Educ. Meas.</source> <volume>48</volume>, <fpage>101</fpage>&#x2013;<lpage>121</lpage>. doi: <pub-id pub-id-type="doi">10.1111/j.1745-3984.2011.00136.x</pub-id></citation></ref>
</ref-list>
<glossary>
<def-list>
<title>Abbreviations</title>
<def-item><term>ACORNS</term><def><p>Assessing COntextual Reasoning about Natural Selection</p></def></def-item>
<def-item><term>AI</term><def><p>Artificial intelligence</p></def></def-item>
<def-item><term>CI</term><def><p>Concept inventory</p></def></def-item>
<def-item><term>CINS</term><def><p>Conceptual Inventory of Natural Selection</p></def></def-item>
<def-item><term>EDM</term><def><p>Educational data mining</p></def></def-item>
<def-item><term>GPA</term><def><p>Grade point average</p></def></def-item>
<def-item><term>KC</term><def><p>Key concepts</p></def></def-item>
<def-item><term>MCMC</term><def><p>Markov chain Monte Carlo</p></def></def-item>
<def-item><term>LA</term><def><p>Learning analytics</p></def></def-item>
<def-item><term>LMS</term><def><p>Learning management system</p></def></def-item>
<def-item><term>ML</term><def><p>Machine learning</p></def></def-item>
<def-item><term>ROPE</term><def><p>Region of practical equivalence</p></def></def-item>
<def-item><term>STEM</term><def><p>Science, technology, engineering, and mathematics</p></def></def-item>
<def-item><term>WAIC</term><def><p>Widely applicable information criterion</p></def></def-item>
</def-list>
</glossary>
</back>
</article>
