<?xml version="1.0" encoding="utf-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v2.3 20070202//EN" "journalpublishing.dtd">
<article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" article-type="research-article" dtd-version="2.3" xml:lang="EN">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">Front. Psychol.</journal-id>
<journal-title>Frontiers in Psychology</journal-title>
<abbrev-journal-title abbrev-type="pubmed">Front. Psychol.</abbrev-journal-title>
<issn pub-type="epub">1664-1078</issn>
<publisher>
<publisher-name>Frontiers Media S.A.</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="doi">10.3389/fpsyg.2025.1487111</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Psychology</subject>
<subj-group>
<subject>Original Research</subject>
</subj-group>
</subj-group>
</article-categories>
<title-group>
<article-title>A random forest dynamic threshold imputation method for handling missing data in cognitive diagnosis assessments</article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author">
<name><surname>You</surname> <given-names>Xiaofeng</given-names></name>
<uri xlink:href="https://loop.frontiersin.org/people/2827525/overview"/>
<role content-type="https://credit.niso.org/contributor-roles/formal-analysis/"/>
<role content-type="https://credit.niso.org/contributor-roles/methodology/"/>
<role content-type="https://credit.niso.org/contributor-roles/writing-original-draft/"/>
</contrib>
<contrib contrib-type="author">
<name><surname>Yang</surname> <given-names>Jianqin</given-names></name>
<role content-type="https://credit.niso.org/contributor-roles/writing-review-editing/"/>
</contrib>
<contrib contrib-type="author" corresp="yes">
<name><surname>Xu</surname> <given-names>Xinai</given-names></name>
<xref ref-type="corresp" rid="c001"><sup>&#x002A;</sup></xref>
<role content-type="https://credit.niso.org/contributor-roles/funding-acquisition/"/>
<role content-type="https://credit.niso.org/contributor-roles/investigation/"/>
<role content-type="https://credit.niso.org/contributor-roles/software/"/>
<role content-type="https://credit.niso.org/contributor-roles/writing-review-editing/"/>
</contrib>
</contrib-group>
<aff><institution>School of Mathematics and Information Science, Nanchang Normal University</institution>, <addr-line>Nanchang</addr-line>, <country>China</country></aff>
<author-notes>
<fn fn-type="edited-by" id="fn0001">
<p>Edited by: Wenchao Ma, University of Minnesota Twin Cities, United States</p>
</fn>
<fn fn-type="edited-by" id="fn0002">
<p>Reviewed by: Shenghai Dai, Washington State University, United States</p>
<p>Xuliang Gao, Guizhou Normal University, China</p>
</fn>
<corresp id="c001">&#x002A;Correspondence: Xinai Xu, <email>jxxxa@ncnu.edu.cn</email></corresp>
</author-notes>
<pub-date pub-type="epub">
<day>05</day>
<month>08</month>
<year>2025</year>
</pub-date>
<pub-date pub-type="ecorrected">
<day>12</day>
<month>09</month>
<year>2025</year>
</pub-date>
<pub-date pub-type="collection">
<year>2025</year>
</pub-date>
<volume>16</volume>
<elocation-id>1487111</elocation-id>
<history>
<date date-type="received">
<day>28</day>
<month>08</month>
<year>2024</year>
</date>
<date date-type="accepted">
<day>20</day>
<month>03</month>
<year>2025</year>
</date>
</history>
<permissions>
<copyright-statement>Copyright &#x00A9; 2025 You, Yang and Xu.</copyright-statement>
<copyright-year>2025</copyright-year>
<copyright-holder>You, Yang and Xu</copyright-holder>
<license xlink:href="http://creativecommons.org/licenses/by/4.0/">
<p>This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.</p>
</license>
</permissions>
<abstract>
<p>The handling of missing data in cognitive diagnostic assessment is an important issue. The Random Forest Threshold Imputation (RFTI) method proposed by You et al. in 2023 is specifically designed for cognitive diagnostic models (CDMs) and built on the random forest imputation. However, in RFTI, the threshold for determining imputed values to be 0 is fixed at 0.5, which may result in uncertainty in this imputation. To address this issue, we proposed an improved method, Random Forest Dynamic Threshold Imputation (RFDTI), which possess two dynamic thresholds for dichotomous imputed values. A simulation study showed that the classification of attribute profiles when using RFDTI to impute missing data was always better than the four commonly used traditional methods (i.e., person mean imputation, two-way imputation, expectation&#x2013;maximization algorithm, and multiple imputation). Compared with RFTI, RFDTI was slightly better for MAR or MCAR data, but slightly worse for MNAR or MIXED data, especially with a larger missingness proportion. An empirical example with MNAR data demonstrates the applicability of RFDTI, which performed similarly as RFTI and much better than the other four traditional methods. An R package is provided to facilitate the application of the proposed method.</p>
</abstract>
<kwd-group>
<kwd>missing data</kwd>
<kwd>cognitive diagnosis assessment</kwd>
<kwd>random forest threshold imputation</kwd>
<kwd>machine learning</kwd>
<kwd>dynamic thresholds</kwd>
</kwd-group>
<contract-num rid="cn1">NSBSJJ2020013</contract-num>
<contract-sponsor id="cn1">Normal University Doctoral Research Start Up Fund</contract-sponsor>
<counts>
<fig-count count="1"/>
<table-count count="4"/>
<equation-count count="10"/>
<ref-count count="55"/>
<page-count count="13"/>
<word-count count="11415"/>
</counts>
<custom-meta-wrap>
<custom-meta>
<meta-name>section-at-acceptance</meta-name>
<meta-value>Quantitative Psychology and Measurement</meta-value>
</custom-meta>
</custom-meta-wrap>
</article-meta>
</front>
<body>
<sec sec-type="intro" id="sec1">
<title>Introduction</title>
<p>With the development of educational research, there is more demand for personalized feedback to enhance students&#x2019; learning in a targeted manner (<xref ref-type="bibr" rid="ref3">Chatzopoulou and Economides, 2010</xref>; <xref ref-type="bibr" rid="ref23">Hattie and Timperley, 2007</xref>; <xref ref-type="bibr" rid="ref32">Parsons et al., 2018</xref>; <xref ref-type="bibr" rid="ref50">Wu and Chang, 2023</xref>). To address this issue, Cognitive Diagnosis Models (CDMs) provide a useful psychometric framework that can finely classify students into different attribute profiles according to their responses on the test items (<xref ref-type="bibr" rid="ref15">de la Torre and Minchen, 2014</xref>; <xref ref-type="bibr" rid="ref25">Ketterlin-Geller and Yovanoff, 2009</xref>; <xref ref-type="bibr" rid="ref39">Sia and Lim, 2018</xref>; <xref ref-type="bibr" rid="ref51">You et al., 2019</xref>). However, the presence of missing responses is usually inevitable in such setting. For example, with the rise of personalized learning and hierarchical teaching in recent years, students often respond to only a portion of the items that match their ability instead of completing the entire test. Also, in large-scale assessments, the balanced-incomplete-block (BIB) design is typically used, in which examinees are administrated only a subset of items (<xref ref-type="bibr" rid="ref9">Dai et al., 2018</xref>). In such cases, the data may be missing completely at random (MCAR), missing at random (MAR), missing not at random (MNAR), or even a mixture of two or three of those missingness mechanisms (e.g., <xref ref-type="bibr" rid="ref29">Liu and Loken, 2025</xref>). Therefore, how to obtain accurate classifications of examinees&#x2019; attribute mastery status in diverse missing data scenarios is a crucial issue (<xref ref-type="bibr" rid="ref7">Dai, 2017</xref>).</p>
<p>However, most commonly-used methods for handling missing data in the field of educational and psychological measurement have obvious limitations, and few methods are specifically designed for CDM applications. First, the existing methods (such as case deletion methods, regression imputation, likelihood-based estimation) often make assumptions about the missingness mechanism, whereas the mechanism is often unknown and even untestable and two or more mechanisms may be mixed in practice (<xref ref-type="bibr" rid="ref7">Dai, 2017</xref>; <xref ref-type="bibr" rid="ref11">de Ayala et al., 2001</xref>). Next, it is difficult for these methods to deal with a high missing data proportion (e.g., more than 30%), which however is common in practice due to test designs. Furthermore, whether the existing conclusions on these methods are applicable to CDMs needs further investigation. Current discussions about these methods are mostly based on Item Response Theory (IRT) models (<xref ref-type="bibr" rid="ref24">Holman and Glas, 2005</xref>; <xref ref-type="bibr" rid="ref36">Robitzsch, 2021</xref>), while some commonly-recommended methods have been found to perform quite differently in different research contexts (<xref ref-type="bibr" rid="ref7">Dai, 2017</xref>; <xref ref-type="bibr" rid="ref31">Newman, 2003</xref>; <xref ref-type="bibr" rid="ref40">Song et al., 2022</xref>). For example, in <xref ref-type="bibr" rid="ref7">Dai (2017)</xref>&#x2019;s and <xref ref-type="bibr" rid="ref40">Song et al. (2022)</xref>&#x2019;s research based on cognitive diagnosis, the results of some popular methods, such as maximum likelihood (ML) and multiple imputation (MI), did not show obvious differences for MAR and MNAR data, which however should be expected in IRT contexts. For clarity, these commonly-used methods (such as case deletion methods, mean imputation, ML estimation, and MI) are collectively referred to as traditional methods in this article, as opposed to the machine-learning-related approaches that will be introduced later.</p>
<p>Since the early 2000s, a new framework for imputing missing values through machine learning algorithms has emerged, in which a machine learning model is trained based on the data samples with observed values for specific variables and then predict the missing values (<xref ref-type="bibr" rid="ref28">Liu and Gopalakrishnan, 2017</xref>). This type of approach is gaining popularity due to their applicability and effectiveness in handling large datasets (<xref ref-type="bibr" rid="ref45">Thomas and Rajabi, 2021</xref>). According to <xref ref-type="bibr" rid="ref45">Thomas and Rajabi&#x2019;s (2021)</xref> review of machine learning-based missing data imputation techniques during 2010&#x202F;~&#x202F;2020, clustering, instance-based (e.g., k-nearest neighbor or KNN), and ensemble (e.g., random forest) techniques are the most popular algorithms applied for data imputation. These techniques are nonparametric algorithms that make no parametric assumptions about the relationship between variables. Thus, when employed in data imputation, they do not require strong assumptions about the missingness mechanism, and have been found to always perform well (e.g., <xref ref-type="bibr" rid="ref26">Kokla et al., 2019</xref>; <xref ref-type="bibr" rid="ref35">Richman et al., 2009</xref>; <xref ref-type="bibr" rid="ref43">Suresh et al., 2019</xref>). Due to these advantages of such new approaches, how to incorporate them into the measurement models is getting more and more attention.</p>
<p>In response to the issue of missing data in the implementation of CDMs and the increasing popularity of machine learning imputation methods, <xref ref-type="bibr" rid="ref52">You et al. (2023)</xref> proposed a Random Forest Threshold Imputation (RFTI) method, which is an adaption of the Random Forest Imputation (RFI; <xref ref-type="bibr" rid="ref42">Stekhoven and B&#x00FC;ehlmann, 2012</xref>). Simply put, when imputing missing values for items scored as 0&#x2013;1, for each unobserved value, RFI first predicts a probability value within [0,1] based on the random forest algorithms, which indicates the probability of the missing value taking the value of 1. In other words, the closer the probability value is to 1, the unobserved value is more likely to be 1; otherwise, the value is more likely to be 0. The probability value is then converted to a dichotomous value of 0 or 1 using a fixed threshold (e.g., 0.5). In reality, however, as the probability value approaches 0.5, the missing value is more likely to be incorrectly imputed, regardless of whether the imputed value is 0 or 1, due to the high uncertainty in the model prediction. Therefore, <xref ref-type="bibr" rid="ref52">You et al. (2023)</xref> proposed to leave the missing value with high uncertainty still missing and only impute those with predicted probability values close to 0 or 1. So, RFTI utilizes two thresholds for the dichotomous imputed value, of which the lower one is fixed at 0.5 and the upper one is dynamic and determined by an adapted person fit index in CDMs. Therefore, RFTI is designed for CDMs based on a machine learning imputation algorithm and has been found to be superior in the recovery of examinees&#x2019; mastery profiles than RFI and the expectation&#x2013;maximization (EM) algorithm (a general method to perform ML estimation on incomplete data), especially for MNAR and MIXED data and a large missingness proportion (<xref ref-type="bibr" rid="ref52">You et al., 2023</xref>).</p>
<p>The idea of setting a dynamic threshold is worthwhile. However, it can be noticed that the lower threshold is still fixed at 0.5 in RFTI. Although <xref ref-type="bibr" rid="ref52">You et al. (2023)</xref> mention that it is reasonable to consider the unobserved responses as wrong (i.e., replace them with 0) in the educational assessment if the predicted probability is 0.5 or below, there is no substantial evidence to support this claim. Replacing missing values with 0 for which the predicted probability is less than but close to 0.5 is still subject to high uncertainty. It remains unknown that whether this will influence the imputation accuracy and hence the classification accuracy of examinees&#x2019; attribute patterns. Besides, the simulation conditions in <xref ref-type="bibr" rid="ref52">You et al. (2023)</xref> are limited in that they only varied the missing data mechanisms and missingness proportions without taking into account other factors in the actual cognitive diagnostic assessments, and only three methods, including EM, RFI and RFTI, were compared.</p>
<p>Therefore, in this study, we proposed an improved version of RFTI, in which both upper and lower thresholds for converting the predicted probability values to dichotomous values are dynamic to fully account for imputation uncertainty, and we call the new method as Random Forest Dynamic Threshold Imputation (RFDTI) method. In the following sections, we first briefly introduce the four missing data mechanisms and several traditional methods that are commonly used in educational and psychological measurements, especially cognitive diagnosis contexts. Then, we describe the principles and ideas of the RFTI method in detail and lead to the improved version, the RFDTI method, on this basis. Afterward, we show a Monte Carlo simulation study, in which we systematically investigated the performance of the proposed RFDTI method under different conditions from the perspective of the classification accuracy of CDMs and compared it with RFTI as well as several commonly used traditional missing data handling methods. An empirical example is also provided to illustrate the applicability of the proposed method in practice. Finally, we conclude the paper with a discussion.</p>
<sec id="sec2">
<title>Missing data mechanisms and traditional handling methods</title>
<p>Missing data and their treatment would substantially affect the analysis results based on such data (<xref ref-type="bibr" rid="ref4">Cheema, 2014</xref>; <xref ref-type="bibr" rid="ref27">Little and Rubin, 2002</xref>; <xref ref-type="bibr" rid="ref44">Tabachnick and Fidell, 1989</xref>). Therefore, appropriate techniques for handling missing data should be adopted, and the method selection is usually based on the mechanism and proportion of missingness, as well as the purpose and model of data analysis (<xref ref-type="bibr" rid="ref27">Little and Rubin, 2002</xref>; <xref ref-type="bibr" rid="ref44">Tabachnick and Fidell, 1989</xref>).</p>
<p><xref ref-type="bibr" rid="ref27">Little and Rubin (2002)</xref> defined three types of missing data mechanisms, i.e., MCAR, MAR, and MNAR. Under the MCAR mechanism, the probability of missingness is unrelated to both observed and unobserved data, so the missing values can be completely ignored in the analysis. When data are MAR, missing data in a particular variable are related to some measured variables in the dataset but are unrelated to that variable itself. For example, the missingness is conditional on other measurable characteristics of the examinee but not on the item score in which missingness occurs. The MNAR mechanism refers to the situation in which the missingness on a variable is partly or completely related to the unobserved values in that variable. For example, the missingness proportion of a difficult item is high, while that proportion of an easy item is low. Therefore, MNAR is considered nonignorable. In addition to the three mechanisms mentioned above, there is a MIXED type of missing data mechanism that was used in <xref ref-type="bibr" rid="ref11">de Ayala et al. (2001)</xref> and <xref ref-type="bibr" rid="ref7">Dai (2017)</xref>. Based on an empirical dataset, <xref ref-type="bibr" rid="ref11">de Ayala et al. (2001)</xref> found that item responses (correct, incorrect, or an omitted response) of examinees were related to both the person&#x2019;s ability and the items. Because a test-taker may omit an item for different reasons in practice and these factors cannot be explicitly measured currently, we also include the MIXED mechanism in this study.</p>
<p>Based on previous studies (e.g., <xref ref-type="bibr" rid="ref8">Dai and Svetina Valdivia, 2022</xref>; <xref ref-type="bibr" rid="ref40">Song et al., 2022</xref>), here we review four categories of the traditional methods for handling missing data, which may be commonly used in cognitive diagnosis contexts: case deletion, single imputation, ML estimation, and MI (<xref ref-type="bibr" rid="ref17">Gemici et al., 2012</xref>; <xref ref-type="bibr" rid="ref38">Schafer and Graham, 2002</xref>). Case deletion methods, including listwise and pairwise deletion, are popular and easy to implement, but often result in a large amount of information loss, thereby decreasing statistical power. Commonly-used single imputation methods include person mean imputation (PM) and two-way imputation (TW). PM imputes each missing value using the corresponding respondent&#x2019;s mean score across all available items. TW method further takes into account information from the item mean and the grand mean in addition to the person mean. These two methods are also easy to implement and are robust in dealing with missing values in multidimensional data (e.g., <xref ref-type="bibr" rid="ref1">Bernaards and Sijtsma, 2000</xref>). As for the ML estimation, a general method to perform it on incomplete data is EM algorithm, which iterates between an expectation step and a maximization step. In the expectation step, missing values are filled in using the expectation based on the current estimates of unknown parameters, whereas in the maximization step, the parameters are re-estimated from the observed and filled data. Strictly speaking, EM is also a single imputation method, but it is stochastic, unlike the deterministic PM and TW. Another ML method is the direct maximum likelihood (also known as full information maximum likelihood), which maximizes the likelihood function directly based on parameters from a specified distribution, rather than first imputing missing values. Therefore, this method is sometimes labeled as the &#x201C;available cases&#x201D; approach in some software.</p>
<p>MI, as a flexible alternative to likelihood methods, is not a specific imputation method but rather a multi-step imputation framework. In MI, each missing value is substituted by <italic>m</italic> &#x003E;&#x202F;1 simulated values, resulting in <italic>m</italic> imputed datasets. Each of the <italic>m</italic> datasets is then analyzed using the desired statistical analysis method in the same manner. Finally, the results are pooled by simple arithmetic to produce overall estimates and standard errors (<xref ref-type="bibr" rid="ref38">Schafer and Graham, 2002</xref>). Theoretically speaking, any stochastic imputation method (such as EM and regression-based methods) can be used with MI. In general, likelihood methods and MI, both considered model-based methods, have been suggested as the optimal approaches for handling missing data in many situations (<xref ref-type="bibr" rid="ref16">Finch, 2008</xref>; <xref ref-type="bibr" rid="ref38">Schafer and Graham, 2002</xref>; <xref ref-type="bibr" rid="ref46">van Buuren, 2018</xref>; <xref ref-type="bibr" rid="ref49">Wothke, 2000</xref>). Nevertheless, each method for treating missing data has its own features and assumption, and no one method can consistently outperform the others under different circumstances (<xref ref-type="bibr" rid="ref16">Finch, 2008</xref>).</p>
<p>Regardless of the specific limitations of each method, all these traditional missing data handling methods are subject to the following issues. First, these methods require (strong) statistical assumptions, including the assumption that the missingness mechanism is MCAR or MAR, which may not be satisfied in practice. However, traditional methods often perform poorly under the MNAR mechanism, and what is worse is that the MNAR mechanism is difficult to test in advance. Second, most methods can provide desirable results only when the missingness proportion is not high. In previous simulation studies on missing data handling methods, the specified missing data proportions ranged from 2% (<xref ref-type="bibr" rid="ref11">de Ayala et al., 2001</xref>) to 50% (<xref ref-type="bibr" rid="ref19">Glas and Pimentel, 2008</xref>), most of which were between 5 and 30% (<xref ref-type="bibr" rid="ref16">Finch, 2008</xref>). These methods do not work well when the proportion exceeds 20%, and a large bias may occur in the estimation when the proportion reaches above 30%. Third, although a variety of approaches has been developed to deal with the problem of missing responses in educational measurement, most of them are within the IRT framework (<xref ref-type="bibr" rid="ref8">Dai and Svetina Valdivia, 2022</xref>). For other complex measurement models, such as CDMs, there are few missing data handling methods that take into account the characteristics of the model itself.</p>
</sec>
</sec>
<sec id="sec3">
<title>Random forest threshold imputation</title>
<p>The rise of machine learning provides a new paradigm for imputing missing values. Machine learning models can be trained based on the observed data and then used to predict missing values. Due to the underlying machine learning algorithms, this type of imputation method is often not as dependent on assumptions of missingness mechanisms as the traditional methods mentioned above and often performs better (e.g., <xref ref-type="bibr" rid="ref26">Kokla et al., 2019</xref>; <xref ref-type="bibr" rid="ref35">Richman et al., 2009</xref>; <xref ref-type="bibr" rid="ref43">Suresh et al., 2019</xref>).</p>
<p><xref ref-type="bibr" rid="ref52">You et al. (2023)</xref> incorporated the Random Forest Imputation (RFI; <xref ref-type="bibr" rid="ref42">Stekhoven and B&#x00FC;ehlmann, 2012</xref>), a flexible and effective machine learning imputation method, with the features of CDMs to develop a Random Forest Threshold Imputation (RFTI) method. It is specially designed for CDMs and possesses the advantages of machine learning imputation methods.</p>
<sec id="sec4">
<title>Basic procedure</title>
<p>The key idea of RFTI is building on the random forest imputation, that is, it allows some missing values with low certainty of imputation to remain missing, which is realized by setting two thresholds:</p>
<disp-formula id="EQ1">
<label>(1)</label>
<mml:math id="M1">
<mml:msub>
<mml:mi>Y</mml:mi>
<mml:mi mathvariant="italic">ij</mml:mi>
</mml:msub>
<mml:mo>=</mml:mo>
<mml:mo stretchy="true">{</mml:mo>
<mml:mtable displaystyle="true">
<mml:mtr>
<mml:mtd>
<mml:mn>1</mml:mn>
<mml:mo></mml:mo>
<mml:msub>
<mml:mi>p</mml:mi>
<mml:mi mathvariant="italic">ij</mml:mi>
</mml:msub>
<mml:mo>&#x2265;</mml:mo>
<mml:mi>&#x03C4;</mml:mi>
</mml:mtd>
</mml:mtr>
<mml:mtr>
<mml:mtd>
<mml:mi mathvariant="italic">NA</mml:mi>
<mml:mspace width="thickmathspace"/>
<mml:mo></mml:mo>
<mml:mn>0.5</mml:mn>
<mml:mo>&#x003C;</mml:mo>
<mml:msub>
<mml:mi>p</mml:mi>
<mml:mi mathvariant="italic">ij</mml:mi>
</mml:msub>
<mml:mo>&#x003C;</mml:mo>
<mml:mi>&#x03C4;</mml:mi>
</mml:mtd>
</mml:mtr>
<mml:mtr>
<mml:mtd>
<mml:mn>0</mml:mn>
<mml:mo></mml:mo>
<mml:msub>
<mml:mi>p</mml:mi>
<mml:mi mathvariant="italic">ij</mml:mi>
</mml:msub>
<mml:mo>&#x2264;</mml:mo>
<mml:mn>0.5</mml:mn>
</mml:mtd>
</mml:mtr>
</mml:mtable>
</mml:math>
</disp-formula>
<p>in which <inline-formula>
<mml:math id="M2">
<mml:msub>
<mml:mi>Y</mml:mi>
<mml:mi mathvariant="italic">ij</mml:mi>
</mml:msub>
</mml:math>
</inline-formula> denotes the imputed response of examinee <italic>i</italic> on item <italic>j</italic>, <inline-formula>
<mml:math id="M3">
<mml:msub>
<mml:mi>p</mml:mi>
<mml:mi mathvariant="italic">ij</mml:mi>
</mml:msub>
</mml:math>
</inline-formula> is the imputed probability for examinee <italic>i</italic> on item <italic>j</italic>, <italic>NA</italic> represents missingness, <inline-formula>
<mml:math id="M4">
<mml:mi>&#x03C4;</mml:mi>
</mml:math>
</inline-formula> is the dynamic upper threshold and 0.5 is the fixed lower threshold. For the same dataset, <inline-formula>
<mml:math id="M5">
<mml:mi>&#x03C4;</mml:mi>
</mml:math>
</inline-formula> will be substituted for a range of possible values <inline-formula>
<mml:math id="M6">
<mml:mo stretchy="true">{</mml:mo>
<mml:msup>
<mml:mi>&#x03C4;</mml:mi>
<mml:mrow>
<mml:mo stretchy="true">(</mml:mo>
<mml:mn>1</mml:mn>
<mml:mo stretchy="true">)</mml:mo>
</mml:mrow>
</mml:msup>
<mml:mo>,</mml:mo>
<mml:mo>&#x2026;</mml:mo>
<mml:mo>,</mml:mo>
<mml:msup>
<mml:mi>&#x03C4;</mml:mi>
<mml:mrow>
<mml:mo stretchy="true">(</mml:mo>
<mml:mi>T</mml:mi>
<mml:mo stretchy="true">)</mml:mo>
</mml:mrow>
</mml:msup>
<mml:mo stretchy="true">}</mml:mo>
</mml:math>
</inline-formula> within a reasonable range of [0.5, 1) in evenly spaced increments (e.g., 0.01), and its final value will be the one yielding the best imputation effect, which is evaluated using an adapted person fit index in CDMs. Therefore, for each <inline-formula>
<mml:math id="M7">
<mml:msup>
<mml:mi>&#x03C4;</mml:mi>
<mml:mrow>
<mml:mo stretchy="true">(</mml:mo>
<mml:mi>t</mml:mi>
<mml:mo stretchy="true">)</mml:mo>
</mml:mrow>
</mml:msup>
<mml:mo>,</mml:mo>
<mml:mi>t</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>1</mml:mn>
<mml:mo>,</mml:mo>
<mml:mn>2</mml:mn>
<mml:mo>,</mml:mo>
<mml:mo>&#x2026;</mml:mo>
<mml:mo>,</mml:mo>
<mml:mi>T</mml:mi>
<mml:mo>,</mml:mo>
</mml:math>
</inline-formula> the following procedures of imputation and model fit will be repeated.</p>
<p>Suppose that there is an <italic>N</italic>&#x202F;&#x00D7;&#x202F;<italic>M</italic> data matrix <inline-formula>
<mml:math id="M8">
<mml:mi>Y</mml:mi>
</mml:math>
</inline-formula>, where <italic>N</italic> denotes the number of examinees and <italic>M</italic> is the number of variables (i.e., test items). Then it can be viewed as <inline-formula>
<mml:math id="M9">
<mml:mi>Y</mml:mi>
<mml:mo>=</mml:mo>
<mml:mo stretchy="true">(</mml:mo>
<mml:msub>
<mml:mi>Y</mml:mi>
<mml:mn>1</mml:mn>
</mml:msub>
<mml:mo>,</mml:mo>
<mml:msub>
<mml:mi>Y</mml:mi>
<mml:mn>2</mml:mn>
</mml:msub>
<mml:mo>,</mml:mo>
<mml:mo>&#x2026;</mml:mo>
<mml:mo>,</mml:mo>
<mml:msub>
<mml:mi>Y</mml:mi>
<mml:mi>M</mml:mi>
</mml:msub>
<mml:mo stretchy="true">)</mml:mo>
</mml:math>
</inline-formula>, in which <inline-formula>
<mml:math id="M10">
<mml:msub>
<mml:mi>Y</mml:mi>
<mml:mi>m</mml:mi>
</mml:msub>
</mml:math>
</inline-formula> is the collection of all examinees&#x2019; responses on the <italic>m</italic>th item (<inline-formula>
<mml:math id="M11">
<mml:mi>m</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>1</mml:mn>
<mml:mo>,</mml:mo>
<mml:mn>2</mml:mn>
<mml:mo>,</mml:mo>
<mml:mo>&#x2026;</mml:mo>
<mml:mo>,</mml:mo>
<mml:mi>M</mml:mi>
</mml:math>
</inline-formula>). Let <inline-formula>
<mml:math id="M12">
<mml:msub>
<mml:mi>Y</mml:mi>
<mml:mi>s</mml:mi>
</mml:msub>
<mml:mspace width="thickmathspace"/>
<mml:mo stretchy="true">(</mml:mo>
<mml:mi>s</mml:mi>
<mml:mo>&#x2208;</mml:mo>
<mml:mn>1</mml:mn>
<mml:mo>,</mml:mo>
<mml:mn>2</mml:mn>
<mml:mo>,</mml:mo>
<mml:mo>&#x2026;</mml:mo>
<mml:mo>,</mml:mo>
<mml:mi>m</mml:mi>
<mml:mo stretchy="true">)</mml:mo>
</mml:math>
</inline-formula> denote an arbitrary variable with missing data, <inline-formula>
<mml:math id="M13">
<mml:msubsup>
<mml:mi>i</mml:mi>
<mml:mi mathvariant="italic">mis</mml:mi>
<mml:mrow>
<mml:mo stretchy="true">(</mml:mo>
<mml:mi>s</mml:mi>
<mml:mo stretchy="true">)</mml:mo>
</mml:mrow>
</mml:msubsup>
<mml:mo>&#x2208;</mml:mo>
<mml:mo stretchy="true">{</mml:mo>
<mml:mn>1</mml:mn>
<mml:mo>,</mml:mo>
<mml:mn>2</mml:mn>
<mml:mo>,</mml:mo>
<mml:mo>&#x2026;</mml:mo>
<mml:mo>,</mml:mo>
<mml:mi>N</mml:mi>
<mml:mo stretchy="true">}</mml:mo>
</mml:math>
</inline-formula> denote the examinees with missing values in <inline-formula>
<mml:math id="M14">
<mml:msub>
<mml:mi>Y</mml:mi>
<mml:mi>s</mml:mi>
</mml:msub>
</mml:math>
</inline-formula>, and <inline-formula>
<mml:math id="M15">
<mml:msubsup>
<mml:mi>i</mml:mi>
<mml:mi mathvariant="italic">obs</mml:mi>
<mml:mrow>
<mml:mo stretchy="true">(</mml:mo>
<mml:mi>s</mml:mi>
<mml:mo stretchy="true">)</mml:mo>
</mml:mrow>
</mml:msubsup>
<mml:mo>&#x2208;</mml:mo>
<mml:mo stretchy="true">{</mml:mo>
<mml:mn>1</mml:mn>
<mml:mo>,</mml:mo>
<mml:mn>2</mml:mn>
<mml:mo>,</mml:mo>
<mml:mo>&#x2026;</mml:mo>
<mml:mo>,</mml:mo>
<mml:mi>N</mml:mi>
<mml:mo stretchy="true">}</mml:mo>
</mml:math>
</inline-formula> denote the remaining examinees with observed values in <inline-formula>
<mml:math id="M16">
<mml:msub>
<mml:mi>Y</mml:mi>
<mml:mi>s</mml:mi>
</mml:msub>
</mml:math>
</inline-formula>. Subsequently, the dataset can be divided into four parts: (1) <inline-formula>
<mml:math id="M17">
<mml:msubsup>
<mml:mi>y</mml:mi>
<mml:mi mathvariant="italic">obs</mml:mi>
<mml:mrow>
<mml:mo stretchy="true">(</mml:mo>
<mml:mi>s</mml:mi>
<mml:mo stretchy="true">)</mml:mo>
</mml:mrow>
</mml:msubsup>
</mml:math>
</inline-formula>, representing the observed values in variable <inline-formula>
<mml:math id="M18">
<mml:msub>
<mml:mi>Y</mml:mi>
<mml:mi>s</mml:mi>
</mml:msub>
</mml:math>
</inline-formula>; (2) <inline-formula>
<mml:math id="M19">
<mml:msubsup>
<mml:mi>y</mml:mi>
<mml:mi mathvariant="italic">mis</mml:mi>
<mml:mrow>
<mml:mo stretchy="true">(</mml:mo>
<mml:mi>s</mml:mi>
<mml:mo stretchy="true">)</mml:mo>
</mml:mrow>
</mml:msubsup>
</mml:math>
</inline-formula>, representing the missing values in <inline-formula>
<mml:math id="M20">
<mml:msub>
<mml:mi>Y</mml:mi>
<mml:mi>s</mml:mi>
</mml:msub>
</mml:math>
</inline-formula>; (3) <inline-formula>
<mml:math id="M21">
<mml:msubsup>
<mml:mi>x</mml:mi>
<mml:mi mathvariant="italic">obs</mml:mi>
<mml:mrow>
<mml:mo stretchy="true">(</mml:mo>
<mml:mi>s</mml:mi>
<mml:mo stretchy="true">)</mml:mo>
</mml:mrow>
</mml:msubsup>
</mml:math>
</inline-formula>, representing the data of examinees <inline-formula>
<mml:math id="M22">
<mml:msubsup>
<mml:mi>i</mml:mi>
<mml:mi mathvariant="italic">obs</mml:mi>
<mml:mrow>
<mml:mo stretchy="true">(</mml:mo>
<mml:mi>s</mml:mi>
<mml:mo stretchy="true">)</mml:mo>
</mml:mrow>
</mml:msubsup>
</mml:math>
</inline-formula> in all other <inline-formula>
<mml:math id="M23">
<mml:mo stretchy="true">(</mml:mo>
<mml:mi>m</mml:mi>
<mml:mo>&#x2212;</mml:mo>
<mml:mn>1</mml:mn>
<mml:mo stretchy="true">)</mml:mo>
</mml:math>
</inline-formula> variables except <inline-formula>
<mml:math id="M24">
<mml:msub>
<mml:mi>Y</mml:mi>
<mml:mi>s</mml:mi>
</mml:msub>
</mml:math>
</inline-formula>; (4) <inline-formula>
<mml:math id="M25">
<mml:msubsup>
<mml:mi>x</mml:mi>
<mml:mi mathvariant="italic">mis</mml:mi>
<mml:mrow>
<mml:mo stretchy="true">(</mml:mo>
<mml:mi>s</mml:mi>
<mml:mo stretchy="true">)</mml:mo>
</mml:mrow>
</mml:msubsup>
</mml:math>
</inline-formula>, representing the data of examinees <inline-formula>
<mml:math id="M26">
<mml:msubsup>
<mml:mi>i</mml:mi>
<mml:mi mathvariant="italic">mis</mml:mi>
<mml:mrow>
<mml:mo stretchy="true">(</mml:mo>
<mml:mi>s</mml:mi>
<mml:mo stretchy="true">)</mml:mo>
</mml:mrow>
</mml:msubsup>
</mml:math>
</inline-formula> in all other <inline-formula>
<mml:math id="M27">
<mml:mo stretchy="true">(</mml:mo>
<mml:mi>m</mml:mi>
<mml:mo>&#x2212;</mml:mo>
<mml:mn>1</mml:mn>
<mml:mo stretchy="true">)</mml:mo>
</mml:math>
</inline-formula> variables except <inline-formula>
<mml:math id="M28">
<mml:msub>
<mml:mi>Y</mml:mi>
<mml:mi>s</mml:mi>
</mml:msub>
</mml:math>
</inline-formula>. The imputation procedure is an iterative process involving the following steps.</p>
<p>The first step is to use a traditional imputation method, such as the item mean imputation, to calculate the initial estimates of all the missing values. Then, sort all variables with missingness, <inline-formula>
<mml:math id="M29">
<mml:msub>
<mml:mi>Y</mml:mi>
<mml:mi>s</mml:mi>
</mml:msub>
<mml:mspace width="thickmathspace"/>
<mml:mo stretchy="true">(</mml:mo>
<mml:mi>s</mml:mi>
<mml:mo>&#x2208;</mml:mo>
<mml:mn>1</mml:mn>
<mml:mo>,</mml:mo>
<mml:mn>2</mml:mn>
<mml:mo>,</mml:mo>
<mml:mo>&#x2026;</mml:mo>
<mml:mo>,</mml:mo>
<mml:mi>m</mml:mi>
<mml:mo stretchy="true">)</mml:mo>
</mml:math>
</inline-formula>, in the ascending order of the number of missing values. The imputed matrix is denoted by <inline-formula>
<mml:math id="M30">
<mml:msubsup>
<mml:mi>Y</mml:mi>
<mml:mi mathvariant="italic">old</mml:mi>
<mml:mi mathvariant="italic">imp</mml:mi>
</mml:msubsup>
<mml:mo>.</mml:mo>
</mml:math>
</inline-formula></p>
<p>The second step is to impute the missing values for each <inline-formula>
<mml:math id="M31">
<mml:msub>
<mml:mi>Y</mml:mi>
<mml:mi>s</mml:mi>
</mml:msub>
</mml:math>
</inline-formula> through the random forest algorithm, in which converting probability values to dichotomous values using <xref ref-type="disp-formula" rid="EQ2">Equation 2</xref>. Specifically, to conduct imputation for the variable <inline-formula>
<mml:math id="M32">
<mml:msub>
<mml:mi>Y</mml:mi>
<mml:mi>s</mml:mi>
</mml:msub>
</mml:math>
</inline-formula>, a random forest model is trained using <inline-formula>
<mml:math id="M33">
<mml:msubsup>
<mml:mi>y</mml:mi>
<mml:mi mathvariant="italic">obs</mml:mi>
<mml:mrow>
<mml:mo stretchy="true">(</mml:mo>
<mml:mi>s</mml:mi>
<mml:mo stretchy="true">)</mml:mo>
</mml:mrow>
</mml:msubsup>
</mml:math>
</inline-formula> as the response and <inline-formula>
<mml:math id="M34">
<mml:msubsup>
<mml:mi>x</mml:mi>
<mml:mi mathvariant="italic">obs</mml:mi>
<mml:mrow>
<mml:mo stretchy="true">(</mml:mo>
<mml:mi>s</mml:mi>
<mml:mo stretchy="true">)</mml:mo>
</mml:mrow>
</mml:msubsup>
</mml:math>
</inline-formula> as the predictors. Then, the fitted model is applied to predict the missing values <inline-formula>
<mml:math id="M35">
<mml:msubsup>
<mml:mi>y</mml:mi>
<mml:mi mathvariant="italic">mis</mml:mi>
<mml:mrow>
<mml:mo stretchy="true">(</mml:mo>
<mml:mi>s</mml:mi>
<mml:mo stretchy="true">)</mml:mo>
</mml:mrow>
</mml:msubsup>
</mml:math>
</inline-formula> using <inline-formula>
<mml:math id="M36">
<mml:msubsup>
<mml:mi>x</mml:mi>
<mml:mi mathvariant="italic">mis</mml:mi>
<mml:mrow>
<mml:mo stretchy="true">(</mml:mo>
<mml:mi>s</mml:mi>
<mml:mo stretchy="true">)</mml:mo>
</mml:mrow>
</mml:msubsup>
</mml:math>
</inline-formula> as input. Notice that at this time, the predicted probability values provided by the fitted model are converted to dichotomous values (0 or 1) based on the prespecified upper threshold <inline-formula>
<mml:math id="M37">
<mml:msup>
<mml:mi>&#x03C4;</mml:mi>
<mml:mrow>
<mml:mo stretchy="true">(</mml:mo>
<mml:mi>t</mml:mi>
<mml:mo stretchy="true">)</mml:mo>
</mml:mrow>
</mml:msup>
</mml:math>
</inline-formula> and fixed lower threshold 0.5. This process is repeated for all variables with missing values. After completing the imputation for all <inline-formula>
<mml:math id="M38">
<mml:msub>
<mml:mi>Y</mml:mi>
<mml:mi>s</mml:mi>
</mml:msub>
</mml:math>
</inline-formula>, the new imputed matrix obtained is denoted by <inline-formula>
<mml:math id="M39">
<mml:msubsup>
<mml:mi>Y</mml:mi>
<mml:mi mathvariant="italic">new</mml:mi>
<mml:mi mathvariant="italic">imp</mml:mi>
</mml:msubsup>
</mml:math>
</inline-formula> and then compared to <inline-formula>
<mml:math id="M40">
<mml:msubsup>
<mml:mi>Y</mml:mi>
<mml:mi mathvariant="italic">old</mml:mi>
<mml:mi mathvariant="italic">imp</mml:mi>
</mml:msubsup>
</mml:math>
</inline-formula>.</p>
<p>If the difference between the two imputed matrices does not meet the stopping criterion, the next iteration will be carried out. In the new iteration, <inline-formula>
<mml:math id="M41">
<mml:msubsup>
<mml:mi>Y</mml:mi>
<mml:mi mathvariant="italic">new</mml:mi>
<mml:mi mathvariant="italic">imp</mml:mi>
</mml:msubsup>
</mml:math>
</inline-formula> in the previous iteration will be assigned to <inline-formula>
<mml:math id="M42">
<mml:msubsup>
<mml:mi>Y</mml:mi>
<mml:mi mathvariant="italic">old</mml:mi>
<mml:mi mathvariant="italic">imp</mml:mi>
</mml:msubsup>
</mml:math>
</inline-formula>, and the second imputation step will be repeated to update <inline-formula>
<mml:math id="M43">
<mml:msubsup>
<mml:mi>Y</mml:mi>
<mml:mi mathvariant="italic">new</mml:mi>
<mml:mi mathvariant="italic">imp</mml:mi>
</mml:msubsup>
</mml:math>
</inline-formula>. The stopping criterion is that the difference between <inline-formula>
<mml:math id="M44">
<mml:msubsup>
<mml:mi>Y</mml:mi>
<mml:mi mathvariant="italic">new</mml:mi>
<mml:mi mathvariant="italic">imp</mml:mi>
</mml:msubsup>
</mml:math>
</inline-formula> and <inline-formula>
<mml:math id="M45">
<mml:msubsup>
<mml:mi>Y</mml:mi>
<mml:mi mathvariant="italic">old</mml:mi>
<mml:mi mathvariant="italic">imp</mml:mi>
</mml:msubsup>
</mml:math>
</inline-formula> increases for the first time. For the set of <inline-formula>
<mml:math id="M46">
<mml:mi>M</mml:mi>
</mml:math>
</inline-formula> discrete variables, that difference is measured by <inline-formula>
<mml:math id="M47">
<mml:mi>&#x0394;</mml:mi>
</mml:math>
</inline-formula>, which is calculated by <xref ref-type="disp-formula" rid="EQ2">Equation 2</xref>.</p>
<disp-formula id="EQ2">
<label>(2)</label>
<mml:math id="M48">
<mml:mi>&#x0394;</mml:mi>
<mml:mo>=</mml:mo>
<mml:mfrac>
<mml:mrow>
<mml:msubsup>
<mml:mo>&#x2211;</mml:mo>
<mml:mrow>
<mml:mi>j</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
<mml:mi>M</mml:mi>
</mml:msubsup>
<mml:msubsup>
<mml:mo>&#x2211;</mml:mo>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
<mml:mi>N</mml:mi>
</mml:msubsup>
<mml:msub>
<mml:mi>I</mml:mi>
<mml:mrow>
<mml:msubsup>
<mml:mi>Y</mml:mi>
<mml:mi mathvariant="italic">new</mml:mi>
<mml:mi mathvariant="italic">imp</mml:mi>
</mml:msubsup>
<mml:mo>&#x2260;</mml:mo>
<mml:msubsup>
<mml:mi>Y</mml:mi>
<mml:mi mathvariant="italic">old</mml:mi>
<mml:mi mathvariant="italic">imp</mml:mi>
</mml:msubsup>
</mml:mrow>
</mml:msub>
</mml:mrow>
<mml:mrow>
<mml:mo>#</mml:mo>
<mml:mi mathvariant="italic">NA</mml:mi>
</mml:mrow>
</mml:mfrac>
</mml:math>
</disp-formula>
<p>in which <inline-formula>
<mml:math id="M49">
<mml:mo>#</mml:mo>
<mml:mi mathvariant="italic">NA</mml:mi>
</mml:math>
</inline-formula> is the number of all missing values in the matrix <inline-formula>
<mml:math id="M50">
<mml:mi>Y</mml:mi>
<mml:mo>,</mml:mo>
</mml:math>
</inline-formula> and <inline-formula>
<mml:math id="M51">
<mml:msub>
<mml:mi>I</mml:mi>
<mml:mrow>
<mml:msubsup>
<mml:mi>Y</mml:mi>
<mml:mi mathvariant="italic">new</mml:mi>
<mml:mi mathvariant="italic">imp</mml:mi>
</mml:msubsup>
<mml:mo>&#x2260;</mml:mo>
<mml:msubsup>
<mml:mi>Y</mml:mi>
<mml:mi mathvariant="italic">old</mml:mi>
<mml:mi mathvariant="italic">imp</mml:mi>
</mml:msubsup>
</mml:mrow>
</mml:msub>
</mml:math>
</inline-formula> is an indicator variable that records whether the imputed value in Row <italic>i,</italic> Column <italic>j</italic> differs between two successive iterations. If that value differs, <inline-formula>
<mml:math id="M52">
<mml:mi>I</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>1</mml:mn>
</mml:math>
</inline-formula>; otherwise, <inline-formula>
<mml:math id="M53">
<mml:mi>I</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>0</mml:mn>
</mml:math>
</inline-formula>. Therefore, the numerator in <xref ref-type="disp-formula" rid="EQ2">Equation 2</xref> represents the number of imputed values that change between two iterations (<xref ref-type="bibr" rid="ref42">Stekhoven and B&#x00FC;ehlmann, 2012</xref>).</p>
<p>After obtaining a final imputed data matrix related to <inline-formula>
<mml:math id="M54">
<mml:msup>
<mml:mi>&#x03C4;</mml:mi>
<mml:mrow>
<mml:mo stretchy="true">(</mml:mo>
<mml:mi>t</mml:mi>
<mml:mo stretchy="true">)</mml:mo>
</mml:mrow>
</mml:msup>
</mml:math>
</inline-formula>, a CDM selected by researchers is fitted to this data matrix, in which the EM algorithm is used for item parameter estimation and the maximum a posterior (MAP) method is used to estimate latent attribute patterns. The remaining missing values after imputation are simply ignored. Based on the estimated attribute patterns and Q-matrix expectations, an adapted person fit index is calculated for evaluating the imputation accuracy for each <inline-formula>
<mml:math id="M55">
<mml:msup>
<mml:mi>&#x03C4;</mml:mi>
<mml:mrow>
<mml:mo stretchy="true">(</mml:mo>
<mml:mi>t</mml:mi>
<mml:mo stretchy="true">)</mml:mo>
</mml:mrow>
</mml:msup>
</mml:math>
</inline-formula>.</p>
</sec>
<sec id="sec5">
<title>Determination of the upper thresholds <inline-formula>
<mml:math id="M56">
<mml:mi>&#x03C4;</mml:mi>
</mml:math>
</inline-formula></title>
<p>The determination of the upper thresholds (<inline-formula>
<mml:math id="M57">
<mml:mi>&#x03C4;</mml:mi>
</mml:math>
</inline-formula>) is a balance between the imputation proportion and imputation accuracy in actual situations. A higher <inline-formula>
<mml:math id="M58">
<mml:mi>&#x03C4;</mml:mi>
</mml:math>
</inline-formula> will result in fewer but more accurate imputed values. On the other hand, the missingness proportion in the imputed dataset should be low enough (preferably less than 10, 10%&#x202F;~&#x202F;15% sometimes acceptable) (e.g., <xref ref-type="bibr" rid="ref7">Dai, 2017</xref>; <xref ref-type="bibr" rid="ref21">Hair et al., 2010</xref>; <xref ref-type="bibr" rid="ref27">Little and Rubin, 2002</xref>; <xref ref-type="bibr" rid="ref30">Muth&#x00E9;n et al., 2011</xref>), so that simply ignoring these remaining missing values in the subsequent analysis will not bring a substantial bias.</p>
<p>Logically, the imputed values with high certainty should not damage the overall fit between the data and the expectations of the CDM used in the analysis. The more errors in imputed values, the greater the deviation of the imputed data from the ideal response patterns. When the deviation is large enough, the imputation should be stopped. Out of this consideration, <xref ref-type="bibr" rid="ref52">You et al. (2023)</xref> adapted the response conformity index (RCI) proposed by <xref ref-type="bibr" rid="ref6">Cui and Li (2015)</xref>, which is a person fit index in CDMs, to evaluate the deviation of the imputed data using a possible value <inline-formula>
<mml:math id="M59">
<mml:msup>
<mml:mi>&#x03C4;</mml:mi>
<mml:mrow>
<mml:mo stretchy="true">(</mml:mo>
<mml:mi>t</mml:mi>
<mml:mo stretchy="true">)</mml:mo>
</mml:mrow>
</mml:msup>
</mml:math>
</inline-formula> of <inline-formula>
<mml:math id="M60">
<mml:mi>&#x03C4;</mml:mi>
</mml:math>
</inline-formula> from the ideal response patterns based on the current estimated model.</p>
<p>The adapted index is calculated in two steps. In the first step, an <inline-formula>
<mml:math id="M61">
<mml:mi mathvariant="italic">RCI</mml:mi>
<mml:mo>_</mml:mo>
<mml:msub>
<mml:mi>C</mml:mi>
<mml:mi>i</mml:mi>
</mml:msub>
</mml:math>
</inline-formula> is calculated for each examinee <italic>i</italic> as</p>
<disp-formula id="EQ3">
<label>(3)</label>
<mml:math id="M62">
<mml:mi mathvariant="italic">RCI</mml:mi>
<mml:mo>_</mml:mo>
<mml:msub>
<mml:mi>C</mml:mi>
<mml:mi>i</mml:mi>
</mml:msub>
<mml:mo>=</mml:mo>
<mml:mfrac>
<mml:mrow>
<mml:msubsup>
<mml:mo>&#x2211;</mml:mo>
<mml:mrow>
<mml:mi>j</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
<mml:msub>
<mml:mi>m</mml:mi>
<mml:mi>i</mml:mi>
</mml:msub>
</mml:msubsup>
<mml:mo>&#x2223;</mml:mo>
<mml:mo>ln</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mo stretchy="true">[</mml:mo>
<mml:mo>&#x2212;</mml:mo>
<mml:mfrac>
<mml:mrow>
<mml:msub>
<mml:mi>Y</mml:mi>
<mml:mi mathvariant="italic">ij</mml:mi>
</mml:msub>
<mml:mo>&#x2212;</mml:mo>
<mml:msub>
<mml:mi>P</mml:mi>
<mml:mi>j</mml:mi>
</mml:msub>
<mml:mo stretchy="true">(</mml:mo>
<mml:msub>
<mml:mover accent="true">
<mml:mi>&#x03B1;</mml:mi>
<mml:mo stretchy="true">&#x0302;</mml:mo>
</mml:mover>
<mml:mi>i</mml:mi>
</mml:msub>
<mml:mo stretchy="true">)</mml:mo>
</mml:mrow>
<mml:mrow>
<mml:msub>
<mml:mi>I</mml:mi>
<mml:mi>j</mml:mi>
</mml:msub>
<mml:mo stretchy="true">(</mml:mo>
<mml:msub>
<mml:mover accent="true">
<mml:mi>&#x03B1;</mml:mi>
<mml:mo stretchy="true">&#x0302;</mml:mo>
</mml:mover>
<mml:mi>i</mml:mi>
</mml:msub>
<mml:mo stretchy="true">)</mml:mo>
<mml:mo>&#x2212;</mml:mo>
<mml:msub>
<mml:mi>P</mml:mi>
<mml:mi>j</mml:mi>
</mml:msub>
<mml:mo stretchy="true">(</mml:mo>
<mml:msub>
<mml:mover accent="true">
<mml:mi>&#x03B1;</mml:mi>
<mml:mo stretchy="true">&#x0302;</mml:mo>
</mml:mover>
<mml:mi>i</mml:mi>
</mml:msub>
<mml:mo stretchy="true">)</mml:mo>
</mml:mrow>
</mml:mfrac>
<mml:mo stretchy="true">]</mml:mo>
</mml:mrow>
<mml:mrow>
<mml:msub>
<mml:mi>Y</mml:mi>
<mml:mi mathvariant="italic">ij</mml:mi>
</mml:msub>
<mml:mo>+</mml:mo>
<mml:msub>
<mml:mi>I</mml:mi>
<mml:mi>j</mml:mi>
</mml:msub>
<mml:mo stretchy="true">(</mml:mo>
<mml:msub>
<mml:mover accent="true">
<mml:mi>&#x03B1;</mml:mi>
<mml:mo stretchy="true">&#x0302;</mml:mo>
</mml:mover>
<mml:mi>i</mml:mi>
</mml:msub>
<mml:mo stretchy="true">)</mml:mo>
</mml:mrow>
</mml:msup>
<mml:mo>&#x2223;</mml:mo>
</mml:mrow>
<mml:msub>
<mml:mi>m</mml:mi>
<mml:mi>i</mml:mi>
</mml:msub>
</mml:mfrac>
</mml:math>
</disp-formula>
<p>in which <inline-formula>
<mml:math id="M63">
<mml:msub>
<mml:mi>m</mml:mi>
<mml:mi>i</mml:mi>
</mml:msub>
</mml:math>
</inline-formula> (<inline-formula>
<mml:math id="M64">
<mml:mn>0</mml:mn>
<mml:mo>&#x003C;</mml:mo>
<mml:msub>
<mml:mi>m</mml:mi>
<mml:mi>i</mml:mi>
</mml:msub>
<mml:mo>&#x2264;</mml:mo>
<mml:mi>M</mml:mi>
</mml:math>
</inline-formula>) is the number of nonmissing items for examinee <italic>i</italic> in the imputed data matrix with <inline-formula>
<mml:math id="M65">
<mml:mi>M</mml:mi>
</mml:math>
</inline-formula> items; <inline-formula>
<mml:math id="M66">
<mml:msub>
<mml:mi>Y</mml:mi>
<mml:mi mathvariant="italic">ij</mml:mi>
</mml:msub>
</mml:math>
</inline-formula> denotes the observed or imputed response of examinee <italic>i</italic> on item <italic>j</italic>; <inline-formula>
<mml:math id="M67">
<mml:msub>
<mml:mover accent="true">
<mml:mi>&#x03B1;</mml:mi>
<mml:mo stretchy="true">&#x0302;</mml:mo>
</mml:mover>
<mml:mi>i</mml:mi>
</mml:msub>
</mml:math>
</inline-formula> represents the estimated attribute profile of examinee <italic>i</italic> since the true profile is unknown in practice; <inline-formula>
<mml:math id="M68">
<mml:msub>
<mml:mi>P</mml:mi>
<mml:mi>j</mml:mi>
</mml:msub>
<mml:mo stretchy="true">(</mml:mo>
<mml:msub>
<mml:mover accent="true">
<mml:mi>&#x03B1;</mml:mi>
<mml:mo stretchy="true">&#x0302;</mml:mo>
</mml:mover>
<mml:mi>i</mml:mi>
</mml:msub>
<mml:mo stretchy="true">)</mml:mo>
</mml:math>
</inline-formula> denotes the probability of a correct response to item <italic>j</italic> given <inline-formula>
<mml:math id="M69">
<mml:msub>
<mml:mi>&#x03B1;</mml:mi>
<mml:mi>i</mml:mi>
</mml:msub>
</mml:math>
</inline-formula>, <inline-formula>
<mml:math id="M70">
<mml:msub>
<mml:mi>I</mml:mi>
<mml:mi>j</mml:mi>
</mml:msub>
<mml:mo stretchy="true">(</mml:mo>
<mml:msub>
<mml:mover accent="true">
<mml:mi>&#x03B1;</mml:mi>
<mml:mo stretchy="true">&#x0302;</mml:mo>
</mml:mover>
<mml:mi>i</mml:mi>
</mml:msub>
<mml:mo stretchy="true">)</mml:mo>
</mml:math>
</inline-formula> is the corresponding ideal response, and <inline-formula>
<mml:math id="M71">
<mml:msub>
<mml:mi>I</mml:mi>
<mml:mi>j</mml:mi>
</mml:msub>
<mml:mo stretchy="true">(</mml:mo>
<mml:msub>
<mml:mover accent="true">
<mml:mi>&#x03B1;</mml:mi>
<mml:mo stretchy="true">&#x0302;</mml:mo>
</mml:mover>
<mml:mi>i</mml:mi>
</mml:msub>
<mml:mo stretchy="true">)</mml:mo>
<mml:mo>=</mml:mo>
<mml:mn>1</mml:mn>
<mml:mspace width="thickmathspace"/>
</mml:math>
</inline-formula>only if examinee <italic>i</italic> masters all the attributes required by item <italic>j</italic>, otherwise, <inline-formula>
<mml:math id="M72">
<mml:msub>
<mml:mi>I</mml:mi>
<mml:mi>j</mml:mi>
</mml:msub>
<mml:mo stretchy="true">(</mml:mo>
<mml:msub>
<mml:mover accent="true">
<mml:mi>&#x03B1;</mml:mi>
<mml:mo stretchy="true">&#x0302;</mml:mo>
</mml:mover>
<mml:mi>i</mml:mi>
</mml:msub>
<mml:mo stretchy="true">)</mml:mo>
<mml:mo>=</mml:mo>
<mml:mn>0</mml:mn>
</mml:math>
</inline-formula>. The Q matrix specifies the attributes required for each item. In the second step, the mean of <inline-formula>
<mml:math id="M73">
<mml:mi mathvariant="italic">RCI</mml:mi>
<mml:mo>_</mml:mo>
<mml:msub>
<mml:mi>C</mml:mi>
<mml:mi>i</mml:mi>
</mml:msub>
</mml:math>
</inline-formula> is calculated across all examinees, that is,</p>
<disp-formula id="EQ4">
<label>(4)</label>
<mml:math id="M74">
<mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">RCI</mml:mi>
<mml:mo>_</mml:mo>
<mml:mi>C</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">&#x00AF;</mml:mo>
</mml:mover>
<mml:mo>=</mml:mo>
<mml:mfrac>
<mml:mrow>
<mml:msubsup>
<mml:mo>&#x2211;</mml:mo>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
<mml:mi>N</mml:mi>
</mml:msubsup>
<mml:mi mathvariant="italic">RCI</mml:mi>
<mml:mo>_</mml:mo>
<mml:msub>
<mml:mi>C</mml:mi>
<mml:mi>i</mml:mi>
</mml:msub>
</mml:mrow>
<mml:mi>N</mml:mi>
</mml:mfrac>
</mml:math>
</disp-formula>
<p>Therefore, for each <inline-formula>
<mml:math id="M75">
<mml:msup>
<mml:mi>&#x03C4;</mml:mi>
<mml:mrow>
<mml:mo stretchy="true">(</mml:mo>
<mml:mi>t</mml:mi>
<mml:mo stretchy="true">)</mml:mo>
</mml:mrow>
</mml:msup>
</mml:math>
</inline-formula>, a value of <inline-formula>
<mml:math id="M76">
<mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">RCI</mml:mi>
<mml:mo>_</mml:mo>
<mml:mi>C</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">&#x00AF;</mml:mo>
</mml:mover>
</mml:math>
</inline-formula> can be obtained. More accurate imputations will result in imputed response patterns that are more consistent with expectations, thereby generating a smaller <inline-formula>
<mml:math id="M77">
<mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">RCI</mml:mi>
<mml:mo>_</mml:mo>
<mml:mi>C</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">&#x00AF;</mml:mo>
</mml:mover>
</mml:math>
</inline-formula>.</p>
<p>In {<inline-formula>
<mml:math id="M78">
<mml:msup>
<mml:mi>&#x03C4;</mml:mi>
<mml:mrow>
<mml:mo stretchy="true">(</mml:mo>
<mml:mn>1</mml:mn>
<mml:mo stretchy="true">)</mml:mo>
</mml:mrow>
</mml:msup>
<mml:mo>,</mml:mo>
<mml:mo>&#x2026;</mml:mo>
<mml:mo>,</mml:mo>
<mml:msup>
<mml:mi>&#x03C4;</mml:mi>
<mml:mrow>
<mml:mo stretchy="true">(</mml:mo>
<mml:mi>T</mml:mi>
<mml:mo stretchy="true">)</mml:mo>
</mml:mrow>
</mml:msup>
</mml:math>
</inline-formula>}, the value resulting in the smallest <inline-formula>
<mml:math id="M79">
<mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">RCI</mml:mi>
<mml:mo>_</mml:mo>
<mml:mi>C</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">&#x00AF;</mml:mo>
</mml:mover>
</mml:math>
</inline-formula> is selected as the optimal upper threshold, and the corresponding imputed data matrix is the final imputed result that will be used in the subsequent analysis. In practical application, it is sufficient to export the final imputed data matrix only.</p>
</sec>
</sec>
<sec id="sec6">
<title>Random forest dynamic threshold imputation</title>
<p>RFTI is built on the random forest imputation algorithms while leveraging the characteristics of CDMs to dynamically determine the upper threshold to reduce the imputation errors. However, according to <xref ref-type="disp-formula" rid="EQ1">Equation 1</xref>, the lower threshold is still fixed at 0.5 in RFTI. That is, any missing value with a predicted probability not greater than 0.5 is replaced with 0. When thinking about the starting point of RFTI to reduce estimation uncertainty, this is puzzling. Replacing missing values with 0 for which the predicted probability approaches 0.5 is still subject to a high degree of uncertainty, and so carries the risk of imputation errors. Although <xref ref-type="bibr" rid="ref52">You et al. (2023)</xref> mention that it is reasonable to consider the unobserved responses as wrong (i.e., replace them with 0) in the educational assessment if the predicted probability is 0.5 or below, there is no substantial evidence to support this claim in their study. It remains unknown that whether this will influence the imputation accuracy and hence the classification accuracy of examinees&#x2019; attribute patterns.</p>
<p>Therefore, in this study, we proposed an improved version of RFTI, i.e., Random Forest Dynamic Threshold Imputation (RFDTI). The key difference between them is that in RFDTI, both upper and lower thresholds (<inline-formula>
<mml:math id="M80">
<mml:msub>
<mml:mi>&#x03C4;</mml:mi>
<mml:mi>u</mml:mi>
</mml:msub>
</mml:math>
</inline-formula> and <inline-formula>
<mml:math id="M81">
<mml:msub>
<mml:mi>&#x03C4;</mml:mi>
<mml:mi>l</mml:mi>
</mml:msub>
</mml:math>
</inline-formula>) for converting the predicted probability values to dichotomous values are dynamic to fully account for imputation uncertainty. In other words, the following <xref ref-type="disp-formula" rid="EQ5">Equation 5</xref> is used instead of <xref ref-type="disp-formula" rid="EQ1">Equation 1</xref> when determine the imputed dichotomous values.</p>
<disp-formula id="EQ5">
<label>(5)</label>
<mml:math id="M82">
<mml:msub>
<mml:mi>Y</mml:mi>
<mml:mi mathvariant="italic">ij</mml:mi>
</mml:msub>
<mml:mo>=</mml:mo>
<mml:mo stretchy="true">{</mml:mo>
<mml:mtable displaystyle="true">
<mml:mtr>
<mml:mtd>
<mml:mn>1</mml:mn>
<mml:mo></mml:mo>
<mml:msub>
<mml:mi>p</mml:mi>
<mml:mi mathvariant="italic">ij</mml:mi>
</mml:msub>
<mml:mo>&#x2265;</mml:mo>
<mml:msub>
<mml:mi>&#x03C4;</mml:mi>
<mml:mi>u</mml:mi>
</mml:msub>
</mml:mtd>
</mml:mtr>
<mml:mtr>
<mml:mtd>
<mml:mi mathvariant="italic">NA</mml:mi>
<mml:mo></mml:mo>
<mml:msub>
<mml:mi>&#x03C4;</mml:mi>
<mml:mi>l</mml:mi>
</mml:msub>
<mml:mo>&#x003C;</mml:mo>
<mml:msub>
<mml:mi>p</mml:mi>
<mml:mi mathvariant="italic">ij</mml:mi>
</mml:msub>
<mml:mo>&#x003C;</mml:mo>
<mml:msub>
<mml:mi>&#x03C4;</mml:mi>
<mml:mi>u</mml:mi>
</mml:msub>
</mml:mtd>
</mml:mtr>
<mml:mtr>
<mml:mtd>
<mml:mn>0</mml:mn>
<mml:mo></mml:mo>
<mml:msub>
<mml:mi>p</mml:mi>
<mml:mi mathvariant="italic">ij</mml:mi>
</mml:msub>
<mml:mo>&#x2264;</mml:mo>
<mml:msub>
<mml:mi>&#x03C4;</mml:mi>
<mml:mi>l</mml:mi>
</mml:msub>
</mml:mtd>
</mml:mtr>
</mml:mtable>
</mml:math>
</disp-formula>
<p>in which <inline-formula>
<mml:math id="M83">
<mml:mn>0</mml:mn>
<mml:mo>&#x003C;</mml:mo>
<mml:msub>
<mml:mi>&#x03C4;</mml:mi>
<mml:mi>l</mml:mi>
</mml:msub>
<mml:mo>&#x003C;</mml:mo>
<mml:mn>0.5</mml:mn>
</mml:math>
</inline-formula> and <inline-formula>
<mml:math id="M84">
<mml:mn>0.5</mml:mn>
<mml:mo>&#x2264;</mml:mo>
<mml:msub>
<mml:mi>&#x03C4;</mml:mi>
<mml:mi>u</mml:mi>
</mml:msub>
<mml:mo>&#x003C;</mml:mo>
<mml:mn>1</mml:mn>
</mml:math>
</inline-formula>. The dynamic upper and lower thresholds will be simultaneously determined based on the adapted person fit index <inline-formula>
<mml:math id="M85">
<mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">RCI</mml:mi>
<mml:mo>_</mml:mo>
<mml:mi>C</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">&#x00AF;</mml:mo>
</mml:mover>
</mml:math>
</inline-formula>, while other procedures are the same as those of RFTI. That is, for each combination of <inline-formula>
<mml:math id="M86">
<mml:msubsup>
<mml:mi>&#x03C4;</mml:mi>
<mml:mi>l</mml:mi>
<mml:mrow>
<mml:mo stretchy="true">(</mml:mo>
<mml:msub>
<mml:mi>t</mml:mi>
<mml:mi>l</mml:mi>
</mml:msub>
<mml:mo stretchy="true">)</mml:mo>
</mml:mrow>
</mml:msubsup>
<mml:mspace width="thickmathspace"/>
<mml:mo stretchy="true">(</mml:mo>
<mml:mi>t</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>1</mml:mn>
<mml:mo>,</mml:mo>
<mml:mn>2</mml:mn>
<mml:mo>,</mml:mo>
<mml:mo>&#x2026;</mml:mo>
<mml:mo>,</mml:mo>
<mml:msub>
<mml:mi>T</mml:mi>
<mml:mi>l</mml:mi>
</mml:msub>
<mml:mo stretchy="true">)</mml:mo>
</mml:math>
</inline-formula> and <inline-formula>
<mml:math id="M87">
<mml:msubsup>
<mml:mi>&#x03C4;</mml:mi>
<mml:mi>u</mml:mi>
<mml:mrow>
<mml:mo stretchy="true">(</mml:mo>
<mml:msub>
<mml:mi>t</mml:mi>
<mml:mi>u</mml:mi>
</mml:msub>
<mml:mo stretchy="true">)</mml:mo>
</mml:mrow>
</mml:msubsup>
<mml:mspace width="thickmathspace"/>
<mml:mo stretchy="true">(</mml:mo>
<mml:msub>
<mml:mi>t</mml:mi>
<mml:mi>u</mml:mi>
</mml:msub>
<mml:mo>=</mml:mo>
<mml:mn>1</mml:mn>
<mml:mo>,</mml:mo>
<mml:mn>2</mml:mn>
<mml:mo>,</mml:mo>
<mml:mo>&#x2026;</mml:mo>
<mml:mo>,</mml:mo>
<mml:msub>
<mml:mi>T</mml:mi>
<mml:mi>u</mml:mi>
</mml:msub>
<mml:mo stretchy="true">)</mml:mo>
</mml:math>
</inline-formula>, an imputed data matrix can be obtained according to the predicted probability of missing values and <xref ref-type="disp-formula" rid="EQ5">Equation 5</xref>, for which an <inline-formula>
<mml:math id="M88">
<mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">RCI</mml:mi>
<mml:mo>_</mml:mo>
<mml:mi>C</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">&#x00AF;</mml:mo>
</mml:mover>
<mml:mspace width="thickmathspace"/>
</mml:math>
</inline-formula>can be calculated according to <xref ref-type="disp-formula" rid="EQ3">Equations 3</xref>, <xref ref-type="disp-formula" rid="EQ4">4</xref>. The combination of <inline-formula>
<mml:math id="M89">
<mml:msub>
<mml:mi>&#x03C4;</mml:mi>
<mml:mi>l</mml:mi>
</mml:msub>
</mml:math>
</inline-formula> and <inline-formula>
<mml:math id="M90">
<mml:msub>
<mml:mi>&#x03C4;</mml:mi>
<mml:mi>u</mml:mi>
</mml:msub>
</mml:math>
</inline-formula> values corresponding to the smallest <inline-formula>
<mml:math id="M91">
<mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">RCI</mml:mi>
<mml:mo>_</mml:mo>
<mml:mi>C</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">&#x00AF;</mml:mo>
</mml:mover>
</mml:math>
</inline-formula> is selected as the optimal thresholds, and the corresponding imputed data matrix will be used in the subsequent analysis.</p>
<p>To facilitate the application of the proposed RFDTI, we developed the corresponding R package <italic>missForestCDA</italic>, which can be downloaded from <ext-link xlink:href="https://jianlingsoft.oss-cn-beijing.aliyuncs.com/missForestCDA.rar" ext-link-type="uri">https://jianlingsoft.oss-cn-beijing.aliyuncs.com/missForestCDA.rar</ext-link>. This package requires preinstallation of the R package <italic>missForest</italic> (<xref ref-type="bibr" rid="ref41">Stekhoven, 2013</xref>) for implementing the random forest imputation and the R package <italic>CDM</italic> (<xref ref-type="bibr" rid="ref18">George et al., 2016</xref>; <xref ref-type="bibr" rid="ref37">Robitzsch et al., 2017</xref>) for the estimation of CDMs. After installing and loading the <italic>missForestCDA</italic> package, the incomplete dataset can be imputed by simply handing it over to the main function of this package:</p>
<disp-formula id="E1">
<mml:math id="M92">
<mml:mtext mathvariant="italic">missForestCDA</mml:mtext>
<mml:mo stretchy="true">(</mml:mo>
<mml:mtext mathvariant="italic">missData</mml:mtext>
<mml:mo>,</mml:mo>
<mml:mi>Q</mml:mi>
<mml:mo>,</mml:mo>
<mml:mtext mathvariant="italic">missN</mml:mtext>
<mml:mo>,</mml:mo>
<mml:mtext mathvariant="italic">stepV</mml:mtext>
<mml:mo>=</mml:mo>
<mml:mn>0.05</mml:mn>
<mml:mo stretchy="true">)</mml:mo>
</mml:math>
</disp-formula>
<p>in which <italic>missData</italic> is the input incomplete dataset, <italic>Q</italic> is the Q matrix that needs to be specified by researchers, <italic>missN</italic> is the number of persons with missing responses, and <italic>stepV</italic> is the increment <inline-formula>
<mml:math id="M93">
<mml:mi>&#x03B4;</mml:mi>
</mml:math>
</inline-formula> used to generate a sequence of possible values for <inline-formula>
<mml:math id="M94">
<mml:msub>
<mml:mi>&#x03C4;</mml:mi>
<mml:mi>l</mml:mi>
</mml:msub>
</mml:math>
</inline-formula> and <inline-formula>
<mml:math id="M95">
<mml:msub>
<mml:mi>&#x03C4;</mml:mi>
<mml:mi>u</mml:mi>
</mml:msub>
</mml:math>
</inline-formula>, which is set at 0.05 by default. This function will directly return the final imputed dataset.</p>
<p>In this study, we conducted a Monte Carlo simulation study and an empirical study to investigated the performance of the proposed RFDTI method under different conditions and compared it with RFTI as well as several commonly used traditional missing data handling methods. According to <xref ref-type="disp-formula" rid="EQ3">Equations 3</xref>, <xref ref-type="disp-formula" rid="EQ4">4</xref>, the adapted person fit statistic <inline-formula>
<mml:math id="M96">
<mml:mover accent="true">
<mml:mrow>
<mml:mi mathvariant="italic">RCI</mml:mi>
<mml:mo>_</mml:mo>
<mml:mi>C</mml:mi>
</mml:mrow>
<mml:mo stretchy="true">&#x00AF;</mml:mo>
</mml:mover>
</mml:math>
</inline-formula> can be obtained for any CDMs with explicitly defined item response function <inline-formula>
<mml:math id="M97">
<mml:msub>
<mml:mi>P</mml:mi>
<mml:mi>j</mml:mi>
</mml:msub>
<mml:mo stretchy="true">(</mml:mo>
<mml:msub>
<mml:mi>&#x03B1;</mml:mi>
<mml:mi>i</mml:mi>
</mml:msub>
<mml:mo stretchy="true">)</mml:mo>
</mml:math>
</inline-formula>. In this study, for the purpose of illustration, the generalized Deterministic Inputs, Noisy and Gate (DINA) model (<xref ref-type="bibr" rid="ref12">de la Torre, 2011</xref>) is used as an example.</p>
<p>Note that in <xref ref-type="bibr" rid="ref52">You et al. (2023)</xref>&#x2019;s study, the superiority of RFTI over RFI and EM was mainly in the classification of attribute profiles, whereas the estimation of item parameters using RFTI was inferior to that of EM. This may be because the training and prediction of the random forest model underlying RFTI are mainly based on the characteristics of individual response patterns across items, while information about responses from different examinees to the same item is rarely used. Based on this, we focus on the classification of attributes&#x2019; mastery status in this study, leaving aside the item parameter estimation temporarily.</p>
</sec>
<sec id="sec7">
<title>Simulation study</title>
<p>A Monte Carlo simulation study was conducted to investigate the imputation effect of the proposed RFDTI method under different missingness conditions and its relative performance compared with the RFTI as well as four commonly used imputation methods in educational assessments.</p>
<sec id="sec8">
<title>Design</title>
<p>A total of 4&#x202F;&#x00D7;&#x202F;5&#x202F;&#x00D7;&#x202F;3&#x202F;&#x00D7;&#x202F;6&#x202F;=&#x202F;360 conditions were created by manipulating four factors, including the missing data mechanism (MIXED, MNAR, MAR, and MCAR), missingness proportion (10, 20, 30, 40, and 50%), sample size (<italic>N</italic>&#x202F;=&#x202F;500, 1,000 and 2000) and the number of attributes (<italic>K</italic>&#x202F;=&#x202F;3, 4, 5, 6, 7, 8). The missing data proportion and the number of attributes were chosen according to common settings in related studies. Specifically, the missing rate reported in the educational measurement literature, as mentioned above, was between 2 and 50%, and most existing CDM studies used three to eight attributes (<xref ref-type="bibr" rid="ref7">Dai, 2017</xref>). In addition, a sample size of 1,000 was widely used (<xref ref-type="bibr" rid="ref7">Dai, 2017</xref>) and was considered sufficient for the DINA model to obtain an accurate parameter estimation (<xref ref-type="bibr" rid="ref14">de la Torre et al., 2010</xref>). Therefore, we considered three levels of sample size centered at 1000. Each simulation condition was replicated 100 times. Each generated dataset was imputed using six approaches, including RFTI, RFDTI, and four frequently used methods in educational assessments, including PM, TW, EM, and MI.</p>
<p>Other specifications reflected the common settings in simulations and empirical studies of CDMs reported in previous literature. According to the review of CDM studies by <xref ref-type="bibr" rid="ref7">Dai (2017)</xref>, the number of items was mostly between 20 and 40, so a test length of 30 items was used here. For simplicity, we assumed that attributes were independent of each other. The Q-matrix reflecting the mapping relationship between attributes and items was randomly generated. Specifically, q-entries in the Q-matrix were randomly drawn from the uniform distribution U(0,1) and then dichotomized by the cut-off point of 0.5. Therefore, each item might measure one or more attributes.</p>
</sec>
<sec id="sec9">
<title>Data generation</title>
<p>Data generation was implemented in R language and involved two steps: generating the complete datasets and then generating the missing data.</p>
<sec id="sec10">
<title>Complete data generation</title>
<p>First, the DINA model was used to simulate the complete dichotomous responses under each condition. In the DINA model, the item response probability is written as:</p>
<disp-formula id="EQ6">
<label>(6)</label>
<mml:math id="M98">
<mml:mtable columnalign="left" displaystyle="true">
<mml:mtr>
<mml:mtd>
<mml:msub>
<mml:mi>P</mml:mi>
<mml:mi>j</mml:mi>
</mml:msub>
<mml:mo stretchy="true">(</mml:mo>
<mml:msub>
<mml:mi>&#x03B1;</mml:mi>
<mml:mi>i</mml:mi>
</mml:msub>
<mml:mo stretchy="true">)</mml:mo>
<mml:mo>=</mml:mo>
<mml:mi>P</mml:mi>
<mml:mo stretchy="true">(</mml:mo>
<mml:msub>
<mml:mi>X</mml:mi>
<mml:mi mathvariant="italic">ij</mml:mi>
</mml:msub>
<mml:mo>=</mml:mo>
<mml:mn>1</mml:mn>
<mml:mo>&#x2223;</mml:mo>
<mml:msub>
<mml:mi>&#x03B1;</mml:mi>
<mml:mi>i</mml:mi>
</mml:msub>
<mml:mo stretchy="true">)</mml:mo>
</mml:mtd>
</mml:mtr>
<mml:mtr>
<mml:mtd>
<mml:mo>=</mml:mo>
<mml:msubsup>
<mml:mi>g</mml:mi>
<mml:mi>j</mml:mi>
<mml:mrow>
<mml:mn>1</mml:mn>
<mml:mo>&#x2212;</mml:mo>
<mml:msub>
<mml:mi>&#x03B7;</mml:mi>
<mml:mi mathvariant="italic">ij</mml:mi>
</mml:msub>
</mml:mrow>
</mml:msubsup>
<mml:msup>
<mml:mrow>
<mml:mo stretchy="true">(</mml:mo>
<mml:mn>1</mml:mn>
<mml:mo>&#x2212;</mml:mo>
<mml:msub>
<mml:mi>s</mml:mi>
<mml:mi>j</mml:mi>
</mml:msub>
<mml:mo stretchy="true">)</mml:mo>
</mml:mrow>
<mml:msub>
<mml:mi>&#x03B7;</mml:mi>
<mml:mi mathvariant="italic">ij</mml:mi>
</mml:msub>
</mml:msup>
<mml:mo>=</mml:mo>
<mml:mo stretchy="true">{</mml:mo>
<mml:mtable>
<mml:mtr>
<mml:mtd>
<mml:msub>
<mml:mi>g</mml:mi>
<mml:mi>j</mml:mi>
</mml:msub>
<mml:mo></mml:mo>
<mml:mtext mathvariant="italic">if</mml:mtext>
<mml:mspace width="thickmathspace"/>
<mml:msub>
<mml:mi>&#x03B7;</mml:mi>
<mml:mi mathvariant="italic">ij</mml:mi>
</mml:msub>
<mml:mo>=</mml:mo>
<mml:mn>0</mml:mn>
</mml:mtd>
</mml:mtr>
<mml:mtr>
<mml:mtd>
<mml:mn>1</mml:mn>
<mml:mo>&#x2212;</mml:mo>
<mml:msub>
<mml:mi>s</mml:mi>
<mml:mi>j</mml:mi>
</mml:msub>
<mml:mo></mml:mo>
<mml:mtext mathvariant="italic">if</mml:mtext>
<mml:mspace width="thickmathspace"/>
<mml:msub>
<mml:mi>&#x03B7;</mml:mi>
<mml:mi mathvariant="italic">ij</mml:mi>
</mml:msub>
<mml:mo>=</mml:mo>
<mml:mn>1</mml:mn>
</mml:mtd>
</mml:mtr>
</mml:mtable>
</mml:mtd>
</mml:mtr>
</mml:mtable>
</mml:math>
</disp-formula>
<p>in which <inline-formula>
<mml:math id="M99">
<mml:msub>
<mml:mi>X</mml:mi>
<mml:mi mathvariant="italic">ij</mml:mi>
</mml:msub>
</mml:math>
</inline-formula> is the response of examinee <italic>i</italic> to item <italic>j</italic>, <inline-formula>
<mml:math id="M100">
<mml:msub>
<mml:mi>&#x03B1;</mml:mi>
<mml:mi>i</mml:mi>
</mml:msub>
<mml:mo>=</mml:mo>
<mml:mo stretchy="true">(</mml:mo>
<mml:msub>
<mml:mi>&#x03B1;</mml:mi>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msub>
<mml:mo>,</mml:mo>
<mml:mo>&#x2026;</mml:mo>
<mml:mo>,</mml:mo>
<mml:msub>
<mml:mi>&#x03B1;</mml:mi>
<mml:mi mathvariant="italic">iK</mml:mi>
</mml:msub>
<mml:mo stretchy="true">)</mml:mo>
</mml:math>
</inline-formula> is the examinee&#x2019;s attribute profile, <inline-formula>
<mml:math id="M101">
<mml:msub>
<mml:mi>g</mml:mi>
<mml:mi>j</mml:mi>
</mml:msub>
</mml:math>
</inline-formula> is the guessing parameter of item <italic>j</italic>, <inline-formula>
<mml:math id="M102">
<mml:msub>
<mml:mi>s</mml:mi>
<mml:mi>j</mml:mi>
</mml:msub>
</mml:math>
</inline-formula> is the slipping parameter of item <italic>j</italic>, <inline-formula>
<mml:math id="M103">
<mml:msub>
<mml:mi>&#x03B7;</mml:mi>
<mml:mi mathvariant="italic">ij</mml:mi>
</mml:msub>
<mml:mo>=</mml:mo>
<mml:munderover>
<mml:mo movablelimits="false">&#x220F;</mml:mo>
<mml:mrow>
<mml:mi>k</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
<mml:mi>K</mml:mi>
</mml:munderover>
<mml:msubsup>
<mml:mi>&#x03B1;</mml:mi>
<mml:mi mathvariant="italic">ik</mml:mi>
<mml:msub>
<mml:mi>q</mml:mi>
<mml:mi mathvariant="italic">jk</mml:mi>
</mml:msub>
</mml:msubsup>
</mml:math>
</inline-formula> is the ideal response of examinee <italic>i</italic> to item <italic>j</italic>, and <inline-formula>
<mml:math id="M104">
<mml:msub>
<mml:mi>q</mml:mi>
<mml:mi mathvariant="italic">jk</mml:mi>
</mml:msub>
</mml:math>
</inline-formula> is the element in the Q matrix indicating whether attribute <italic>k</italic> is required for a correct response to item <italic>j</italic>.</p>
<p>Following the literature (e.g., <xref ref-type="bibr" rid="ref5">Cui et al., 2012</xref>; <xref ref-type="bibr" rid="ref8">Dai and Svetina Valdivia, 2022</xref>), examinees&#x2019; attribute profiles (<inline-formula>
<mml:math id="M105">
<mml:msub>
<mml:mi>&#x03B1;</mml:mi>
<mml:mi>i</mml:mi>
</mml:msub>
</mml:math>
</inline-formula>) were generated from a dichotomized multivariate normal distribution <inline-formula>
<mml:math id="M106">
<mml:mi mathvariant="italic">MVN</mml:mi>
<mml:mo stretchy="true">(</mml:mo>
<mml:msub>
<mml:mn mathvariant="bold">0</mml:mn>
<mml:mi>K</mml:mi>
</mml:msub>
<mml:mo>,</mml:mo>
<mml:mi>&#x03A3;</mml:mi>
<mml:mo stretchy="true">)</mml:mo>
</mml:math>
</inline-formula>. Specifically, <inline-formula>
<mml:math id="M107">
<mml:msub>
<mml:mn mathvariant="bold">0</mml:mn>
<mml:mi>K</mml:mi>
</mml:msub>
</mml:math>
</inline-formula> is a <inline-formula>
<mml:math id="M108">
<mml:mn>1</mml:mn>
<mml:mo>&#x00D7;</mml:mo>
<mml:mi>K</mml:mi>
</mml:math>
</inline-formula> vector of zeros with <italic>K</italic> being the number of attributes measured by the test, <inline-formula>
<mml:math id="M109">
<mml:mi>&#x03A3;</mml:mi>
</mml:math>
</inline-formula> is a <inline-formula>
<mml:math id="M110">
<mml:mi>K</mml:mi>
<mml:mo>&#x00D7;</mml:mo>
<mml:mi>K</mml:mi>
<mml:mspace width="thickmathspace"/>
</mml:math>
</inline-formula>covariance matrix with all diagonal elements being 1 and all off-diagonal elements being 0.5, as shown in <xref ref-type="disp-formula" rid="EQ7">Equation 7</xref>:</p>
<disp-formula id="EQ7">
<label>(7)</label>
<mml:math id="M111">
<mml:mi>&#x03A3;</mml:mi>
<mml:mo>=</mml:mo>
<mml:mo stretchy="true">[</mml:mo>
<mml:mtable displaystyle="true">
<mml:mtr>
<mml:mtd>
<mml:mn>1</mml:mn>
</mml:mtd>
<mml:mtd>
<mml:mo>&#x2026;</mml:mo>
</mml:mtd>
<mml:mtd>
<mml:mn>0.5</mml:mn>
</mml:mtd>
</mml:mtr>
<mml:mtr>
<mml:mtd>
<mml:mo>&#x22EE;</mml:mo>
</mml:mtd>
<mml:mtd>
<mml:mo>&#x22F1;</mml:mo>
</mml:mtd>
<mml:mtd>
<mml:mo>&#x22EE;</mml:mo>
</mml:mtd>
</mml:mtr>
<mml:mtr>
<mml:mtd>
<mml:mn>0.5</mml:mn>
</mml:mtd>
<mml:mtd>
<mml:mo>&#x2026;</mml:mo>
</mml:mtd>
<mml:mtd>
<mml:mn>1</mml:mn>
</mml:mtd>
</mml:mtr>
</mml:mtable>
<mml:mo stretchy="true">]</mml:mo>
</mml:math>
</disp-formula>
<p>and the cut point for each attribute was set to zero. Item parameters in the DINA model, including the slipping parameter <italic>s</italic> and the guessing parameter <italic>g</italic>, were drawn from the uniform distribution [0.05, 0.25]. Then, according to <xref ref-type="disp-formula" rid="EQ6">Equation 6</xref>, the probability of a correct response of examinee <italic>i</italic> to item <italic>j</italic>, <inline-formula>
<mml:math id="M112">
<mml:msub>
<mml:mi>P</mml:mi>
<mml:mi>j</mml:mi>
</mml:msub>
<mml:mo stretchy="true">(</mml:mo>
<mml:msub>
<mml:mi>&#x03B1;</mml:mi>
<mml:mi>i</mml:mi>
</mml:msub>
<mml:mo stretchy="true">)</mml:mo>
</mml:math>
</inline-formula>, was calculated and compared to a uniform random number [0, 1]. If <inline-formula>
<mml:math id="M113">
<mml:msub>
<mml:mi>P</mml:mi>
<mml:mi>j</mml:mi>
</mml:msub>
<mml:mo stretchy="true">(</mml:mo>
<mml:msub>
<mml:mi>&#x03B1;</mml:mi>
<mml:mi>i</mml:mi>
</mml:msub>
<mml:mo stretchy="true">)</mml:mo>
</mml:math>
</inline-formula> was not less than the random number, the response <inline-formula>
<mml:math id="M114">
<mml:msub>
<mml:mi>X</mml:mi>
<mml:mi mathvariant="italic">ij</mml:mi>
</mml:msub>
</mml:math>
</inline-formula> was coded as 1 for correct; otherwise, <inline-formula>
<mml:math id="M115">
<mml:msub>
<mml:mi>X</mml:mi>
<mml:mi mathvariant="italic">ij</mml:mi>
</mml:msub>
<mml:mo>=</mml:mo>
<mml:mn>0</mml:mn>
</mml:math>
</inline-formula>.</p>
</sec>
<sec id="sec11">
<title>Missing data generation</title>
<p>Considering that the trained models in RFTI and RFDTI methods are based on examinees with observed data on the target variable, the training accuracy can be improved if there are some examinees with complete response data. Therefore, we randomly selected 80% of the sample (e.g., 800 out of 1,000 simulated examinees) to generate missing values, leaving a small number of examinees with complete data. Note, however, that the application of the two methods do not require some examinees in the sample have complete data.</p>
<p>MCAR missing samples were simulated by randomly removing a specified percentage of responses from the complete dataset. This was achieved by comparing the specified overall missingness proportion (e.g., 30%) with a uniform random number [0, 1] generated for each response. If the random number was greater than or equal to the proportion, the corresponding response was removed as missing.</p>
<p>Generation of missing responses of MAR followed the methods outlined in <xref ref-type="bibr" rid="ref11">de Ayala et al. (2001)</xref>, <xref ref-type="bibr" rid="ref33">Peugh and Enders (2004)</xref>, and <xref ref-type="bibr" rid="ref16">Finch (2008)</xref>. Based on the complete dataset, the number-correct score was calculated for each examinee on all but the target item as an ability-proxy variable. Examinees were divided into seven fractiles based on the 5th, 15th, 30th, 70th, 85th, and 95th percentiles of their normalized scores on the proxy variable. Examinees of each fractile were assigned a missingness probability that was inversely related to their scores (see <xref ref-type="table" rid="tab1">Table 1</xref>), while the average missing rate across fractiles was kept at the desired level. Uniform random numbers [0, 1] were used to select responses for deletion according to the missing rate of each fractile.</p>
<table-wrap position="float" id="tab1">
<label>Table 1</label>
<caption>
<p>MAR missing rate of each fractile.</p>
</caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th align="left" valign="top">Percentile</th>
<th align="center" valign="top">Missing rate (%)</th>
</tr>
</thead>
<tbody>
<tr>
<td align="left" valign="middle">0&#x202F;~&#x202F;5th</td>
<td align="center" valign="middle">MR&#x202F;&#x00D7;&#x202F;1.50</td>
</tr>
<tr>
<td align="left" valign="middle">5&#x202F;~&#x202F;15th</td>
<td align="center" valign="middle">MR&#x202F;&#x00D7;&#x202F;1.35</td>
</tr>
<tr>
<td align="left" valign="middle">15&#x202F;~&#x202F;30th</td>
<td align="center" valign="middle">MR&#x202F;&#x00D7;&#x202F;1.15</td>
</tr>
<tr>
<td align="left" valign="middle">30&#x202F;~&#x202F;70th</td>
<td align="center" valign="middle">MR&#x202F;&#x00D7;&#x202F;1.00</td>
</tr>
<tr>
<td align="left" valign="middle">70&#x202F;~&#x202F;85th</td>
<td align="center" valign="middle">MR&#x202F;&#x00D7;&#x202F;0.85</td>
</tr>
<tr>
<td align="left" valign="middle">85&#x202F;~&#x202F;95th</td>
<td align="center" valign="middle">MR&#x202F;&#x00D7;&#x202F;0.65</td>
</tr>
<tr>
<td align="left" valign="middle">95&#x202F;~&#x202F;100th</td>
<td align="center" valign="middle">MR&#x202F;&#x00D7;&#x202F;0.50</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<p>MR&#x202F;=&#x202F;the desired overall missing rate in each condition. Fractiles are represented by the percentiles of the normalized scores on the proxy variable.</p>
</table-wrap-foot>
</table-wrap>
<p>Following the method outlined in <xref ref-type="bibr" rid="ref7">Dai (2017)</xref>, the MNAR data were generated by calculating the omission probability based on the responses in the complete dataset. Examinees with an incorrect response to an item were assigned a higher probability of omission than those who answered the item correctly. In addition, the missing rate on items increased with the item difficulty. The generation procedure is as follows. The number of omitted responses for each examinee was first calculated based on the desired overall missingness proportion. A probability factor <inline-formula>
<mml:math id="M116">
<mml:mi>&#x03B5;</mml:mi>
</mml:math>
</inline-formula> was then specified for each examinee, with an initial value of 0. A uniform random number [0, 1] was compared with <inline-formula>
<mml:math id="M117">
<mml:mi>p</mml:mi>
<mml:mo>+</mml:mo>
<mml:mi>&#x03B5;</mml:mi>
</mml:math>
</inline-formula> to determine whether to remove a response as an omission, in which <inline-formula>
<mml:math id="M118">
<mml:mi>p</mml:mi>
</mml:math>
</inline-formula> is the probability of a correct response. If the random number was larger, the response was removed. During the procedure, if the number of missing responses for an examinee was greater (or less) than the prespecified number, the value of <inline-formula>
<mml:math id="M119">
<mml:mi>&#x03B5;</mml:mi>
</mml:math>
</inline-formula> would be increased (or decreased). For each examinee, the value of <inline-formula>
<mml:math id="M120">
<mml:mi>&#x03B5;</mml:mi>
</mml:math>
</inline-formula> was constantly adjusted to regenerate the missing data until the number of omitted items was equal to the desired number.</p>
<p>To generate data of the MIXED mechanism, we first adopted the same procedure as generating the MAR data, that is, dividing the sample into seven fractiles and calculating the missing rate of each fractile, in which examinees with higher scores had a lower missingness proportion. Subsequently, the number of missing responses for each examinee could be calculated according to the assigned fractile, and the omissions were generated through the procedure of generating the MNAR data mentioned above.</p>
</sec>
</sec>
<sec id="sec12">
<title>Analysis</title>
<p>All missing data were imputed using the corresponding R packages. Specifically, PM, TW, and EM were conducted using the <italic>TestDataImputation</italic> package (<xref ref-type="bibr" rid="ref10">Dai et al., 2021</xref>). MI was carried out using function mice() in <italic>mice</italic> package (<xref ref-type="bibr" rid="ref47">van Buuren et al., 2021</xref>), in which the specific imputation method used was logistic regression imputation and 20 imputed datasets were created for each incomplete dataset (<xref ref-type="bibr" rid="ref20">Graham et al., 2007</xref>). RFTI and RFDTI were implemented with the <italic>missForestDINA</italic> and <italic>missForestCDA</italic> packages, respectively.</p>
<p>After the imputation, the DINA model was fit to the data using the R package <italic>CDM</italic> (<xref ref-type="bibr" rid="ref37">Robitzsch et al., 2017</xref>). The estimation of examinees&#x2019; attribute profiles, as the focus of this study, was then evaluated across all 100 replications in each condition. Note that when using MI for imputation, since attribute profiles were dichotomous data, their estimation accuracy results (rather than estimates) were pooled by averaging the corresponding results across multiple imputations.</p>
</sec>
<sec id="sec13">
<title>Evaluation criteria</title>
<p>As this study focuses on the classification of attribute mastery status, we adopted two relevant criteria to evaluate the performance of each imputation method: the pattern-wise classification accuracy (PCA) and the attribute-wise classification accuracy (ACA).</p>
<disp-formula id="EQ8">
<label>(8)</label>
<mml:math id="M121">
<mml:mi mathvariant="italic">PCA</mml:mi>
<mml:mo>=</mml:mo>
<mml:munderover>
<mml:mo movablelimits="false">&#x2211;</mml:mo>
<mml:mrow>
<mml:mi>r</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
<mml:mi>R</mml:mi>
</mml:munderover>
<mml:munderover>
<mml:mo movablelimits="false">&#x2211;</mml:mo>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
<mml:mi>N</mml:mi>
</mml:munderover>
<mml:mi>I</mml:mi>
<mml:mo stretchy="true">[</mml:mo>
<mml:msub>
<mml:mover accent="true">
<mml:mi>&#x03B1;</mml:mi>
<mml:mo stretchy="true">&#x0302;</mml:mo>
</mml:mover>
<mml:mi>i</mml:mi>
</mml:msub>
<mml:mo>=</mml:mo>
<mml:msub>
<mml:mi>&#x03B1;</mml:mi>
<mml:mi>i</mml:mi>
</mml:msub>
<mml:mo stretchy="true">]</mml:mo>
<mml:mo>/</mml:mo>
<mml:mo stretchy="true">(</mml:mo>
<mml:mi>R</mml:mi>
<mml:mo>&#x00D7;</mml:mo>
<mml:mi>N</mml:mi>
<mml:mo stretchy="true">)</mml:mo>
</mml:math>
</disp-formula>
<disp-formula id="EQ9">
<label>(9)</label>
<mml:math id="M122">
<mml:mi mathvariant="italic">ACA</mml:mi>
<mml:mo>=</mml:mo>
<mml:munderover>
<mml:mo movablelimits="false">&#x2211;</mml:mo>
<mml:mrow>
<mml:mi>r</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
<mml:mi>R</mml:mi>
</mml:munderover>
<mml:munderover>
<mml:mo movablelimits="false">&#x2211;</mml:mo>
<mml:mrow>
<mml:mi>k</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
<mml:mi>K</mml:mi>
</mml:munderover>
<mml:munderover>
<mml:mo movablelimits="false">&#x2211;</mml:mo>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
<mml:mi>N</mml:mi>
</mml:munderover>
<mml:mi>I</mml:mi>
<mml:mo stretchy="true">[</mml:mo>
<mml:msub>
<mml:mover accent="true">
<mml:mi>&#x03B1;</mml:mi>
<mml:mo stretchy="true">&#x0302;</mml:mo>
</mml:mover>
<mml:mi mathvariant="italic">ik</mml:mi>
</mml:msub>
<mml:mo>=</mml:mo>
<mml:msub>
<mml:mi>&#x03B1;</mml:mi>
<mml:mi mathvariant="italic">ik</mml:mi>
</mml:msub>
<mml:mo stretchy="true">]</mml:mo>
<mml:mo>/</mml:mo>
<mml:mo stretchy="true">(</mml:mo>
<mml:mi>R</mml:mi>
<mml:mo>&#x00D7;</mml:mo>
<mml:mi>K</mml:mi>
<mml:mo>&#x00D7;</mml:mo>
<mml:mi>N</mml:mi>
<mml:mo stretchy="true">)</mml:mo>
</mml:math>
</disp-formula>
<p>where <inline-formula>
<mml:math id="M123">
<mml:msub>
<mml:mover accent="true">
<mml:mi>&#x03B1;</mml:mi>
<mml:mo stretchy="true">&#x0302;</mml:mo>
</mml:mover>
<mml:mi>i</mml:mi>
</mml:msub>
<mml:mo>=</mml:mo>
<mml:mo stretchy="true">(</mml:mo>
<mml:msub>
<mml:mover accent="true">
<mml:mi>&#x03B1;</mml:mi>
<mml:mo stretchy="true">&#x0302;</mml:mo>
</mml:mover>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msub>
<mml:mo>,</mml:mo>
<mml:mo>&#x2026;</mml:mo>
<mml:mo>,</mml:mo>
<mml:msub>
<mml:mover accent="true">
<mml:mi>&#x03B1;</mml:mi>
<mml:mo stretchy="true">&#x0302;</mml:mo>
</mml:mover>
<mml:mi mathvariant="italic">iK</mml:mi>
</mml:msub>
<mml:mo stretchy="true">)</mml:mo>
</mml:math>
</inline-formula> and <inline-formula>
<mml:math id="M124">
<mml:msub>
<mml:mi>&#x03B1;</mml:mi>
<mml:mi>i</mml:mi>
</mml:msub>
<mml:mo>=</mml:mo>
<mml:mo stretchy="true">(</mml:mo>
<mml:msub>
<mml:mi>&#x03B1;</mml:mi>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msub>
<mml:mo>,</mml:mo>
<mml:mo>&#x2026;</mml:mo>
<mml:mo>,</mml:mo>
<mml:msub>
<mml:mi>&#x03B1;</mml:mi>
<mml:mi mathvariant="italic">iK</mml:mi>
</mml:msub>
<mml:mo stretchy="true">)</mml:mo>
</mml:math>
</inline-formula> are the estimated and true attribute patterns for examinee <italic>I</italic>, respectively, and <inline-formula>
<mml:math id="M125">
<mml:mi>I</mml:mi>
<mml:mo stretchy="true">[</mml:mo>
<mml:mo>&#x00B7;</mml:mo>
<mml:mo stretchy="true">]</mml:mo>
</mml:math>
</inline-formula> is an indicator function that takes the value of 1 or 0 depending on whether the condition in brackets is met; <italic>R</italic> is the number of successfully converged replications in each condition; <italic>N</italic> is the sample size; and <italic>K</italic> is the number of attributes. PCA measures the average classification accuracy of examinees&#x2019; attribute patterns, and ACA measures the average classification accuracy of the attributes. A larger value of PCA or ACA indicates a more accurate classification of the attribute mastery status.</p>
</sec>
</sec>
<sec sec-type="results" id="sec14">
<title>Results</title>
<sec id="sec15">
<title>Missing rate of data imputed by RFDTI</title>
<p>Considering that RFDTI may not impute all missing values and a low proportion of missing data may be retained and ignored in the following analysis, we first examined the remaining missing rates of the data imputed by RFDTI in different conditions. Due to the same issue faced by RFTI, we also provided the results from RFTI for comparison. Results showed that the missing rate of RFDTI imputed data was mainly affected by the missingness mechanism and proportion, while the sample size and the number of attributes had little effect. Therefore, the missing rates of RFDTI imputed data under different missingness mechanisms and proportions are listed in <xref ref-type="table" rid="tab2">Table 2</xref>.</p>
<table-wrap position="float" id="tab2">
<label>Table 2</label>
<caption>
<p>Remaining missing rates of data imputed by RFDTI and RFTI under different missing mechanisms and proportions.</p>
</caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th align="center" valign="top" char="&#x00D7;">Missingness mechanism</th>
<th align="center" valign="top">Missingness proportion</th>
<th align="center" valign="top">RFDTI</th>
<th align="center" valign="top">RFTI</th>
</tr>
</thead>
<tbody>
<tr>
<td align="left" valign="middle" rowspan="5">MCAR</td>
<td align="center" valign="middle">10%</td>
<td align="center" valign="middle">2.44%</td>
<td align="center" valign="middle">1.03%</td>
</tr>
<tr>
<td align="center" valign="middle">20%</td>
<td align="center" valign="middle">4.65%</td>
<td align="center" valign="middle">2.33%</td>
</tr>
<tr>
<td align="center" valign="middle">30%</td>
<td align="center" valign="middle">6.98%</td>
<td align="center" valign="middle">4.32%</td>
</tr>
<tr>
<td align="center" valign="middle">40%</td>
<td align="center" valign="middle">9.99%</td>
<td align="center" valign="middle">6.48%</td>
</tr>
<tr>
<td align="center" valign="middle">50%</td>
<td align="center" valign="middle">13.73%</td>
<td align="center" valign="middle">9.16%</td>
</tr>
<tr>
<td align="left" valign="middle" rowspan="5">MAR</td>
<td align="center" valign="middle">10%</td>
<td align="center" valign="middle">2.47%</td>
<td align="center" valign="middle">1.02%</td>
</tr>
<tr>
<td align="center" valign="middle">20%</td>
<td align="center" valign="middle">4.55%</td>
<td align="center" valign="middle">2.33%</td>
</tr>
<tr>
<td align="center" valign="middle">30%</td>
<td align="center" valign="middle">7.04%</td>
<td align="center" valign="middle">4.15%</td>
</tr>
<tr>
<td align="center" valign="middle">40%</td>
<td align="center" valign="middle">9.94%</td>
<td align="center" valign="middle">6.50%</td>
</tr>
<tr>
<td align="center" valign="middle">50%</td>
<td align="center" valign="middle">13.59%</td>
<td align="center" valign="middle">9.21%</td>
</tr>
<tr>
<td align="left" valign="middle" rowspan="5">MNAR</td>
<td align="center" valign="middle">10%</td>
<td align="center" valign="middle">1.53%</td>
<td align="center" valign="middle">1.10%</td>
</tr>
<tr>
<td align="center" valign="middle">20%</td>
<td align="center" valign="middle">4.15%</td>
<td align="center" valign="middle">2.99%</td>
</tr>
<tr>
<td align="center" valign="middle">30%</td>
<td align="center" valign="middle">8.09%</td>
<td align="center" valign="middle">5.92%</td>
</tr>
<tr>
<td align="center" valign="middle">40%</td>
<td align="center" valign="middle">11.69%</td>
<td align="center" valign="middle">9.81%</td>
</tr>
<tr>
<td align="center" valign="middle">50%</td>
<td align="center" valign="middle">14.30%</td>
<td align="center" valign="middle">14.14%</td>
</tr>
<tr>
<td align="left" valign="middle" rowspan="5">MIXED</td>
<td align="center" valign="middle">10%</td>
<td align="center" valign="middle">1.29%</td>
<td align="center" valign="middle">0.86%</td>
</tr>
<tr>
<td align="center" valign="middle">20%</td>
<td align="center" valign="middle">3.65%</td>
<td align="center" valign="middle">2.29%</td>
</tr>
<tr>
<td align="center" valign="middle">30%</td>
<td align="center" valign="middle">7.61%</td>
<td align="center" valign="middle">4.55%</td>
</tr>
<tr>
<td align="center" valign="middle">40%</td>
<td align="center" valign="middle">12.07%</td>
<td align="center" valign="middle">8.00%</td>
</tr>
<tr>
<td align="center" valign="middle">50%</td>
<td align="center" valign="middle">14.83%</td>
<td align="center" valign="middle">12.60%</td>
</tr>
</tbody>
</table>
</table-wrap>
<p>In general, the remaining missing rate showed an upward trend as the missingness proportion in the original data increased, and it was below or approximate 10% in most conditions considered in this study. Only when the original missing proportion reached 50% under the MCAR and MAR mechanisms, or when the original missing proportion reached 40% under the MNAR or MIXED mechanisms, the remaining missing rate of RFDTI imputed data was about 10%&#x202F;~&#x202F;15%. Therefore, in the subsequent analysis based on the RFDTI imputed data, the remaining missing values after being imputed by RFDTI were temporarily ignored (<xref ref-type="bibr" rid="ref22">Hartz et al., 2002</xref>; <xref ref-type="bibr" rid="ref27">Little and Rubin, 2002</xref>; <xref ref-type="bibr" rid="ref30">Muth&#x00E9;n et al., 2011</xref>). In addition, as expected, the remaining missing rates of RFDTI were slightly higher that those of RFTI.</p>
</sec>
<sec id="sec16">
<title>Classification accuracy of attribute profiles</title>
<p>In general, PCA and ACA results had similar trends between methods or the different levels of design factors, while the differences in ACA values were smaller than those in PCA values. <xref ref-type="fig" rid="fig1">Figure 1</xref> shows the average PCA and ACA of the estimated attribute profiles for the six methods under different missingness mechanisms and proportions. According to <xref ref-type="fig" rid="fig1">Figure 1</xref>, the higher the missingness proportion, the worse the classification accuracy tends to be.</p>
<fig position="float" id="fig1">
<label>Figure 1</label>
<caption>
<p>Average attribute classification accuracy under different missing mechanisms and proportions. <bold>(A&#x2013;D)</bold> PCA values of attribute profiles under four missingness mechanisms. <bold>(E&#x2013;H)</bold> ACA values of attribute profiles under four missingness mechanisms.</p>
</caption>
<graphic xlink:href="fpsyg-16-1487111-g001.tif"/>
</fig>
<p>Then we focused on the comparison between methods. When the missing data was MCAR (<xref ref-type="fig" rid="fig1">Figures 1A</xref>,<xref ref-type="fig" rid="fig1">E</xref>) or MAR (<xref ref-type="fig" rid="fig1">Figures 1B</xref>,<xref ref-type="fig" rid="fig1">F</xref>), RFDTI performed very similarly or even slightly better than RFTI, and both outperformed the other four methods, especially when the missing proportion increased. EM and MI performed slightly better in PCA than TW and PM, while these four methods resulted in quite similar ACA results. Under the MNAR (<xref ref-type="fig" rid="fig1">Figures 1C</xref>,<xref ref-type="fig" rid="fig1">G</xref>) or MIXED (<xref ref-type="fig" rid="fig1">Figures 1D</xref>,<xref ref-type="fig" rid="fig1">H</xref>) mechanism, RFDTI also performed reasonably well. Specifically, RFDTI performed better than the other four traditional methods (i.e., PM, TW, EM, and MI) in terms of PCA. In addition, based on ACA results, RFDTI performed similarly to PM and TW and better than EM and MI. However, for MNAR or MIXED data, the recovery of attribute patterns based on RFDTI was no better than that from RFTI, and the difference between two methods increased with a larger missingness proportion. This might be related to the fact that the remaining missing rate in RFDTI-imputed data was higher than that of RFTI.</p>
<p>The average PCA and ACA values of each method under different missingness mechanism, missing proportions, number of attributes, or sample sizes are provided in <xref ref-type="sec" rid="sec30">Table A1 of Appendix A</xref>. When comparing the results across different missingness mechanisms, we found that the classification accuracy of each method for the MNAR and MIXED data was slightly higher than that for the MAR and MCAR data. This pattern was more apparent for PM, TW, and RFTI. As for the other two design factors (i.e., the number of attributes and the sample size), the classification accuracy for each method tended to decrease when the test measured more attributes, while the sample size had little effect on the classification accuracy for all six methods.</p>
</sec>
</sec>
<sec id="sec17">
<title>Empirical study</title>
<p>In this section, we illustrate the application of the proposed RFDTI method using the data collected from a cognitive diagnosis assessment designed for Chinese seventh-grade students. Missing data were MNAR by design, which will be explained in detail in the subsection <italic>Test administration</italic>. Students&#x2019; academic achievements in Chinese and mathematics in the final examination and their attribute patterns estimated from the complete data in a parallel test were used as the criteria to evaluate the performance of RFDTI.</p>
<sec id="sec18">
<title>Instrument</title>
<p>We adopted a cognitive aptitude test for seventh-grade students developed by the psychometric research center of Beijing Normal University. It contains two parallel test forms (A and B) with identical test length and structure (i.e., Q matrix). Each test form has 50 items and measures five attributes, including verbal reasoning, analogical reasoning, symbolic operation, matrix reasoning, and spatial reasoning. In each form, each item measures only one attribute, and each attribute is measured by 10 items (see <xref ref-type="sec" rid="sec30">Table B1 of Appendix B</xref>). The instruments and assessment procedures were reviewed and approved by the research committee of Beijing Normal University. The school teachers, students, and their parents had a clear understanding of this project and how data was collected. Parents of all student participants approved and signed informed consent forms.</p>
<p>Before using the two forms of the cognitive aptitude test, we performed a prior analysis to examine their instrument quality. We collected response data from 181 and 186 seventh-grade students from Dalian City, Liaoning Province on test forms A and B, respectively. Then, under the classical test theory (CTT) framework, we calculated the difficulty and discrimination of each item, the difficulty and reliability of each attribute and the entire test (see <xref ref-type="sec" rid="sec30">Appendix B</xref>). In general, the difficulty of most items was between 0.3 and 0.7 and the discrimination was between 0.3 and 0.5. The test difficulty of the two forms was 0.421 and 0.471, respectively, and their test reliability was 0.870 and 0.899, respectively.</p>
<p>According to the item difficulty, test form A was divided into two subtests with identical length and structure. The easier subtest A1 was composed of the 5 easiest items for each of the five attributes, totaling 25 items. Subtest A2, the more difficult one, consisted of the remaining 5 items for each attribute.</p>
</sec>
<sec id="sec19">
<title>Test administration</title>
<p>The test administration involved two phases. In the first phase, each student was required to complete test form B within 60&#x202F;min. The responses were then analyzed using the DINA model to estimate the students&#x2019; attribute mastery patterns, which were transformed into attribute mastery scores (i.e., the number of attributes mastered) ranging from 0 to 5. Afterward, all the students were divided into two groups, including a low-level group with attribute mastery scores from 0 to 2, and a high-level group with attribute mastery scores from 3 to 5.</p>
<p>The second phase was conducted two weeks later. The low-level group and the high-level group were administered the easy subtest A1 and the difficult subtest A2, respectively, within 30&#x202F;min. In this case, the missing responses of the low-level group on subtest A2 and the high-level group on subtest A1 could be regarded as MNAR.</p>
</sec>
<sec id="sec20">
<title>Sample</title>
<p>A total of 610 seventh-grade students from a junior middle school in Dalian City, Liaoning Province in China participated in the first phase of the test (completing test form B), of which 52.78% were boys and 47.22% were girls. Only 599 of them participated in the second phase (completing test form A1 or A2), including 271 in the low-level group and 328 in the high-level group. Therefore, the sample used for analysis in this study was 599 students, each of whom responded to just half of the items on test form A.</p>
</sec>
<sec id="sec21">
<title>Analysis</title>
<p>Data analysis consists of two stages: (1) dealing with missing data using different methods and estimating students&#x2019; attribute mastery patterns based on responses of subtests A1 and A2; and (2) evaluating the performance of those methods.</p>
<p>In the first stage, we used six methods, i.e., PM, TW, EM, MI, RFTI, and RFDTI, to impute missing data on test form A, respectively, and fit the DINA model to each of the six imputed datasets to obtain a set of estimates of students&#x2019; attribute patterns. In order to improve the results of different imputation methods, we added the complete data of 181 students on test form A into the current response data, which were collected for quality analysis of the test instrument (see subsection <italic>Instrument</italic>). Therefore, all the available response data of 181&#x202F;+&#x202F;599&#x202F;=&#x202F;780 students on Test A were used and the missingness proportion was 38.40% in this stage.</p>
<p>In the second stage, the performance of different imputation methods was evaluated through two external criteria, i.e., attribute patterns estimated from response data on test B and academic achievements, based on the sample of 599 students.</p>
<p>First, we calculated the consistency between the classifications estimated based on the response data on test form B and the data on test A dealt with by each imputation method. Specifically, the DINA model was fitted to the 599 students&#x2019; responses to test B, estimating their attribute patterns. These estimates were used as criteria to evaluate the imputation accuracy of the six missing data handling methods. PCA and ACA were still used as the evaluation criteria, while the true attribute patterns in <xref ref-type="disp-formula" rid="EQ8">Equations 8</xref>, <xref ref-type="disp-formula" rid="EQ9">9</xref> were replaced with the estimated attribute patterns based on test form B. Therefore, PCA and ACA measured the consistency between the classifications estimated based on the data of the two test forms. The higher the consistency, the better the performance of the imputation method.</p>
<p>Next, we calculated the correlation between the estimated attribute mastery scores and the academic achievements in Chinese and mathematics, and then compared these correlations based on test forms A and B. Specifically, for test form A, we calculated the correlations between the academic achievements and attribute mastery scores obtained by using different missing data handling methods, so each method had a corresponding correlation value. Then, the correlations between the attribute mastery scores from test form B and the academic achievements were taken as the comparison standards. A smaller difference between the correlation of a missing data handling method and that coefficient based on test form B indicated better performance of this method.</p>
<p>Note that, as in the simulation study, MI results were pooled by averaging the corresponding measures of the estimation (i.e., PCA, ACA, or the correlation between the estimated attribute mastery scores and external criteria) across all imputed datasets.</p>
</sec>
<sec id="sec22">
<title>Results</title>
<p><xref ref-type="table" rid="tab3">Table 3</xref> shows the PCA and ACA of each missing data handling method for the whole sample, as well as the two groups with different ability levels. In this MNAR design, RFTI always resulted in highest values of PCA and ACA among the methods, above 0.9, both for the whole sample or for subgroups. The proposed RFDTI could provide similar results as RFTI, of which the PCA and ACA values were close to or above 0.90. Among the remaining four traditional methods, TW performed relatively better and its results were similar between the two ability level groups, while PM performed the worst for the whole sample (both PCA and ACA were lower than 0.6) and quite differentially between two groups. The PCA of PM was even below 0.4 for the high-level group, but exceeded 0.8 for the low-level group.</p>
<table-wrap position="float" id="tab3">
<label>Table 3</label>
<caption>
<p>PCA and ACA of six missing data handling methods.</p>
</caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th align="left" valign="top" rowspan="2">Method</th>
<th align="center" valign="top" colspan="3">PCA</th>
<th align="center" valign="top" colspan="3">ACA</th>
</tr>
<tr>
<th align="center" valign="top">Whole sample</th>
<th align="center" valign="top">Low-level group</th>
<th align="center" valign="top">High-level group</th>
<th align="center" valign="top">Whole sample</th>
<th align="center" valign="top">Low-level group</th>
<th align="center" valign="top">High-level group</th>
</tr>
</thead>
<tbody>
<tr>
<td align="left" valign="middle">PM</td>
<td align="center" valign="middle">0.576</td>
<td align="center" valign="middle">0.815</td>
<td align="center" valign="middle">0.378</td>
<td align="center" valign="middle">0.581</td>
<td align="center" valign="middle">0.865</td>
<td align="center" valign="middle">0.458</td>
</tr>
<tr>
<td align="left" valign="middle">TW</td>
<td align="center" valign="middle">0.788</td>
<td align="center" valign="middle">0.775</td>
<td align="center" valign="middle">0.799</td>
<td align="center" valign="middle">0.810</td>
<td align="center" valign="middle">0.825</td>
<td align="center" valign="middle">0.836</td>
</tr>
<tr>
<td align="left" valign="middle">EM</td>
<td align="center" valign="middle">0.701</td>
<td align="center" valign="middle">0.749</td>
<td align="center" valign="middle">0.662</td>
<td align="center" valign="middle">0.723</td>
<td align="center" valign="middle">0.796</td>
<td align="center" valign="middle">0.701</td>
</tr>
<tr>
<td align="left" valign="middle">MI</td>
<td align="center" valign="middle">0.650</td>
<td align="center" valign="middle">0.715</td>
<td align="center" valign="middle">0.596</td>
<td align="center" valign="middle">0.683</td>
<td align="center" valign="middle">0.751</td>
<td align="center" valign="middle">0.642</td>
</tr>
<tr>
<td align="left" valign="middle">RFTI</td>
<td align="center" valign="middle">0.920</td>
<td align="center" valign="middle">0.915</td>
<td align="center" valign="middle">0.924</td>
<td align="center" valign="middle">0.941</td>
<td align="center" valign="middle">0.931</td>
<td align="center" valign="middle">0.946</td>
</tr>
<tr>
<td align="left" valign="middle">RFDTI</td>
<td align="center" valign="middle">0.901</td>
<td align="center" valign="middle">0.899</td>
<td align="center" valign="middle">0.905</td>
<td align="center" valign="middle">0.934</td>
<td align="center" valign="middle">0.928</td>
<td align="center" valign="middle">0.932</td>
</tr>
</tbody>
</table>
</table-wrap>
<p><xref ref-type="table" rid="tab4">Table 4</xref> presents the correlation coefficients between the attribute mastery scores based on two test forms and academic achievements in Chinese and mathematics. The pattern of results among six imputation methods was consistent for the two subjects. The estimated attribute mastery scores after using RFDTI or RFTI to deal with missing data were the most strongly correlated with academic achievements, and these correlations were the closest to those based on the complete responses from test B. MI also performed well, while EM was the worst.</p>
<table-wrap position="float" id="tab4">
<label>Table 4</label>
<caption>
<p>Correlation between attribute mastery scores and academic achievements.</p>
</caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th align="left" valign="top">Test form</th>
<th align="center" valign="top">Method</th>
<th align="center" valign="top">Chinese</th>
<th align="center" valign="top">Mathematics</th>
</tr>
</thead>
<tbody>
<tr>
<td align="left" valign="middle" rowspan="6">A</td>
<td align="left" valign="middle">PM</td>
<td align="center" valign="middle">0.271</td>
<td align="center" valign="middle">0.318</td>
</tr>
<tr>
<td align="left" valign="middle">TW</td>
<td align="center" valign="middle">0.371</td>
<td align="center" valign="middle">0.416</td>
</tr>
<tr>
<td align="left" valign="middle">EM</td>
<td align="center" valign="middle">0.253</td>
<td align="center" valign="middle">0.296</td>
</tr>
<tr>
<td align="left" valign="middle">MI</td>
<td align="center" valign="middle">0.455</td>
<td align="center" valign="middle">0.511</td>
</tr>
<tr>
<td align="left" valign="middle">RFTI</td>
<td align="center" valign="middle">0.484</td>
<td align="center" valign="middle">0.530</td>
</tr>
<tr>
<td align="left" valign="middle">RFDTI</td>
<td align="center" valign="middle">0.494</td>
<td align="center" valign="middle">0.523</td>
</tr>
<tr>
<td align="left" valign="middle">B</td>
<td align="center" valign="middle">/</td>
<td align="center" valign="middle">0.544</td>
<td align="center" valign="middle">0.613</td>
</tr>
</tbody>
</table>
</table-wrap>
</sec>
</sec>
<sec sec-type="discussion" id="sec23">
<title>Discussion</title>
<p>In this study, we improved the Random Forest Threshold Imputation method proposed by <xref ref-type="bibr" rid="ref52">You et al. (2023)</xref>, which is designed for handling missing data in the implementation of CDMs and demonstrates superiority for MNAR and MIXED data and a large missingness proportion. Specifically, motivated by the fixed lower threshold in RFTI and related uncertainty of imputation, in this study, we adapted the RFTI method by setting both dynamic upper and lower thresholds to increase the imputation accuracy. For ease of application, we also developed an R package <italic>missForestCDA</italic> for the RFDTI method.</p>
<p>Based on the machine learning algorithm, RFDTI is a nonparametric method, and it relies much less on the assumptions of the distribution or the missingness mechanism of the data compared with traditional methods such as EM and MI. Results of the current simulation and empirical studies also demonstrate the effectiveness of the RFDTI method from the perspective of attribute pattern classification. The attribute profile estimations for RFDTI were consistently more accurate than the four traditional methods (PM, TW, EM, MI), even when the missingness proportion was high (&#x003E;30%). The performance of RFDTI for the empirical MNAR data was also better than the four methods and much closer to the results based on complete data.</p>
<p>However, RFDTI did not show obvious advantages over RFTI. In the simulation, for MCAR or MAR data, RFDTI slightly outperformed RFTI. However, for MIXED and MNAR data, the situation was reversed. The differences between the two methods can be negligible in the case of a small percentage of missingness, but became larger with a higher missingness proportion. This may be related to the percentage of remaining missing data after imputation. When the missing percentage of the original data is higher, the remaining missing percentage after imputation will also be relatively higher, while this part of missing values will not be treated, but just ignored. RFDTI would produce a higher missing percentage than RFTI. According to <xref ref-type="table" rid="tab2">Table 2</xref>, in the case of an original missing percentage of 50%, the remaining missing percentage after imputation using RFDTI could approach 15%, while it is lower in RFTI. Under the MNAR and MIXED missing mechanism, the requirement for the percentage of missing data that can be negligible may be lower. So the accuracy of RFDTI is instead lower than RFTI with a large missing proportion for MNAR or MIXED data.</p>
<p>The findings of this study also indicate the need to pay special attention to the treatment of missing data in CDM applications, which is also one of the starting points of the current study. In the current simulation, the estimation of attribute profiles for each method was better under the MIXED and MNAR mechanisms than under the MAR and MCAR mechanisms. For the traditional missing data handling methods, this finding is inconsistent with previous research results in the IRT context (e.g., <xref ref-type="bibr" rid="ref16">Finch, 2008</xref>; <xref ref-type="bibr" rid="ref48">Wolkowitz and Skorupski, 2013</xref>). The reason is likely that the person parameters to be estimated in CDMs are binary variables (i.e., the classification of attribute mastery status), rather than continuous variables (such as latent ability) as in IRT models. Research has found that the performance of missing data handling methods is related to the missingness mechanism and the relationship relies on the specific research contexts, including the analysis model and data type (categorical or continuous) (<xref ref-type="bibr" rid="ref7">Dai, 2017</xref>; <xref ref-type="bibr" rid="ref31">Newman, 2003</xref>; <xref ref-type="bibr" rid="ref34">Poleto et al., 2011</xref>; <xref ref-type="bibr" rid="ref40">Song et al., 2022</xref>; <xref ref-type="bibr" rid="ref53">Zhuchkova and Rotmistrov, 2022</xref>; <xref ref-type="bibr" rid="ref9001">Fu et al., 2025</xref>; <xref ref-type="bibr" rid="ref9002">Qin et al., 2024</xref>). Accordingly, it is conceivable that the impact of the missing data mechanism on the traditional methods may differ between CDM and IRT. On the other hand, the good performance of the RFDTI method may be related to its greater use of individual response patterns that may provide additional useful information under nonrandom missingness mechanisms. Therefore, the proposed method and its comparison with traditional methods in this study provide users with more choices of missing data handling methods in CDM applications and provide a basis for the method selection.</p>
<p>Although the RFDTI method seems very promising, there are some issues for further study. First, in this study, we only focus on the estimation of the attribute mastery status, leaving the item parameter estimation aside temporarily. However, how the item parameter estimation of CDMs will be affected when using RFDTI to deal with missing data needs further investigation. Second, in this study, we only explored the performance of the RFDTI method in the context of the common DINA model. In future research, the RFDTI method can be applied in combination with other CDMs, such as the fusion model (<xref ref-type="bibr" rid="ref22">Hartz et al., 2002</xref>), the hierarchical DINA model (<xref ref-type="bibr" rid="ref13">de la Torre and Douglas, 2004</xref>; <xref ref-type="bibr" rid="ref9003">Yan et al., 2025</xref>), and the DINO model, so as to explore its performance in more research contexts. Third, simulation conditions related to cognitive diagnosis assessments should be enriched (e.g., varying test lengths and hierarchical relationships of attributes) to more comprehensively demonstrate the performance of the RFDTI approach.</p>
</sec>
</body>
<back>
<sec sec-type="data-availability" id="sec24">
<title>Data availability statement</title>
<p>The original contributions presented in the study are included in the article/<xref ref-type="sec" rid="sec30">Supplementary material</xref>, further inquiries can be directed to the corresponding author.</p>
</sec>
<sec sec-type="ethics-statement" id="sec25">
<title>Ethics statement</title>
<p>The studies involving humans were approved by Academic Ethics Committee of School of Mathematics and Information Science, Nanchang Normal University. The studies were conducted in accordance with the local legislation and institutional requirements. Written informed consent for participation was not required from the participants or the participants&#x2019; legal guardians/next of kin in accordance with the national legislation and institutional requirements.</p>
</sec>
<sec sec-type="author-contributions" id="sec26">
<title>Author contributions</title>
<p>XY: Writing &#x2013; original draft, Methodology, Formal analysis. JY: Writing &#x2013; review &#x0026; editing. XX: Writing &#x2013; review &#x0026; editing, Investigation, Funding acquisition, Software.</p>
</sec>
<sec sec-type="funding-information" id="sec27">
<title>Funding</title>
<p>The author(s) declare that financial support was received for the research and/or publication of this article. This work was supported by Nanchang Normal University Doctoral Research Start Up Fund, no: NSBSJJ2020013.</p>
</sec>
<ack>
<p>We are very grateful to Yue Xiao and Hongyun Liu for their support and assistance in the preparation of the initial draft of this manuscript.</p>
</ack>
<sec sec-type="COI-statement" id="sec28">
<title>Conflict of interest</title>
<p>The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.</p>
</sec>
<sec sec-type="correction-note" id="sec028">
<title>Correction note</title>
<p>A correction has been made to this article. Details can be found at: <ext-link xlink:href="https://doi.org/10.3389/fpsyg.2025.1686437" ext-link-type="uri">10.3389/fpsyg.2025.1686437</ext-link>.</p>
</sec>
<sec sec-type="disclaimer" id="sec29">
<title>Publisher&#x2019;s note</title>
<p>All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.</p>
</sec>
<sec sec-type="supplementary-material" id="sec30">
<title>Supplementary material</title>
<p>The Supplementary material for this article can be found online at: <ext-link xlink:href="https://www.frontiersin.org/articles/10.3389/fpsyg.2025.1487111/full#supplementary-material" ext-link-type="uri">https://www.frontiersin.org/articles/10.3389/fpsyg.2025.1487111/full#supplementary-material</ext-link></p>
<supplementary-material xlink:href="Table_1.pdf" id="SM1" mimetype="application/pdf" xmlns:xlink="http://www.w3.org/1999/xlink"/>
</sec>
<ref-list>
<title>References</title>
<ref id="ref1"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bernaards</surname> <given-names>C. A.</given-names></name> <name><surname>Sijtsma</surname> <given-names>K.</given-names></name></person-group> (<year>2000</year>). <article-title>Influence of imputation and EM methods on factor analysis when item nonresponse in questionnaire data is nonignorable</article-title>. <source>Multivar. Behav. Res.</source> <volume>35</volume>, <fpage>321</fpage>&#x2013;<lpage>364</lpage>. doi: <pub-id pub-id-type="doi">10.1207/S15327906MBR3503_03</pub-id>, PMID: <pub-id pub-id-type="pmid">26745335</pub-id></citation></ref>
<ref id="ref3"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Chatzopoulou</surname> <given-names>D. I.</given-names></name> <name><surname>Economides</surname> <given-names>A. A.</given-names></name></person-group> (<year>2010</year>). <article-title>Adaptive assessment of student&#x2019;s knowledge in programming courses</article-title>. <source>J. Comput. Assist. Learn.</source> <volume>26</volume>, <fpage>258</fpage>&#x2013;<lpage>269</lpage>. doi: <pub-id pub-id-type="doi">10.1111/j.1365-2729.2010.00363.x</pub-id></citation></ref>
<ref id="ref4"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Cheema</surname> <given-names>J. R.</given-names></name></person-group> (<year>2014</year>). <article-title>A review of missing data handling methods in education research</article-title>. <source>Rev. Educ. Res.</source> <volume>84</volume>, <fpage>487</fpage>&#x2013;<lpage>508</lpage>. doi: <pub-id pub-id-type="doi">10.3102/0034654314532697</pub-id></citation></ref>
<ref id="ref5"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Cui</surname> <given-names>Y.</given-names></name> <name><surname>Gierl</surname> <given-names>M. J.</given-names></name> <name><surname>Chang</surname> <given-names>H.-H.</given-names></name></person-group> (<year>2012</year>). <article-title>Estimating classification consistency and accuracy for cognitive diagnostic assessment</article-title>. <source>J. Educ. Meas.</source> <volume>49</volume>, <fpage>19</fpage>&#x2013;<lpage>38</lpage>. doi: <pub-id pub-id-type="doi">10.1111/j.1745-3984.2011.00158.x</pub-id>, PMID: <pub-id pub-id-type="pmid">40114432</pub-id></citation></ref>
<ref id="ref6"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Cui</surname> <given-names>Y.</given-names></name> <name><surname>Li</surname> <given-names>L.</given-names></name></person-group> (<year>2015</year>). <article-title>Evaluating person fit for cognitive diagnostic assessment</article-title>. <source>Appl. Psychol. Meas.</source> <volume>39</volume>, <fpage>223</fpage>&#x2013;<lpage>238</lpage>. doi: <pub-id pub-id-type="doi">10.1177/0146621614557272</pub-id>, PMID: <pub-id pub-id-type="pmid">29881004</pub-id></citation></ref>
<ref id="ref7"><citation citation-type="book"><person-group person-group-type="author"><name><surname>Dai</surname> <given-names>S.</given-names></name></person-group> (<year>2017</year>). <source>Investigation of missing responses in implementation of cognitive diagnostic models</source> [Unpublished doctoral dissertation]. <publisher-loc>Bloomington, IN</publisher-loc>: <publisher-name>Indiana University</publisher-name>.</citation></ref>
<ref id="ref8"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Dai</surname> <given-names>S.</given-names></name> <name><surname>Svetina Valdivia</surname> <given-names>D.</given-names></name></person-group> (<year>2022</year>). <article-title>Dealing with missing responses in cognitive diagnostic modeling</article-title>. <source>Psych</source> <volume>4</volume>, <fpage>318</fpage>&#x2013;<lpage>342</lpage>. doi: <pub-id pub-id-type="doi">10.3390/psych4020028</pub-id></citation></ref>
<ref id="ref9"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Dai</surname> <given-names>S.</given-names></name> <name><surname>Svetina Valdivia</surname> <given-names>D.</given-names></name> <name><surname>Chen</surname> <given-names>C.</given-names></name></person-group> (<year>2018</year>). <article-title>Investigation of missing responses in Q-matrix validation</article-title>. <source>Appl. Psychol. Meas.</source> <volume>42</volume>, <fpage>660</fpage>&#x2013;<lpage>676</lpage>. doi: <pub-id pub-id-type="doi">10.1177/0146621618762742</pub-id>, PMID: <pub-id pub-id-type="pmid">30559573</pub-id></citation></ref>
<ref id="ref10"><citation citation-type="other"><person-group person-group-type="author"><name><surname>Dai</surname> <given-names>S.</given-names></name> <name><surname>Wang</surname> <given-names>X.</given-names></name> <name><surname>Svetina</surname> <given-names>D.</given-names></name></person-group> (<year>2021</year>). TestDataImputation: missing item responses imputation for test and assessment data (version 2.3) [R package]. Available online at: <ext-link xlink:href="https://cran.r-project.org/web/packages/TestDataImputation/index.html" ext-link-type="uri">https://cran.r-project.org/web/packages/TestDataImputation/index.html</ext-link></citation></ref>
<ref id="ref11"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>de Ayala</surname> <given-names>R. J.</given-names></name> <name><surname>Plake</surname> <given-names>B. S.</given-names></name> <name><surname>Impara</surname> <given-names>J. C.</given-names></name></person-group> (<year>2001</year>). <article-title>The impact of omitted responses on the accuracy of ability estimation in item response theory</article-title>. <source>J. Educ. Meas.</source> <volume>38</volume>, <fpage>213</fpage>&#x2013;<lpage>234</lpage>. doi: <pub-id pub-id-type="doi">10.1111/j.1745-3984.2001.tb01124.x</pub-id></citation></ref>
<ref id="ref12"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>de la Torre</surname> <given-names>J.</given-names></name></person-group> (<year>2011</year>). <article-title>The generalized DINA model framework</article-title>. <source>Psychometrika</source> <volume>76</volume>, <fpage>179</fpage>&#x2013;<lpage>199</lpage>. doi: <pub-id pub-id-type="doi">10.1007/S11336-011-9207-7</pub-id>, PMID: <pub-id pub-id-type="pmid">40115325</pub-id></citation></ref>
<ref id="ref13"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>de la Torre</surname> <given-names>J.</given-names></name> <name><surname>Douglas</surname> <given-names>J. A.</given-names></name></person-group> (<year>2004</year>). <article-title>Higher-order latent trait models for cognitive diagnosis</article-title>. <source>Psychometrika</source> <volume>69</volume>, <fpage>333</fpage>&#x2013;<lpage>353</lpage>. doi: <pub-id pub-id-type="doi">10.1007/BF02295640</pub-id></citation></ref>
<ref id="ref14"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>de la Torre</surname> <given-names>J.</given-names></name> <name><surname>Hong</surname> <given-names>Y.</given-names></name> <name><surname>Deng</surname> <given-names>W.</given-names></name></person-group> (<year>2010</year>). <article-title>Factors affecting the item parameter estimation and classification accuracy of the DINA model</article-title>. <source>J. Educ. Meas.</source> <volume>47</volume>, <fpage>227</fpage>&#x2013;<lpage>249</lpage>. doi: <pub-id pub-id-type="doi">10.1111/j.1745-3984.2010.00110.x</pub-id></citation></ref>
<ref id="ref15"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>de la Torre</surname> <given-names>J.</given-names></name> <name><surname>Minchen</surname> <given-names>N.</given-names></name></person-group> (<year>2014</year>). <article-title>Cognitively diagnostic assessments and the cognitive diagnosis model framework</article-title>. <source>Psicolog&#x00ED;a Educativa</source> <volume>20</volume>, <fpage>89</fpage>&#x2013;<lpage>97</lpage>. doi: <pub-id pub-id-type="doi">10.1016/j.pse.2014.11.001</pub-id></citation></ref>
<ref id="ref16"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Finch</surname> <given-names>H.</given-names></name></person-group> (<year>2008</year>). <article-title>Estimation of item response theory parameters in the presence of missing data</article-title>. <source>J. Educ. Meas.</source> <volume>45</volume>, <fpage>225</fpage>&#x2013;<lpage>245</lpage>. doi: <pub-id pub-id-type="doi">10.1111/j.1745-3984.2008.00062.x</pub-id></citation></ref>
<ref id="ref9001"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Fu</surname> <given-names>D. X.</given-names></name> <name><surname>Qin</surname> <given-names>C. Y.</given-names></name> <name><surname>Luo</surname> <given-names>Z. S.</given-names></name> <name><surname>Li</surname> <given-names>Y. J.</given-names></name> <name><surname>Yu</surname> <given-names>X. F.</given-names></name> <name><surname>Ye</surname> <given-names>Z. Y.</given-names></name></person-group> (<year>2025</year>). <article-title>Using regularized methods to validate Q-matrix in cognitive diagnostic assessment</article-title>. <source>J. Educ. Behav. stat.</source> <volume>50</volume>, <fpage>149</fpage>&#x2013;<lpage>179</lpage>. doi: <pub-id pub-id-type="doi">10.3102/10769986241240084</pub-id></citation></ref>
<ref id="ref17"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Gemici</surname> <given-names>S.</given-names></name> <name><surname>Bednarz</surname> <given-names>A.</given-names></name> <name><surname>Lim</surname> <given-names>P.</given-names></name></person-group> (<year>2012</year>). <article-title>A primer for handling missing values in the analysis of education and training data</article-title>. <source>Int. J. Train. Res.</source> <volume>10</volume>, <fpage>233</fpage>&#x2013;<lpage>250</lpage>. doi: <pub-id pub-id-type="doi">10.5172/ijtr.2012.10.3.233</pub-id></citation></ref>
<ref id="ref18"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>George</surname> <given-names>A. C.</given-names></name> <name><surname>Robitzsch</surname> <given-names>A.</given-names></name> <name><surname>Kiefer</surname> <given-names>T.</given-names></name> <name><surname>Gro&#x00DF;</surname> <given-names>J.</given-names></name> <name><surname>&#x00DC;nl&#x00FC;</surname> <given-names>A.</given-names></name></person-group> (<year>2016</year>). <article-title>The R package CDM for cognitive diagnosis models</article-title>. <source>J. Stat. Softw.</source> <volume>74</volume>, <fpage>1</fpage>&#x2013;<lpage>24</lpage>. doi: <pub-id pub-id-type="doi">10.18637/jss.v074.i02</pub-id></citation></ref>
<ref id="ref19"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Glas</surname> <given-names>C.</given-names></name> <name><surname>Pimentel</surname> <given-names>J.</given-names></name></person-group> (<year>2008</year>). <article-title>Modeling nonignorable missing data in speeded tests</article-title>. <source>Educ. Psychol. Meas.</source> <volume>68</volume>, <fpage>907</fpage>&#x2013;<lpage>922</lpage>. doi: <pub-id pub-id-type="doi">10.1177/0013164408315262</pub-id></citation></ref>
<ref id="ref20"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Graham</surname> <given-names>J. W.</given-names></name> <name><surname>Olchowski</surname> <given-names>A. E.</given-names></name> <name><surname>Gilreath</surname> <given-names>T. D.</given-names></name></person-group> (<year>2007</year>). <article-title>How many imputations are really needed? Some practical clarifications of multiple imputation theory</article-title>. <source>Prev. Sci.</source> <volume>8</volume>, <fpage>206</fpage>&#x2013;<lpage>213</lpage>. doi: <pub-id pub-id-type="doi">10.1007/s11121-007-0070-9</pub-id>, PMID: <pub-id pub-id-type="pmid">17549635</pub-id></citation></ref>
<ref id="ref21"><citation citation-type="book"><person-group person-group-type="author"><name><surname>Hair</surname> <given-names>J. F.</given-names></name> <name><surname>Black</surname> <given-names>W.</given-names></name> <name><surname>Babin</surname> <given-names>B. J.</given-names></name> <name><surname>Anderson</surname> <given-names>R. E.</given-names></name></person-group> (<year>2010</year>). <source>Multivariate data analysis</source>. <edition>7th</edition> Edn. <publisher-loc>Edinburgh, England</publisher-loc>: <publisher-name>Pearson</publisher-name>.</citation></ref>
<ref id="ref22"><citation citation-type="book"><person-group person-group-type="author"><name><surname>Hartz</surname> <given-names>S.</given-names></name> <name><surname>Roussos</surname> <given-names>L.</given-names></name> <name><surname>Stout</surname> <given-names>W.</given-names></name></person-group> (<year>2002</year>). <source>Skill diagnosis: Theory and practice [computer software user manual for arpeggio software]</source>. <publisher-loc>New Jersey, USA</publisher-loc>: <publisher-name>Educational Testing Service</publisher-name>.</citation></ref>
<ref id="ref23"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hattie</surname> <given-names>J.</given-names></name> <name><surname>Timperley</surname> <given-names>H.</given-names></name></person-group> (<year>2007</year>). <article-title>The power of feedback</article-title>. <source>Rev. Educ. Res.</source> <volume>77</volume>, <fpage>81</fpage>&#x2013;<lpage>112</lpage>. doi: <pub-id pub-id-type="doi">10.3102/003465430298487</pub-id></citation></ref>
<ref id="ref24"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Holman</surname> <given-names>R.</given-names></name> <name><surname>Glas</surname> <given-names>C. A. W.</given-names></name></person-group> (<year>2005</year>). <article-title>Modelling non-ignorable missing-data mechanisms with item response theory models</article-title>. <source>British J. Math. Stat. Psychol.</source> <volume>58</volume>, <fpage>1</fpage>&#x2013;<lpage>17</lpage>. doi: <pub-id pub-id-type="doi">10.1111/j.2044-8317.2005.tb00312.x</pub-id>, PMID: <pub-id pub-id-type="pmid">15969835</pub-id></citation></ref>
<ref id="ref25"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ketterlin-Geller</surname> <given-names>L. R.</given-names></name> <name><surname>Yovanoff</surname> <given-names>P.</given-names></name></person-group> (<year>2009</year>). <article-title>Diagnostic assessments in mathematics to support instructional decision making</article-title>. <source>Pract. Assess. Res. Eval.</source> <volume>14</volume>:<fpage>16</fpage>. doi: <pub-id pub-id-type="doi">10.7275/vxrk-3190</pub-id></citation></ref>
<ref id="ref26"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kokla</surname> <given-names>M.</given-names></name> <name><surname>Virtanen</surname> <given-names>J.</given-names></name> <name><surname>Kolehmainen</surname> <given-names>M.</given-names></name> <name><surname>Paananen</surname> <given-names>J.</given-names></name> <name><surname>Hanhineva</surname> <given-names>K.</given-names></name></person-group> (<year>2019</year>). <article-title>Random forest-based imputation outperforms other methods for imputing LC-MS metabolomics data: a comparative study</article-title>. <source>BMC Bioinform.</source> <volume>20</volume>:<fpage>492</fpage>. doi: <pub-id pub-id-type="doi">10.1186/s12859-019-3110-0</pub-id>, PMID: <pub-id pub-id-type="pmid">31601178</pub-id></citation></ref>
<ref id="ref27"><citation citation-type="book"><person-group person-group-type="author"><name><surname>Little</surname> <given-names>R.</given-names></name> <name><surname>Rubin</surname> <given-names>D. B.</given-names></name></person-group> (<year>2002</year>). <source>Statistical analysis with missing data</source>. <edition>2nd</edition> Edn. <publisher-loc>New York, USA</publisher-loc>: <publisher-name>Wiley</publisher-name>.</citation></ref>
<ref id="ref28"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Liu</surname> <given-names>Y.</given-names></name> <name><surname>Gopalakrishnan</surname> <given-names>V.</given-names></name></person-group> (<year>2017</year>). <article-title>An overview and evaluation of recent machine learning imputation methods using cardiac imaging data</article-title>. <source>Data</source> <volume>2</volume>, <fpage>8</fpage>&#x2013;<lpage>23</lpage>. doi: <pub-id pub-id-type="doi">10.3390/data2010008</pub-id>, PMID: <pub-id pub-id-type="pmid">28243594</pub-id></citation></ref>
<ref id="ref29"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Liu</surname> <given-names>X.</given-names></name> <name><surname>Loken</surname> <given-names>E.</given-names></name></person-group> (<year>2025</year>). <article-title>The impact of missing data on parameter estimation: three examples in computerized adaptive testing</article-title>. <source>Educ. Psychol. Meas.</source> doi: <pub-id pub-id-type="doi">10.1177/00131644241306990</pub-id>, PMID: <pub-id pub-id-type="pmid">39780953</pub-id></citation></ref>
<ref id="ref30"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Muth&#x00E9;n</surname> <given-names>B.</given-names></name> <name><surname>Asparouhov</surname> <given-names>T.</given-names></name> <name><surname>Hunter</surname> <given-names>A.</given-names></name> <name><surname>Leuchter</surname> <given-names>A.</given-names></name></person-group> (<year>2011</year>). <article-title>Growth modeling with non-ignorable dropout: alternative analyses of the STAR&#x002A;D antidepressant trial</article-title>. <source>Psychol. Methods</source> <volume>16</volume>, <fpage>17</fpage>&#x2013;<lpage>33</lpage>. doi: <pub-id pub-id-type="doi">10.1037/a0022634</pub-id>, PMID: <pub-id pub-id-type="pmid">21381817</pub-id></citation></ref>
<ref id="ref31"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Newman</surname> <given-names>D. A.</given-names></name></person-group> (<year>2003</year>). <article-title>Longitudinal modeling with randomly and systematically missing data: a simulation of ad hoc, maximum likelihood, and multiple imputation techniques</article-title>. <source>Organ. Res. Methods</source> <volume>6</volume>, <fpage>328</fpage>&#x2013;<lpage>362</lpage>. doi: <pub-id pub-id-type="doi">10.1177/1094428103254673</pub-id></citation></ref>
<ref id="ref32"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Parsons</surname> <given-names>S. A.</given-names></name> <name><surname>Vaughn</surname> <given-names>M.</given-names></name> <name><surname>Scales</surname> <given-names>R. Q.</given-names></name> <name><surname>Gallagher</surname> <given-names>M. A.</given-names></name> <name><surname>Parsons</surname> <given-names>A. W.</given-names></name> <name><surname>Davis</surname> <given-names>S. G.</given-names></name> <etal/></person-group>. (<year>2018</year>). <article-title>Teachers&#x2019; instructional adaptations: a research synthesis</article-title>. <source>Rev. Educ. Res.</source> <volume>88</volume>, <fpage>205</fpage>&#x2013;<lpage>242</lpage>. doi: <pub-id pub-id-type="doi">10.3102/0034654317743198</pub-id></citation></ref>
<ref id="ref33"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Peugh</surname> <given-names>J. L.</given-names></name> <name><surname>Enders</surname> <given-names>C. K.</given-names></name></person-group> (<year>2004</year>). <article-title>Missing data in educational research: a review of reporting practices and suggestions for improvement</article-title>. <source>Rev. Educ. Res.</source> <volume>74</volume>, <fpage>525</fpage>&#x2013;<lpage>556</lpage>. doi: <pub-id pub-id-type="doi">10.3102/00346543074004525</pub-id></citation></ref>
<ref id="ref34"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Poleto</surname> <given-names>F.</given-names></name> <name><surname>Singer</surname> <given-names>J.</given-names></name> <name><surname>Paulino</surname> <given-names>C.</given-names></name></person-group> (<year>2011</year>). <article-title>Missing data mechanisms and their implications on the analysis of categorical data</article-title>. <source>Knowledge-Based Systems.</source> <volume>21</volume>, <fpage>31</fpage>&#x2013;<lpage>43</lpage>. doi: <pub-id pub-id-type="doi">10.1007/s11222-009-9143-x</pub-id>, PMID: <pub-id pub-id-type="pmid">40115325</pub-id></citation></ref>
<ref id="ref9002"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Qin</surname> <given-names>C. Y.</given-names></name> <name><surname>Dong</surname> <given-names>S. H.</given-names></name> <name><surname>Yu</surname> <given-names>X. F.</given-names></name></person-group> (<year>2024</year>). <article-title>Exploration of polytomous-attribute Q-matrix validation in cognitive diagnostic assessment</article-title>. <source>Int. J. Train. Res.</source> <volume>292</volume>:<fpage>111577</fpage>. doi: <pub-id pub-id-type="doi">10.1016/j.knosys.2024.111577</pub-id></citation></ref>
<ref id="ref35"><citation citation-type="book"><person-group person-group-type="author"><name><surname>Richman</surname> <given-names>M.</given-names></name> <name><surname>Trafalis</surname> <given-names>T.</given-names></name> <name><surname>Adrianto</surname> <given-names>I.</given-names></name></person-group> (<year>2009</year>). &#x201C;<article-title>Missing data imputation through machine learning algorithms</article-title>&#x201D; in <source>Artificial intelligence methods through machine learning algorithms</source>. eds. <person-group person-group-type="editor"><name><surname>Haupt</surname> <given-names>S. E.</given-names></name> <name><surname>Pasini</surname> <given-names>A.</given-names></name> <name><surname>Marzban</surname> <given-names>C.</given-names></name></person-group>. (<publisher-loc>Dordrecht</publisher-loc>: <publisher-name>Springer</publisher-name>), <fpage>153</fpage>&#x2013;<lpage>169</lpage>.</citation></ref>
<ref id="ref36"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Robitzsch</surname> <given-names>A.</given-names></name></person-group> (<year>2021</year>). <article-title>On the treatment of missing item responses in educational large-scale assessment data: an illustrative simulation study and a case study using PISA 2018 mathematics data</article-title>. <source>Europ. J. Invest. Health Psychol. Educ.</source> <volume>11</volume>, <fpage>1653</fpage>&#x2013;<lpage>1687</lpage>. doi: <pub-id pub-id-type="doi">10.3390/ejihpe11040117</pub-id>, PMID: <pub-id pub-id-type="pmid">34940395</pub-id></citation></ref>
<ref id="ref37"><citation citation-type="other"><person-group person-group-type="author"><name><surname>Robitzsch</surname> <given-names>A.</given-names></name> <name><surname>Kiefer</surname> <given-names>T.</given-names></name> <name><surname>George</surname> <given-names>A. C.</given-names></name> <name><surname>Uenlue</surname> <given-names>A.</given-names></name></person-group> (<year>2017</year>). CDM: cognitive diagnosis modeling (version 6.0-101) [R package]. Available online at: <ext-link xlink:href="https://cran.r-project.org/web/packages/CDM/index.html" ext-link-type="uri">https://cran.r-project.org/web/packages/CDM/index.html</ext-link></citation></ref>
<ref id="ref38"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Schafer</surname> <given-names>J.</given-names></name> <name><surname>Graham</surname> <given-names>J. W.</given-names></name></person-group> (<year>2002</year>). <article-title>Missing data: our view of the state of the art</article-title>. <source>Psychol. Methods</source> <volume>7</volume>, <fpage>147</fpage>&#x2013;<lpage>177</lpage>. doi: <pub-id pub-id-type="doi">10.1037/1082-989X.7.2.147</pub-id></citation></ref>
<ref id="ref39"><citation citation-type="book"><person-group person-group-type="author"><name><surname>Sia</surname> <given-names>C. J. L.</given-names></name> <name><surname>Lim</surname> <given-names>C. S.</given-names></name></person-group> (<year>2018</year>). &#x201C;<article-title>Cognitive diagnostic assessment: an alternative mode of assessment for learning</article-title>&#x201D; in <source>Classroom assessment in mathematics</source>. eds. <person-group person-group-type="editor"><name><surname>Thompson</surname> <given-names>D. R.</given-names></name> <name><surname>Burton</surname> <given-names>M.</given-names></name> <name><surname>Cusi</surname> <given-names>A.</given-names></name> <name><surname>Wright</surname> <given-names>D.</given-names></name></person-group>. (<publisher-loc>Cham</publisher-loc>: <publisher-name>Springer</publisher-name>), <fpage>123</fpage>&#x2013;<lpage>137</lpage>.</citation></ref>
<ref id="ref40"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Song</surname> <given-names>Z.</given-names></name> <name><surname>Guo</surname> <given-names>L.</given-names></name> <name><surname>Zheng</surname> <given-names>T.</given-names></name></person-group> (<year>2022</year>). <article-title>Comparison of missing data handling methods in cognitive diagnosis: zero replacement, multiple imputation and maximum likelihood estimation</article-title>. <source>Acta Psychol. Sin.</source> <volume>54</volume>, <fpage>426</fpage>&#x2013;<lpage>440</lpage>. [In Chinese]. doi: <pub-id pub-id-type="doi">10.3724/SP.J.1041.2022.00426</pub-id></citation></ref>
<ref id="ref41"><citation citation-type="other"><person-group person-group-type="author"><name><surname>Stekhoven</surname> <given-names>D.</given-names></name></person-group> (<year>2013</year>). MissForest: nonparametric missing value imputation using random forest (version 1.4) [R package]. Available at: <ext-link xlink:href="https://cran.r-project.org/web/packages/missForest/index.html" ext-link-type="uri">https://cran.r-project.org/web/packages/missForest/index.html</ext-link></citation></ref>
<ref id="ref42"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Stekhoven</surname> <given-names>D.</given-names></name> <name><surname>B&#x00FC;ehlmann</surname> <given-names>P.</given-names></name></person-group> (<year>2012</year>). <article-title>MissForest&#x2013;nonparametric missing value imputation for mixed-type data</article-title>. <source>Bioinformatics</source> <volume>28</volume>, <fpage>112</fpage>&#x2013;<lpage>118</lpage>. doi: <pub-id pub-id-type="doi">10.1093/bioinformatics/btr597</pub-id>, PMID: <pub-id pub-id-type="pmid">22039212</pub-id></citation></ref>
<ref id="ref43"><citation citation-type="book"><person-group person-group-type="author"><name><surname>Suresh</surname> <given-names>M.</given-names></name> <name><surname>Taib</surname> <given-names>R.</given-names></name> <name><surname>Zhao</surname> <given-names>Y.</given-names></name> <name><surname>Jin</surname> <given-names>W.</given-names></name></person-group> (<year>2019</year>). &#x201C;<article-title>Sharpening the BLADE: missing data imputation using supervised machine learning</article-title>&#x201D; in <source>AI 2019: advances in Artificial Intelligence</source>. eds. <person-group person-group-type="editor"><name><surname>Liu</surname> <given-names>J.</given-names></name> <name><surname>Bailey</surname> <given-names>J.</given-names></name></person-group>. (<publisher-loc>Cham</publisher-loc>: <publisher-name>Springer</publisher-name>), <fpage>215</fpage>&#x2013;<lpage>227</lpage>.</citation></ref>
<ref id="ref44"><citation citation-type="book"><person-group person-group-type="author"><name><surname>Tabachnick</surname> <given-names>B. G.</given-names></name> <name><surname>Fidell</surname> <given-names>L. S.</given-names></name></person-group> (<year>1989</year>). <source>Using multivariate statistics</source>. <edition>2nd</edition> Edn. <publisher-loc>New York, USA</publisher-loc>: <publisher-name>Harper &#x0026; R</publisher-name>.</citation></ref>
<ref id="ref45"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Thomas</surname> <given-names>T.</given-names></name> <name><surname>Rajabi</surname> <given-names>E.</given-names></name></person-group> (<year>2021</year>). <article-title>A systematic review of machine learning-based missing value imputation techniques</article-title>. <source>Data Technol. Appl.</source> <volume>55</volume>, <fpage>558</fpage>&#x2013;<lpage>585</lpage>. doi: <pub-id pub-id-type="doi">10.1108/DTA-12-2020-0298</pub-id></citation></ref>
<ref id="ref46"><citation citation-type="book"><person-group person-group-type="author"><name><surname>van Buuren</surname> <given-names>S.</given-names></name></person-group> (<year>2018</year>). <source>Flexible imputation of missing data</source>. <edition>Second</edition> Edn: <publisher-name>Chapman and Hall/CRC</publisher-name>.</citation></ref>
<ref id="ref47"><citation citation-type="other"><person-group person-group-type="author"><name><surname>van Buuren</surname> <given-names>S.</given-names></name> <name><surname>Groothuis-Oudshoorn</surname> <given-names>K.</given-names></name> <name><surname>Robitzsch</surname> <given-names>A.</given-names></name> <name><surname>Vink</surname> <given-names>G.</given-names></name> <name><surname>Schouten</surname> <given-names>R.</given-names></name> <name><surname>Robitzsch</surname> <given-names>A.</given-names></name> <etal/></person-group>. (<year>2021</year>). Mice: multivariate imputation by chained equations (version 3.14.0) [R package]. Available online at: <ext-link xlink:href="https://cran.r-project.org/web/packages/mice/index.html" ext-link-type="uri">https://cran.r-project.org/web/packages/mice/index.html</ext-link></citation></ref>
<ref id="ref48"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wolkowitz</surname> <given-names>A. A.</given-names></name> <name><surname>Skorupski</surname> <given-names>W. P.</given-names></name></person-group> (<year>2013</year>). <article-title>A method for imputing response options for missing data on multiple-choice assessments</article-title>. <source>Educ. Psychol. Meas.</source> <volume>73</volume>, <fpage>1036</fpage>&#x2013;<lpage>1053</lpage>. doi: <pub-id pub-id-type="doi">10.1177/0013164413497016</pub-id></citation></ref>
<ref id="ref49"><citation citation-type="book"><person-group person-group-type="author"><name><surname>Wothke</surname> <given-names>W.</given-names></name></person-group> (<year>2000</year>). &#x201C;<article-title>Longitudinal and multigroup modeling with missing data</article-title>&#x201D; in <source>Modeling longitudinal and multilevel data</source>. eds. <person-group person-group-type="editor"><name><surname>Little</surname> <given-names>T. D.</given-names></name> <name><surname>Schnabel</surname> <given-names>K. U.</given-names></name> <name><surname>Baumert</surname> <given-names>J.</given-names></name></person-group>. (<publisher-loc>New York</publisher-loc>: <publisher-name>Psychology Press</publisher-name>), <fpage>205</fpage>&#x2013;<lpage>224</lpage>.</citation></ref>
<ref id="ref50"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wu</surname> <given-names>L. J.</given-names></name> <name><surname>Chang</surname> <given-names>K.-E.</given-names></name></person-group> (<year>2023</year>). <article-title>Effect of embedding a cognitive diagnosis into the adaptive dynamic assessment of spatial geometry learning</article-title>. <source>Interact. Learn. Environ.</source> <volume>31</volume>, <fpage>890</fpage>&#x2013;<lpage>907</lpage>. doi: <pub-id pub-id-type="doi">10.1080/10494820.2020.1815216</pub-id></citation></ref>
<ref id="ref9003"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Yan</surname> <given-names>Y. Z.</given-names></name> <name><surname>Dong</surname> <given-names>S. H.</given-names></name> <name><surname>Yu</surname> <given-names>X. F.</given-names></name></person-group> (<year>2025</year>). <article-title>Using ordering theory to learn attribute hierarchies from examinees&#x2019; attribute profiles</article-title>. <source>J. Educ. Behav. stat.</source> doi: <pub-id pub-id-type="doi">10.3102/10769986241280389</pub-id></citation></ref>
<ref id="ref51"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>You</surname> <given-names>X.</given-names></name> <name><surname>Li</surname> <given-names>M.</given-names></name> <name><surname>Xiao</surname> <given-names>Y.</given-names></name> <name><surname>Liu</surname> <given-names>H.</given-names></name></person-group> (<year>2019</year>). <article-title>The feedback of the Chinese learning diagnosis system for personalized learning in classrooms</article-title>. <source>Front. Psychol.</source> <volume>10</volume>:<fpage>1751</fpage>. doi: <pub-id pub-id-type="doi">10.3389/fpsyg.2019.01751</pub-id>, PMID: <pub-id pub-id-type="pmid">31611827</pub-id></citation></ref>
<ref id="ref52"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>You</surname> <given-names>X.</given-names></name> <name><surname>Yang</surname> <given-names>J.</given-names></name> <name><surname>Qin</surname> <given-names>C.</given-names></name> <name><surname>Liu</surname> <given-names>H.</given-names></name></person-group> (<year>2023</year>). <article-title>Missing data analysis in cognitive diagnostic models: random forest threshold imputation method</article-title>. <source>Acta Psychologica Sinica</source> <volume>55</volume>, <fpage>1192</fpage>&#x2013;<lpage>1206</lpage>. doi: <pub-id pub-id-type="doi">10.3724/SP.J.1041.2023.01192</pub-id></citation></ref>
<ref id="ref53"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zhuchkova</surname> <given-names>S.</given-names></name> <name><surname>Rotmistrov</surname> <given-names>A.</given-names></name></person-group> (<year>2022</year>). <article-title>How to choose an approach to handling missing categorical data: (un)expected findings from a simulated statistical experiment</article-title>. <source>Qual. Quant.</source> <volume>56</volume>, <fpage>1</fpage>&#x2013;<lpage>22</lpage>. doi: <pub-id pub-id-type="doi">10.1007/s11135-021-01114-w</pub-id>, PMID: <pub-id pub-id-type="pmid">40115325</pub-id></citation></ref>
</ref-list>
</back>
</article>