<?xml version="1.0" encoding="UTF-8" standalone="no"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v2.3 20070202//EN" "journalpublishing.dtd">
<article xml:lang="EN" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" article-type="research-article">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">Front. Educ.</journal-id>
<journal-title>Frontiers in Education</journal-title>
<abbrev-journal-title abbrev-type="pubmed">Front. Educ.</abbrev-journal-title>
<issn pub-type="epub">2504-284X</issn>
<publisher>
<publisher-name>Frontiers Media S.A.</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="doi">10.3389/feduc.2022.884635</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Education</subject>
<subj-group>
<subject>Original Research</subject>
</subj-group>
</subj-group>
</article-categories>
<title-group>
<article-title>Exploring First Semester Changes in Domain-Specific Critical Thinking</article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author" corresp="yes">
<name><surname>Nielsen</surname> <given-names>Tine</given-names></name>
<xref ref-type="aff" rid="aff1"><sup>1</sup></xref>
<xref ref-type="aff" rid="aff2"><sup>2</sup></xref>
<xref ref-type="corresp" rid="c001"><sup>&#x002A;</sup></xref>
<uri xlink:href="http://loop.frontiersin.org/people/1492435/overview"/>
</contrib>
<contrib contrib-type="author">
<name><surname>Mart&#x00ED;nez-Garc&#x00ED;a</surname> <given-names>Inmaculada</given-names></name>
<xref ref-type="aff" rid="aff3"><sup>3</sup></xref>
<uri xlink:href="http://loop.frontiersin.org/people/1855726/overview"/>
</contrib>
<contrib contrib-type="author">
<name><surname>Alastor</surname> <given-names>Enrique</given-names></name>
<xref ref-type="aff" rid="aff3"><sup>3</sup></xref>
<uri xlink:href="http://loop.frontiersin.org/people/1703268/overview"/>
</contrib>
</contrib-group>
<aff id="aff1"><sup>1</sup><institution>Department of Applied Research in Education and Social Sciences, UCL University College [UCL Erhvervsakademi og Professionsh&#x00F8;jskole]</institution>, <addr-line>Odense</addr-line>, <country>Denmark</country></aff>
<aff id="aff2"><sup>2</sup><institution>Department of Psychology, University of Copenhagen</institution>, <addr-line>Copenhagen</addr-line>, <country>Denmark</country></aff>
<aff id="aff3"><sup>3</sup><institution>Department of Didactics and School Organization, Faculty of Educational Sciences, University of M&#x00E1;laga</institution>, <addr-line>M&#x00E1;laga</addr-line>, <country>Spain</country></aff>
<author-notes>
<fn fn-type="edited-by"><p>Edited by: Edith Braun, Justus-Liebig Universit&#x00E4;t, Germany</p></fn>
<fn fn-type="edited-by"><p>Reviewed by: Kristina Walz, University of Giessen, Germany; Kari Nissinen, University of Jyv&#x00E4;skyl&#x00E4;, Finland</p></fn>
<corresp id="c001">&#x002A;Correspondence: Tine Nielsen, <email>tini@ucl.dk</email></corresp>
<fn fn-type="other" id="fn004"><p>This article was submitted to Higher Education, a section of the journal Frontiers in Education</p></fn>
</author-notes>
<pub-date pub-type="epub">
<day>24</day>
<month>06</month>
<year>2022</year>
</pub-date>
<pub-date pub-type="collection">
<year>2022</year>
</pub-date>
<volume>7</volume>
<elocation-id>884635</elocation-id>
<history>
<date date-type="received">
<day>26</day>
<month>02</month>
<year>2022</year>
</date>
<date date-type="accepted">
<day>23</day>
<month>05</month>
<year>2022</year>
</date>
</history>
<permissions>
<copyright-statement>Copyright &#x00A9; 2022 Nielsen, Mart&#x00ED;nez-Garc&#x00ED;a and Alastor.</copyright-statement>
<copyright-year>2022</copyright-year>
<copyright-holder>Nielsen, Mart&#x00ED;nez-Garc&#x00ED;a and Alastor</copyright-holder>
<license xlink:href="http://creativecommons.org/licenses/by/4.0/"><p>This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.</p></license>
</permissions>
<abstract>
<p>Critical thinking is a common aim for higher education students, often described as general competencies to be acquired through entire programs as well as domain-specific skills to be acquired within subjects. The aim of the study was to investigate whether statistics-specific critical thinking changed from the start of the first semester to the start of the second semester of a two-semester statistics course, where the curriculum contains learning objectives and assessment criteria related to critical thinking. The brief version of the Critical Thinking scale (CTh) from the Motivated Strategies of Learning Questionnaire addresses the core aspects of critical thinking common to three different definitions of critical thinking. Students rate item statements in relation to their statistics course using a frequency scale: 1 = never, 2 = rarely, 3 = sometimes, 4 = often, and 5 = always. Participants were two consecutive year-cohorts of full-time Bachelor of Psychology students taking a two-semester long statistics course placed in the first two semesters. Data were collected in class with a paper-pencil survey 1 month into their first semester and again 1 month into the second. The study sample consisted of 336 students (n<sub>cohort 1</sub> = 166, n<sub>cohort 2</sub> = 170) at baseline, the follow-up was completed by 270 students with 165 students who could be matched to their baseline response. To investigate the measurement properties of the CTh scale, item analysis by the Rasch model was conducted on baseline data and subsequently on follow-up data. Change scores at the group level were calculated as the standardized effect size (ES) (i.e., the difference between baseline and follow-up scores relative to the standard deviation of the baseline scores). Data fitted Rasch models at baseline and follow-up. The targeting of the CTh scale to the student sample was excellent at both timepoints. Absolute individual changes on the CTh ranged from &#x2212;5.3 to 5.1 points, thus showing large individual changes in critical thinking. The overall standardized effect was small and negative (&#x2212;0.12), with some variation in student strata defined by, gender, age, perceived adequacy of math knowledge to learn statistics, and expectation to need statistics in future employment.</p>
</abstract>
<kwd-group>
<kwd>critical thinking</kwd>
<kwd>domain-specific</kwd>
<kwd>changes</kwd>
<kwd>higher education</kwd>
<kwd>Rasch model</kwd>
<kwd>statistics in psychology</kwd>
</kwd-group>
<counts>
<fig-count count="1"/>
<table-count count="4"/>
<equation-count count="0"/>
<ref-count count="62"/>
<page-count count="13"/>
<word-count count="11461"/>
</counts>
</article-meta>
</front>
<body>
<sec id="S1" sec-type="intro">
<title>Introduction</title>
<p>Critical thinking is a central concept in higher education, and as it has become relevant at both the individual and societal level it will not only improve students&#x2019; academic success but also the quality of education (<xref ref-type="bibr" rid="B52">Ren et al., 2020</xref>). The scientific literature has investigated the responsibility of educational institutions in training students in competencies that enables them to be future citizens ready to be an acting part of the society by making them critical thinkers (<xref ref-type="bibr" rid="B31">Kuhn, 1999</xref>; <xref ref-type="bibr" rid="B48">Paul and Elder, 2005</xref>). Thus, there has been a growing interest in the incorporation of critical thinking in the education curricula making critical thinking one of the main aims (<xref ref-type="bibr" rid="B33">Lau, 2015</xref>; <xref ref-type="bibr" rid="B35">McGuirk, 2021</xref>). With regard to the outcome of higher education, critical thinking is predominantly construed as generic, as it is described in terms of the competencies, students are expected to possess at the completion of a degree program (see <xref ref-type="supplementary-material" rid="DS1">Supplementary Appendix 2</xref> for the competency description for a degree program in this study). However, in terms of incorporating critical thinking into higher education programs, this appears rarely to be in the form of independent courses teaching critical thinking. More often critical thinking seems to be implemented through teaching methods and specifically designed activities within subject courses thus construing critical thinking as domain-specific, or simply by using the term critical thinking in the curriculum description without clear definitions, program- or course-determined approaches to teaching toward critical thinking (c.f. <xref ref-type="supplementary-material" rid="DS1">Supplementary Appendix 2</xref> for the current study). These two levels of implementing critical thinking in higher education tie to the discussion of critical thinking as generic/general or domain- and subject-specific.</p>
<p>There are different ways of understanding critical thinking that involve different implications for practice, so there is no consensus on a single definition (<xref ref-type="bibr" rid="B40">Moseley et al., 2005</xref>). Commonly in the literature, there is a distinction between thinking or cognitive skills and dispositional aspects of critical thinking, but as two sides of critical thinking and not separate positions. Two prevalent authors in the field, whose definitions or instruments many draw on in their research, are Facione and Halpern. <xref ref-type="bibr" rid="B11">Facione (1990)</xref> conducted a large Delphi study to narrow down the components of critical thinking, and the panel reached a consensus conceptualization of critical thinking as consisting of two dimensions: cognitive skills and affective dispositions. He further defines critical thinking as &#x201C;<italic>the purposeful, self-regulatory judgment which results in interpretation, analysis, evaluation, and inference, as well as explanation of the evidential, conceptual, methodological, criteriological, or contextual considerations upon which that judgment is based</italic>&#x201D; (<xref ref-type="bibr" rid="B11">Facione, 1990</xref>, p. 3). <xref ref-type="bibr" rid="B16">Halpern (2014)</xref> understand critical thinking as &#x201C;<italic>the deliberate use of skills and strategies that increase the probability of a desirable outcome</italic>&#x201D; (p. 450) and that critical thinking is involved in &#x201C;<italic>solving problems, formulating inferences, calculating likelihoods, and making decisions</italic>&#x201D; (p. 8), and thus also refer to both skills and dispositions. Facione and Halpern also make the distinction of critical thinking skills being assessed as the abilities to demonstrate critical skills in tasks or assignments, while critical thinking dispositions are assessed by self-report instruments. However, in the empirical studies in the field, there is no consensus on this. Thus, studies using self-report instruments have claimed to assess critical thinking skills (e.g., <xref ref-type="bibr" rid="B54">Ricketts and Rudd, 2005</xref>), studies employing critical thinking dispositions self-report instrument has claimed to assess critical thinking skills with this (e.g., <xref ref-type="bibr" rid="B20">Kanbay et al., 2017</xref>), and lastly, studies claiming to assess abilities are doing this through to some degree subjective teacher evaluations using short rubrics<sup><xref ref-type="fn" rid="footnote1">1</xref></sup> (e.g., <xref ref-type="bibr" rid="B50">Ralston and Bays, 2015</xref>). See the following sections for more details on these studies. At a general level, there appears to be a conceptual shift toward using the term skills and then differentiating between assessed and self-report. Thus, in the remainder of this article, we simply use the term critical thinking skills, while recognizing that we use a self-report instrument to assess this, thus assessing students&#x2019; perceptions of their critical thinking skills.</p>
<sec id="S1.SS1">
<title>Critical Thinking as Generic/General Skills or Domain-Specific Skills</title>
<p>One particularly pertinent discussion in the field is whether critical thinking skills are generic/general skills or whether they are domain-/subject-specific (<xref ref-type="bibr" rid="B61">Tiruneh et al., 2017</xref>).</p>
<p>The view of critical thinking as a generic set of skills applicable across domains is based on the common features of critical thinking tasks across a wide variety of domains (e.g., <xref ref-type="bibr" rid="B14">Halpern, 1998</xref>; <xref ref-type="bibr" rid="B31">Kuhn, 1999</xref>). While <xref ref-type="bibr" rid="B14">Halpern (1998)</xref> is a proponent of critical thinking as a set of generic skills, her &#x201C;Four-Part Model for Enhancing Critical Thinking&#x201D; to teach critical thinking acknowledges that critical thinking takes place within a knowledge domain and should be taught within this domain. However, this does not mean that Halpern considers critical thinking as domain-specific, but rather that the domain is the learning context for skills, which can be applied more universally across domains after being mastered. The view of critical thinking as domain-specific emphasizes that different domains have different criteria relating to critical thinking and thus the skills required inevitably vary across domains (e.g., <xref ref-type="bibr" rid="B36">McPeck, 1992</xref>; <xref ref-type="bibr" rid="B39">Moore, 2011</xref>). The issue is more likely not an either/or issue, but an issue of both in combination, as content and critical thinking tasks and skills might differ across domains as they are invariably linked to the domain-knowledge, but there are also commonalities across domains, due to the cognitive processes involved in critical thinking (e.g., <xref ref-type="bibr" rid="B2">Bailin et al., 1999</xref>). As such, critical thinking may be regarded as a set of domain-specific skills of which some also belong to the set of generic or general critical thinking skills. Whether there is in fact a transfer effect from the domain-specific learning of critical thinking skills to other domains or adding on to generic critical thinking skills, as suggested in some of the literature, is another pertinent issue in the critical thinking field. However, this is not a central topic in the current study, as we are concerned with domain-specific critical thinking skills and their development in the first part of university studies.</p>
</sec>
<sec id="S1.SS2">
<title>Critical Thinking Skills in First-Year University Students and Their Development</title>
<p>First-year university students are particularly interesting when it comes to studying critical thinking skills and how they develop, as many higher education teachers and researchers concur that &#x201C;first-year students often enter higher education without the ability to use higher-order thinking skills to master their studies&#x201D; (<xref ref-type="bibr" rid="B10">De Jager, 2012</xref>, p. 1374). Much of the research into critical thinking skills of first-year university students and the development of critical thinking skills during university, has been focused on the development of teaching models and methods to enhance critical thinking, assessing their effects, and comparing how different teaching methods affect the critical thinking of the students. One example is <xref ref-type="bibr" rid="B55">Saenab et al. (2021)</xref> who developed the ReCODE model (Reading, Connecting, Observing, Discussing, and Evaluating) to improve first-year Biology students&#x2019; acquisition of critical thinking. The outcome was positive with regard to enhancing students&#x2019; critical thinking over the course of 3 months, however, it was only used on 38 students. <xref ref-type="bibr" rid="B60">Thomas (2011)</xref> developed the &#x201C;Embedding generic skills in a business curriculum&#x201D;-program consisting of activities and assessment resources for university teachers to develop critical thinking skills with their first-year students, and emphasize that these skills should be developed in the first year. The suggestions were not tested. On a similar note, <xref ref-type="bibr" rid="B18">Hammer and Green (2011)</xref> redesigned a written assessment in the form of a case-based business report for first-year management students in order to facilitate better development of critical thinking as this was part of the requirements for passing. The authors used the percentage of passing students to evaluate the success of the redesign &#x2013; this went from 78.8 to 84% &#x2013; but details of the teachers&#x2019; assessments were not provided, and thus how critical thinking was assessed was not divulged beyond its being a teacher assessment. <xref ref-type="bibr" rid="B50">Ralston and Bays (2015)</xref>, on the other hand, found that Engineering students&#x2019; (<italic>n</italic> = 182) critical thinking increased during the course of their undergraduate studies, which had purposely been designed to incorporate assignments focused on critical thinking. A four-point, holistic critical thinking rubric was designed for the purpose of the study to evaluate domain-specific critical thinking. As a final example, <xref ref-type="bibr" rid="B61">Tiruneh et al. (2017)</xref> compared both domain-specific and general critical thinking skills for first-year students in an introductory Physics course (<italic>n</italic> = 143), using the Halpern Critical Thinking Assessment (HCTA; <xref ref-type="bibr" rid="B17">Halpern, 2015</xref>); a standardized scenario-based instrument with 25 everyday scenarios assessing general critical thinking skills by means of computerized scoring in combination with trained grader scoring. The study compared three different instructional designs and found that students in what they termed immersion and infusion designs (intervention) outperformed students in the control design significantly with regard to domain-specific critical thinking as well as course achievement. However, neither of the intervention designs fostered the acquisition of general critical thinking skills.</p>
<p>It is evident that there is an abundance of studies on various methods to enhance students&#x2019; critical thinking skills in the first year and over the course of university studies. However, critical thinking in first-year students has also been investigated with regard to its &#x201C;natural&#x201D; development over time (i.e., no particular design implemented to enhance critical thinking) and how critical thinking is related to other psychological and educational constructs, e.g., emotional intelligence (<xref ref-type="bibr" rid="B21">Kaya et al., 2017</xref>; <xref ref-type="bibr" rid="B56">Sahanowas and Halder, 2020</xref>) and perceived academic control (<xref ref-type="bibr" rid="B58">Stupnisky et al., 2008</xref>). <xref ref-type="bibr" rid="B56">Sahanowas and Halder (2020)</xref> used the University of Florida - Engagement, Cognitive Maturity and Innovativeness assessment (UF-EMI, <xref ref-type="bibr" rid="B54">Ricketts and Rudd, 2005</xref>), which is a self-report instrument measuring generic critical thinking, in a cross-sectional study with the first-year students in various disciplines (<italic>n</italic> = 500) found that emotional intelligence was positively related to critical thinking. <xref ref-type="bibr" rid="B21">Kaya et al. (2017)</xref> in their study of Nursing students find that they possess a low level of critical thinking at the start of the first academic year, and while critical thinking was positively associated with emotional intelligence at the start, neither developed over the course of the year. <xref ref-type="bibr" rid="B21">Kaya et al. (2017)</xref> made use of a Turkish translation of the California Critical Thinking Disposition Scale (<xref ref-type="bibr" rid="B12">Facione et al., 1998</xref>), which is a self-report measure of generic critical thinking. <xref ref-type="bibr" rid="B58">Stupnisky et al. (2008)</xref> conducted a longitudinal study with Psychology students (<italic>n</italic> = 1,196) with the Motivated Strategies for Learning Questionnaire (MSLQ; <xref ref-type="bibr" rid="B49">Pintrich et al., 1991</xref>), which contains a domain-specific self-report critical thinking scale, and found a reciprocal relationship between critical thinking and perceived academic control, so that students perceived academic control 1 month into the first year predicted critical thinking 6 months later, while critical thinking 1 month into the first year also predicted perceived academic control 6 months later. Another example, of a study on the &#x201C;natural&#x201D; development of critical thinking over time, is <xref ref-type="bibr" rid="B20">Kanbay et al. (2017)</xref> who assessed critical thinking in Nursing students (<italic>n</italic> = 46), with the (California Critical Thinking Disposition Scale, see above) at the start of the first year and at the end of the second, third and fourth years of study. Their results revealed a medium level of critical thinking at the beginning and no improvement in critical thinking across the four-year period of time, not statistically and not at the absolute level. In a qualitative study, <xref ref-type="bibr" rid="B47">&#x00D6;zel&#x00E7;i and &#x00C7;al&#x0131;&#x015F;kan, 2019</xref>, interviewed 11 teacher candidates two times about their critical thinking. The results showed no change in self-perception in critical thinking from the first to the fourth year of study.</p>
</sec>
<sec id="S1.SS3">
<title>Development of Students&#x2019; Statistics-Related Critical Thinking</title>
<p>Turning to the domain-specific concept of statistics-related critical thinking, several studies have been conducted. <xref ref-type="bibr" rid="B6">Bensley et al. (2010)</xref> studied the acquisition of critical thinking skills in instructional different groups of students enrolled in a research methods course by the psychological Critical Thinking Test (<xref ref-type="bibr" rid="B5">Bensley and Baxter, 2006</xref>), which is a domain-specific multiple-choice test. More specifically they compared the acquisition of critical thinking skills for analyses of psychological arguments students who had critical thinking skills infused directly into their course with students where this was not the case. The infusing of critical thinking skills consisted of using a methodologically oriented textbook as well as a critical thinking textbook, as well as examples and practice of critical thinking through exercises and corrective feedback. The non-infusing courses used another textbook that embedded statics instruction within a research design and methodology discussion. The study found that the group of students who had received instruction aimed explicitly at critical thinking showed significantly greater gains in argument analysis skills than the students who had received no explicit critical thinking instruction. Contrary to this, <xref ref-type="bibr" rid="B13">Goode et al. (2018)</xref> compared how critical thinking was expressed in early and late writing assignments using specific critical thinking learning objectives recommended by the American Psychological Association (i.e., effective use of critical thinking, use of reasoning in argument, and problem-solving effectiveness) for psychology students assigned at random to a face-to-face and a blended learning versions of a statistics and research design course. <xref ref-type="bibr" rid="B13">Goode et al. (2018)</xref> developed a domain-specific scoring rubric with three areas being scored from &#x2018;does not meet expectations&#x2019; to &#x2018;far exceeds expectations&#x2019; for the teachers&#x2019; assessment of critical thinking. The difference between the two instructional designs was simply that the blended learning version of the course was taught as a 50/50 flipped hybrid of the face-to-face course. Thus, in the blended learning hybrid, students attended face-to-face classes once a week rather than two, and for the second weekly class, they viewed online lectures and worked with other materials outside of the class setting. There was no significant difference in the development of critical thinking between students in the face-to-face and students in the blended learning design. However, an instructor effect was found, showing that student assigned to classes by two instructors increased their critical thinking significantly more than students assigned to two other instructors, and for one instructor both randomly assigned groups of students had a decline in critical thinking during the course. <xref ref-type="bibr" rid="B57">Setambah et al. (2019)</xref> evaluated how the critical thinking skills of teacher preparation students in their second semester developed in a basic statistics course employing Adventure-Based Learning (ABL) compared to a control group not receiving ABL. They found that after 10 weeks there was no significant difference, while there was weak evidence for a difference favoring the experimental groups after a further 8 weeks. Lastly, <xref ref-type="bibr" rid="B7">Cheng et al. (2018)</xref> showed how the critical thinking of undergraduate students taking introductory statistics classes within various degree programs increased across semester-long courses incorporating assignments, in-class discussion, and Socratic dialog. <xref ref-type="bibr" rid="B7">Cheng et al. (2018)</xref> designed a domain-specific rubric with four dimensions related to critical thinking to be scored by domain-specialists and as well as a student self-report survey to assess students&#x2019; perceptions of improvement in critical thinking.</p>
<p>With regard to the domain-specific statistics-related critical thinking, there appears to be a lack of studies on the &#x201C;natural&#x201D; development over time, i.e., without implementation of any specific teaching methodology. As <xref ref-type="bibr" rid="B61">Tiruneh et al. (2017)</xref> suggest that &#x201C;<italic>meaningful instruction in every subject domain inherently comprises the development of CT skills, and therefore, proficiency in CT skills can be achieved as students construct knowledge of a subject-matter domain without any explicit emphasis on the teaching of general CT skills during instruction</italic>&#x201D; (p. 1067), such studies might contribute to the knowledge of the &#x201C;natural&#x201D; development of statistics-related critical thinking.</p>
</sec>
<sec id="S1.SS4">
<title>The Current Study</title>
<p>Drawing on the previous research, the present study intends to contribute to the field by studying specifically the development of statistics-related critical thinking in first-year psychology undergraduate students in a Danish university, where no particular emphasis on teaching critical thinking skills is reflected in the curriculum, but rather implicit references are given and critical thinking is mentioned in the assessment criteria (c.f. <xref ref-type="bibr" rid="B39">Moore, 2011</xref>). The primary aim of the study is to investigate whether statistics-related critical thinking changes from the start of the first semester to the start of the second semester of a two-semester-long statistics course, where the curriculum contains learning objectives implying critical thinking and assessment criteria explicitly requiring critical thinking.</p>
<p>At the overall level, we expected all students to have an increase in critical thinking, based on the general goal and performance orientation of these students<sup><xref ref-type="fn" rid="footnote2">2</xref></sup> in combination with the implicit mention in the learning objectives and particularly the explicit mention of critical thinking in the assessment criteria for the first semester of the course. However, such an overall change might mask differentiated subgroup changes, and subgroup changes in opposite directions might also result in no change at the overall level. With regard to subgroups, we expected that the overall change in critical thinking would differ for subgroups of students dependent on their baseline perception of the adequacy of their own mathematical knowledge for learning statistics as well as their expectation to need statistics in their future employment, as would the students&#x2019; baseline level of critical thinking. Specifically, we expected:</p>
<p>Students who perceived their mathematical knowledge to be inadequate for learning statistics were less inclined toward critical thinking at baseline compared to students who perceived they had an adequate level of mathematical knowledge, due to their lack of insight into the field. We had no set expectation with regard to the direction of difference in the change in critical thinking dependent on the perception of the adequacy of mathematical knowledge, as this could go both ways. For some students, a perceived lack in the prerequisite knowledge required would be a motivating factor making them engage more and thus possibly increase more in critical thinking compared to the other student group. But the opposite is also likely for some students, i.e., perceiving a lack of prerequisite knowledge might be further dis-engaging leading to a smaller increase in critical thinking or even a decrease for this subgroup. In addition, students perceiving adequacy in prerequisite knowledge could be expected to engage more due to their insight and thus increase more in critical thinking than students perceiving their pre-requisite knowledge as inadequate.</p>
<p>Students who did not believe they would need statistics in their future employment were less inclined toward critical thinking at baseline and compared to students believing they would be needing statistics, as they would not be as likely to engage in the cognitively demanding critical because it would be perceived as unnecessary. We would not expect that students&#x2019; beliefs about their future employment to change much over the cause of the first semester of study, and thus we expected that the largest increase in critical thinking would be seen for the students believing to need statistics in the future, as they would engage more in the subject.</p>
<p>The secondary aim is to investigate further the psychometric properties of the brief version of the Motivated Strategies of Learning Questionnaire critical thinking scale (MSLQ; <xref ref-type="bibr" rid="B49">Pintrich et al., 1991</xref>) resulting from a recent validation study, which critically considered the content and construct validity of this scale as well as its cross-cultural validity (<xref ref-type="bibr" rid="B45">Nielsen et al., 2021</xref>). As this brief critical thinking scale (CTh) was shown to fit the Rasch model both with a Danish and a Spanish sample of psychology students and have reliability for the Danish sample at the level of those obtained with the original scale, we found the CTh scale to be a good candidate for the current study.</p>
</sec>
</sec>
<sec id="S2" sec-type="materials|methods">
<title>Materials and Methods</title>
<sec id="S2.SS1">
<title>Instrument</title>
<p>The Critical Thinking scale (CTh) employed in the present study is a brief version of the critical thinking scale from the Motivated Strategies of Learning Questionnaire (MSLQ; <xref ref-type="bibr" rid="B49">Pintrich et al., 1991</xref>) resulting from a recent validation study, which critically considered the content and construct validity of this scale as well as its cross-cultural validity (<xref ref-type="bibr" rid="B45">Nielsen et al., 2021</xref>). The MSLQ is a multi-scale questionnaire intended to measure aspects of students&#x2019; motivational orientation and learning strategies in high school and higher education (<xref ref-type="bibr" rid="B49">Pintrich et al., 1991</xref>). One of the scales included in the MSLQ is a five-item course-specific critical thinking scale with a seven-point response scale anchored for meaning only at the extremes. Of all the short scales in the MSLQ, the critical thinking scale was originally reported as having one of the highest reliabilities (Cronbach&#x2019;s alpha 0.8) with the development sample of 380 Midwestern college students (<xref ref-type="bibr" rid="B49">Pintrich et al., 1991</xref>). More recently, <xref ref-type="bibr" rid="B19">Holland et al. (2018)</xref> in their meta-analysis found the reliability of the critical thinking scale to be similar across 344 samples (<italic>N</italic> = 27,619) stemming from 32 countries and 14 languages (mean Cronbach&#x2019;s alpha 0.78).</p>
<p>In their study of the cross-cultural validity of the critical thinking scale from the MSLQ, <xref ref-type="bibr" rid="B45">Nielsen et al. (2021)</xref> analyzed thoroughly the content validity of the scale and found that only three items actually measured critical thinking. Content validity was considered both with a theoretically based approach, i.e., analysis of the item content in relation to three different and prevalent definitions of critical thinking (<xref ref-type="bibr" rid="B11">Facione, 1990</xref>; <xref ref-type="bibr" rid="B49">Pintrich et al., 1991</xref>; <xref ref-type="bibr" rid="B15">Halpern, 2003</xref>), and a statistical and psychometric approach, i.e., analysis of local independence and dimensionality (<xref ref-type="bibr" rid="B27">Kreiner and Christensen, 2004</xref>). Both approaches reached the conclusion that two items (the same) should be eliminated in order to improve content validity by eliminating construct contamination.</p>
<p>In addition to eliminating two items, <xref ref-type="bibr" rid="B45">Nielsen et al. (2021)</xref> also employed an adapted five-point response scale with meaning anchors for all categories with the brief CTh scale in order to pre-assign the meaning that respondents should infer from the categories and thus prevent a random assignment of meaning to a row of numbers, which would affect the validity in interpretation and reliability (<xref ref-type="bibr" rid="B30">Krosnick and Fabrigar, 1997</xref>, <xref ref-type="bibr" rid="B34">Maitland, 2009</xref>, <xref ref-type="bibr" rid="B37">Menold et al., 2014</xref>). This approach was further supported empirically in previous validity studies of other scales from the MSLQ, e.g., <xref ref-type="bibr" rid="B42">Nielsen (2018)</xref> with the motivation scales; <xref ref-type="bibr" rid="B43">Nielsen (2020)</xref>, <xref ref-type="bibr" rid="B44">Nielsen et al. (2017</xref>, <xref ref-type="bibr" rid="B46">2022)</xref> with the self-efficacy scale, where a similar adaption of the response scale had no noteworthy effect on the reliability of the scales compared to the original version.</p>
<p>The three-item CTh scale with the adapted response scale (see below) resulting from the study by <xref ref-type="bibr" rid="B45">Nielsen et al. (2021)</xref> had reliability at the level of the original five-item scale with seven response categories for a Danish sample of psychology students (0.82), while slightly lower for a Spanish sample of psychology students (0.73).</p>
<p>The items of the brief CTh scale employed in the present study address the purposeful and inquiring aspect of CTh common to three different definitions of critical thinking (<xref ref-type="bibr" rid="B11">Facione, 1990</xref>; <xref ref-type="bibr" rid="B49">Pintrich et al., 1991</xref>; <xref ref-type="bibr" rid="B15">Halpern, 2003</xref>): how often the student questions things and decide about them (item: I often find myself questioning things I hear or read in this statistics course to decide if I find them convincing); how the student decides about a theory, interpretation or conclusion (item: when a theory, interpretation or conclusion is presented in the statistics course or in the readings, I try to decide if there is good supporting evidence); how the student looks for alternatives (item: whenever I read or hear an assertion or conclusion in this statistics course, I think about possible alternatives) (see also <xref ref-type="supplementary-material" rid="DS1">Supplementary Table A1</xref> in <xref ref-type="supplementary-material" rid="DS1">Supplementary Appendix 1</xref>). Thus, the CTh scale does not cover all aspects of critical thinking, but it covers the core aspects, and more importantly, it is not &#x201C;contaminated&#x201D; by items not measuring critical thinking (<xref ref-type="bibr" rid="B45">Nielsen et al., 2021</xref>). As with the MSLQ, students rate how they feel that the item statements in the brief CTh scale describe them in relation to a specified course (in this case statistics) in terms of frequency of the thinking described in the items: 1 = never, 2 = rarely, 3 = sometimes, 4 = often, and 5 = always. The Danish item texts can be seen in <xref ref-type="supplementary-material" rid="DS1">Supplementary Appendix 1</xref> with the English equivalents (<xref ref-type="supplementary-material" rid="DS1">Supplementary Table A1</xref>). In this article, CTh items are referenced with their original order from the MSLQ to facilitate comparison to other studies with item-level data.</p>
<p>At baseline, students also provided information on gender and age, whether students perceived their mathematical knowledge to be adequate for learning statistics, and whether they believed they would need statistics in their future employment.</p>
</sec>
<sec id="S2.SS2">
<title>Participants and Data Collection</title>
<p>Participants were two consecutive year-cohorts of first-semester students enrolled in a full-time Bachelor of Psychology program in a major Danish university. The students were all taking a two-semester-long statistics course placed in the first two semesters of the bachelor&#x2019;s program. The course consists of weekly lectures and weakly exercise classes. The learning objectives for the first semester of the course contain implicit references to critical thinking (see <xref ref-type="supplementary-material" rid="DS1">Supplementary Appendix 2</xref>). The course has a separate exam in each of the two semesters, and the first-semester exam is an on-campus written exam assessed as pass/fail using a set of specified criteria. As part of these criteria are both implicit and explicit references to critical thinking (see <xref ref-type="supplementary-material" rid="DS1">Supplementary Appendix 2</xref>).</p>
<p>The students completed the CTh scale as part of a larger survey 1 month into their first semester of the course and again 1 month into their second semester of the statistics course. Data were collected in class with a paper-pencil survey. The data collections were arranged with the responsible lecturer before the start of the course. Students were informed ahead of the lecture that the data collection would take place and that it was voluntary to complete the survey. At the point of the data collection, students were informed of the purpose of the overall study, that participation was voluntary, that their data would be treated according to the prevailing data protection regulations, and that they could ask to have their data deleted up to a specified point in time where they would be anonymized. In addition, students were provided with a written information sheet providing the same information as well as contact information for the responsible researcher.</p>
<p>The study sample consisted of 336 students at baseline (n<sub>cohort 1</sub> = 166, n<sub>cohort 2</sub> = 170), while the follow-up was completed by 270 students with 165 students who could be matched to their baseline response. The matching rate was determined by circumstances related to student enrollment (drop-out and new enrollment), the matching design (asking students for their student ID in handwriting if they wanted to participate again), and chance (students present in the lecture where data were collected). Thus, as various factors contributed to the missingness of data at follow-up, it could not with any certainty be determined whether data were missing at random or not, though the number of contributing factors makes it more likely that they were missing at random. Likewise, the missingness could not be considered in terms of selection bias, due to the external contributing factors. The mean age of the students at baseline was 22.7 years (SD 4.99) and 81% of the 336 students in the baseline sample identified as female, which is a close match to the official gender distribution of the student admitted to the two particular year-cohorts was 81.3% female students (<xref ref-type="bibr" rid="B38">Ministry of Higher Education and Science, 2021</xref>). The gender distribution did not change at follow-up, i.e., 82% of the 165 students in the follow-up sample identified as female.</p>
</sec>
<sec id="S2.SS3">
<title>Statistical Analyses</title>
<p>First, we conducted item analysis using the Rasch measurement model (RM; <xref ref-type="bibr" rid="B51">Rasch, 1960</xref>) to establish the psychometric properties of the CTh scale both at baseline and at follow-up. The Rasch model was chosen, as <xref ref-type="bibr" rid="B45">Nielsen et al. (2021)</xref> have shown the CTh scale to fit the Rasch model in both a Danish and a Spanish sample. Second, we assessed the changes in CTh scores from the start of the first to the start of the second semester as standardized effect sizes.</p>
<sec id="S2.SS3.SSS1">
<title>Item Analyses by Rasch Models</title>
<p>To investigate the measurement properties of the CTh scale (the secondary issue of the study), item analysis by the Rasch model was conducted first on the baseline sample and subsequently in the follow-up sample to confirm the results. The RM provides optimal measurement properties of scales fitting it (<xref ref-type="bibr" rid="B24">Kreiner, 2007</xref>, <xref ref-type="bibr" rid="B26">2013</xref>). These properties include:</p>
<list list-type="simple">
<list-item>
<label>1.</label>
<p><italic>Unidimensionality</italic> &#x2013; the scale measures a single latent construct (Critical Thinking).</p>
</list-item>
<list-item>
<label>2.</label>
<p><italic>Local independence of items</italic> (no LD) &#x2013; responses to a CTh item depends only on the level of Critical Thinking and not on responses to any of the other items on the scale.</p>
</list-item>
<list-item>
<label>3.</label>
<p><italic>Optimal reliability</italic>, as items are locally independent.</p>
</list-item>
<list-item>
<label>4.</label>
<p><italic>Absence of differential item functioning</italic> (no DIF) &#x2013; responses to a CTh item depends only on the level of critical thinking and not on persons&#x2019; membership of subgroups such as gender, age, etc.</p>
</list-item>
<list-item>
<label>5.</label>
<p><italic>Homogeneity</italic> &#x2013; the rank order of the item parameters/item difficulty is the same for all persons.</p>
</list-item>
<list-item>
<label>6.</label>
<p><italic>Score sufficiency</italic> &#x2013; the sum score is a sufficient statistic for the person&#x2019;s parameter estimates of Critical Thinking.</p>
</list-item>
</list>
<p>Homogeneity and sufficiency are properties only provided by the Rasch model, not any other IRT model. The property of sufficiency is particularly desirable when using the summed raw score of a scale, as it is the usual case with the CTh scale. However, fit to the Rasch model facilitates the use of the person parameter estimates resulting from the measurement model (sometimes termed Rasch-scores), and thus either these or the raw scores can be used in subsequent analysis, as preferred by the individual researcher for their specific purpose.</p>
<p>The overall tests of global homogeneity by comparison of item parameters in low and high scoring groups and overall tests of invariance were conducted as overall tests of fit using <xref ref-type="bibr" rid="B1">Andersen (1973)</xref> conditional likelihood ratio test (CLR). The fit of individual items to the Rasch model was tested by comparing the observed item-rest-score correlations with the expected item-rest-score correlations under the RM (<xref ref-type="bibr" rid="B25">Kreiner, 2011</xref>). Local independence of items and the assumption of no DIF were tested using <xref ref-type="bibr" rid="B22">Kelderman (1984)</xref> conditional likelihood ratio test. DIF was tested in relation to five background variables year cohort (1, 2), gender (female and male), median-split age groups (21 years and younger, 22 years and older), as well as baseline perception of the adequacy of mathematical knowledge to learn statistics (not adequate, adequate), and baseline expectancy to need statistics in future employment (yes, maybe, and no).</p>
<p>Reliability was calculated as Cronbach&#x2019;s alpha (<xref ref-type="bibr" rid="B9">Cronbach, 1951</xref>). Targeting (whether items provide information in the area of the scale where the sample population is located) was assessed graphically by item maps as well as numerically by two target indices (<xref ref-type="bibr" rid="B29">Kreiner and Christensen, 2013</xref>): the test information target index (the mean test information divided by the maximum test information for theta, and the root mean squared error (RMSE) target index (the minimum standard error of measurement divided by the mean standard error of measurement for theta). Both indices should preferably have a value close to one, as this would indicate the degree to which maximum information and minimum measurement error were obtained, respectively. The target of the observed score and the standard error of measurement (SEM) was also calculated. Items maps are plots of the distribution of the item threshold locations against weighted maximum likelihood estimations of the person parameter locations as well as the person parameters for the population (assuming a normal distribution) and the information function.</p>
<p>Critical values were adjusted for false discovery rate (FDR) arising from conducting multiple statistical tests (i.e., controlling type I errors), whenever appropriate (<xref ref-type="bibr" rid="B4">Benjamini and Hochberg, 1995</xref>). As recommended by <xref ref-type="bibr" rid="B8">Cox et al. (1977)</xref>, we distinguished between weak (<italic>p</italic> &#x003C; 0.05), moderate (<italic>p</italic> &#x003C; 0.01), and strong (<italic>p</italic> &#x003C; 0.001) evidence against the model, rather than applying a deterministic 5% critical limit for <italic>p</italic>-Values.</p>
</sec>
<sec id="S2.SS3.SSS2">
<title>Analysis of Differences at Baseline and Analysis of Change</title>
<p>To investigate the primary issue of the study, namely changes in critical thinking, the person parameter estimates resulting from the Rasch models, which have equal distance between any two values, were rescaled to the score range of the instrument and used for baseline differences and in the analysis of change. Differences in mean scores for subgroups of students at baseline were tested within the framework of multiple analyses of variance framework to be able to include grouping variables with more than two categories and test for interaction effects. The change was tested using a paired samples <italic>t</italic>-test approach and change scores at the group level were calculated as the standardized effect size (ES) (i.e., the difference between baseline and follow-up scores relative to the standard deviation of the baseline scores) (<xref ref-type="bibr" rid="B32">Lakens, 2013</xref>; <xref ref-type="bibr" rid="B3">Beauchamp et al., 2015</xref>). Subgroups of students were defined by our primary independent variables of interest, i.e., perception of the adequacy of their own mathematical knowledge for learning statistics as well as the students&#x2019; expectations to need statistics in their future employment. As secondary subgroupings, we included gender and age groups, in order to show whether there were any effects of these on baseline levels of critical thinking or on changes that might be imposed on the primary issues.</p>
</sec>
<sec id="S2.SS3.SSS3">
<title>Software</title>
<p>The item analyses by Rasch models were conducted using DIGRAM (<xref ref-type="bibr" rid="B23">Kreiner, 2003</xref>; <xref ref-type="bibr" rid="B28">Kreiner and Nielsen, 2013</xref>), while R was used to produce the item maps. Analyses of variance and <italic>t</italic>-tests were conducted using SPSS. Effect sizes were calculated using Excel.</p>
</sec>
</sec>
</sec>
<sec id="S3" sec-type="results">
<title>Results</title>
<sec id="S3.SS1">
<title>Psychometric Properties at Baseline and Follow-Up</title>
<p>Results of the item analyses (the secondary research issue) showed that the baseline data fitted the Rasch model, and this was also the case with the follow-up data. Thus, there was no evidence against global homogeneity or invariance (<xref ref-type="table" rid="T1">Table 1</xref>), nor was there any evidence against the fit of the individual items to the Rasch model (<xref ref-type="table" rid="T2">Table 2</xref>). In addition, we found no evidence against local independence of items (<xref ref-type="supplementary-material" rid="DS1">Supplementary Table A2</xref> in <xref ref-type="supplementary-material" rid="DS1">Supplementary Appendix 1</xref>) and no evidence of differential item functioning relative to year cohort, students&#x2019; baseline perception of the adequacy of mathematical knowledge to learn statistics, students&#x2019; baseline expectancy to need statistics in future employment, gender, or age (<xref ref-type="supplementary-material" rid="DS1">Supplementary Table A3</xref> in <xref ref-type="supplementary-material" rid="DS1">Supplementary Appendix 1</xref>). Information on Item thresholds, locations, difficulties, targets, and information is also provided in <xref ref-type="supplementary-material" rid="DS1">Supplementary Appendix 1</xref> (<xref ref-type="supplementary-material" rid="DS1">Supplementary Table A4</xref>).</p>
<table-wrap position="float" id="T1">
<label>TABLE 1</label>
<caption><p>Global tests of homogeneity and invariance for the Critical Thinking Scale at baseline and follow-up.</p></caption>
<table cellspacing="5" cellpadding="5" frame="hsides" rules="groups">
<thead>
<tr>
<td valign="top" align="left">Tests of fit</td>
<td valign="top" align="center" colspan="3">Baseline<hr/></td>
<td valign="top" align="center" colspan="3">Follow-up<hr/></td>
</tr>
<tr>
<td/>
<td valign="top" align="center">CLR</td>
<td valign="top" align="center">df</td>
<td valign="top" align="center"><italic>p</italic></td>
<td valign="top" align="center">CLR</td>
<td valign="top" align="center">df</td>
<td valign="top" align="center"><italic>p</italic></td>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left">Global homogeneity<italic><xref ref-type="table-fn" rid="t1fna"><sup>a</sup></xref></italic></td>
<td valign="top" align="center">9.0</td>
<td valign="top" align="center">11</td>
<td valign="top" align="center">0.622</td>
<td valign="top" align="center">6.4</td>
<td valign="top" align="center">11</td>
<td valign="top" align="center">0.844</td>
</tr>
<tr>
<td valign="top" align="left"><bold>Invariance</bold></td>
<td/>
<td/>
<td/>
<td/>
<td/>
<td/>
</tr>
<tr>
<td valign="top" align="left">Year cohort</td>
<td valign="top" align="center">16.4</td>
<td valign="top" align="center">11</td>
<td valign="top" align="center">0.128</td>
<td valign="top" align="center">9.8</td>
<td valign="top" align="center">11</td>
<td valign="top" align="center">0.553</td>
</tr>
<tr>
<td valign="top" align="left">Math adequacy</td>
<td valign="top" align="center">10.9</td>
<td valign="top" align="center">11</td>
<td valign="top" align="center">0.449</td>
<td valign="top" align="center">7.4</td>
<td valign="top" align="center">11</td>
<td valign="top" align="center">0.768</td>
</tr>
<tr>
<td valign="top" align="left">Stat in Future work</td>
<td valign="top" align="center">40.8</td>
<td valign="top" align="center">22</td>
<td valign="top" align="center">0.009<xref ref-type="table-fn" rid="t1fnb"><sup>+</sup></xref></td>
<td valign="top" align="center">30.2</td>
<td valign="top" align="center">22</td>
<td valign="top" align="center">0.113</td>
</tr>
<tr>
<td valign="top" align="left">Gender</td>
<td valign="top" align="center">14.2</td>
<td valign="top" align="center">11</td>
<td valign="top" align="center">0.220</td>
<td valign="top" align="center">7.3</td>
<td valign="top" align="center">11</td>
<td valign="top" align="center">0.775</td>
</tr>
<tr>
<td valign="top" align="left">Age groups</td>
<td valign="top" align="center">17.4</td>
<td valign="top" align="center">11</td>
<td valign="top" align="center">0.097</td>
<td valign="top" align="center">6.7</td>
<td valign="top" align="center">11</td>
<td valign="top" align="center">0.824</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<fn><p><italic>CTh, Critical Thinking Scale; CLR, Conditional Likelihood Ratio test.</italic></p></fn>
<fn id="t1fna"><p><italic><sup>a</sup>The test of homogeneity is a test of the hypothesis that item parameters are the same for persons with low or high scores.</italic></p></fn>
<fn id="t1fnb"><p><italic><sup>+</sup>The Benjamini-Hochberg adjusted critical level for false discovery rate at the 5% level was p = 0.0083 and at the 1% level p = 0.0017.</italic></p></fn>
</table-wrap-foot>
</table-wrap>
<table-wrap position="float" id="T2">
<label>TABLE 2</label>
<caption><p>Item fit statistics for the Critical Thinking Scale at baseline and follow-up.</p></caption>
<table cellspacing="5" cellpadding="5" frame="hsides" rules="groups">
<thead>
<tr>
<td valign="top" align="left"></td>
<td valign="top" align="center" colspan="3">Baseline<hr/></td>
<td valign="top" align="center" colspan="3">Follow-up<hr/></td>
</tr>
<tr>
<td valign="top" align="left">Items</td>
<td valign="top" align="center">Observed &#x03B3;</td>
<td valign="top" align="center">Expected &#x03B3;</td>
<td valign="top" align="center"><italic>P</italic></td>
<td valign="top" align="center">Observed &#x03B3;</td>
<td valign="top" align="center">Expected &#x03B3;</td>
<td valign="top" align="center"><italic>p</italic></td>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left">CTh1</td>
<td valign="top" align="center">0.47</td>
<td valign="top" align="center">0.51</td>
<td valign="top" align="center">0.438</td>
<td valign="top" align="center">0.61</td>
<td valign="top" align="center">0.58</td>
<td valign="top" align="center">0.665</td>
</tr>
<tr>
<td valign="top" align="left">CTh2</td>
<td valign="top" align="center">0.62</td>
<td valign="top" align="center">0.52</td>
<td valign="top" align="center">0.035<xref ref-type="table-fn" rid="tfn1"><sup>+</sup></xref></td>
<td valign="top" align="center">0.64</td>
<td valign="top" align="center">0.58</td>
<td valign="top" align="center">0.336</td>
</tr>
<tr>
<td valign="top" align="left">CTh5</td>
<td valign="top" align="center">0.47</td>
<td valign="top" align="center">0.52</td>
<td valign="top" align="center">0.389</td>
<td valign="top" align="center">0.51</td>
<td valign="top" align="center">0.58</td>
<td valign="top" align="center">0.246</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<fn><p><italic>&#x03B3; = Item-rest score correlations in the form of Goodman and Kruskal&#x2019;s rank correlation for ordinal items.</italic></p></fn>
<fn id="tfn1"><p><italic><sup>+</sup>The Benjamini-Hochberg adjusted critical level for false discovery rate at the 5% level was p = 0.0111 and at the 1% level p = 0.0011.</italic></p></fn>
</table-wrap-foot>
</table-wrap>
<p>The targeting of the CTh scale to the student sample was excellent at both baseline and follow-up; slightly better at follow-up with a target information index of 86% at follow-up versus 83% at baseline (<xref ref-type="supplementary-material" rid="DS1">Supplementary Table A5</xref> in <xref ref-type="supplementary-material" rid="DS1">Supplementary Appendix 1</xref>). The level of information is highest where most students are located on the CTh scale at both time points (<xref ref-type="supplementary-material" rid="DS1">Supplementary Figure A1</xref> in <xref ref-type="supplementary-material" rid="DS1">Supplementary Appendix 1</xref>). The reliability of the CTh scale was satisfactory for the purpose of statistical analyses at both baseline and follow-up; 0.72 and 0.75 respectively (<xref ref-type="supplementary-material" rid="DS1">Supplementary Table A5</xref> in <xref ref-type="supplementary-material" rid="DS1">Supplementary Appendix 1</xref>).</p>
<p>The conversion from the summed raw scores of the CTh scale to the estimated person parameters resulting from the Rasch model, as well as these person parameters, estimate rescaled to the original range of the CTh scale are provided in <xref ref-type="supplementary-material" rid="DS1">Supplementary Appendix 1</xref> (<xref ref-type="supplementary-material" rid="DS1">Supplementary Table A6</xref>). This allows users of the scales to choose between using the sum scores, which uses the unit of the scale, or to convert these to any of the person parameters estimates, which are continuous and equidistant scores, as preferred for whatever purpose of use.</p>
</sec>
<sec id="S3.SS2">
<title>Differences in Statistics-Related Critical Thinking at Baseline and Changes in Critical Thinking</title>
<p>The primary research question of the study concerned changes in statistics-related critical thinking from the start of the first semester (baseline) to the start of the second semester (follow-up). As we expected the overall change in critical thinking to differ for subgroups of students dependent on their baseline perception of the adequacy of their own mathematical knowledge for learning statistics as well as their expectation to need statistics in their future employment, we first tested baseline differences. To test whether the expected baseline subgroup difference in critical thinking could be confirmed, we conducted a multivariate analysis of variance using a backward models search strategy, which included the primary independent variables (i.e., perception of mathematical knowledge as adequate or not and expectation to need statistics in future employment) as well as gender and age and all possible two-way interactions between the independent variables. The results showed that only the two primary independent variables defined significant differences for subgroups of students, and there was no interaction effect. Thus, we present simple tests for differences in critical thinking mean scores for subgroups defined by all four of the background variables in <xref ref-type="table" rid="T3">Table 3</xref>. As expected, students who perceived their mathematical knowledge to be inadequate for learning statistics scored lower on statistics-related critical thinking scores at baseline compared to the students who perceived they had an adequate level of mathematical knowledge (<italic>p</italic> &#x003C; 0.001). Also as expected, students who did not believe they would need statistics in their future employment scored the lowest on statistics-related critical thinking compared to students who thought they might need or would definitely need statistics in future employment (<italic>p</italic> &#x003C; 0.001).</p>
<table-wrap position="float" id="T3">
<label>TABLE 3</label>
<caption><p>Mean statistics-related critical thinking scores at baseline.</p></caption>
<table cellspacing="5" cellpadding="5" frame="hsides" rules="groups">
<thead>
<tr>
<td valign="top" align="left">Group (n)</td>
<td valign="top" align="center">Mean</td>
<td valign="top" align="center">SD</td>
<td valign="top" align="center"><italic>p</italic></td>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left">All students (336)</td>
<td valign="top" align="center">8.05</td>
<td valign="top" align="center">1.90</td>
<td/>
</tr>
<tr>
<td valign="top" align="left"><bold>Gender</bold></td>
<td/>
<td/>
<td/>
</tr>
<tr>
<td valign="top" align="left">Male (51)</td>
<td valign="top" align="center">8.36</td>
<td valign="top" align="center">1.73</td>
<td/>
</tr>
<tr>
<td valign="top" align="left">Female (272)</td>
<td valign="top" align="center">7.97</td>
<td valign="top" align="center">1.92</td>
<td valign="top" align="center">0.172</td>
</tr>
<tr>
<td valign="top" align="left"><bold>Age groups</bold></td>
<td/>
<td/>
<td/>
</tr>
<tr>
<td valign="top" align="left">21 years and younger (199)</td>
<td valign="top" align="center">8.13</td>
<td valign="top" align="center">1.82</td>
<td/>
</tr>
<tr>
<td valign="top" align="left">22 years and older (131)</td>
<td valign="top" align="center">7.99</td>
<td valign="top" align="center">2.00</td>
<td valign="top" align="center">0.500</td>
</tr>
<tr>
<td valign="top" align="left"><bold>Math knowledge to learn statistics</bold></td>
<td/>
<td/>
<td/>
</tr>
<tr>
<td valign="top" align="left">Not adequate (53)</td>
<td valign="top" align="center">7.01</td>
<td valign="top" align="center">1.89</td>
<td/>
</tr>
<tr>
<td valign="top" align="left">Adequate (281)</td>
<td valign="top" align="center">8.25</td>
<td valign="top" align="center">1.85</td>
<td valign="top" align="center">&#x003C;0.001</td>
</tr>
<tr>
<td valign="top" align="left"><bold>Expect to need statistics in future work life</bold></td>
<td/>
<td/>
<td/>
</tr>
<tr>
<td valign="top" align="left">Yes (86)</td>
<td valign="top" align="center">8.49</td>
<td valign="top" align="center">1.69</td>
<td/>
</tr>
<tr>
<td valign="top" align="left">Maybe (196)</td>
<td valign="top" align="center">8.07</td>
<td valign="top" align="center">1.81</td>
<td/>
</tr>
<tr>
<td valign="top" align="left">No (52)</td>
<td valign="top" align="center">7.13</td>
<td valign="top" align="center">2.06</td>
<td valign="top" align="center">&#x003C;0.001<italic><xref ref-type="table-fn" rid="t3fna"><sup>a</sup></xref></italic></td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<fn><p><italic>p-Values for &#x201C;math knowledge to learn statistics&#x201D; and &#x201C;expect to need statistics in future work life&#x201D; are one-sided, due to expectations on the direction of differences.</italic></p></fn>
<fn id="t3fna"><p><italic><sup>a</sup>Post hoc pairwise tests showed that it was the group not expecting to need statistics in their future employment that differed significantly from the remaining two groups.</italic></p></fn>
</table-wrap-foot>
</table-wrap>
<p>We then proceeded to analyze the changes in critical thinking. Absolute individual changes on the CTh scale ranged from &#x2212;5.3 to 5.1 points on the rescaled logit scale (<xref ref-type="supplementary-material" rid="DS1">Supplementary Table A6</xref> in <xref ref-type="supplementary-material" rid="DS1">Supplementary Appendix 1</xref>), thus showing large individual changes in critical thinking from the first to the second semester (<xref ref-type="fig" rid="F1">Figure 1</xref>). The overall standardized effect was small and negative (&#x2212;0.12), and while there were some variations for student strata defined gender, age, perceived adequacy of math knowledge to learn statistics, and expectation to need statistics in future employment, effect sizes remained small for all subgroups (<xref ref-type="table" rid="T4">Table 4</xref>). Thus, while there were large absolute changes in the equidistant scores resulting from the Rasch models at the individual level, effect size estimates show that there were only very small and predominantly negative effects. Our expectation that students overall would increase in critical thinking was rejected. The same was the case with our expectation that students, who at baseline did not expect to need statistics in their future employment would increase less in critical thinking than students expecting to need statistics. Only two subgroups of students showed an increase, though small, in critical thinking. These were the male students and students who at baseline perceived their mathematical knowledge as inadequate for learning statistics.</p>
<fig id="F1" position="float">
<label>FIGURE 1</label>
<caption><p>Distribution of differences in Critical Thinking scores (rescaled person parameter estimates) from baseline to follow-up. Differences are shown as follow-up minus baseline so that positive values show an increase and negative values show a decline in critical thinking over time. Distances between any two scores are equal.</p></caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="feduc-07-884635-g001.tif"/>
</fig>
<table-wrap position="float" id="T4">
<label>TABLE 4</label>
<caption><p>Overall and stratified mean differences in critical thinking and effect sizes over time.</p></caption>
<table cellspacing="5" cellpadding="5" frame="hsides" rules="groups">
<thead>
<tr>
<td valign="top" align="left">Group (n)</td>
<td valign="top" align="center">Mean difference (<italic>p</italic>)</td>
<td valign="top" align="center">Effect size</td>
<td valign="top" align="center">(95% CI)</td>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left">All students (165)</td>
<td valign="top" align="center">0.23 (0.050)</td>
<td valign="top" align="center">&#x2013;0.12</td>
<td valign="top" align="center">(&#x2212;0.33 to 0.10)</td>
</tr>
<tr>
<td valign="top" align="left"><bold>Gender</bold></td>
<td/>
<td valign="top" align="center"/><td/>
</tr>
<tr>
<td valign="top" align="left">Male (25)</td>
<td valign="top" align="center">0.23 (0.313)</td>
<td valign="top" align="center">0.09</td>
<td valign="top" align="center">(&#x2212;0.46 to 0.65)</td>
</tr>
<tr>
<td valign="top" align="left">Female (136)</td>
<td valign="top" align="center">&#x2212;0.29 (0.018)</td>
<td valign="top" align="center">&#x2013;0.16</td>
<td valign="top" align="center">(&#x2212;0.40 to 0.07)</td>
</tr>
<tr>
<td valign="top" align="left"><bold>Age groups</bold></td>
<td/>
<td valign="top" align="center"/><td/>
</tr>
<tr>
<td valign="top" align="left">21 years and younger (104)</td>
<td valign="top" align="center">&#x2212;0.26 (0.069)</td>
<td valign="top" align="center">&#x2013;0.14</td>
<td valign="top" align="center">(&#x2212;0.41 to 0.14)</td>
</tr>
<tr>
<td valign="top" align="left">22 years and older (60)</td>
<td valign="top" align="center">&#x2212;0.16 (0.233)</td>
<td valign="top" align="center">&#x2013;0.08</td>
<td valign="top" align="center">(&#x2212;0.44 to 0.27)</td>
</tr>
<tr>
<td valign="top" align="left"><bold>Math knowledge to learn statistics</bold></td>
<td/>
<td valign="top" align="center"/><td/>
</tr>
<tr>
<td valign="top" align="left">Not adequate (29)</td>
<td valign="top" align="center">0.09 (0.405)</td>
<td valign="top" align="center">0.05</td>
<td valign="top" align="center">(&#x2212;0.46 to 0.57)</td>
</tr>
<tr>
<td valign="top" align="left">Adequate (136)</td>
<td valign="top" align="center">&#x2212;0.29 (0.021)</td>
<td valign="top" align="center">&#x2013;0.15</td>
<td valign="top" align="center">(&#x2212;0.39 to 0.09)</td>
</tr>
<tr>
<td valign="top" align="left"><bold>Expect to need statistics in future work life</bold></td>
<td/>
<td valign="top" align="center"/><td/>
</tr>
<tr>
<td valign="top" align="left">Yes (45)</td>
<td valign="top" align="center">&#x2212;0.46 (0.089)</td>
<td valign="top" align="center">&#x2013;0.24</td>
<td valign="top" align="center">(&#x2212;0.66 to 0.17)</td>
</tr>
<tr>
<td valign="top" align="left">Maybe (101)</td>
<td valign="top" align="center">&#x2212;0.09 (0.283)</td>
<td valign="top" align="center">&#x2013;0.05</td>
<td valign="top" align="center">(&#x2212;0.32 to 0.23)</td>
</tr>
<tr>
<td valign="top" align="left">No (19)</td>
<td valign="top" align="center">&#x2212;0.40 (0.145)</td>
<td valign="top" align="center">&#x2013;0.22</td>
<td valign="top" align="center">(&#x2212;0.85 to 0.42)</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<fn><p><italic>Mean differences are shown as follow-up minus baseline so that positive values show an increase and negative values show a decline in critical thinking over time. P-values are one-sided). CI, Confidence Interval. Effect sizes are calculated using the rescaled person parameter estimates, as the distance between any two scores is equal.</italic></p></fn>
</table-wrap-foot>
</table-wrap>
</sec>
</sec>
<sec id="S4" sec-type="discussion">
<title>Discussion</title>
<p>The main aim of the study was to explore changes in statistics-related critical thinking from the start of the first semester to the start of the second semester of a two-semester-long statistics course, where the curriculum contains learning objectives implying critical thinking and assessment criteria explicitly requiring critical thinking. The results showed that the student group as a whole has a low mean score of statistics-related critical thinking at baseline (i.e., a mean score of 8.05 within the possible range of 3 to 15) and that there were no significant differences related to gender or age at baseline. In a previous cross-cultural study employing the same instrument, statistics-related critical thinking scores were reported at the same level for both Danish and Spanish psychology students, while the mean personality psychology-related critical thinking scores were markedly higher for Danish psychology students, but not the Spanish students (<xref ref-type="bibr" rid="B45">Nielsen et al., 2021</xref>). This might very tentatively suggest that domain-specific critical thinking at the start of a semester course varies not only with specific domains with the same academic discipline but also with culture. Two other studies report statistics-related critical thinking at higher levels at the start of a semester course in statistics using different instruments. <xref ref-type="bibr" rid="B6">Bensley et al. (2010)</xref> report medium-level scores on one of their subscales for critical thinking, i.e., the argument analysis scale, at the start of a semester prior to introducing different instructional methods to enhance critical thinking in a research methods course for psychology students. <xref ref-type="bibr" rid="B7">Cheng et al. (2018)</xref> report high baseline scores on four single items tapping into four dimensions of critical thinking at the start of introductory statistics classes for students from various academic disciplines. The current results open interesting new avenues of research into domain-specific critical thinking in higher education and its development, both within and between academic disciplines, and across cultures.</p>
<p>Furthermore, we found strong evidence that the baseline statistics-related critical thinking scores differed dependent on students&#x2019; perception of the adequacy of their mathematical knowledge for learning statistics as well as whether they expected to need statistics in their future work life. Thus, students who perceived their mathematical knowledge to be inadequate for learning statistics had a lower level of critical thinking than students perceiving their mathematical knowledge as adequate. The Danish psychology program requires level B mathematics<sup><xref ref-type="fn" rid="footnote3">3</xref></sup> for being admitted to the program but does not require a particular grade for admittance, and thus students can enter with a &#x201C;just pass&#x201D;-grade of 02 (see <xref ref-type="supplementary-material" rid="DS1">Supplementary Appendix 2</xref> for the Danish grading scale). As the psychology program is very hard to get into and there is a fixed number of places available, however, only students with a very high-grade point average get in. We assumed that the lack of insight into the field of statistics presents just 1 month into the statistics course and their first semester in the Bachelor of Psychology program might be reflected in their perception of their mathematical basis as adequate or inadequate for learning statistics, and thus also for their inclination toward statistics-related critical thinking at this early point. However, in hindsight, more information on this issue should have been gathered. With regard to baseline differences dependent on the students&#x2019; expectations to need statistics in their future work life, results were also in line with our assumption, i.e., that confidence in needing statistics in the future would be associated with an enhanced inclination toward statistics-related critical thinking compared to students who were confident they would not need statistics in the future. The results not only confirmed our assumptions but also showed that it was the group of students that were certain to not need statistics in their future work life, who had significantly lowered inclination toward critical thinking compared to both students thinking they might need statistics and students who were sure they would need statistics in the future. The results even showed that there was an ordered relationship in the mean scores for the three groups so that students who expected to need statistics in their future employment had the highest CTh scores, and students who did <italic>not</italic> expect to need statistics in their future employment had the lowest CTh scores and students who thought they might need statistics scored in between. This finding leads us to suggest that future research might explore how the interaction between expectancy-to-need statistics and initial inclination toward statistics-related critical thinking might be related to the outcome of statistics courses, but also to the actual need for statistics in the first employment of the graduates.</p>
<p>Turning to the main results of the study, namely the lack of an overall increase in statistics-related critical thinking in the first semester, this was the opposite of what we expected. Previous research on the development of statistics-related critical thinking has mainly focused on comparing teaching methods designed to enhance critical thinking with &#x201C;usual&#x201D; teaching methods not designed for this purpose, or by simply evaluating the enhancing effect of purposely designed teaching methods. While methods for measuring statistics-related critical thinking differ across studies as does the teaching methods evaluated results are also ambiguous, as some find no effect of the purposely designed teaching compared to the usual teaching without clarifying whether this means there was an effect or no effect for both groups (<xref ref-type="bibr" rid="B13">Goode et al., 2018</xref>; <xref ref-type="bibr" rid="B57">Setambah et al., 2019</xref>), and others a positive effect for only the students receiving the purposely designed teaching and no change for the students receiving the usual teaching (e.g., <xref ref-type="bibr" rid="B6">Bensley et al., 2010</xref>). On the same note, one study evaluating just the effect of a purposely designed teaching method in itself found this to enhance the statistics-related critical thinking of the students (<xref ref-type="bibr" rid="B7">Cheng et al., 2018</xref>). The lack of increase in the statistics-related critical thinking in the current study is thus only supported by <xref ref-type="bibr" rid="B6">Bensley et al. (2010)</xref>, who did not find any change for their control group of psychology students. The current study is not enough to refute that meaningful instruction within a subject domain inherently will entail the development of critical thinking skills even if these are not purposely targeted with teaching activities, as suggested by <xref ref-type="bibr" rid="B61">Tiruneh et al. (2017)</xref>. However, the current study does show that even the students have a low level of critical thinking at baseline and thus ample room for improvement, one semester&#x2019;s worth of university-level teaching in statistics with lectures as well as small exercise classes, where assessment criteria explicitly mention critical thinking (<xref ref-type="supplementary-material" rid="DS1">Supplementary Appendix 2</xref>), does not enhance the critical thinking of the students as a whole. Thus, <xref ref-type="bibr" rid="B61">Tiruneh et al.&#x2019;s (2017)</xref> notion cannot be supported by our research, as we do not find an overall positive effect on statistics-related critical thinking over the semester. Our study, however, points to the need for developing further research to explore the factors involved in the development of statistics-related critical thinking skills.</p>
<p>The subgroup results in the current study also showed small effects for all subgroups, and thus did not divulge any clear patterns with regard to student factors related to the development of statistics-related critical thinking. The findings, which might suggest areas of interest for future research are the differences in the direction of the development in statistics-related critical thinking found across gender and across perceptions of the adequacy of mathematical knowledge for learning statistics, even if these differences in direction might be random results due to small group sizes. Thus, future research should include additional student characteristics to explore this further, e.g., characteristics such as dispositional characteristics such as personality, e.g., conscientiousness which has consistently been found to be positively associated with academic success in higher education (<xref ref-type="bibr" rid="B53">Richardson et al., 2012</xref>; <xref ref-type="bibr" rid="B62">Vedel, 2014</xref>), an association, which in relation to learning statistics might very well be mediated by statistics-related critical thinking. Motivation and academic self-efficacy, as both have been linked to student performance (<xref ref-type="bibr" rid="B53">Richardson et al., 2012</xref>) and student anxiety (<xref ref-type="bibr" rid="B59">Tahmassian and Jalali Moghadam, 2011</xref>; <xref ref-type="bibr" rid="B41">Nguyen and Deci, 2016</xref>) and statistics-related anxiety is well-documented among students from other disciplines taking statistics courses and the detrimental effect of anxiety on learning is well-known. We thus propose that motivational factors as well as the belief in one&#x2019;s own ability to learn statistics might moderate the development of statistics-related critical thinking and that this is certainly worth investigating in the future.</p>
<p>Dispositional measures and other student characteristics might also be successfully employed in future studies of increases and decreases at the individual level, and preferably with more points of measurement (three to six), as they might then contribute to explaining individual student trajectories with regard to statistics-related critical thinking and whether these are one-directional across multiple points of measurement. Such student characteristics might also be useful with larger samples to explore whether certain student profiles are associated with an increase and certain profiles with a decrease in statistics-related critical thinking. In addition, future studies might link to the current research and expand these by including students from other academic disciplines than psychology.</p>
<p>The study has four major strengths. The first strength is that the results concerning change stand on a very strong psychometric foundation as the CTh scale fitted the Rasch model both at baseline and at follow-up and as the scale was very well targeted to the study population of first-year Danish Psychology Bachelor students taking their statistics course. As such, we know that the CTh scale possesses the psychometric properties, we aimed for and that the results of the change analyses and both the differences at baseline and the effect sizes are not biased due to a general lack of invariance or differential item function. The second strength lies in the use of standardized effect sizes to assess changes in statistics-related critical thinking, as this makes it possible for future studies using the same instrument under different conditions to compare the results. The third strength of the study is its contribution to the body of knowledge on the so-called &#x201C;natural&#x201D; development of domain-specific critical thinking, by showing that there was no overall increase in critical thinking. The contribution is important, as it showed that even though critical thinking was explicitly mentioned in the assessment criteria and implicitly in the learning objectives for the course as well as the overall competencies to be achieved through the program, no overall increase was found nor were there subgroup-specific increases of any significance. However, equally important is the finding that there were rather large absolute changes in critical thinking at the individual level, both in the form of increases and decreases, as are the findings of baseline differences dependent on students&#x2019; perception of the adequacy of their mathematical knowledge for learning statistics and their expectancy to need statistics in their future work life.</p>
<p>Likewise, the study has three limitations. The first is the sample size and the subgroup distributions in the longitudinal sample, as this did not allow us to explore any possible interaction effects by stratifying on more than one grouping variable at a time. Thus, it was not possible to explore with any certainty how the differences in statistics-related critical thinking at baseline might affect the development. The second limitation might be considered to be the CTh scale itself, as it only comprises three items covering the purposeful and inquiring aspect of CTh common to three major definitions of critical thinking. However, as thoroughly demonstrated with the content and construct validity analyses conducted by <xref ref-type="bibr" rid="B45">Nielsen et al. (2021)</xref>, there is no loss in content validity by eliminating two items from the original scale from the MLSQ, as these did in fact not measure critical thinking &#x2013; not content-wise nor when considering the dimensionality issue. As the brief version, we employed in this study, furthermore fitted the strictest measurement model (i.e., the Rasch model) and was well targeted to the student population in this study and the cross-cultural sample in the study by <xref ref-type="bibr" rid="B45">Nielsen et al. (2021)</xref>, we do not find the brief version to be inferior to the five-item version from the MSLQ, rather the contrary. However, we do recognize that other and longer instruments might be preferred by other researchers and that such instruments, if appropriately validated, can offer more precise measurement. The third limitation is that we did not collect any qualitative and detailed information from the professor or the students, which might have contributed to a better understanding of the lack of overall increase in statistics-related critical thinking as well as the results at the individual level.</p>
</sec>
<sec id="S5" sec-type="data-availability">
<title>Data Availability Statement</title>
<p>The original contributions presented in the study are publicly available. This data can be found here: <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.5281/zenodo.6401225">10.5281/zenodo.6401225</ext-link>.</p>
</sec>
<sec id="S6">
<title>Ethics Statement</title>
<p>Ethical review and approval was not required for the study on human participants in accordance with the local legislation and institutional requirements. Written informed consent for participation was not required for this study in accordance with the national legislation and the institutional requirements.</p>
</sec>
<sec id="S7">
<title>Author Contributions</title>
<p>TN construed the study and conducted the analyses. All authors listed have made a substantial, direct, and intellectual contribution to the work, and approved it for publication.</p>
</sec>
<sec id="conf1" sec-type="COI-statement">
<title>Conflict of Interest</title>
<p>The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.</p>
</sec>
<sec id="pudiscl1" sec-type="disclaimer">
<title>Publisher&#x2019;s Note</title>
<p>All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.</p>
</sec>
</body>
<back>
<sec id="S8" sec-type="funding-information">
<title>Funding</title>
<p>The authors have received to funding for the study.</p>
</sec>
<ack><p>The authors thank the students and the professor who kindly lend their time to provide data, Martin Andersen for collecting part of the data, Pedro Henrique Ribeiro Santiago for providing the R-code for the item maps, and Mary McGovern for translating the part of the curricula descriptions which was only available in Danish.</p>
</ack>
<sec id="S10" sec-type="supplementary-material">
<title>Supplementary Material</title>
<p>The Supplementary Material for this article can be found online at: <ext-link ext-link-type="uri" xlink:href="https://www.frontiersin.org/articles/10.3389/feduc.2022.884635/full#supplementary-material">https://www.frontiersin.org/articles/10.3389/feduc.2022.884635/full#supplementary-material</ext-link></p>
<supplementary-material xlink:href="Data_Sheet_1.pdf" id="DS1" mimetype="application/pdf" xmlns:xlink="http://www.w3.org/1999/xlink"/>
</sec>
<ref-list>
<title>References</title>
<ref id="B1"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Andersen</surname> <given-names>E. B.</given-names></name></person-group> (<year>1973</year>). <article-title>A goodness of fit test for the Rasch model.</article-title> <source><italic>Psychometrika</italic></source> <volume>38</volume> <fpage>123</fpage>&#x2013;<lpage>140</lpage>. <pub-id pub-id-type="doi">10.1007/BF02291180</pub-id></citation></ref>
<ref id="B2"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bailin</surname> <given-names>S.</given-names></name> <name><surname>Case</surname> <given-names>R.</given-names></name> <name><surname>Coombs</surname> <given-names>J. R.</given-names></name> <name><surname>Daniels</surname> <given-names>L. B.</given-names></name></person-group> (<year>1999</year>). <article-title>Common misconceptions of critical thinking.</article-title> <source><italic>J. Curric. Stud.</italic></source> <volume>31</volume> <fpage>269</fpage>&#x2013;<lpage>283</lpage>. <pub-id pub-id-type="doi">10.1080/002202799183124</pub-id></citation></ref>
<ref id="B3"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Beauchamp</surname> <given-names>M. K.</given-names></name> <name><surname>Jette</surname> <given-names>A. M.</given-names></name> <name><surname>Ward</surname> <given-names>R. E.</given-names></name> <name><surname>Kurlinski</surname> <given-names>L. A.</given-names></name> <name><surname>Kiely</surname> <given-names>D.</given-names></name> <name><surname>Latham</surname> <given-names>N. K.</given-names></name><etal/></person-group> (<year>2015</year>). <article-title>Predictive validity and responsiveness of patient-reported and performance-based measures of function in the Boston RISE study.</article-title> <source><italic>J. Gerontol. Med. Sci.</italic></source> <volume>70</volume> <fpage>616</fpage>&#x2013;<lpage>622</lpage>. <pub-id pub-id-type="doi">10.1093/gerona/glu227</pub-id> <pub-id pub-id-type="pmid">25512569</pub-id></citation></ref>
<ref id="B4"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Benjamini</surname> <given-names>Y.</given-names></name> <name><surname>Hochberg</surname> <given-names>Y.</given-names></name></person-group> (<year>1995</year>). <article-title>Controlling the False Discovery Rate: a Practical and Powerful Approach to Multiple Testing.</article-title> <source><italic>J. R. Statist. Soc. Series B</italic></source> <volume>57</volume> <fpage>289</fpage>&#x2013;<lpage>300</lpage>. <pub-id pub-id-type="doi">10.1111/j.2517-6161.1995.tb02031.x</pub-id></citation></ref>
<ref id="B5"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bensley</surname> <given-names>D. A.</given-names></name> <name><surname>Baxter</surname> <given-names>C.</given-names></name></person-group> (<year>2006</year>). <source><italic>The Critical Thinking in Psychology Test. Unpublished manuscript.</italic></source> <publisher-loc>Frostburg, MD</publisher-loc>: <publisher-name>Frostburg State Univerisity</publisher-name>.</citation></ref>
<ref id="B6"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bensley</surname> <given-names>D. A.</given-names></name> <name><surname>Crowe</surname> <given-names>D. S.</given-names></name> <name><surname>Bernhardt</surname> <given-names>P.</given-names></name> <name><surname>Buckner</surname> <given-names>C.</given-names></name> <name><surname>Allman</surname> <given-names>A. L.</given-names></name></person-group> (<year>2010</year>). <article-title>Teaching and Assessing Critical Thinking Skills for Argument Analysis in Psychology.</article-title> <source><italic>Teach. Psychol.</italic></source> <volume>37</volume> <fpage>91</fpage>&#x2013;<lpage>96</lpage>. <pub-id pub-id-type="doi">10.1080/00986281003626656</pub-id></citation></ref>
<ref id="B7"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Cheng</surname> <given-names>S.</given-names></name> <name><surname>Ferris</surname> <given-names>M.</given-names></name> <name><surname>Perolio</surname> <given-names>J.</given-names></name></person-group> (<year>2018</year>). <article-title>An innovative classroom approach for developing critical thinkers in the introductory statistics course.</article-title> <source><italic>Am. Statist.</italic></source> <volume>72</volume> <fpage>354</fpage>&#x2013;<lpage>358</lpage>. <pub-id pub-id-type="doi">10.1080/00031305.2017.1305293</pub-id></citation></ref>
<ref id="B8"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Cox</surname> <given-names>D. R.</given-names></name> <name><surname>Spj&#x00F8;tvoll</surname> <given-names>E.</given-names></name> <name><surname>Johansen</surname> <given-names>S.</given-names></name> <name><surname>van Zwet</surname> <given-names>W. R.</given-names></name> <name><surname>Bithell</surname> <given-names>J. F.</given-names></name> <name><surname>Barndorff-Nielsen</surname> <given-names>O.</given-names></name></person-group> (<year>1977</year>). <article-title>The Role of Significance Tests [with Discussion and Reply].</article-title> <source><italic>Scand. J. Stat.</italic></source> <volume>4</volume> <fpage>49</fpage>&#x2013;<lpage>70</lpage>.</citation></ref>
<ref id="B9"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Cronbach</surname> <given-names>L. J.</given-names></name></person-group> (<year>1951</year>). <article-title>Coefficient alpha and the internal structure of tests.</article-title> <source><italic>Psychometrika</italic></source> <volume>16</volume> <fpage>297</fpage>&#x2013;<lpage>334</lpage>. <pub-id pub-id-type="doi">10.1016/0020-7489(93)90092-9</pub-id> <pub-id pub-id-type="pmid">8449658</pub-id></citation></ref>
<ref id="B10"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>De Jager</surname> <given-names>T.</given-names></name></person-group> (<year>2012</year>). <article-title>Can first year students&#x2019; critical thinking skills develop in a space of three months?. Procedia.</article-title> <source><italic>Soc. Behav. Sci.</italic></source> <volume>47</volume> <fpage>1374</fpage>&#x2013;<lpage>1381</lpage>. <pub-id pub-id-type="doi">10.1016/j.sbspro.2012.06.829</pub-id></citation></ref>
<ref id="B11"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Facione</surname> <given-names>P.</given-names></name></person-group> (<year>1990</year>). <source><italic>Critical thinking: A statement of expert consensus for purposes of educational assessment and instruction. Research findings and recommendations.</italic></source> <publisher-loc>Newark, NJ</publisher-loc>: <publisher-name>American Philosophical Association</publisher-name>.</citation></ref>
<ref id="B12"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Facione</surname> <given-names>P. A.</given-names></name> <name><surname>Facione</surname> <given-names>N. C.</given-names></name> <name><surname>Giancarlo</surname> <given-names>C. A. F.</given-names></name></person-group> (<year>1998</year>). <source><italic>The California Critical Thinking Disposition Inventory.</italic></source> <publisher-loc>California</publisher-loc>: <publisher-name>Academic Press</publisher-name>, <fpage>67</fpage>&#x2013;<lpage>79</lpage>.</citation></ref>
<ref id="B13"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Goode</surname> <given-names>C. T.</given-names></name> <name><surname>Lamoreaux</surname> <given-names>M.</given-names></name> <name><surname>Atchison</surname> <given-names>K. J.</given-names></name> <name><surname>Jeffress</surname> <given-names>E. C.</given-names></name> <name><surname>Lynch</surname> <given-names>H. L.</given-names></name> <name><surname>Sheehan</surname> <given-names>E.</given-names></name></person-group> (<year>2018</year>). <article-title>Quantitative Skills, Critical Thinking, and Writing Mechanics in Blended Versus Face-to-Face Versions of a Research Methods and Statistics Course.</article-title> <source><italic>Teach. Psychol.</italic></source> <volume>45</volume> <fpage>124</fpage>&#x2013;<lpage>131</lpage>. <pub-id pub-id-type="doi">10.1177/0098628318762873</pub-id></citation></ref>
<ref id="B14"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Halpern</surname> <given-names>D. F.</given-names></name></person-group> (<year>1998</year>). <article-title>Teaching critical thinking for transfer across domains: Disposition, skills, structure training, and metacognitive monitoring.</article-title> <source><italic>Am. Psychol.</italic></source> <volume>53</volume> <fpage>449</fpage>&#x2013;<lpage>455</lpage>. <pub-id pub-id-type="doi">10.1037/0003-066X.53.4.449</pub-id> <pub-id pub-id-type="pmid">9572008</pub-id></citation></ref>
<ref id="B15"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Halpern</surname> <given-names>D. F.</given-names></name></person-group> (<year>2003</year>). <source><italic>Thought and knowledge: An introduction to critical thinking.</italic></source> <publisher-loc>Mahwah, NJ</publisher-loc>: <publisher-name>Erlbaum</publisher-name>.</citation></ref>
<ref id="B16"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Halpern</surname> <given-names>D. F.</given-names></name></person-group> (<year>2014</year>). <source><italic>Thought and knowledge: An introduction to critical thinking. 5th Edn</italic></source>. <publisher-loc>New York, NY</publisher-loc>: <publisher-name>Psychology Press Taylor &#x0026; Francis Group.</publisher-name></citation></ref>
<ref id="B17"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Halpern</surname> <given-names>D. F.</given-names></name></person-group> (<year>2015</year>). <source><italic>Halpern critical thinking assessment.</italic></source> <publisher-loc>Austria</publisher-loc>: <publisher-name>Schuhfried GmbH</publisher-name>.</citation></ref>
<ref id="B18"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hammer</surname> <given-names>S. J.</given-names></name> <name><surname>Green</surname> <given-names>W.</given-names></name></person-group> (<year>2011</year>). <article-title>Critical thinking in a first year management unit: the relationship between disciplinary learning, academic literacy and learning progression.</article-title> <source><italic>Higher Educ. Res. Dev.</italic></source> <volume>30</volume> <fpage>303</fpage>&#x2013;<lpage>315</lpage>. <pub-id pub-id-type="doi">10.1080/07294360.2010.501075</pub-id></citation></ref>
<ref id="B19"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Holland</surname> <given-names>D. F.</given-names></name> <name><surname>Kraha</surname> <given-names>A.</given-names></name> <name><surname>Zientek</surname> <given-names>L. R.</given-names></name> <name><surname>Nimon</surname> <given-names>K.</given-names></name> <name><surname>Fulmore</surname> <given-names>J. A.</given-names></name> <name><surname>Johnson</surname> <given-names>U. Y.</given-names></name><etal/></person-group> (<year>2018</year>). <article-title>Reliability Generalization of the Motivated Strategies for Learning Questionnaire: a Meta-Analytic View of Reliability Estimates.</article-title> <source><italic>SAGE Open</italic></source> <volume>8</volume> <fpage>1</fpage>&#x2013;<lpage>29</lpage>. <pub-id pub-id-type="doi">10.1177/2158244018802334</pub-id></citation></ref>
<ref id="B20"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kanbay</surname> <given-names>Y.</given-names></name> <name><surname>Isik</surname> <given-names>E.</given-names></name> <name><surname>Aslan</surname> <given-names>O.</given-names></name> <name><surname>Tektas</surname> <given-names>P.</given-names></name> <name><surname>Kilic</surname> <given-names>N.</given-names></name></person-group> (<year>2017</year>). <article-title>Critical Thinking Skill and Academic Achievement Development in Nursing Students: four-year Longitudinal Study.</article-title> <source><italic>Am. J. Educ. Res. Rev.</italic></source> <volume>2</volume>;<fpage>12</fpage>. <pub-id pub-id-type="doi">10.28933/ajerr-2017-12-0501</pub-id></citation></ref>
<ref id="B21"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kaya</surname> <given-names>H.</given-names></name> <name><surname>&#x015E;enyuva</surname> <given-names>E.</given-names></name> <name><surname>Bodur</surname> <given-names>G.</given-names></name></person-group> (<year>2017</year>). <article-title>Developing critical thinking disposition and emotional intelligence of nursing students: a longitudinal research.</article-title> <source><italic>Nurse Educ. Today</italic></source> <volume>48</volume> <fpage>72</fpage>&#x2013;<lpage>77</lpage>. <pub-id pub-id-type="doi">10.1016/j.nedt.2016.09.011</pub-id> <pub-id pub-id-type="pmid">27721088</pub-id></citation></ref>
<ref id="B22"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kelderman</surname> <given-names>H.</given-names></name></person-group> (<year>1984</year>). <article-title>Loglinear Rasch model tests.</article-title> <source><italic>Psychometrika</italic></source> <volume>49</volume> <fpage>223</fpage>&#x2013;<lpage>245</lpage>.</citation></ref>
<ref id="B23"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kreiner</surname> <given-names>S.</given-names></name></person-group> (<year>2003</year>). <source><italic>Introduction to DIGRAM.</italic></source> <publisher-loc>Copenhagen</publisher-loc>: <publisher-name>Department of Biostatistics, University of Copenhagen</publisher-name>.</citation></ref>
<ref id="B24"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kreiner</surname> <given-names>S.</given-names></name></person-group> (<year>2007</year>). <article-title>Validity and objectivity. Reflections on the role and nature of Rasch Models.</article-title> <source><italic>Nordic Psychol.</italic></source> <volume>59</volume> <fpage>268</fpage>&#x2013;<lpage>298</lpage>. <pub-id pub-id-type="doi">10.1027/1901-2276.59.3.268</pub-id></citation></ref>
<ref id="B25"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kreiner</surname> <given-names>S.</given-names></name></person-group> (<year>2011</year>). <article-title>A Note on Item-Restscore Association in Rasch Models.</article-title> <source><italic>Appl. Psycholog. Meas.</italic></source> <volume>35</volume> <fpage>557</fpage>&#x2013;<lpage>561</lpage>. <pub-id pub-id-type="doi">10.1177/014662161141022</pub-id></citation></ref>
<ref id="B26"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kreiner</surname> <given-names>S.</given-names></name></person-group> (<year>2013</year>). &#x201C;<article-title>The Rasch model for dichotomous items</article-title>,&#x201D; in <source><italic>Rasch Models in Health</italic></source>, <role>eds</role> <person-group person-group-type="editor"><name><surname>Christensen</surname> <given-names>K. B.</given-names></name> <name><surname>Kreiner</surname> <given-names>S.</given-names></name> <name><surname>Mesbah</surname> <given-names>M.</given-names></name></person-group> (<publisher-loc>London</publisher-loc>: <publisher-name>ISTE Ltd, Wiley</publisher-name>), <fpage>5</fpage>&#x2013;<lpage>26</lpage>. <pub-id pub-id-type="doi">10.1002/9781118574454.ch1</pub-id></citation></ref>
<ref id="B27"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kreiner</surname> <given-names>S.</given-names></name> <name><surname>Christensen</surname> <given-names>K. B.</given-names></name></person-group> (<year>2004</year>). <article-title>Analysis of local dependence and multidimensionality in graphical loglinear Rasch models.</article-title> <source><italic>Commun. Stat. Theory Methods</italic></source> <volume>33</volume> <fpage>1239</fpage>&#x2013;<lpage>1276</lpage>. <pub-id pub-id-type="doi">10.1081/sta-120030148</pub-id></citation></ref>
<ref id="B28"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kreiner</surname> <given-names>S.</given-names></name> <name><surname>Nielsen</surname> <given-names>T.</given-names></name></person-group> (<year>2013</year>). <source><italic>Item analysis in DIGRAM 3.04. Part I: Guided tours. Research report 2013/06.</italic></source> <publisher-loc>Denmark</publisher-loc>: <publisher-name>University of Copenhagen, Department of Public Health</publisher-name>.</citation></ref>
<ref id="B29"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kreiner</surname> <given-names>S.</given-names></name> <name><surname>Christensen</surname> <given-names>K. B.</given-names></name></person-group> (<year>2013</year>). &#x201C;<article-title>Person Parameter Estimation and Measurement in Rasch Models</article-title>&#x201D;, in <source><italic>Rasch Models Health</italic></source>, <role>eds</role> <person-group person-group-type="editor"><name><surname>Christensen</surname> <given-names>K.B.</given-names></name> <name><surname>Kreiner</surname> <given-names>S.</given-names></name> <name><surname>Mesbah</surname> <given-names>M.</given-names></name></person-group>. (<publisher-loc>London</publisher-loc>, <publisher-name>ISTE and John Wiley &#x0026; Sons, Inc</publisher-name>.) <fpage>63</fpage>&#x2013;<lpage>78</lpage>. <pub-id pub-id-type="doi">10.1002/9781118574454.ch4</pub-id></citation></ref>
<ref id="B30"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Krosnick</surname> <given-names>J. A.</given-names></name> <name><surname>Fabrigar</surname> <given-names>L. R.</given-names></name></person-group> (<year>1997</year>). &#x201C;<article-title>Designing rating scales for effective measurement in surveys</article-title>,&#x201D; in <source><italic>Survey measurement and process quality</italic></source>, <role>eds</role> <person-group person-group-type="editor"><name><surname>Lyberg</surname> <given-names>L.</given-names></name> <name><surname>Biemer</surname> <given-names>P.</given-names></name> <name><surname>Collins</surname> <given-names>M.</given-names></name> <name><surname>de Leeuw</surname> <given-names>E.</given-names></name> <name><surname>Dippo</surname> <given-names>C.</given-names></name> <name><surname>Schwarz</surname> <given-names>N.</given-names></name><etal/></person-group> (<publisher-loc>New York, NY</publisher-loc>: <publisher-name>John Wiley</publisher-name>), <fpage>141</fpage>&#x2013;<lpage>164</lpage>. <pub-id pub-id-type="doi">10.1002/9781118490013.ch6</pub-id></citation></ref>
<ref id="B31"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kuhn</surname> <given-names>D.</given-names></name></person-group> (<year>1999</year>). <article-title>A developmental model of critical thinking.</article-title> <source><italic>Educ. Res.</italic></source> <volume>28</volume> <fpage>16</fpage>&#x2013;<lpage>25</lpage>. <pub-id pub-id-type="doi">10.2307/1177186</pub-id></citation></ref>
<ref id="B32"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lakens</surname> <given-names>D.</given-names></name></person-group> (<year>2013</year>). <article-title>Calculating and reporting effect sizes to facilitate cumulative science: a practical primer for t-tests and ANOVAs.</article-title> <source><italic>Front. Psychol.</italic></source> <volume>4</volume>:<fpage>863</fpage>. <pub-id pub-id-type="doi">10.3389/fpsyg.2013.00863</pub-id> <pub-id pub-id-type="pmid">24324449</pub-id></citation></ref>
<ref id="B33"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lau</surname> <given-names>J. Y. F.</given-names></name></person-group> (<year>2015</year>). &#x201C;<article-title>Metacognitive education: Going beyond critical thinking</article-title>,&#x201D; in <source><italic>The palgrave handbook of critical thinking in higher education</italic></source>, <role>eds</role> <person-group person-group-type="editor"><name><surname>Davies</surname> <given-names>M.</given-names></name> <name><surname>Barnett</surname> <given-names>R.</given-names></name></person-group> (<publisher-loc>New York, NY</publisher-loc>: <publisher-name>Palgrave Macmillan</publisher-name>), <fpage>373</fpage>&#x2013;<lpage>389</lpage>. <pub-id pub-id-type="doi">10.1057/9781137378057</pub-id></citation></ref>
<ref id="B34"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Maitland</surname> <given-names>A.</given-names></name></person-group> (<year>2009</year>). <article-title>Should I label all scale points or just the end points for attitudinal questions?</article-title> <source><italic>Survey Pract.</italic></source> <volume>4</volume> <fpage>1</fpage>&#x2013;<lpage>4</lpage>. <pub-id pub-id-type="doi">10.29115/SP-2009-0014</pub-id></citation></ref>
<ref id="B35"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>McGuirk</surname> <given-names>J.</given-names></name></person-group> (<year>2021</year>). <article-title>Embedded rationality and the contextualization of critical thinking.</article-title> <source><italic>J. Philosop. Educ.</italic></source> <volume>55</volume> <fpage>606</fpage>&#x2013;<lpage>620</lpage>. <pub-id pub-id-type="doi">10.1111/1467-9752.12563</pub-id></citation></ref>
<ref id="B36"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>McPeck</surname> <given-names>J.</given-names></name></person-group> (<year>1992</year>). &#x201C;<article-title>Thoughts on subject specificity</article-title>,&#x201D; in <source><italic>The generalizability of critical thinking: Multiple perspectives on an educational ideal</italic></source>, <role>ed.</role> <person-group person-group-type="editor"><name><surname>Norris</surname> <given-names>S.</given-names></name></person-group> (<publisher-loc>New York, NY</publisher-loc>: <publisher-name>Teachers College Press</publisher-name>), <fpage>198</fpage>&#x2013;<lpage>205</lpage>.</citation></ref>
<ref id="B37"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Menold</surname> <given-names>N.</given-names></name> <name><surname>Kaczmirek</surname> <given-names>L.</given-names></name> <name><surname>Lenzner</surname> <given-names>T.</given-names></name> <name><surname>Neusar</surname> <given-names>A.</given-names></name></person-group> (<year>2014</year>). <article-title>How Do Respondents Attend to Verbal Labels in Rating Scales?</article-title> <source><italic>Field Methods</italic></source> <volume>26</volume> <fpage>21</fpage>&#x2013;<lpage>39</lpage>. <pub-id pub-id-type="doi">10.1177/1525822X13508270</pub-id></citation></ref>
<ref id="B38"><citation citation-type="journal"><collab>Ministry of Higher Education and Science</collab> (<year>2021</year>). <source><italic>Ans&#x00F8;gere og optagne fordelt p&#x00E5; k&#x00F8;n, alder og adgangsgrundlag.</italic></source> Available online at: <ext-link ext-link-type="uri" xlink:href="https://ufm.dk/uddannelse/statistik-og-analyser/sogning-og-optag-pa-videregaende-uddannelser/grundtal-om-sogning-og-optag/ansogere-og-optagne-fordelt-pa-kon-alder-og-adgangsgrundlag">https://ufm.dk/uddannelse/statistik-og-analyser/sogning-og-optag-pa-videregaende-uddannelser/grundtal-om-sogning-og-optag/ansogere-og-optagne-fordelt-pa-kon-alder-og-adgangsgrundlag</ext-link> <comment>(accessed date 23.12.2021)</comment>.</citation></ref>
<ref id="B39"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Moore</surname> <given-names>T.</given-names></name></person-group> (<year>2011</year>). <article-title>Critical thinking and disciplinary thinking: a continuing debate.</article-title> <source><italic>High. Educ. Res. Dev.</italic></source> <volume>30</volume> <fpage>261</fpage>&#x2013;<lpage>274</lpage>. <pub-id pub-id-type="doi">10.1080/07294360.2010.501328</pub-id></citation></ref>
<ref id="B40"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Moseley</surname> <given-names>D.</given-names></name> <name><surname>Baumfield</surname> <given-names>V.</given-names></name> <name><surname>Elliott</surname> <given-names>J.</given-names></name> <name><surname>Higgins</surname> <given-names>S.</given-names></name> <name><surname>Miller</surname> <given-names>J.</given-names></name> <name><surname>Newton</surname> <given-names>D. P.</given-names></name><etal/></person-group> (<year>2005</year>). <source><italic>Frameworks for thinking: A handbook for teaching and learning.</italic></source> <publisher-loc>Cambridge, UK</publisher-loc>: <publisher-name>Cambridge University Press</publisher-name>.</citation></ref>
<ref id="B41"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Nguyen</surname> <given-names>T. T.</given-names></name> <name><surname>Deci</surname> <given-names>E. L.</given-names></name></person-group> (<year>2016</year>). <article-title>Can it be good to set the bar high? The role of motivational regulation in moderating the link from high standards to academic well-being.</article-title> <source><italic>Learn. Indiv. Diff.</italic></source> <volume>45</volume> <fpage>245</fpage>&#x2013;<lpage>251</lpage>. <pub-id pub-id-type="doi">10.1016/j.lindif.2015.12.020</pub-id></citation></ref>
<ref id="B42"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Nielsen</surname> <given-names>T.</given-names></name></person-group> (<year>2018</year>). <article-title>The intrinsic and extrinsic motivation subscales of the Motivated Strategies for Learning Questionnaire: a Rasch-based construct validity study.</article-title> <source><italic>Cog. Educ.</italic></source> <volume>5</volume> <fpage>1</fpage>&#x2013;<lpage>19</lpage>. <pub-id pub-id-type="doi">10.1080/2331186X.2018.1504485</pub-id></citation></ref>
<ref id="B43"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Nielsen</surname> <given-names>T.</given-names></name></person-group> (<year>2020</year>). <article-title>The Specific Academic Learning Self-efficacy and the Specific Academic Exam Self-Efficacy scales: construct and criterion validity revisited using Rasch models.</article-title> <source><italic>Cog. Educ.</italic></source> <volume>7</volume> <fpage>1</fpage>&#x2013;<lpage>15</lpage>. <pub-id pub-id-type="doi">10.1080/2331186X.2020.1840009</pub-id></citation></ref>
<ref id="B44"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Nielsen</surname> <given-names>T.</given-names></name> <name><surname>Makransky</surname> <given-names>G.</given-names></name> <name><surname>Vang</surname> <given-names>M. L.</given-names></name> <name><surname>Dammeyer</surname> <given-names>J.</given-names></name></person-group> (<year>2017</year>). <article-title>How specific is specific self-efficacy? A construct validity study using Rasch measurement models.</article-title> <source><italic>Stud. Educ. Eval.</italic></source> <volume>57</volume> <fpage>87</fpage>&#x2013;<lpage>97</lpage>. <pub-id pub-id-type="doi">10.1016/j.stueduc.2017.04.003</pub-id></citation></ref>
<ref id="B45"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Nielsen</surname> <given-names>T.</given-names></name> <name><surname>Mart&#x00ED;nez-Garc&#x00ED;a</surname> <given-names>I.</given-names></name> <name><surname>Alastor</surname> <given-names>E.</given-names></name></person-group> (<year>2021</year>). <article-title>Critical Thinking of Psychology Students: a Within- and Cross-Cultural Study using Rasch models.</article-title> <source><italic>Scand. J. Psychol.</italic></source> <volume>62</volume> <fpage>426</fpage>&#x2013;<lpage>435</lpage>. <pub-id pub-id-type="doi">10.1111/sjop.12714</pub-id> <pub-id pub-id-type="pmid">33586175</pub-id></citation></ref>
<ref id="B46"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Nielsen</surname> <given-names>T.</given-names></name> <name><surname>Mart&#x00ED;nez-Garc&#x00ED;a</surname> <given-names>I.</given-names></name> <name><surname>Alastor</surname> <given-names>E.</given-names></name></person-group> (<year>2022</year>). &#x201C;<article-title>Psychometric properties of the Spanish translation of the Specific Academic Learning Self-Efficacy and the Specific Academic Exam Self-Efficacy scales in a higher education context</article-title>,&#x201D; in <source><italic>Academic Self-efficacy in Education: Nature, Measurement, and Research</italic></source>, <role>eds</role> <person-group person-group-type="editor"><name><surname>Khine</surname> <given-names>M. S.</given-names></name> <name><surname>Nielsen</surname> <given-names>T.</given-names></name></person-group> (<publisher-loc>New York, NY</publisher-loc>: <publisher-name>Springer</publisher-name>), <fpage>71</fpage>&#x2013;<lpage>96</lpage>. <pub-id pub-id-type="doi">10.1007/978-981-16-8240-7_5</pub-id></citation></ref>
<ref id="B47"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>&#x00D6;zel&#x00E7;i</surname> <given-names>S. Y.</given-names></name> <name><surname>&#x00C7;al&#x0131;&#x015F;kan</surname> <given-names>G.</given-names></name></person-group> (<year>2019</year>). <article-title>What is critical thinking? A longitudinal study with teacher candidates.</article-title> <source><italic>Internat. J. Eval. Res. Educ.</italic></source> <volume>8</volume> <fpage>495</fpage>&#x2013;<lpage>509</lpage>. <pub-id pub-id-type="doi">10.11591/ijere.v8i3.20254</pub-id></citation></ref>
<ref id="B48"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Paul</surname> <given-names>R.</given-names></name> <name><surname>Elder</surname> <given-names>L.</given-names></name></person-group> (<year>2005</year>). <source><italic>A guide for educators to Critical Thinking Competency Standards.</italic></source> <publisher-loc>Santa Barbara, CA</publisher-loc>: <publisher-name>Foundation for Critical Thinking</publisher-name>.</citation></ref>
<ref id="B49"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Pintrich</surname> <given-names>P. R.</given-names></name> <name><surname>Smith</surname> <given-names>D. A. F.</given-names></name> <name><surname>Garcia</surname> <given-names>T.</given-names></name> <name><surname>McKeachie</surname> <given-names>W. J.</given-names></name></person-group> (<year>1991</year>). <source><italic>A manual for the use of the Motivated Strategies for Learning Questionnaire (MSLQ). (Technical Report No. 91-8-004).</italic></source> <publisher-loc>Ann Arbor, MI</publisher-loc>: <publisher-name>The Regents of the University of Michigan</publisher-name>.</citation></ref>
<ref id="B50"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ralston</surname> <given-names>P. A.</given-names></name> <name><surname>Bays</surname> <given-names>C. L.</given-names></name></person-group> (<year>2015</year>). <article-title>Critical thinking development in undergraduate engineering students from Freshman Through Senior Year: a 3-Cohort Longitudinal Study.</article-title> <source><italic>Am. J. Eng. Educ.</italic></source> <volume>6</volume> <fpage>85</fpage>&#x2013;<lpage>98</lpage>. <pub-id pub-id-type="doi">10.19030/ajee.v6i2.9504</pub-id></citation></ref>
<ref id="B51"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Rasch</surname> <given-names>G.</given-names></name></person-group> (<year>1960</year>). <source><italic>Probabilistic models for some intelligence and attainment tests.</italic></source> <publisher-loc>Copenhagen</publisher-loc>: <publisher-name>Danish Institute for Educational Research</publisher-name>.</citation></ref>
<ref id="B52"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ren</surname> <given-names>X.</given-names></name> <name><surname>Tong</surname> <given-names>Y.</given-names></name> <name><surname>Peng</surname> <given-names>P.</given-names></name> <name><surname>Wang</surname> <given-names>T.</given-names></name></person-group> (<year>2020</year>). <article-title>Critical thinking predicts academic performance beyond general cognitive ability: evidence from adults and children.</article-title> <source><italic>Intelligence</italic></source> <volume>82</volume>:<fpage>101487</fpage>. <pub-id pub-id-type="doi">10.1016/j.intell.2020.101487</pub-id></citation></ref>
<ref id="B53"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Richardson</surname> <given-names>M.</given-names></name> <name><surname>Abraham</surname> <given-names>C.</given-names></name> <name><surname>Bond</surname> <given-names>R.</given-names></name></person-group> (<year>2012</year>). <article-title>Psychological correlates of university students&#x2019; academic performance: a systematic review and meta-analysis.</article-title> <source><italic>Psychol. Bull.</italic></source> <volume>138</volume> <fpage>353</fpage>&#x2013;<lpage>387</lpage>. <pub-id pub-id-type="doi">10.1037/a0026838</pub-id> <pub-id pub-id-type="pmid">22352812</pub-id></citation></ref>
<ref id="B54"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ricketts</surname> <given-names>J. C.</given-names></name> <name><surname>Rudd</surname> <given-names>R. D.</given-names></name></person-group> (<year>2005</year>). <article-title>Critical thinking of selected youth leaders: the efficacy of critical thinking dispositions, leadership and academic performance.</article-title> <source><italic>J. Agricult. Educ.</italic></source> <volume>46</volume> <fpage>32</fpage>&#x2013;<lpage>43</lpage>.</citation></ref>
<ref id="B55"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Saenab</surname> <given-names>S.</given-names></name> <name><surname>Zubaidah</surname> <given-names>S.</given-names></name> <name><surname>Mahanal</surname> <given-names>S.</given-names></name> <name><surname>Lestari</surname> <given-names>S. R.</given-names></name></person-group> (<year>2021</year>). <article-title>ReCODE to Re-Code: an instructional model to accelerate students&#x2019; critical thinking skills.</article-title> <source><italic>Educ. Sci.</italic></source> <volume>11</volume>:<fpage>2</fpage>. <pub-id pub-id-type="doi">10.3390/educsci11010002</pub-id></citation></ref>
<ref id="B56"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Sahanowas</surname> <given-names>S. K.</given-names></name> <name><surname>Halder</surname> <given-names>S.</given-names></name></person-group> (<year>2020</year>). <article-title>Critical thinking disposition of undergraduate students in relation to emotional intelligence: gender as moderator.</article-title> <source><italic>Heliyon</italic></source> <volume>6</volume>:<fpage>e05477</fpage>. <pub-id pub-id-type="doi">10.1016/j.heliyon.2020.e05477</pub-id> <pub-id pub-id-type="pmid">33294656</pub-id></citation></ref>
<ref id="B57"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Setambah</surname> <given-names>M. A. B.</given-names></name> <name><surname>Tajudin</surname> <given-names>N. M.</given-names></name> <name><surname>Yaakob</surname> <given-names>M. F. M.</given-names></name> <name><surname>Saad</surname> <given-names>M. I. M.</given-names></name></person-group> (<year>2019</year>). <article-title>Adventure Learning in Basics Statistics: impact on Students Critical Thinking.</article-title> <source><italic>Internat. J. Instruct.</italic></source> <volume>12</volume> <fpage>151</fpage>&#x2013;<lpage>166</lpage>. <pub-id pub-id-type="doi">10.29333/iji.2019.12310a</pub-id></citation></ref>
<ref id="B58"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Stupnisky</surname> <given-names>R. H.</given-names></name> <name><surname>Renaud</surname> <given-names>R. D.</given-names></name> <name><surname>Daniels</surname> <given-names>L. M.</given-names></name> <name><surname>Haynes</surname> <given-names>T. L.</given-names></name> <name><surname>Perry</surname> <given-names>R. P.</given-names></name></person-group> (<year>2008</year>). <article-title>The interrelation of first-year college students&#x2019; critical thinking disposition, perceived academic control and academic achievement.</article-title> <source><italic>Res. High. Educ.</italic></source> <volume>49</volume> <fpage>513</fpage>&#x2013;<lpage>530</lpage>. <pub-id pub-id-type="doi">10.1007/s11162-008-9093-8</pub-id></citation></ref>
<ref id="B59"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Tahmassian</surname> <given-names>K.</given-names></name> <name><surname>Jalali Moghadam</surname> <given-names>N.</given-names></name></person-group> (<year>2011</year>). <article-title>Relationship between self-efficacy and symptoms of anxiety, depression, worry and social avoidance in a normal sample of students.</article-title> <source><italic>Iran. J. Psychiatry Behav. Sci.</italic></source> <volume>5</volume> <fpage>91</fpage>&#x2013;<lpage>98</lpage>.</citation></ref>
<ref id="B60"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Thomas</surname> <given-names>T.</given-names></name></person-group> (<year>2011</year>). <article-title>Developing first year students&#x2019; critical thinking skills.</article-title> <source><italic>Asian Soc. Sci.</italic></source> <volume>7</volume> <fpage>26</fpage>&#x2013;<lpage>35</lpage>. <pub-id pub-id-type="doi">10.5539/ass.v7n4p26</pub-id></citation></ref>
<ref id="B61"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Tiruneh</surname> <given-names>D. T.</given-names></name> <name><surname>De Cock</surname> <given-names>M.</given-names></name> <name><surname>Elen</surname> <given-names>J.</given-names></name></person-group> (<year>2017</year>). <article-title>Designing Learning Environments for Critical Thinking: examining Effective Instructional Approaches.</article-title> <source><italic>Internat. J. Sci. Mathem. Educ.</italic></source> <volume>16</volume> <fpage>1065</fpage>&#x2013;<lpage>1089</lpage>. <pub-id pub-id-type="doi">10.1007/s10763-017-9829-z</pub-id></citation></ref>
<ref id="B62"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Vedel</surname> <given-names>A.</given-names></name></person-group> (<year>2014</year>). <article-title>The Big Five and tertiary academic performance: a systematic review and metaanalysis.</article-title> <source><italic>Personal. Indiv. Diff.</italic></source> <volume>71</volume> <fpage>66</fpage>&#x2013;<lpage>76</lpage>. <pub-id pub-id-type="doi">10.1016/j.paid.2014.07.011</pub-id></citation></ref>
</ref-list>
<fn-group>
<fn id="footnote1">
<label>1</label>
<p>Subjective in the sense that the rubrics have so few categories and the descriptions of categories are so general that even an identical scorings can ensure that the behavior or products rated by the teachers as indicating critical thinking is the same (c.f. <xref ref-type="bibr" rid="B50">Ralston and Bays, 2015</xref>).</p></fn>
<fn id="footnote2">
<label>2</label>
<p>In Denmark, psychology is one of the top-10 most difficult higher education degree program to be admitted to, as there are a limited number of vacancies to compete for, and thus it requires almost perfect grades to be admitted.</p></fn>
<fn id="footnote3">
<label>3</label>
<p>Levels are A, B and C, with A being the highest.</p></fn>
</fn-group>
</back>
</article>