<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v2.3 20070202//EN" "journalpublishing.dtd">
<article article-type="research-article" dtd-version="2.3" xml:lang="EN" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">Front. Educ.</journal-id>
<journal-title>Frontiers in Education</journal-title>
<abbrev-journal-title abbrev-type="pubmed">Front. Educ.</abbrev-journal-title>
<issn pub-type="epub">2504-284X</issn>
<publisher>
<publisher-name>Frontiers Media S.A.</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="publisher-id">772832</article-id>
<article-id pub-id-type="doi">10.3389/feduc.2021.772832</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Education</subject>
<subj-group>
<subject>Original Research</subject>
</subj-group>
</subj-group>
</article-categories>
<title-group>
<article-title>Examining the Validity of Adaptive Comparative Judgment for Peer Evaluation in a Design Thinking Course</article-title>
<alt-title alt-title-type="left-running-head">Mentzer et&#x20;al.</alt-title>
<alt-title alt-title-type="right-running-head">Exploring Validity of Adaptive Comparative Judgment</alt-title>
</title-group>
<contrib-group>
<contrib contrib-type="author" corresp="yes">
<name>
<surname>Mentzer</surname>
<given-names>Nathan</given-names>
</name>
<xref ref-type="aff" rid="aff1">
<sup>1</sup>
</xref>
<xref ref-type="corresp" rid="c001">&#x2a;</xref>
<uri xlink:href="https://loop.frontiersin.org/people/1526582/overview"/>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Lee</surname>
<given-names>Wonki</given-names>
</name>
<xref ref-type="aff" rid="aff2">
<sup>2</sup>
</xref>
<uri xlink:href="https://loop.frontiersin.org/people/1518108/overview"/>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Bartholomew</surname>
<given-names>Scott Ronald</given-names>
</name>
<xref ref-type="aff" rid="aff3">
<sup>3</sup>
</xref>
<uri xlink:href="https://loop.frontiersin.org/people/1285972/overview"/>
</contrib>
</contrib-group>
<aff id="aff1">
<label>
<sup>1</sup>
</label>Purdue Polytechnic Institute, Purdue University, <addr-line>West Lafayette</addr-line>, <addr-line>IN</addr-line>, <country>United&#x20;States</country>
</aff>
<aff id="aff2">
<label>
<sup>2</sup>
</label>College of Education, Curriculum and Instruction, Purdue University, <addr-line>West Lafayette</addr-line>, <addr-line>IN</addr-line>, <country>United&#x20;States</country>
</aff>
<aff id="aff3">
<label>
<sup>3</sup>
</label>School of Technology, Brigham Young University, <addr-line>Provo</addr-line>, <addr-line>UT</addr-line>, <country>United&#x20;States</country>
</aff>
<author-notes>
<fn fn-type="edited-by">
<p>
<bold>Edited by:</bold> <ext-link ext-link-type="uri" xlink:href="https://loop.frontiersin.org/people/413062/overview">Tine Van Daal</ext-link>, University of Antwerp, Belgium</p>
</fn>
<fn fn-type="edited-by">
<p>
<bold>Reviewed by:</bold> <ext-link ext-link-type="uri" xlink:href="https://loop.frontiersin.org/people/1413722/overview">Jessica To</ext-link>, Nanyang Technological University, Singapore</p>
<p>
<ext-link ext-link-type="uri" xlink:href="https://loop.frontiersin.org/people/390918/overview">Rosemary Hipkins</ext-link>, New&#x20;Zealand Council for Educational Research, New&#x20;Zealand</p>
</fn>
<corresp id="c001">&#x2a;Correspondence: Nathan Mentzer, <email>nmentzer@purdue.edu</email>
</corresp>
<fn fn-type="other">
<p>This article was submitted to Assessment, Testing and Applied Measurement, a section of the journal Frontiers in Education</p>
</fn>
</author-notes>
<pub-date pub-type="epub">
<day>16</day>
<month>12</month>
<year>2021</year>
</pub-date>
<pub-date pub-type="collection">
<year>2021</year>
</pub-date>
<volume>6</volume>
<elocation-id>772832</elocation-id>
<history>
<date date-type="received">
<day>08</day>
<month>09</month>
<year>2021</year>
</date>
<date date-type="accepted">
<day>09</day>
<month>11</month>
<year>2021</year>
</date>
</history>
<permissions>
<copyright-statement>Copyright &#xa9; 2021 Mentzer, Lee and Bartholomew.</copyright-statement>
<copyright-year>2021</copyright-year>
<copyright-holder>Mentzer, Lee and Bartholomew</copyright-holder>
<license xlink:href="http://creativecommons.org/licenses/by/4.0/">
<p>This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these&#x20;terms.</p>
</license>
</permissions>
<abstract>
<p>Adaptive comparative judgment (ACJ) is a holistic judgment approach used to evaluate the quality of something (e.g., student work) in which individuals are presented with pairs of work and select the better item from each pair. This approach has demonstrated high levels of reliability with less bias than other approaches, hence providing accurate values in summative and formative assessment in educational settings. Though ACJ itself has demonstrated significantly high reliability levels, relatively few studies have investigated the validity of peer-evaluated ACJ in the context of design thinking. This study explored peer-evaluation, facilitated through ACJ, in terms of construct validity and criterion validity (concurrent validity and predictive validity) in the context of a design thinking course. Using ACJ, undergraduate students (<italic>n</italic>&#x20;&#x3d; 597) who took a design thinking course during Spring 2019 were invited to evaluate design point-of-view (POV) statements written by their peers. As a result of this ACJ exercise, each POV statement attained a specific parameter value, which reflects the quality of POV statements. In order to examine the construct validity, researchers conducted a content analysis, comparing the contents of the 10 POV statements with highest scores (parameter values) and the 10 POV statements with the lowest scores (parameter values)&#x2014;as derived from the ACJ session. For the criterion validity, we studied the relationship between peer-evaluated ACJ and grader&#x2019;s rubric-based grading. To study the concurrent validity, we investigated the correlation between peer-evaluated ACJ parameter values and grades assigned by course instructors for the same POV writing task. Then, predictive validity was studied by exploring if peer-evaluated ACJ of POV statements were predictive of students&#x2019; grades on the final project. Results showed that the contents of the statements with the highest parameter values were of better quality compared to the statements with the lowest parameter values. Therefore, peer-evaluated ACJ showed construct validity. Also, though peer-evaluated ACJ did not show concurrent validity, it did show moderate predictive validity.</p>
</abstract>
<kwd-group>
<kwd>adaptive comparative judgement</kwd>
<kwd>comparative judgement</kwd>
<kwd>design education</kwd>
<kwd>validity and reliability</kwd>
<kwd>technology and engineering education</kwd>
</kwd-group>
</article-meta>
</front>
<body>
<sec id="s1">
<title>Introduction</title>
<p>Design is believed to be the core of technology and engineering, which promotes experiential learning towards the development of a robust understanding (<xref ref-type="bibr" rid="B16">Dym et&#x20;al., 2005</xref>; <xref ref-type="bibr" rid="B3">Atman et&#x20;al., 2008</xref>). Design situates learning in real life contexts, involving ambiguity and multiple potentially viable solutions (<xref ref-type="bibr" rid="B35">Lammi and Becker, 2013</xref>), and thus promotes the development of students to adapt rapidly to diverse, complicated, and changing requirements (<xref ref-type="bibr" rid="B16">Dym et&#x20;al., 2005</xref>; <xref ref-type="bibr" rid="B35">Lammi and Becker, 2013</xref>). Generally, design thinking in the context of technology and engineering settings follows five stages (<xref ref-type="bibr" rid="B18">Erickson et&#x20;al., 2005</xref>; <xref ref-type="bibr" rid="B36">Lindberg et&#x20;al., 2010</xref>): Empathy, define, ideate, prototype, and test. In the stage of empathy, students learn about the users for whom they are designing. Then, they redefine and articulate their specific design problem based on the findings from the empathy stage. Later, students brainstorm creative solutions, build prototypes of ideas, and test prototypes with the original/possible user group to assess their ideas. In the design thinking process, defining the problem is a critical step to capturing what the students are attempting to accomplish through the design. The Point-Of-View (POV) statement (<xref ref-type="fig" rid="F1">Figure&#x20;1</xref>), which includes three parts (user, need, insight), is one element of problem definition; this artifact often arises during the define stage and serves as a guideline during the entire design process (<xref ref-type="bibr" rid="B60">Sohaib et&#x20;al., 2019</xref>).</p>
<fig id="F1" position="float">
<label>FIGURE 1</label>
<caption>
<p>An example of a Point of View (POV) from course reading (<xref ref-type="bibr" rid="B55">Rikke Friis and Teo Yu, 2020</xref>).</p>
</caption>
<graphic xlink:href="feduc-06-772832-g001.tif"/>
</fig>
<p>In the context of the design thinking course in which this research took place, students worked in groups to write a POV statement to address one or more problem(s) their potential user(s) may confront, by combining user, needs, and insights into a 1-2 sentence statement. Students were instructed that a good problem statement is human-centered, reflecting specific users&#x2019; insights, broad enough for creative freedom but not too narrowly focused to explore creative ideas, and narrow enough to make it manageable and feasible within a given timeframe (<xref ref-type="bibr" rid="B55">Rikke Friis and Teo Yu, 2020</xref>). Hence, a good POV statement is considered a &#x201c;meaningful and actionable&#x201d; problem statement (<xref ref-type="bibr" rid="B55">Rikke Friis and Teo Yu, 2020</xref>), which guides people to foreground insights about the emotion and experiences of possible user groups (<xref ref-type="bibr" rid="B31">Karjalainen, 2016</xref>). It is a crucial step which defines the right challenge to situate the ideation process in a goal-oriented manner (<xref ref-type="bibr" rid="B68">Woolery, 2019</xref>) and inspires a team to generate multiple quality solutions (<xref ref-type="bibr" rid="B33">Kernbach and Nabergoj, 2018</xref>). Further, effective POV statements facilitate the ideation process by helping an individual to better communicate one&#x2019;s vision to team members or other stakeholders (<xref ref-type="bibr" rid="B31">Karjalainen, 2016</xref>).</p>
<p>To encourage students to write well-defined and focused POV statements, design thinking instructors have highlighted the importance of teaching detailed, explicit criteria of good POV statements based on a specific grading rubric (<xref ref-type="bibr" rid="B20">Gettens et&#x20;al., 2015</xref>; <xref ref-type="bibr" rid="B56">Riofr&#xed;o et&#x20;al., 2015</xref>; <xref ref-type="bibr" rid="B21">Gettens and Spotts, 2018</xref>; <xref ref-type="bibr" rid="B23">Haolin et&#x20;al., 2019</xref>). Though competent use of scoring rubrics is believed to ensure reliability and validity of performance assessments, there are inherent difficulties in carrying out rubric-based assessments on summative assignments (<xref ref-type="bibr" rid="B30">Jonsson and Svingby, 2007</xref>). Further, this assessment becomes especially difficult in the context of collaborative, project-based design thinking assignments which demand a high level of creativity (<xref ref-type="bibr" rid="B40">Mahboub et&#x20;al., 2004</xref>), especially in terms of organizing the content and structure of the rubric (<xref ref-type="bibr" rid="B11">Chapman and Inman, 2009</xref>). Bartholomew et&#x20;al. have also noted that traditional teacher-centric assessment models (e.g., rubrics) are not always effective at facilitating students&#x2019; learning in a meaningful way (<xref ref-type="bibr" rid="B5">Bartholomew et&#x20;al., 2020a</xref>) and other studies have raised questions about the reliability and validity of the rubric-based assessment, such as subjectivity bias of the graders (<xref ref-type="bibr" rid="B24">Hoge and Butcher, 1984</xref>), one&#x2019;s leniency or severity (<xref ref-type="bibr" rid="B38">Lunz and Stahl, 1990</xref>; <xref ref-type="bibr" rid="B39">Lunz et&#x20;al., 1990</xref>; <xref ref-type="bibr" rid="B61">Spooren, 2010</xref>), and halo effect due to the broader knowledge of some students (<xref ref-type="bibr" rid="B66">Wilson and Wright, 1993</xref>).</p>
<p>In contrast to rubrics, Adaptive comparative judgement (ACJ) has been implemented as an efficient and statistically sound measure to assess the relative quality of each student&#x2019;s work (<xref ref-type="bibr" rid="B7">Bartholomew et&#x20;al., 2019</xref>; <xref ref-type="bibr" rid="B5">Bartholomew et&#x20;al., 2020a</xref>). In ACJ, an individual compares and evaluates pairs of items (e.g., the POV statements) and chooses the better of the two; this process is repeated&#x2014;with different pairings of items&#x2014;until a rank order of all items is created (<xref ref-type="bibr" rid="B62">Thurstone, 1927</xref>). The pairwise comparison process is iterative and multiple judges can make comparative decisions on multiple sets of work (<xref ref-type="bibr" rid="B62">Thurstone, 1927</xref>), with the final ordering of items&#x2014;from strongest to weakest&#x2014;calculated using multifaceted Rasch modeling (<xref ref-type="bibr" rid="B53">Rasch, 1980</xref>). In addition to a ranking, the judged quality of the items results in the creation of parameter values&#x2014;which specify both the rank and the magnitude of differences between items&#x2014;based on the outcome of the judgments (<xref ref-type="bibr" rid="B49">Pollitt, 2012b</xref>). Thus, the ACJ approach differs fundamentally from a traditional rubric-based approach in that it allows summative assessment without subjective point assigning (<xref ref-type="bibr" rid="B49">Pollitt, 2012b</xref>; <xref ref-type="bibr" rid="B4">Bartholomew and Jones, 2021</xref>).</p>
<p>For ACJ, there is no predetermined specific criteria like rubric-based assessments. Rather, in ACJ, holistic statement, or basis for judgment, is used. This provides the rationale for judges&#x2019; decisions and is considered a critical theoretical underpinning for reliability and validity (<xref ref-type="bibr" rid="B64">Van Daal et&#x20;al., 2019</xref>). To achieve a level of consensus in ACJ, professionally trained judges&#x2019; with collective expertise are often considered ideal; however, studies have also demonstrated that students&#x2014;with less preparation and/or expertise&#x2014;can also be proficient judges with levels of reliability and validity similar to professionals (<xref ref-type="bibr" rid="B28">Jones and Alcock, 2014</xref>). For examples, studies investigating concurrent validity of peer-evaluated ACJ showed that the results generated by peer-evaluated ACJ had a high correlation with the results of experts (e.g., professionally trained instructors, graders) (<xref ref-type="bibr" rid="B28">Jones and Alcock, 2014</xref>; <xref ref-type="bibr" rid="B5">Bartholomew et&#x20;al., 2020a</xref>). Jones and Alcock (<xref ref-type="bibr" rid="B28">Jones and Alcock, 2014</xref>) conducted peer-evaluated ACJ in the field of mathematics, to see the conceptual understanding of multivariable calculus. The results indicated mean peer and mean expert scores of ACJ had high correlation (<italic>r</italic>&#x20;&#x3d; 0.77), and also had significant correlation with summative assessments. Similarly, Bartholomew and others (<xref ref-type="bibr" rid="B5">Bartholomew et&#x20;al., 2020a</xref>) compared the results of professional, experienced instructors&#x2019; ACJ with student-evaluated ACJ results. Though peer-evaluated ACJ showed non-normality, results suggested strong correlation between peer-evaluated ACJ and instructor-evaluated&#x20;ACJ.</p>
<p>The present study aims to investigate whether peer-evaluated ACJ can yield sound validity in design thinking. More specifically, the validity of ACJ was studied from two perspectives: construct validity and criterion validity (as investigated through both concurrent and predictive validity). The construct validity was studied based on the holistic nature of ACJ.&#x20;Three researchers with professional backgrounds evaluated POV statements, studying whether the results of ACJ (parameter values) appropriately reflected general criteria of good POV statement. Following the construct validity, criterion validity was studied. First, researchers investigated concurrent validity of peer-evaluated ACJ by studying the relationships of peer-evaluated ACJ and instructors&#x2019; rubric-based grading. Second, the researchers studied the predictive validity of peer-evaluated ACJ by studying the relationships of peer-evaluated ACJ and students&#x2019; final grades. By doing so, we explored the validity of implementing peer-evaluated ACJ in design thinking context.</p>
</sec>
<sec id="s2">
<title>Literature Review</title>
<p>In this section, we first will start by introducing the concept of a POV statement and the importance of a good POV statement in a design thinking context. Then, two assessments implemented to evaluate POV statements will be presented: rubric-based grading and ACJ.&#x20;To explore the potential of ACJ as an effective and efficient alternative to rubric-based grading widely implemented in design thinking context, we share a brief review of existing literature on the reliability and validity prior to making our contribution to the knowledge base through this research.</p>
<sec id="s2-1">
<title>Point-Of-View Statements</title>
<p>The problem definition stage of design thinking explores the problem space and creates a meaningful and actionable problem statement (<xref ref-type="bibr" rid="B55">Rikke Friis and Teo Yu, 2020</xref>). Dam and Siang asserted that a good POV statement has three major traits (<xref ref-type="bibr" rid="B14">Dam and Siang, 2018</xref>). First, the POV needs to be human-oriented. This means the problem statement students write should focus on the specific users, from whom they learn the needs and insights through the empathy stage. Also, a human-centered POV statement is required to be about the people who are stakeholders in the design problem rather than the technology, monetary return, and/or product improvement. Second, the problem statement should be broad enough for creative freedom meaning the problem statement should be devoid of a specific method or solution. When the statement is framed around a narrowly defined solution, or with a possible solution in mind, it restricts the creativity of the ideation process (<xref ref-type="bibr" rid="B65">Wedell-Wedellsborg, 2017</xref>). The final trait of a strong problem statement is that it should be narrow enough to make it viable with the available resources. The third trait complements the second trait, which suggests that the POV statements should possess appropriate parameters for the scope of the problem, avoiding extreme narrowness or ambiguity. A good POV statement, equipped with all three traits, can contribute to delivering attention, providing sound framework for the problem, motivating students working on the problem, and providing informational guidelines (<xref ref-type="bibr" rid="B60">Sohaib et&#x20;al., 2019</xref>).</p>
</sec>
<sec id="s2-2">
<title>Assessment of Point-Of-View Statements With Rubrics</title>
<p>One trend among assessments in higher education is a shift from traditional knowledge-based tests towards assessment to support learning (<xref ref-type="bibr" rid="B15">Dochy et&#x20;al., 2006</xref>). In order to capture students&#x2019; higher-order thinking, a credible, trustworthy assessment, which is both valid and reliable, is needed. The historic development of a rubric as a scoring tool for the assessment of students&#x2019; authentic and complex work, including what counts (e.g., user, needs, insights are what count in POV statements) and for how much, has traditionally centered on 1) articulating the expectations of quality for each task and 2) describing the gradation of quality (e.g., excellent to poor, proficient to novice) for each element (<xref ref-type="bibr" rid="B11">Chapman and Inman, 2009</xref>; <xref ref-type="bibr" rid="B54">Reddy and Andrade, 2010</xref>). Three factors are included in a rubric: evaluation criteria, quality definitions, and a scoring strategy. The analytic rubric used in the Design thinking course to grade POV statements is included below (<xref ref-type="table" rid="T1">Table&#x20;1</xref>). The rubric-based evaluation of competency is made through analytical reflections by graders, in which the representation of the ability is scored on a set of established categories of criteria (<xref ref-type="bibr" rid="B13">Coenen et&#x20;al., 2018</xref>).</p>
<table-wrap id="T1" position="float">
<label>TABLE 1</label>
<caption>
<p>Grading rubrics of POV statements from the design thinking course.</p>
</caption>
<table>
<thead valign="top">
<tr>
<th align="left">Evaluation criteria</th>
<th align="center">Proficient</th>
<th align="center">Adequate</th>
<th align="center">Novice</th>
<th align="center">Criterion score</th>
</tr>
</thead>
<tbody valign="top">
<tr>
<td rowspan="2" align="left">Detail for USER and NEEDS</td>
<td align="left">(6 points)</td>
<td align="left">(3 points)</td>
<td align="left">(0 points)</td>
<td rowspan="2" align="center">6</td>
</tr>
<tr>
<td align="left">Student work includes adjectives and details to describe the users and their needs. 1 USER and 1 NEED are identified. USERS and NEEDS are clear and concise, actionable, and provide a solid framework for a problem</td>
<td align="left">Fewer than the required number of USERS and NEEDS have been generated. USERS and NEEDS are too vague to be useful</td>
<td align="left">None</td>
</tr>
<tr>
<td rowspan="2" align="left">INSIGHT</td>
<td align="left">(4 points)</td>
<td align="left">(1 point)</td>
<td align="left">(0 points)</td>
<td rowspan="2" align="center">4</td>
</tr>
<tr>
<td align="left">Student work shows evidence of considering multiple insights based on the USER and NEEDS. INSIGHTS are surprising and inspirational</td>
<td align="left">Evidence for only single INSIGHT was shown. INSIGHT is not based on the USERS or NEEDS; they are uninspiring or obvious</td>
<td align="left">None</td>
</tr>
<tr>
<td rowspan="2" align="left">POV</td>
<td align="left">(5 points)</td>
<td align="left">(2.5 points)</td>
<td align="left">(0 points)</td>
<td rowspan="2" align="center">5</td>
</tr>
<tr>
<td align="left">Students generated 1 POV statement stemming from the USERS and NEEDS generated. The statement is synthesized, clear, and actionable</td>
<td align="left">USER, NEED, and INSIGHT are not aligned with each other or the problem. The POV is too vague to be useful, it is unclear, and/ or not actionable</td>
<td align="left">None</td>
</tr>
</tbody>
</table>
</table-wrap>
</sec>
<sec id="s2-3">
<title>Adaptive Comparative Judgment</title>
<p>Adaptive comparative judgment (ACJ) is an evaluation approach accomplished through multiple comparisons. In 1927, Thurstone presented the &#x201c;Law of Comparative Judgment&#x201d; (<xref ref-type="bibr" rid="B62">Thurstone, 1927</xref>) as an alternative to the existing measurement scales, aimed at increasing reliability. Thurstone specifically argued that making decisions using holistic comparative judgments can increase reliability compared to decisions made from predetermined rubric criteria (<xref ref-type="bibr" rid="B62">Thurstone, 1927</xref>). Years later, based on Thurstone&#x2019;s law of comparative judgement, Pollitt outlined the potential for ACJ, seeking the possibility of implementing the comparative judgment approach in marking a wide range of educational assessments (<xref ref-type="bibr" rid="B49">Pollitt, 2012b</xref>), with statistically sound measurements in terms of accuracy and consistency (<xref ref-type="bibr" rid="B4">Bartholomew and Jones, 2021</xref>). The adaptive attribute of ACJ is based on an algorithm embedded within the approach which pairs similarly ranked items as the judge makes progress in the comparative judgement process&#x2014;an approach aimed at expediting the process of achieving an acceptable level of reliability (<xref ref-type="bibr" rid="B34">Kimbell, 2008</xref>; <xref ref-type="bibr" rid="B7">Bartholomew et&#x20;al., 2019</xref>).</p>
<p>We choose to use a software titled RMCompare to facilitate adaptive comparative judgment enabling students to make a series of judgments with an outcome consisting of several helpful data, including: a rank order of the items judged, parameter values (statistical values representing the relative quality of each item), judgment time of each comparison, a misfit statistic of judges and items (showing consistency, or lack thereof, among judgments), and judge-provided rationale for the comparative decisions (<xref ref-type="bibr" rid="B49">Pollitt, 2012b</xref>). Previous research has shown that utilizing these data can provide educators with a host of possibilities including insight into students&#x2019; judgment criteria, consensus, and their processing/understanding of the given task. In a design thinking process scenario specifically, ACJ&#x2014;though originally designed for expert assessment&#x2014;has demonstrated through educational research efforts to be a helpful measure for students who participate in the task because it promotes learning and engagement (<xref ref-type="bibr" rid="B59">Seery et&#x20;al., 2012</xref>; <xref ref-type="bibr" rid="B7">Bartholomew et&#x20;al., 2019</xref>). Specifically, Bartholomew et&#x20;al. noted that ACJ can efficiently facilitate learning among students studying design and innovation by including students as judges (<xref ref-type="bibr" rid="B5">Bartholomew et&#x20;al., 2020a</xref>).</p>
</sec>
<sec id="s2-4">
<title>Validity of Adaptive Comparative Judgment</title>
<sec id="s2-4-1">
<title>Construct Validity of Adaptive Comparative Judgment: Holistic Approach</title>
<p>The traditional concept of validity was established by Kelley (<xref ref-type="bibr" rid="B32">Kelley, 1927</xref>), who claimed that validity is the extent to which a test measures what it is supposed to measure. Construct validity pertains to &#x201c;the degree to which the measure of a content sufficiently measures the intended concept&#x201d; (<xref ref-type="bibr" rid="B44">O&#x2019;Leary-Kelly and Vokurka, 1998</xref>, <italic>p</italic>. 387). The validity estimate has to be considered in the context of its use, and needs evidence of the relevance and the utility of the score inferences and actions (<xref ref-type="bibr" rid="B42">Messick, 1994</xref>). In other words, researchers need to take into account the context, with adequate construct validity evidence, to support the inferences made from a measure (<xref ref-type="bibr" rid="B26">Hubley and Zumbo, 2011</xref>).</p>
<p>Since ACJ requires holistic assessment, researchers examining the validity of comparative judgement have highlighted the importance of an agreed upon set of criteria (<xref ref-type="bibr" rid="B47">Pollitt, 2012a</xref>) and shared consensus across judges (<xref ref-type="bibr" rid="B47">Pollitt, 2012a</xref>; <xref ref-type="bibr" rid="B29">Jones et&#x20;al., 2015</xref>; <xref ref-type="bibr" rid="B64">Van Daal et&#x20;al., 2019</xref>). In terms of an agreed upon criteria for judgment, in some instances, rather than following a predetermined specific criterion for the assessment, judges in ACJ have followed a general description regarding the assessment. For instance, Pollitt (<xref ref-type="bibr" rid="B47">Pollitt, 2012a</xref>) used the &#x201c;Importance Statements&#x201d; published on England&#x2019;s National Curriculum to assess design thinking portfolios:</p>
<list list-type="simple">
<list-item>
<p>&#xa0;In design and technology pupils combine practical and technological skills with creative thinking to design and make products and systems that meet human needs. They learn to use current technologies and consider the impact of future technological developments. They learn to think creatively and intervene to improve the quality of life, solving problems as individuals and members of a&#x20;team.</p>
</list-item>
<list-item>
<p>&#xa0;Working in stimulating contexts that provide a range of opportunities and draw on the local ethos, community and wider world, pupils identify needs and opportunities. They respond with ideas, products and systems, challenging expectations where appropriate. They combine practical and intellectual skills with an understanding of aesthetic, technical, cultural, health, social, emotional, economic, industrial, and environmental issues. As they do so, they evaluate present and past design and technology, and its uses and effects. Through design and technology pupils develop confidence in using practical skills and become discriminating users of products. They apply their creative thinking and learn to innovate. (<xref ref-type="bibr" rid="B52">QCDA., 1999</xref>).</p>
</list-item>
</list>
<p>The shared consensus among judges, facilitated through the ACJ process, underpins the validity of ACJ, because each artifact is systematically evaluated in various pairings across multiple judges. Through the process of judgement, a shared conceptualization of quality and collective expertise of judges is then reflected in the final rank order (<xref ref-type="bibr" rid="B64">Van Daal et&#x20;al., 2019</xref>). Though the majority of studies initially limited the judges to trained graders/instructors, recent work has explored students&#x2019; (or other untrained judges&#x2019;) competence as judges in ACJ (<xref ref-type="bibr" rid="B57">Rowsome et&#x20;al., 2013</xref>; <xref ref-type="bibr" rid="B28">Jones and Alcock, 2014</xref>; <xref ref-type="bibr" rid="B46">Palisse et&#x20;al., 2021</xref>). Findings suggest that, in many cases, students&#x2014;and even out-of-class-professionals (e.g., practicing engineers; see <xref ref-type="bibr" rid="B69">Strimel et&#x20;al., 2021</xref>) can reach similar consensus to that reached by trained judges or classroom teachers suggesting a shared quality consensus across different judge groups.</p>
<p>Considering the curriculum, goals, and educational setting of design thinking, our research team postulated that when implementing ACJ to assess POV statements of the students in the design thinking course, the high score of parameter values should reasonably be interpreted as one&#x2019;s ability to write a good POV statement, while a low score of parameter values can be understood as one&#x2019;s low ability, or lack of ability, to write a good POV statement.</p>
</sec>
<sec id="s2-4-2">
<title>Validity of Adaptive Comparative Judgment: Criterion Validity</title>
<p>In classical views of validity, criterion validity concerns &#x201c;the correlation with a measure and a standard regarded as a representative of the construct under consideration&#x201d; (<xref ref-type="bibr" rid="B12">Clemens et&#x20;al., 2018</xref>). If the measure shows a correlation with an assessment in the same time frame, it is termed concurrent validity. If the measure shows a correlation with a future assessment, it is termed predictive validity. The criterion validity evidence is related to how accurately one measure predicts the outcome of another criterion measure. Criterion validity is useful for predicting performance of an individual in different context (e.g., past, present, future) (<xref ref-type="bibr" rid="B9">Borrego et&#x20;al., 2009</xref>).</p>
<p>Although the unique, holistic characteristics of ACJ provides meaningful insights, concurrent validity of ACJ also has been studied with great importance (<xref ref-type="bibr" rid="B28">Jones and Alcock, 2014</xref>; <xref ref-type="bibr" rid="B29">Jones et&#x20;al., 2015</xref>; <xref ref-type="bibr" rid="B8">Bisson et&#x20;al., 2016</xref>). There has been several efforts to establish criterion validity of ACJ, which mostly concentrated on the concurrent validity (<xref ref-type="bibr" rid="B28">Jones and Alcock, 2014</xref>; <xref ref-type="bibr" rid="B29">Jones et&#x20;al., 2015</xref>; <xref ref-type="bibr" rid="B8">Bisson et&#x20;al., 2016</xref>). These studies compared the results of ACJ with the results of other validated assessments to investigate the conceptual understanding. Examining the criterion validity is crucial to implement ACJ in various educational contexts as an effective alternative. Considering that ACJ can be rapidly applied to target concepts, it has the potential to effectively and efficiently evaluate various artifacts in a wide range of contexts with high validity and reliability (<xref ref-type="bibr" rid="B8">Bisson et&#x20;al., 2016</xref>).</p>
<p>Informed by previous studies, this study examines the validity of peer-evaluated ACJ in design thinking context. Though it has relatively high and stable reliability, coming from its adaptive nature, empirical evidence regarding ACJ&#x2019;s predictive validity is limited (<xref ref-type="bibr" rid="B59">Seery et&#x20;al., 2012</xref>; <xref ref-type="bibr" rid="B64">Van Daal et&#x20;al., 2019</xref>). Delving into predictive validity is necessary for demonstrating the technical adequacy and practical utility of ACJ (<xref ref-type="bibr" rid="B12">Clemens et&#x20;al., 2018</xref>). Therefore, investigating the validity of ACJ may provide another potentially strong peer assessment measure in design thinking context, where most of the assignments are portfolios, thus hard to operationalize explicit assessment criteria using traditional rubric based approaches (<xref ref-type="bibr" rid="B5">Bartholomew et&#x20;al., 2020a</xref>). Not only may ACJ be a viable assessment tool but, it may also be a valuable learning experience for students who engage in the peer evaluation process (<xref ref-type="bibr" rid="B5">Bartholomew et&#x20;al., 2020a</xref>).</p>
</sec>
</sec>
</sec>
<sec id="s3">
<title>Research Question</title>
<p>The ACJ-produced rank order and standardized scores (i.e.,&#x20;parameter values) reflect the relative work quality of students&#x2019; POV statements according to the ACJ judges. Therefore, researchers assumed that POV statements with higher parameter values were better in quality when compared to the POV statements with lower parameter values. The first research question investigated in this study will qualitatively explore how students&#x2019; shared consensus reflects the general and broad criteria of good POV statement.</p>
<p>RQ 1. What is the construct validity of ACJ? Does peer-reviewed ACJ reflect general criteria of good POV statements?</p>
<p>Taking its effectiveness and efficiency into consideration, studies already explored ACJ&#x2019;s theoretical promise in educational setting as a new approach with acceptable statistical evidence (<xref ref-type="bibr" rid="B28">Jones and Alcock, 2014</xref>; <xref ref-type="bibr" rid="B5">Bartholomew et&#x20;al., 2020a</xref>). This study aims to investigate the criterion validity of ACJ.&#x20;More specifically, concurrent validity and predictive validity of ACJ were examined by comparing the results of ACJ with rubric-based grading.</p>
<p>RQ 2. What is the criterion validity of ACJ? Does peer-reviewed ACJ correlate with existing assessment?</p>
<p>RQ 2-1. What is the concurrent validity of ACJ? Does peer-reviewed ACJ correlate with instructors&#x2019; rubric-based grading on the same assignment?</p>
<p>RQ 2-2. What is the predictive validity of ACJ? Does peer-reviewed ACJ predict instructors&#x2019; rubric-based grading on the key final project deliverable?</p>
</sec>
<sec sec-type="methods" id="s4">
<title>Methods</title>
<sec id="s4-1">
<title>Participants</title>
<p>Study participants were 597 technology students out of 621 students enrolled in a first-year Design Thinking Course at a large Midwestern university in the United&#x20;States during Spring 2019. These students are subset of entire Polytechnic population (<italic>N</italic>&#x20;&#x3d; 4,480). This research was approved by the university&#x2019;s Institutional Research Board. Sociodemographic information of the participants is provided in <xref ref-type="table" rid="T2">Table&#x20;2</xref>.</p>
<table-wrap id="T2" position="float">
<label>TABLE 2</label>
<caption>
<p>Sociodemographic characteristics of participants.</p>
</caption>
<table>
<thead valign="top">
<tr>
<th align="left">Socio-demographic variables</th>
<th align="center">Number</th>
<th align="center">Percent</th>
</tr>
</thead>
<tbody valign="top">
<tr>
<td colspan="3" align="left">Gender</td>
</tr>
<tr>
<td align="left">&#x2003;Female</td>
<td align="center">147</td>
<td align="char" char=".">24.62</td>
</tr>
<tr>
<td align="left">&#x2003;Male</td>
<td align="center">446</td>
<td align="char" char=".">74.71</td>
</tr>
<tr>
<td align="left">&#x2003;Prefer not to answer</td>
<td align="center">4</td>
<td align="char" char=".">0.67</td>
</tr>
<tr>
<td colspan="3" align="left">Residency</td>
</tr>
<tr>
<td align="left">&#x2003;Foreign</td>
<td align="center">52</td>
<td align="char" char=".">8.72</td>
</tr>
<tr>
<td align="left">&#x2003;Non-Resident</td>
<td align="center">207</td>
<td align="char" char=".">34.67</td>
</tr>
<tr>
<td align="left">&#x2003;Resident</td>
<td align="center">334</td>
<td align="char" char=".">55.95</td>
</tr>
<tr>
<td align="left">&#x2003;Prefer not to answer</td>
<td align="center">4</td>
<td align="char" char=".">0.67</td>
</tr>
<tr>
<td colspan="3" align="left">Race</td>
</tr>
<tr>
<td align="left">&#x2003;Multiracial</td>
<td align="center">17</td>
<td align="char" char=".">2.85</td>
</tr>
<tr>
<td align="left">&#x2003;Alaskan Native</td>
<td align="center">1</td>
<td align="char" char=".">0.17</td>
</tr>
<tr>
<td align="left">&#x2003;Asian</td>
<td align="center">53</td>
<td align="char" char=".">8.88</td>
</tr>
<tr>
<td align="left">&#x2003;Black/African American</td>
<td align="center">14</td>
<td align="char" char=".">2.35</td>
</tr>
<tr>
<td align="left">&#x2003;Hispanic/ Latino</td>
<td align="center">41</td>
<td align="char" char=".">6.87</td>
</tr>
<tr>
<td align="left">&#x2003;Native American</td>
<td align="center">1</td>
<td align="char" char=".">0.17</td>
</tr>
<tr>
<td align="left">&#x2003;Unknown</td>
<td align="center">8</td>
<td align="char" char=".">1.34</td>
</tr>
<tr>
<td align="left">&#x2003;White</td>
<td align="center">406</td>
<td align="char" char=".">68.01</td>
</tr>
<tr>
<td align="left">&#x2003;Prefer not to answer</td>
<td align="center">56</td>
<td align="char" char=".">9.38</td>
</tr>
<tr>
<td colspan="3" align="left">Rank by credit hour</td>
</tr>
<tr>
<td align="left">&#x2003;Freshman</td>
<td align="center">182</td>
<td align="char" char=".">30.49</td>
</tr>
<tr>
<td align="left">&#x2003;Sophomore</td>
<td align="center">235</td>
<td align="char" char=".">39.36</td>
</tr>
<tr>
<td align="left">&#x2003;Junior</td>
<td align="center">124</td>
<td align="char" char=".">20.77</td>
</tr>
<tr>
<td align="left">&#x2003;Senior</td>
<td align="center">52</td>
<td align="char" char=".">8.71</td>
</tr>
<tr>
<td align="left">&#x2003;Prefer not to answer</td>
<td align="center">4</td>
<td align="char" char=".">0.67</td>
</tr>
</tbody>
</table>
</table-wrap>
</sec>
<sec id="s4-2">
<title>Research Process</title>
<sec id="s4-2-1">
<title>Research Design</title>
<p>The research design of this study is graphically depicted by <xref ref-type="fig" rid="F2">Figure&#x20;2</xref>. First, students wrote the POV statements during the project 3 as a team. Researchers collated and anonymized the total 124 POV statements. Followed by this process, students performed ACJ on their peer&#x2019;s POV statements (Assessment 1, peer-evaluated ACJ). Concurrently, instructors graded the same POV statement using rubrics (Assessment 2, <xref ref-type="table" rid="T1">Table&#x20;1</xref>). After project 3, instructors, who worked as graders assigned grades to final deliverables of project 3 (Assessment 3). To study the construct validity, researchers qualitatively analyzed ACJ statements using content analysis. Before analyzing the criterion validity, we analyzed the descriptive statistics of all three assessments. For the concurrent validity, we studied correlation between the peer-evaluated ACJ (Assessment 1) and instructors&#x2019; grading based on rubric (Assessment 2). Finally, for the predictive validity, we examined if peer-evaluated ACJ (Assessment 1) predicts final deliverables (Assessment&#x20;3).</p>
<fig id="F2" position="float">
<label>FIGURE 2</label>
<caption>
<p>Research design of this&#x20;study.</p>
</caption>
<graphic xlink:href="feduc-06-772832-g002.tif"/>
</fig>
</sec>
<sec id="s4-2-2">
<title>Study Context and Point-Of-View Statement Writing</title>
<p>In the semester-long, three credit design thinking course, 597 students from 14 sections designed and developed solutions to real problems, voluntarily forming 124 groups in alignment with their current interests or major within each section of the course. During the course, students fostered their own foundational understanding of design thinking by participating in three projects, in which they could create, optimize, and prepare innovative solutions for people. The first project was designed to provide overview and theoretical descriptions with simple hands-on projects about the design thinking process and lasted about a week. The second course project was a more real-life based group project, and took approximately 4&#xa0;weeks, following the five stages of design thinking: empathize, define the problem, ideate, prototype, and test (retest).</p>
<p>The final project spanned about 8&#xa0;weeks and engaged students in addressing a problem related to a self-selected grand challenge of engineering (<xref ref-type="bibr" rid="B43">National Academy of Engineering, 2008</xref>). In this study, we observed the &#x201c;define&#x201d; stage of the third project, when we hypothesized that students would have had enough experience with the design thinking process, including the POV statements, to work comfortably through the designing approach. At this point in class these students had already written four POV statements, two as an individual during the first project, and two as a team during the second project. As a part of the define stage during the third project, the course instructors utilized one 50-min class concentrating on POV creation, highlighting essential components of quality POV statements (user, needs, and insights), structures of POV statements, essential criteria for producing a good POV statement, and importance of writing a good POV statement for this project. During and after this class session, the students wrote a definition of their problem as a team using a provided format for POV statements [User . . . (descriptive)] needs [need . . . (verb)] because [insight. . . (compelling)].</p>
</sec>
<sec id="s4-2-3">
<title>Measures</title>
<p>This study used three types of assessments: peer-evaluated ACJ of POVs (Assessment 1), rubric-based grading of POV(Assessment 2), and rubric-based grading of final deliverables (Assessment 3). First, we compared two types of assessments: Assessment 1 and Assessment 2. For both rubric based and ACJ based assessments, all the POV statements from the 124 teams written at the beginning of the final project were included in the dataset. Then, researchers included the rubric-based grading of final deliverables (Assessment 3) to see if the peer-evaluated ACJ can predict the future achievements.</p>
<p>Assessment 1. Peer-Evaluated ACJ of the POV Statements.</p>
<p>For the peer-evaluated ACJ, the POV statements were collated, anonymized, and uploaded into the ACJ software called <italic>RMCompare</italic> for evaluation. Near the end of the final project, in preparation for presenting their design projects, students were challenged to evaluate the POV statements using the <italic>RMCompare</italic> interface by selecting the POV statement they believed was holistically better between the pairs displayed to them. For the holistic judgment prompt, students were reminded of general qualities of good POV statements (<xref ref-type="bibr" rid="B55">Rikke Friis and Teo Yu, 2020</xref>), which were already familiar to them. Students previously used these same criteria (<xref ref-type="bibr" rid="B55">Rikke Friis and Teo Yu, 2020</xref>) as class material to learn the notion of POV statement. Each student (550 of 597) compared approximately 8 pairs of POV statements written by their peers. The subsequent ACJ judgments resulted in all 124 POV statements being compared at least 12&#x20;times to other increasingly similarly ranked POV statements in line with the adaptive nature of the software. As a result, the rank and parameter value for each POV statement was automatically calculated using the embedded Rasch multifaceted model (see <xref ref-type="bibr" rid="B49">Pollitt, 2012b</xref>; <xref ref-type="bibr" rid="B48">Pollitt, 2015</xref> for more details).</p>
<p>Assessment 2. Instructor&#x2019;s Rubric-Based Grading of the POV Statements.</p>
<p>Rubric based grading was performed based on assigned criteria (<xref ref-type="table" rid="T1">Table&#x20;1</xref>). Graders are currently working as course instructors of design thinking course, who were pursuing a MS or Ph.D. degree in relevant fields (e.g., engineering, polytechnic, or education) at the time of study. Each grader assessed two sections, in which around 40 students enrolled. As a result, the numerical grading value (total 15&#xa0;pts) were provided.</p>
<p>Assessment 3. Final Project Deliverables.</p>
<p>Student teams submitted their final prototypes as one of the significant final project deliverables. They plan, implement, and reflect on testing scenarios for their prototypes, and present prototypes for the purpose of receiving feedback from the peers. Instructors (same as Assessment 2) grade the prototypes as a key final deliverable based on assigned criteria (see <xref ref-type="table" rid="T3">Table&#x20;3</xref>). As a result, the numerical grading value (total 35&#xa0;pts) were provided.</p>
<table-wrap id="T3" position="float">
<label>TABLE 3</label>
<caption>
<p>Rubrics of the final project deliverable.</p>
</caption>
<table>
<thead valign="top">
<tr>
<th align="left">Criteria</th>
<th align="center">Proficient</th>
<th align="center">Adequate</th>
<th align="center">Novice</th>
<th align="center">Criterion score</th>
</tr>
</thead>
<tbody valign="top">
<tr>
<td rowspan="2" align="left">Sketches of how it will work provided</td>
<td align="left">(4 points)</td>
<td align="left">(2 points)</td>
<td align="left">(0 points)</td>
<td rowspan="2" align="center">4</td>
</tr>
<tr>
<td align="left">Sketches illustrating how it works</td>
<td align="left">Sketches provided for prototype are provided but are misaligned and/or unclear</td>
<td align="left">Sketches entirely lacking</td>
</tr>
<tr>
<td rowspan="2" align="left">Area of concern/ functionality investigated by prototype described</td>
<td align="left">(4 points)</td>
<td align="left">(2 points)</td>
<td align="left">(0 points)</td>
<td rowspan="2" align="center">4</td>
</tr>
<tr>
<td align="left">Robust description provided for prototype</td>
<td align="left">Descriptions are provided but muddled/unclear</td>
<td align="left">Insufficient descriptions provided</td>
</tr>
<tr>
<td rowspan="2" align="left">Picture of prototype included; Description of how prototype was&#x20;built included</td>
<td align="left">(4 points)</td>
<td align="left">(2 points)</td>
<td align="left">(0 points)</td>
<td rowspan="2" align="center">4</td>
</tr>
<tr>
<td align="left">Pictures and robust description provided for prototype</td>
<td align="left">Some pictures provided; descriptions are provided but muddled/unclear</td>
<td align="left">Picture lacking; Insufficient descriptions provided</td>
</tr>
<tr>
<td rowspan="2" align="left">Pictures provided of prototype &#x201c;in&#x20;use&#x201d;; description of relevant test&#x20;conditions</td>
<td align="left">(4 points)</td>
<td align="left">(2 points)</td>
<td align="left">(0 points)</td>
<td rowspan="2" align="center">4</td>
</tr>
<tr>
<td align="left">Pictures and robust description provided for prototype</td>
<td align="left">Pictures included; descriptions provided for prototype; descriptions are provided but muddled/unclear</td>
<td align="left">Pictures lacking; Insufficient descriptions provided</td>
</tr>
<tr>
<td rowspan="2" align="left">Test results provided</td>
<td align="left">(5 points)</td>
<td align="left">(2.5 points)</td>
<td align="left">(0 points)</td>
<td rowspan="2" align="center">5</td>
</tr>
<tr>
<td align="left">Test results included; results are primarily quantitative with supplemental qualitative results included</td>
<td align="left">Test results included but results primarily observational or anecdotal</td>
<td align="left">Test results either lacking, or extremely insufficient</td>
</tr>
<tr>
<td rowspan="2" align="left">Most comparable existing product pictured; differences described</td>
<td align="left">(4 points)</td>
<td align="left">(2 points)</td>
<td align="left">(0 points)</td>
<td rowspan="2" align="center">4</td>
</tr>
<tr>
<td align="left">Pictures included; differences provided</td>
<td align="left">Pictures provided; differences provided but are muddled/unclear</td>
<td align="left">Pictures lacking; Insufficient differences provided</td>
</tr>
<tr>
<td rowspan="2" align="left">Prototype Functions</td>
<td align="left">(10 points)</td>
<td align="left">(5 points)</td>
<td align="left">(0 points)</td>
<td rowspan="2" align="center">10</td>
</tr>
<tr>
<td align="left">The group&#x2019;s prototype functions properly</td>
<td align="left">The prototype partially function</td>
<td align="left">The prototype does not function</td>
</tr>
</tbody>
</table>
</table-wrap>
</sec>
</sec>
<sec id="s4-3">
<title>Analysis</title>
<sec id="s4-3-1">
<title>Construct Validity</title>
<sec id="s4-3-1-1">
<title>Qualitative Content Analysis (QCA)</title>
<p>Content analysis is an analytic method frequently adopted in both quantitative and qualitative research for the systematic reduction of text or video data (<xref ref-type="bibr" rid="B25">Hsieh and Shannon, 2005</xref>; <xref ref-type="bibr" rid="B41">Mayring, 2015</xref>). Qualitative content analysis, QCA is one of the recognized research methods in the field of education. It is a method for &#x201c;the subjective interpretation of the content of text data through the systematic classification process of coding and identifying themes or patterns&#x201d; (<xref ref-type="bibr" rid="B25">Hsieh and Shannon, 2005</xref>, <italic>p</italic>. 1278). We used directive (qualitative) content analysis to extend the findings of ACJ, therefore enriching the findings (<xref ref-type="bibr" rid="B50">Potter and Levine-Donnerstein, 1999</xref>). The focus of current study was on validating ACJ from analyzing the key concepts of POV statements (e.g., structure, user, needs, and insights). Researchers began the research by identifying the key concepts POV statements. Then, researchers begin coding immediately with the predetermined codes. We articulated four categories based on the discussion: framework (alignment, logic), user, needs, and insights.</p>
<p>Two major approaches are frequently used for the validity and reliability of QCA: Quantitative and qualitative (<xref ref-type="bibr" rid="B41">Mayring, 2015</xref>). Quantitative approach measures inter-coder reliability and agreement using the quantitative methods (<xref ref-type="bibr" rid="B42">Messick, 1994</xref>). Qualitative approach adopts a consensus process in which multiple coders independently code the data, compare their coding, and discuss and resolve discrepancies when they arise, rather than measuring them (<xref ref-type="bibr" rid="B58">Schreier, 2012</xref>; <xref ref-type="bibr" rid="B41">Mayring, 2015</xref>). The qualitative validation approach is preferred to the quantitative research because it provides reason with reflexivity, the critical thinking of researchers&#x2019; own assumptions and perspective (<xref ref-type="bibr" rid="B58">Schreier, 2012</xref>). This is particularly important during the negotiation process because coders meet to discuss their own rationale used in coding. In this study context, researchers compared, reviewed, and revisited coding process before reaching consensus on the codes (<xref ref-type="bibr" rid="B25">Hsieh and Shannon, 2005</xref>; <xref ref-type="bibr" rid="B19">Forman and Damschroder, 2007</xref>; <xref ref-type="bibr" rid="B58">Schreier, 2012</xref>).</p>
</sec>
<sec id="s4-3-1-2">
<title>Sample Selections of Point-Of-View Statements</title>
<p>To provide validation to ACJ data (parameter values), researchers selectively analyzed 20 POV statements out of the 124 POV statements as was done in a previous related study (<xref ref-type="bibr" rid="B6">Bartholomew et&#x20;al., 2020b</xref>). Based on ACJ, we selectively analyzed the 10 POV statements with the highest parameter values and the 10 POV statements with the lowest parameter values to provide contrasting cases. Using the rubrics implemented in the grading system (<xref ref-type="table" rid="T1">Table&#x20;1</xref>), researchers analyzed whether the parameter values were aligned with the criteria for a strong POV statement. More specifically, in an effort to explore the construct validity of the ACJ results, we investigated if the 10 POV statements with high parameter values better reflect the required criteria for good POV statements and if the 10 POV statements with low parameter values fail to meet the criteria required of the student groups.</p>
</sec>
<sec id="s4-3-1-3">
<title>Criterion Validity Analysis</title>
<p>The software program RStudio Version 1.3.959 was used for our criterion validity analysis.</p>
</sec>
<sec id="s4-3-1-4">
<title>Preliminary Data Analysis</title>
<p>Prior to running the statistical analysis, researchers screened the data for missing values and outliers. Participants with missing data on a variable were excluded from the analysis. For instance, if there was a missing value either in grader&#x2019;s grading in POV statements or final deliverables, the data were not included in the statistical analysis. As a result, 26 participants were removed from data. Values greater than 4 SD from the mean on any measures were considered as outliers and thus removed. The results of ACJ demonstrated a high level of interrater reliability (<italic>r</italic>&#x20;&#x3d; 0.94), with none of the judges showing significant misalignment.</p>
</sec>
<sec id="s4-3-1-5">
<title>Descriptive Statistics</title>
<p>We analyzed the rubric based grading of POV statements (POV Grading), ACJ on the same POV statements (ACJ), and rubric-based grading on the final deliverables (Final Deliverable) (<xref ref-type="table" rid="T4">Table&#x20;4</xref>).</p>
<table-wrap id="T4" position="float">
<label>TABLE 4</label>
<caption>
<p>Descriptive statistics.</p>
</caption>
<table>
<thead valign="top">
<tr>
<th rowspan="2" align="left"/>
<th rowspan="2" align="center">N</th>
<th rowspan="2" align="center">Min</th>
<th rowspan="2" align="center">Max</th>
<th rowspan="2" align="center">Mean</th>
<th rowspan="2" align="center">SD</th>
<th colspan="2" align="center">Skewness</th>
<th colspan="2" align="center">Kurtosis</th>
</tr>
<tr>
<th align="center">Statistic</th>
<th align="center">SE</th>
<th align="center">Statistic</th>
<th align="center">SE</th>
</tr>
</thead>
<tbody valign="top">
<tr>
<td align="left">POV Grading</td>
<td align="center">576</td>
<td align="char" char=".">0.00</td>
<td align="char" char=".">15.00</td>
<td align="char" char=".">13.08</td>
<td align="char" char=".">3.74</td>
<td align="char" char=".">&#x2212;2.70</td>
<td align="char" char=".">0.10</td>
<td align="char" char=".">6.62</td>
<td align="char" char=".">0.20</td>
</tr>
<tr>
<td align="left">ACJ</td>
<td align="center">576</td>
<td align="char" char=".">&#x2212;1.80</td>
<td align="char" char=".">1.23</td>
<td align="char" char=".">0.01</td>
<td align="char" char=".">0.56</td>
<td align="char" char=".">&#x2212;0.53</td>
<td align="char" char=".">0.10</td>
<td align="char" char=".">0.68</td>
<td align="char" char=".">0.20</td>
</tr>
<tr>
<td align="left">Final Deliverable</td>
<td align="center">576</td>
<td align="char" char=".">16.25</td>
<td align="center">35</td>
<td align="center">20</td>
<td align="char" char=".">2.65</td>
<td align="char" char=".">1.78</td>
<td align="char" char=".">0.10</td>
<td align="char" char=".">6.94</td>
<td align="char" char=".">0.21</td>
</tr>
</tbody>
</table>
</table-wrap>
</sec>
<sec id="s4-3-1-6">
<title>Correlation and Regression Analysis</title>
<p>Specifically, both Spearman&#x2019;s <inline-formula id="inf1">
<mml:math id="m1">
<mml:mi>&#x3c1;</mml:mi>
</mml:math>
</inline-formula> and linear regression statistical techniques were employed to test the concurrent validity and predictive validity. We adopted Spearman&#x2019;s <inline-formula id="inf2">
<mml:math id="m2">
<mml:mi>&#x3c1;</mml:mi>
</mml:math>
</inline-formula> because the POV grading was negatively skewed.</p>
</sec>
</sec>
</sec>
</sec>
<sec sec-type="results" id="s5">
<title>Results</title>
<sec id="s5-1">
<title>Construct Validity of Peer-Evaluated Adaptive Comparative Judgment</title>
<p>The POV statements with the highest parameter values (<xref ref-type="table" rid="T5">Table&#x20;5</xref>) and the lowest parameter values (<xref ref-type="table" rid="T6">Table&#x20;6</xref>) are presented based on their rank order and referenced in the following discussion.</p>
<table-wrap id="T5" position="float">
<label>TABLE 5</label>
<caption>
<p>POV statements with the highest parameter values.</p>
</caption>
<table>
<thead valign="top">
<tr>
<th align="left">Rank order</th>
<th align="center">Point-of-view statement</th>
<th align="center">Parameter value</th>
</tr>
</thead>
<tbody valign="top">
<tr>
<td align="left">&#x23;1</td>
<td align="left">The school of aviation and transportation technology needs to utilize a more accessible, personalized and interactive method for giving safety meetings because currently they lack motivation and differently levels of complexity within the class environment</td>
<td align="char" char=".">1.23</td>
</tr>
<tr>
<td align="left">&#x23;2</td>
<td align="left">People utilizing automobiles and transportation vehicles need a way to reduce the amount of CO<sub>2</sub> emissions of the transportation sector because the transportation sector is the largest emitter of CO<sub>2</sub> as of 2018, which leads to more impacts of global warming</td>
<td align="char" char=".">1.17</td>
</tr>
<tr>
<td align="left">&#x23;3</td>
<td align="left">College aged students need a way to learn about the importance of recycling, by reusing wasted materials in an effective manner because it will reduce the carbon footprint that college campuses leave</td>
<td align="char" char=".">0.94</td>
</tr>
<tr>
<td align="left">&#x23;4</td>
<td align="left">The people of (Name of the City) should be offered an incentive to recycle responsibly, because recycling is being done the wrong way which hurts the environment more than it helps</td>
<td align="char" char=".">0.91</td>
</tr>
<tr>
<td align="left">&#x23;5</td>
<td align="left">College students need technology and social networks as an alternative form of learning about reading mainly in English classes so that students have access to alternative forms of non-discriminatory educational methods</td>
<td align="char" char=".">0.90</td>
</tr>
<tr>
<td align="left">&#x23;6</td>
<td align="left">(Name of the University) students need a way of navigating (Name of the University&#x2019;s) flooded sidewalks without getting their feet soaked in snow or ice because walking into class with cold and wet boots because it is both unsanitary and potentially dangerous, especially in the winter months</td>
<td align="char" char=".">0.87</td>
</tr>
<tr>
<td align="left">&#x23;7</td>
<td align="left">Due to time and accessibility constraints, students on campus need a means to achieve a healthier lifestyle without spending too much extra time and money, because better health is very important to busy and stressed college students</td>
<td align="char" char=".">0.87</td>
</tr>
<tr>
<td align="left">&#x23;8</td>
<td align="left">Junior High students need an interactive method of teaching fundamental ideas of STEM because the current system of teaching lacks the support, motivation, and exposure students need to grow intellectually</td>
<td align="char" char=".">0.81</td>
</tr>
<tr>
<td align="left">&#x23;9</td>
<td align="left">University members need a consistently secure authentication service because hacked accounts can lead to data leakage and theft</td>
<td align="char" char=".">0.79</td>
</tr>
<tr>
<td align="left">&#x23;10</td>
<td align="left">Local business owners need a cheap and efficient way to cool their data lefts, and reuse the energy because the current technology involving air conditioning and water cooling is very expensive and wasteful to the environment</td>
<td align="char" char=".">0.76</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<fn>
<p>&#x2a;Note: Original statements are as written by students.</p>
</fn>
</table-wrap-foot>
</table-wrap>
<table-wrap id="T6" position="float">
<label>TABLE 6</label>
<caption>
<p>POV statements with the lowest parameter values.</p>
</caption>
<table>
<thead valign="top">
<tr>
<th align="left">Rank order</th>
<th align="center">Point-of-view statement</th>
<th align="center">Parameter value</th>
</tr>
</thead>
<tbody valign="top">
<tr>
<td align="left">&#x23;115</td>
<td align="left">People need to become more educated on the topics of stereotyping and cultural diffusion because ignorance can lead to discrimination</td>
<td align="char" char=".">&#x2212;0.85</td>
</tr>
<tr>
<td align="left">&#x23;116</td>
<td align="left">People in the (Name of the University) university need assist to find parking spots because currently there is no helpful approach improve the shortage of parking slots</td>
<td align="char" char=".">&#x2212;0.86</td>
</tr>
<tr>
<td align="left">&#x23;117</td>
<td align="left">People at (Name of the University) University do not have access to cheap, healthy food for an unknown reason</td>
<td align="char" char=".">&#x2212;0.88</td>
</tr>
<tr>
<td align="left">&#x23;118</td>
<td align="left">Pedestrians need signage to prevent vehicle users in the bike lanes from hitting them because there is a high risk of accidents in that area</td>
<td align="char" char=".">&#x2212;0.97</td>
</tr>
<tr>
<td align="left">&#x23;119</td>
<td align="left">Anyone involved in scientific or technological labs currently have no access to virtual lab spaces to practice techniques or methods that are otherwise difficult to obtain physically</td>
<td align="char" char=".">&#x2212;0.98</td>
</tr>
<tr>
<td align="left">&#x23;120</td>
<td align="left">The VR market is growing rapidly since 2012, but it has not yet reached a mature market. We are going to explore challenges Virtual Reality needs to overcome in order to be more adaptable for people, especially for educational purposes. People who are in the education system need a way to incorporate Virtual Reality into teaching and learning because VR provides a new way to share immersive information in an affordable way</td>
<td align="char" char=".">&#x2212;1.00</td>
</tr>
<tr>
<td align="left">&#x23;121</td>
<td align="left">People who live in urban areas need a sustainable source of foods because it decreases their reliance on imports</td>
<td align="char" char=".">&#x2212;1.13</td>
</tr>
<tr>
<td align="left">&#x23;122</td>
<td align="left">The Food Industry needs to waste less because the environment is suffering due to excessive usage of natural resources</td>
<td align="char" char=".">&#x2212;1.35</td>
</tr>
<tr>
<td align="left">&#x23;123</td>
<td align="left">We will implement lights above each parking spots in parking garage, and they will glow either green or red depending on whether it&#x2019;s available or not</td>
<td align="char" char=".">&#x2212;1.69</td>
</tr>
<tr>
<td align="left">&#x23;124</td>
<td align="left">Infrastructure at (Name of the University) needs to be improved because parts of (Name of the University) are overcrowded</td>
<td align="char" char=".">&#x2212;1.80</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<fn>
<p>&#x2a;Note: Original statements are as written by students.</p>
</fn>
</table-wrap-foot>
</table-wrap>
<sec id="s5-1-1">
<title>Framework of Point-Of-View Statements</title>
<sec id="s5-1-1-1">
<title>Structure and Length</title>
<p>To articulate their user, needs, and insights to solve the current challenges users are facing, the assignment required students to make a POV statement using the sentence structure: [User . . . (descriptive)] needs [need . . . (verb)] because [insight. . . (compelling)] (<xref ref-type="bibr" rid="B55">Rikke Friis and Teo Yu, 2020</xref>). Though most of the POV statements with high parameter values followed the basic structures, some of the POV statements with low parameter values deviated from the basic POV statement structure. For instance, the POV &#x23;117 and &#x23;119 statements omitted insights resulting in their POV statements not leading to an actionable statement. The &#x23;120 statement included unnecessary background information prior to the POV statement which may be distracting and hinder the readers&#x2019; understanding of the POV statement itself. In the &#x23;123 statement, a specific solution was presented instead of the POV statement and a problem statement like this, framed with a certain solution in mind, might restrict the creativity of problem-solving (<xref ref-type="bibr" rid="B65">Wedell-Wedellsborg, 2017</xref>). Therefore, based on our analysis, the judges perceived that good POV statements should include the required information with all the necessary components (i.e.,&#x20;user, needs, insights) in a concise manner with the necessary details.</p>
<p>In terms of the length, researchers found the POV statements of low parameter values were notably shorter than the POV statements with high parameter values, except for the statement &#x23;120. It provides insights to the researchers that the students produced POV statements with lower parameter values are not clearly specifying the user, need and insight. Therefore, short length reflects the lack of thorough description to understand the context in which the POV statements are based on. Also, when we took a more detailed analysis on the statement &#x23;120, we found that this statement included introductory sentence as part of their POV statement. The inclusion of introductory sentences can either be interpreted as students&#x2019; misunderstanding of the structure of POV statement, or lack of writing skills to integrate all the necessary detailed information in the structure of POV statement.</p>
</sec>
<sec id="s5-1-1-2">
<title>Alignment and Logic</title>
<p>The user, needs, and insights should be aligned and actionable to increase the likelihood of success during the follow-up designing process. Well-aligned POV statements enhance the team&#x2019;s ability to assist the users in meeting their goals and objectives in an efficient and effective way (<xref ref-type="bibr" rid="B67">Wolcott et&#x20;al., 2021</xref>). Compared to the high parameter value statements, our research team agreed that the low parameter value statements typically showed less logically aligned user, needs, and insights. In most of the cases, the less cohesive POV statements came from stating the user and needs in a manner that was too broad, vague, or less clarified. Statement &#x23;121, &#x23;122, &#x23;124 were direct examples of this problem. For instance, the statement &#x23;121 fell short of a detailed illustration about why &#x201c;people who live in urban areas&#x201d; needed a &#x201c;sustainable source of foods&#x201d;. Too broad of a user group, like &#x201c;people live in urban areas&#x201d;, was not cohesively related to the need of &#x201c;sustainable foods&#x201d;, and this statement did not articulate what were the &#x201c;sustainable foods&#x201d;. Thus, it appeared difficult to determine whether it was hard to gain sustainable sources of food in urban areas, or whether the struggles were due to the socio-economic status of the residents in urban districts that more sustainable sources of food were needed. Moreover, the insights did not clarify the range and definition of &#x201c;imports&#x201d;, and why it was important and/or positive to decrease the reliance on imports.</p>
<p>POV statements lacking alignment between the user, need and insight were not logical and/or easy to follow. These kinds of statements appeared unfounded or unsupported. For instance, statement &#x23;117, &#x23;119, &#x23;120, &#x23;121, and &#x23;122 could face rebuttal because the user group was not well aligned with the needs. As an example, the statement &#x23;122 insisted that the &#x201c;Food industry&#x201d; &#x201c;waste less&#x201d;, to prevent &#x201c;excessive usage of natural resources&#x201d;. Not only were the contents of this statement not written in the way POV statements required, but it also lacked a logical explanation of why the food industry needed to waste less, while there could be many possible factors/ subjects excessively wasting natural resources. Overall, not including the components of a POV statement (user, need and insight) or including them in ways that are not well aligned yield POV statements that are marginally actionable and vague. Additionally, the lower quality POV statements often framed the users&#x2019; needs as oriented towards a specific solution rather than focusing on the problem at&#x20;hand.</p>
</sec>
</sec>
<sec id="s5-1-2">
<title>Components of Point-Of-View Statements</title>
<sec id="s5-1-2-1">
<title>User</title>
<p>Although these were broad in some senses, the user defined in both the POV statements with high parameter values and low parameter values were narrowed down with descriptive explanations, though the degree of specification differed from statement to statement. Specifically, some of the POV statements with low parameter values revealed limitations when defining users. For instance, the statement &#x23;115 defined &#x201c;People&#x201d; as a user group but did not narrow down the user and not provide any illustrated details about the user group they are targeting. The user group of the statement &#x23;118 was &#x201c;pedestrians&#x201d;, which was not any different from &#x201c;people&#x201d;, failing to narrow it down enough. The statement &#x23;123 did not designate any user group, therefore making the targeted user group remain unspecified. By failing to define user groups from the specific user&#x2019;s perspective in the problem-solving, these teams fell short of solutions with quantity and higher quality.</p>
</sec>
<sec id="s5-1-2-2">
<title>Needs</title>
<p>The needs are something essential or important, and are required for targeted users (<xref ref-type="bibr" rid="B27">Interaction Design Foundation, 2020</xref>). Though it still could have been improved, compared to the low parameter value statements, most of the high parameter value statements incorporated adjectives and details specific to the user group. For instance, the statement &#x23;1 and &#x23;2 proposed the needs pertinent to the user group. The statement &#x23;1 proposed a need for an &#x201c;accessible, personalized and interactive&#x201d; method for safety meetings. When limited to the user and needs, this statement did not seem to provide sufficient information due to the vague depiction of the user group. However, considering their insights illustrated the current situation of the statement &#x23;1 user group, it seemed to reflect the current needs the user group was confronting. The statement &#x23;2 also showed needs of &#x201c;reducing the CO2 emissions&#x201d; relevant to the user group utilizing the automobiles and transportation vehicles. Also, the user group of &#x23;6 was students who had constraints on time and accessibility on campus. The needs of these user groups were stated as a &#x201c;means to achieve a healthier lifestyle without spending too much extra time and money&#x201d;. The proposed need of an efficient, healthy lifestyle was well aligned with the busy user group on campus.</p>
<p>Compared to the high parameter value statements, the low parameter statements were less pertinent to the user group because either the user group was too general and not specified enough or the needs were too broad and vague. For the statements like &#x23;115 and &#x23;119, it was hard to connect the user and needs because the user was &#x201c;people&#x201d; or &#x201c;anyone involved in scientific or technology labs&#x201d;. Like these two statements, either too broad or user groups without any detailed information, hindered the cohesive alignment of user group and their needs. Statement &#x23;122 and &#x23;124 showed the examples of too vague and broad needs: &#x201c;To waste less (&#x23;122)&#x2019; and &#x2018;to be improved (&#x23;124)&#x201d; lacked adjectives and details to enhance the needs. For the needs of the statement &#x23;122, missing details of &#x201c;what&#x201d; was wasted and &#x201c;how much&#x201d; it should or could be less wasted made the statement less strong. The statement &#x23;124 was not only less related to the user group in that it did not provide how the infrastructure(s) could be improved, but also the user, &#x201c;infrastructure at (The name of University)&#x201d; was not clarified enough among the broad notion of infrastructure (e.g., system or organization, clinical facilities, offices, centers, communities) (<xref ref-type="bibr" rid="B37">Longtin, 2014</xref>).</p>
<p>The high parameter value POV statements identified the user groups&#x2019; needs and goals in, or with, a verb form so that users could see the choices they could make and choose among the options. In contrast, some of the low parameter value statements&#x2019; needs provided the needs in a noun form, which described the solution relying on technology, money/funding, a product (specifications), and/or a system (e.g., &#x23;117, &#x23;118, &#x23;119, &#x23;120, &#x23;121). Although these statements proposed possible solutions, those were limited, predetermined solutions from the perspectives of the writers, not allowing the alternatives from the user&#x2019;s stance. For example, the statement &#x23;118 suggested &#x201c;signage&#x201d; as a need of their user group to reduce the risk of accidents in the bike lanes. However, this need was a solution and did not include various other possible solutions and the actual needs designers might consider, obviously excluding the possibility that the signage itself might not be the only best solution for the pedestrians.</p>
<p>Another problem found in the low parameter value statements was the interpretation of &#x201c;need&#x201d; itself. While most of the high parameter value statements concentrated on the goals and needs user groups experience, some of the low parameter value statements regarded the needs of user groups according to the dictionary definition, as a requirement, necessary duty, or obligation instead of user&#x2019;s goals. This particular type of need misinterpretation can be found in statement &#x23;115, &#x23;122, and &#x23;124. For example, statement &#x23;115 highlighted a necessary moral, educational duty of people to be culturally sensitive, statement &#x23;122 also emphasized that the user group (food industry) waste less to protect the environment, and statement &#x23;124 called for the upgrade of the infrastructure to resolve the overcrowded campus issue. These examples of misinterpretation appeared to affect the insights. Specifically, these misinterpretations appear to lead to a misunderstanding of the problems and current issues specific to the insights for the&#x20;users.</p>
</sec>
<sec id="s5-1-2-3">
<title>Insights</title>
<p>A good insight provides the result of meeting the needs, which should be based on the empathy (<xref ref-type="bibr" rid="B22">Gibbons, 2019</xref>). It provides the goals user groups can accomplish by solving the current needs, among the multiple possible solutions (<xref ref-type="bibr" rid="B51">Pressman, 2018</xref>). In terms of insights, both the high parameter value statements and the low parameter value statements mostly provided the current problem without resolving their current needs, except for statements &#x23;2, &#x23;3, &#x23;5, and &#x23;120. These statements provided the positive side the user group could achieve when finding the appropriate solution of the user needs. However, other statements failed to meet this criterion and got high parameter scores regardless of the contents of their insights. For instance, the statement &#x23;1 proposed &#x201c;currently the users lack motivation and different levels of complexity within the class environment&#x201d; as their insights. However, this was the problem the current situation reveals, not the goal the user group (the school of aviation and transportation technology) are trying to accomplish. The low parameter value statements provided positive goals the user group could achieve but showed the lower parameter value compared to the statement &#x23;1. Based on these findings it appeared that, when judging the POV statements, there was a high chance the students did not take the notion of good insights into account. Thus, in terms of insights, the parameter value was not always aligned with the actual quality of the insights.</p>
</sec>
</sec>
</sec>
<sec id="s5-2">
<title>Summary of the Findings From Construct Validity Analysis</title>
<p>
<xref ref-type="table" rid="T7">Table&#x20;7</xref> provides the summary of the findings from construct validity analysis.</p>
<table-wrap id="T7" position="float">
<label>TABLE 7</label>
<caption>
<p>Summary of findings.</p>
</caption>
<table>
<thead valign="top">
<tr>
<th align="left"/>
<th align="center">Highest parameter values</th>
<th align="center">Lowest parameter values</th>
</tr>
</thead>
<tbody valign="top">
<tr>
<td colspan="3" align="left">
<bold>Framework</bold>
</td>
</tr>
<tr>
<td rowspan="3" align="left">&#x2003;Structure and length</td>
<td rowspan="3" align="left">- Following basic structures with all necessary components (i.e.,&#x20;user, needs, insights) in a concise manner with necessary details</td>
<td align="left">- Not leading to an actionable statement (e.g., omitted insights)</td>
</tr>
<tr>
<td align="left">- Include unnecessary information</td>
</tr>
<tr>
<td align="left">- Short POV statements due to the lack of description</td>
</tr>
<tr>
<td rowspan="3" align="left">&#x2003;Alignment and logic</td>
<td rowspan="3" align="left">- Aligned and actionable</td>
<td align="left">- Lacks alignment, not logical</td>
</tr>
<tr>
<td align="left">- Not actionable due to the vagueness (e.g., waste less)</td>
</tr>
<tr>
<td align="left">- Frame the user needs as a specific solution (e.g., implement lights in the parking garage)</td>
</tr>
<tr>
<td colspan="3" align="left">
<bold>Components of POV statements</bold>
</td>
</tr>
<tr>
<td align="left">&#x2003;User</td>
<td align="left">- Narrowed down with description about the users</td>
<td align="left">- Some of them lacks illustration (e.g., people, pedestrians)</td>
</tr>
<tr>
<td rowspan="1" align="left">&#x2003;Needs</td>
<td align="left">- Incorporated adjectives and details specific to the user group - Identified the user groups&#x2019; needs and goals in, or with, a verb form so that users could see the choices</td>
<td align="left">- Less pertinent to the user group because either the user group was too general (e.g., people need to become more educated) - Not specified enough (e.g., Infrastructures need to be improved) - Misinterpretation of &#x2018;need&#x2019; itself (e.g., As a requirement, necessary duty, or obligation instead of user&#x2019;s goals)</td>
</tr>
<tr>
<td rowspan="2" align="left">&#x2003;Insights</td>
<td colspan="2" align="left">- Both groups showed limitation: parameter value was not always aligned with the actual quality of the insights</td>
</tr>
<tr>
<td colspan="2" align="left">- Provided the current problem without resolving their current needs (e.g., because it will reduce the carbon footprint that college campuses leave)</td>
</tr>
</tbody>
</table>
</table-wrap>
</sec>
<sec id="s5-3">
<title>Criterion Validity of Adaptive Comparative Judgment</title>
<sec id="s5-3-1">
<title>Concurrent Validity of Adaptive Comparative Judgment</title>
<p>To measure concurrent validity, a correlation was run between the parameter values from conducting the peer reviewed ACJ assessment and the instructors&#x2019; rubric based grade assignments on the POV statements. The peer-evaluated ACJ was not significantly correlated (<italic>r</italic>&#x20;&#x3d; 0.08, <italic>p</italic>&#x20;&#x3d; 0.51) with graders&#x2019; grading based on rubric. Therefore, the potential concurrent validity of peer-evaluation using ACJ with POV statements is not supported by these results in the context of design thinking.</p>
</sec>
<sec id="s5-3-2">
<title>Predictive Validity of Adaptive Comparative Judgment</title>
<p>As seen in <xref ref-type="table" rid="T8">Table&#x20;8</xref>, A simple linear regression was calculated to predict grades of final deliverables (Assessment 3) based on the parameter values of peer-evaluated ACJ (Assessment 1). A significant regression was found (<italic>F</italic> (1, 575) &#x3d; 63.057, <italic>p</italic>&#x20;&#x3c; 0.001), with an <inline-formula id="inf3">
<mml:math id="m3">
<mml:mrow>
<mml:msup>
<mml:mi>R</mml:mi>
<mml:mn>2</mml:mn>
</mml:msup>
</mml:mrow>
</mml:math>
</inline-formula> of 0.101. Students&#x2019; predicted grades of final deliverables (Assessment 3) is equal to 20.95 &#x2b; 1.50 (parameter values). The grades of final deliverables (Assessment 3) increased 1.50 for each point of parameter values of peer-evaluated ACJ (Assessment 1). Therefore, peer-reviewed ACJ showed predictive validity in the context of design thinking.</p>
<table-wrap id="T8" position="float">
<label>TABLE 8</label>
<caption>
<p>Regression results using Assessment 3 (Grades of final deliverable) as the criterion.</p>
</caption>
<table>
<thead valign="top">
<tr>
<th rowspan="2" align="left">Predictor</th>
<th rowspan="2" align="center">
<italic>b</italic>
</th>
<th align="center">
<italic>b</italic>
</th>
<th rowspan="2" align="center">
<italic>beta</italic>
</th>
<th align="center">
<italic>beta</italic>
</th>
<th rowspan="2" align="center">
<italic>sr</italic>
<sup>
<italic>2</italic>
</sup>
</th>
<th align="center">
<italic>sr</italic>
<sup>
<italic>2</italic>
</sup>
</th>
<th rowspan="2" align="center">
<italic>r</italic>
</th>
<th rowspan="2" align="center">Fit</th>
</tr>
<tr>
<th align="center">95% CI [LL, UL]</th>
<th align="center">95%S CI [LL, UL]</th>
<th align="center">95% CI [LL, UL]</th>
</tr>
</thead>
<tbody valign="top">
<tr>
<td align="left">(Intercept)</td>
<td align="char" char=".">20.95&#x2a;&#x2a;</td>
<td align="center">(20.74, 21.16)</td>
<td align="center">&#x2014;</td>
<td align="center">&#x2014;</td>
<td align="center">&#x2014;</td>
<td align="center">&#x2014;</td>
<td align="center">&#x2014;</td>
<td align="center">&#x2014;</td>
</tr>
<tr>
<td align="left">Parameter Values</td>
<td align="char" char=".">1.50&#x2a;&#x2a;</td>
<td align="center">(1.13, 1.87)</td>
<td align="char" char=".">0.32</td>
<td align="center">(0.24, 0.40)</td>
<td align="char" char=".">0.10</td>
<td align="center">(0.06, 0.15)</td>
<td align="char" char=".">0.32&#x2a;&#x2a;</td>
<td align="center">&#x2014;</td>
</tr>
<tr>
<td align="left">&#x2014;</td>
<td align="center">&#x2014;</td>
<td align="center">&#x2014;</td>
<td align="center">&#x2014;</td>
<td align="center">&#x2014;</td>
<td align="center">&#x2014;</td>
<td align="center">&#x2014;</td>
<td align="center">&#x2014;</td>
<td align="center">
<italic>R</italic>
<sup>
<italic>2</italic>
</sup> &#x3d; 0.101&#x2a;&#x2a; 95% CI (0.06,0.15)</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<fn>
<p>Note. A significant <italic>b</italic>-weight indicates the beta-weight and semi-partial correlation are also significant. <italic>b</italic> represents unstandardized regression weights. <italic>beta</italic> indicates the standardized regression weights. <italic>sr</italic>
<sup>
<italic>2</italic>
</sup> represents the semi-partial correlation squared. <italic>r</italic> represents the zero-order correlation. <italic>LL</italic> and <italic>UL</italic> indicate the lower and upper limits of a confidence interval, respectively. &#x2a; indicates <italic>p</italic>&#x20;&#x3c; 0.05. &#x2a;&#x2a; indicates <italic>p</italic>&#x20;&#x3c; 0.01.</p>
</fn>
</table-wrap-foot>
</table-wrap>
</sec>
</sec>
</sec>
<sec sec-type="discussion" id="s6">
<title>Discussion</title>
<p>Our research questions guiding the inquiry were: 1) What is the construct validity of ACJ? Does peer-reviewed ACJ reflect general criteria of good POV statements? 2) What is the criterion validity of ACJ? By doing so, this study aimed to validate peer-evaluated ACJ in the design thinking education context. First, this study analyzed ten high parameter value statements and ten low parameter value statements based on the criteria of &#x201c;good&#x201d; POV statements (<xref ref-type="bibr" rid="B27">Interaction Design Foundation, 2020</xref>; <xref ref-type="bibr" rid="B55">Rikke Friis and Teo Yu, 2020</xref>) to examine the construct validity of ACJ.&#x20;Second, this study examined criterion validity: Concurrent validity and predictive validity. Concurrent validity was studied using correlation between the parameter values and grades on the same POV assignment. Then, the study on the predictive validity was followed to see the parameter values on POV statement can predict future achievement of students, the grades of final deliverables.</p>
<p>The results revealed that peer-evaluated ACJ demonstrated construct validity. The parameter values reflect the quality of POV statements in terms of content structure, needs, user, and insights. The POV statements with higher parameter values showed better quality compared to the POV statements with lower parameter values. This finding is aligned with the findings from previous studies, which reported that ACJ completed by students can be a sound measure for evaluation of self and peer work (<xref ref-type="bibr" rid="B28">Jones and Alcock, 2014</xref>; <xref ref-type="bibr" rid="B5">Bartholomew et&#x20;al., 2020a</xref>). Further, the results suggested that peer-evaluated ACJ had predictive validity, but not concurrent validity. When assessing the same POV statements, the results of peer-evaluated ACJ (parameter values) and rubric-based grading by instructors did not show significant correlation. However, the results of peer-evaluated ACJ moderately predicted students&#x2019; final grades in project&#x20;3.</p>
<p>As mentioned in previous studies, peer-evaluated ACJ is not proficient nor professional enough compared to instructors&#x2019; ACJ (<xref ref-type="bibr" rid="B28">Jones and Alcock, 2014</xref>). This may potentially affect the lack of correlation between peer-evaluated ACJ and rubric-based grading of instructors. The lack of correlation between peer evaluated ACJ results and the instructors&#x2019; rubric based grading may potentially be due to the distributions of the variables as opposed to a lack of concurrent validity. We note that the instructors&#x2019; rubric based scores are negatively skewed&#x2014;which we attribute to the criterion-referenced evaluation. Thus, many POV statements may have scored high and similarly to each other on the rubric while in fact there was a noticeable difference between them as discussed in our criterion validity analysis. The ACJ approach yields a norm referenced output which includes a normal distribution regardless of the POV statements meeting the quality standards (or&#x20;not).</p>
<p>ACJ offers researchers and practitioners in design thinking an effective quality assessment tool that is valid and reliable. As could be seen in the comparison between two groups (i.e.,&#x20;POV statement with high parameter values and POV statements with low parameter values), the results of ACJ displayed the quality of student assignments in a more conspicuous way. The outlier POV statements, such as those generated by teams who failed to progress or high-achiever groups were more notable when using the ACJ, due to its rank system. Early detection of struggling students (or groups) is important for both supporting student&#x2019;s academic achievement in following task and keeping students from dropping out. Instructors could provide timely educational intervention to the student groups who received low parameter values in their task. For instance, if the instructor could support student groups who were struggling in POV statement, he or she could facilitate iteration and revision before student group make a progress using poor-quality POV statement, which might deleteriously affect following design thinking process. Additionally, instructors also could benefit from evaluating the quality of formative assessment during the design projects because goal-oriented, competitive students who were interested in developing one&#x2019;s project in a more excellent manner would be motivated from the results of&#x20;ACJ.</p>
<p>This study is not without limitations. First, while ACJ provided reliable and valid assessment method, the parameter value highly depends on the relative quality/level of the objects which were being assessed compared. If everyone performs well in the assignment, some students will get low parameter value and rank although the submission successfully meet overall criteria of good POV statements. Therefore, educators should bear the learning objectives and expected outcomes in mind when using ACJ and pay attention to the difference between the higher and lower ranked items. Second, the goal of assessment should be clarified. The rubric based assessment yielded a measure comparing work against a minimum standard where every team could have succeeded. The ACJ measure provided a rank order where one team&#x2019;s POV was strongest, while another weakest. This means that both the strongest and weakest POV&#x2019;s may or may not have met the minimum standards for a good POV statement. Further, peers are students and may not&#x20;be as proficient as trained graduate students or instructors though they were nearly finished with the course at the time of&#x20;assessment and the previously-noted work has pointed to the&#x20;potential for students to complete judgments similarly to experts.</p>
</sec>
<sec id="s7">
<title>Future Implications</title>
<p>We suspect that an additional benefit of ACJ during the design thinking process was the opportunity for students to learn from both 1) the judgment process and 2) the POV statement examples of their teammates. During the comparative judgment of the POV statements, students had to cognitively internalize criteria to select &#x201c;better&#x201d; POV statement and applied those perceptions of quality. Also, the process required students to take a careful look at other students&#x2019; works as examples of POV statements. Examples resemble the given task and illustrate how the POV-writing task can be completed in the form of near transfer (<xref ref-type="bibr" rid="B17">Eiriksdottir and Catrambone, 2011</xref>). Studies revealed that simply being exposed to good examples did not lead to actual transfer (e.g., specify the criteria of good POV statement, explicitly articulate the principles of good POV statement, produce a good POV statement based on what student(s) learn from the POV statements) because learners often do not actively engage in cognitive strategies which help them learning better (<xref ref-type="bibr" rid="B17">Eiriksdottir and Catrambone, 2011</xref>). In other words, simply providing good POV examples to the students may not lead to the ability to judge or produce a good POV statement, because students did not use the knowledge from the examples to direct their POV judging/writing process. Educators who were interested in implementing ACJ in the course were required to adopt teaching strategies to enhance transfer of learning from examples such as emphasizing subgoals (<xref ref-type="bibr" rid="B10">Catrambone, 1994</xref>; <xref ref-type="bibr" rid="B2">Atkinson et&#x20;al., 2000</xref>) (e.g., articulate main components of POV statements, narrow down the user, set insights as ultimate goal of users), self-explanation (e.g., add detailed explanation about their judging criteria) (<xref ref-type="bibr" rid="B1">Anderson et&#x20;al., 1997</xref>) and group discussion (<xref ref-type="bibr" rid="B45">Olivera and Straus, 2004</xref>; <xref ref-type="bibr" rid="B63">Van Blankenstein et&#x20;al., 2011</xref>) (e.g., discuss comparative judgement criteria with peers).</p>
</sec>
</body>
<back>
<sec id="s8">
<title>Data Availability Statement</title>
<p>The datasets presented in this article are not readily available because data is restricted to use by the investigators as per the IRB agreement. Requests to access the datasets should be directed to <email>nmentzer@purdue.edu</email>.</p>
</sec>
<sec id="s9">
<title>Ethics Statement</title>
<p>The studies involving human participants were reviewed and approved by the Purdue University Institutional Review Board. Written informed consent for participation was not required for this study in accordance with the national legislation and the institutional requirements.</p>
</sec>
<sec id="s10">
<title>Author Contributions</title>
<p>NM contributed to the research implementation and methodology of this project. WL contributed to the writing, literature review, and statistical analysis. SB contributed to the overall research design and expertise in adaptive comparative judgment.</p>
</sec>
<sec id="s11">
<title>Funding</title>
<p>This material is based on work supported by the National Science Foundation under Grant Number DRL-2101235.</p>
</sec>
<sec sec-type="COI-statement" id="s12">
<title>Conflict of Interest</title>
<p>The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.</p>
</sec>
<sec sec-type="disclaimer" id="s13">
<title>Publisher&#x2019;s Note</title>
<p>All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.</p>
</sec>
<ref-list>
<title>References</title>
<ref id="B1">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Anderson</surname>
<given-names>J.&#x20;R.</given-names>
</name>
<name>
<surname>Fincham</surname>
<given-names>J.&#x20;M.</given-names>
</name>
<name>
<surname>Douglass</surname>
<given-names>S.</given-names>
</name>
</person-group> (<year>1997</year>). <article-title>The Role of Examples and Rules in the Acquisition of a Cognitive Skill</article-title>. <source>J.&#x20;Exp. Psychol. Learn. Mem. Cogn.</source> <volume>23</volume>, <fpage>932</fpage>&#x2013;<lpage>945</lpage>. <pub-id pub-id-type="doi">10.1037//0278-7393.23.4.932</pub-id> </citation>
</ref>
<ref id="B2">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Atkinson</surname>
<given-names>R. K.</given-names>
</name>
<name>
<surname>Derry</surname>
<given-names>S. J.</given-names>
</name>
<name>
<surname>Renkl</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Wortham</surname>
<given-names>D.</given-names>
</name>
</person-group> (<year>2000</year>). <article-title>Learning from Examples: Instructional Principles from the Worked Examples Research</article-title>. <source>Rev. Educ. Res.</source> <volume>70</volume>, <fpage>181</fpage>&#x2013;<lpage>214</lpage>. <pub-id pub-id-type="doi">10.3102/00346543070002181</pub-id> </citation>
</ref>
<ref id="B3">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Atman</surname>
<given-names>C. J.</given-names>
</name>
<name>
<surname>Kilgore</surname>
<given-names>D.</given-names>
</name>
<name>
<surname>McKenna</surname>
<given-names>A.</given-names>
</name>
</person-group> (<year>2008</year>). <article-title>Characterizing Design Learning: A Mixed-Methods Study of Engineering Designers&#x27; Use of Language</article-title>. <source>J.&#x20;Eng. Educ.</source> <volume>97</volume>, <fpage>309</fpage>&#x2013;<lpage>326</lpage>. <pub-id pub-id-type="doi">10.1002/j.2168-9830.2008.tb00981.x</pub-id> </citation>
</ref>
<ref id="B4">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Bartholomew</surname>
<given-names>S. R.</given-names>
</name>
<name>
<surname>Jones</surname>
<given-names>M. D.</given-names>
</name>
<name>
<surname>Hawkins</surname>
<given-names>S. R.</given-names>
</name>
<name>
<surname>Orton</surname>
<given-names>J.</given-names>
</name>
</person-group> (<year>2021</year>). <article-title>A Systematized Review of Research with Adaptive Comparative Judgment (ACJ) in Higher Education</article-title>. <source>Int. J.&#x20;Technol. Des. Educ.</source>, <fpage>1</fpage>&#x2013;<lpage>32</lpage>. <pub-id pub-id-type="doi">10.5296/jet.v9i1.19046</pub-id> </citation>
</ref>
<ref id="B5">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Bartholomew</surname>
<given-names>S. R.</given-names>
</name>
<name>
<surname>Mentzer</surname>
<given-names>N.</given-names>
</name>
<name>
<surname>Jones</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Sherman</surname>
<given-names>D.</given-names>
</name>
<name>
<surname>Baniya</surname>
<given-names>S.</given-names>
</name>
</person-group> (<year>2020a</year>). <article-title>Learning by Evaluating (LbE) through Adaptive Comparative Judgment</article-title>. <source>Int. J.&#x20;Technol. Des. Educ.</source> <volume>2020</volume>, <fpage>1</fpage>&#x2013;<lpage>15</lpage>. <pub-id pub-id-type="doi">10.1007/s10798-020-09639-1</pub-id> </citation>
</ref>
<ref id="B6">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Bartholomew</surname>
<given-names>S. R.</given-names>
</name>
<name>
<surname>Ruesch</surname>
<given-names>E. Y.</given-names>
</name>
<name>
<surname>Hartell</surname>
<given-names>E.</given-names>
</name>
<name>
<surname>Strimel</surname>
<given-names>G. J.</given-names>
</name>
</person-group> (<year>2020b</year>). <article-title>Identifying Design Values across Countries through Adaptive Comparative Judgment</article-title>. <source>Int. J.&#x20;Technol. Des. Educ.</source> <volume>30</volume>, <fpage>321</fpage>&#x2013;<lpage>347</lpage>. <pub-id pub-id-type="doi">10.1007/s10798-019-09506-8</pub-id> </citation>
</ref>
<ref id="B7">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Bartholomew</surname>
<given-names>S. R.</given-names>
</name>
<name>
<surname>Strimel</surname>
<given-names>G. J.</given-names>
</name>
<name>
<surname>Yoshikawa</surname>
<given-names>E.</given-names>
</name>
</person-group> (<year>2019</year>). <article-title>Using Adaptive Comparative Judgment for Student Formative Feedback and Learning during a Middle School Design Project</article-title>. <source>Int. J.&#x20;Technol. Des. Educ.</source> <volume>29</volume>, <fpage>363</fpage>&#x2013;<lpage>385</lpage>. <pub-id pub-id-type="doi">10.1007/s10798-018-9442-7</pub-id> </citation>
</ref>
<ref id="B8">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Bisson</surname>
<given-names>M.-J.</given-names>
</name>
<name>
<surname>Gilmore</surname>
<given-names>C.</given-names>
</name>
<name>
<surname>Inglis</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Jones</surname>
<given-names>I.</given-names>
</name>
</person-group> (<year>2016</year>). <article-title>Measuring Conceptual Understanding Using Comparative Judgement</article-title>. <source>Int. J.&#x20;Res. Undergrad. Math. Ed.</source> <volume>2</volume>, <fpage>141</fpage>&#x2013;<lpage>164</lpage>. <pub-id pub-id-type="doi">10.1007/s40753-016-0024-3</pub-id> </citation>
</ref>
<ref id="B9">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Borrego</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Douglas</surname>
<given-names>E. P.</given-names>
</name>
<name>
<surname>Amelink</surname>
<given-names>C. T.</given-names>
</name>
</person-group> (<year>2009</year>). <article-title>Quantitative, Qualitative, and Mixed Research Methods in Engineering Education</article-title>. <source>J.&#x20;Eng. Educ.</source> <volume>98</volume>, <fpage>53</fpage>&#x2013;<lpage>66</lpage>. <pub-id pub-id-type="doi">10.1002/j.2168-9830.2009.tb01005.x</pub-id> </citation>
</ref>
<ref id="B10">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Catrambone</surname>
<given-names>R.</given-names>
</name>
</person-group> (<year>1994</year>). <article-title>Improving Examples to Improve Transfer to Novel Problems</article-title>. <source>Mem. Cognit.</source> <volume>22</volume>, <fpage>606</fpage>&#x2013;<lpage>615</lpage>. <pub-id pub-id-type="doi">10.3758/bf03198399</pub-id> </citation>
</ref>
<ref id="B11">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Chapman</surname>
<given-names>V. G.</given-names>
</name>
<name>
<surname>Inman</surname>
<given-names>M. D.</given-names>
</name>
</person-group> (<year>2009</year>). <article-title>A Conundrum: Rubrics or Creativity/metacognitive Development?</article-title> <source>Educ. Horiz.</source>, <fpage>198</fpage>&#x2013;<lpage>202</lpage>. </citation>
</ref>
<ref id="B12">
<citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname>Clemens</surname>
<given-names>N. H.</given-names>
</name>
<name>
<surname>Ragan</surname>
<given-names>K.</given-names>
</name>
<name>
<surname>Christopher</surname>
<given-names>P.</given-names>
</name>
</person-group> (<year>2018</year>). &#x201c;<article-title>Predictive Validity</article-title>,&#x201d; in <source>The SAGE Encyclopedia of Educational Research, Measurement, and Evaluation</source>. Editor <person-group person-group-type="editor">
<name>
<surname>Frey</surname>
<given-names>B. B.</given-names>
</name>
</person-group> (<publisher-loc>Thousand Oaks: California</publisher-loc>: <publisher-name>SAGE</publisher-name>), <fpage>1289</fpage>&#x2013;<lpage>1291</lpage>. </citation>
</ref>
<ref id="B13">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Coenen</surname>
<given-names>T.</given-names>
</name>
<name>
<surname>Coertjens</surname>
<given-names>L.</given-names>
</name>
<name>
<surname>Vlerick</surname>
<given-names>P.</given-names>
</name>
<name>
<surname>Lesterhuis</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Mortier</surname>
<given-names>A. V.</given-names>
</name>
<name>
<surname>Donche</surname>
<given-names>V.</given-names>
</name>
<etal/>
</person-group> (<year>2018</year>). <article-title>An Information System Design Theory for the Comparative Judgement of Competences</article-title>. <source>Eur. J.&#x20;Inf. Syst.</source> <volume>27</volume>, <fpage>248</fpage>&#x2013;<lpage>261</lpage>. <pub-id pub-id-type="doi">10.1080/0960085x.2018.1445461</pub-id> </citation>
</ref>
<ref id="B14">
<citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname>Dam</surname>
<given-names>R.</given-names>
</name>
<name>
<surname>Siang</surname>
<given-names>T.</given-names>
</name>
</person-group> (<year>2018</year>). <source>Design Thinking: Get Started with Prototyping</source>. <publisher-loc>Denmark</publisher-loc>: <publisher-name>Interact. Des. Found</publisher-name>. </citation>
</ref>
<ref id="B15">
<citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname>Dochy</surname>
<given-names>F.</given-names>
</name>
<name>
<surname>Gijbels</surname>
<given-names>D.</given-names>
</name>
<name>
<surname>Segers</surname>
<given-names>M.</given-names>
</name>
</person-group> (<year>2006</year>). &#x201c;<article-title>Learning and the Emerging New Assessment Culture</article-title>,&#x201d; in <source>Instructional Psychology: Past, Present, and Future Trends</source>. Editors <person-group person-group-type="editor">
<name>
<surname>Verschaffel</surname>
<given-names>L</given-names>
</name>
<name>
<surname>Dochy</surname>
<given-names>F</given-names>
</name>
<name>
<surname>Boekaerts</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Vosniadou</surname>
<given-names>S.</given-names>
</name>
</person-group> (<publisher-loc>Amsterdam</publisher-loc>: <publisher-name>Elsevier</publisher-name>), <fpage>191</fpage>&#x2013;<lpage>206</lpage>. </citation>
</ref>
<ref id="B16">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Dym</surname>
<given-names>C. L.</given-names>
</name>
<name>
<surname>Agogino</surname>
<given-names>A. M.</given-names>
</name>
<name>
<surname>Eris</surname>
<given-names>O.</given-names>
</name>
<name>
<surname>Frey</surname>
<given-names>D. D.</given-names>
</name>
<name>
<surname>Leifer</surname>
<given-names>L. J.</given-names>
</name>
</person-group> (<year>2005</year>). <article-title>Engineering Design Thinking, Teaching, and Learning</article-title>. <source>J.&#x20;Eng. Educ.</source> <volume>94</volume>, <fpage>103</fpage>&#x2013;<lpage>120</lpage>. <pub-id pub-id-type="doi">10.1002/j.2168-9830.2005.tb00832.x</pub-id> </citation>
</ref>
<ref id="B17">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Eiriksdottir</surname>
<given-names>E.</given-names>
</name>
<name>
<surname>Catrambone</surname>
<given-names>R.</given-names>
</name>
</person-group> (<year>2011</year>). <article-title>Procedural Instructions, Principles, and Examples: How to Structure Instructions for Procedural Tasks to Enhance Performance, Learning, and Transfer</article-title>. <source>Hum. Factors</source> <volume>53</volume>, <fpage>749</fpage>&#x2013;<lpage>770</lpage>. <pub-id pub-id-type="doi">10.1177/0018720811419154</pub-id> </citation>
</ref>
<ref id="B18">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Erickson</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Lyytinen</surname>
<given-names>K.</given-names>
</name>
<name>
<surname>Siau</surname>
<given-names>K.</given-names>
</name>
</person-group> (<year>2005</year>). <article-title>Agile Modeling, Agile Software Development, and Extreme Programming</article-title>. <source>J.&#x20;Database Manag.</source> <volume>16</volume>, <fpage>88</fpage>&#x2013;<lpage>100</lpage>. <pub-id pub-id-type="doi">10.4018/jdm.2005100105</pub-id> </citation>
</ref>
<ref id="B19">
<citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname>Forman</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Damschroder</surname>
<given-names>L.</given-names>
</name>
</person-group> (<year>2007</year>). &#x201c;<article-title>Qualitative Content Analysis</article-title>,&#x201d; in <source>Empirical Methods For Bioethics: A Primer Advances in Bioethics.</source> Editors <person-group person-group-type="editor">
<name>
<surname>Jacoby</surname>
<given-names>L.</given-names>
</name>
<name>
<surname>Siminoff</surname>
<given-names>L. A.</given-names>
</name>
</person-group> (<publisher-loc>Bingley, UK</publisher-loc>: <publisher-name>Emerald Group Publishing Limited</publisher-name>), <fpage>39</fpage>&#x2013;<lpage>62</lpage>. <pub-id pub-id-type="doi">10.1016/S1479-3709(07)11003-7</pub-id> </citation>
</ref>
<ref id="B20">
<citation citation-type="confproc">
<person-group person-group-type="author">
<name>
<surname>Gettens</surname>
<given-names>R.</given-names>
</name>
<name>
<surname>Riofr&#xed;o</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Spotts</surname>
<given-names>H.</given-names>
</name>
</person-group> <year>2015</year>, &#x201c;<article-title>Opportunity Thinktank: Laying a Foundation for the Entrepreneurially Minded Engineer</article-title>.&#x201d; in <conf-name>ASEE Conferences</conf-name>, <conf-loc>Seattle, Washington</conf-loc>, <conf-date>June 14-17, 2015</conf-date>. <pub-id pub-id-type="doi">10.18260/p.24545</pub-id> </citation>
</ref>
<ref id="B21">
<citation citation-type="web">
<person-group person-group-type="author">
<name>
<surname>Gettens</surname>
<given-names>R.</given-names>
</name>
<name>
<surname>Spotts</surname>
<given-names>H. E.</given-names>
</name>
</person-group> (<year>2018</year>). &#x201c;<article-title>Workshop: Problem Definition and Concept Ideation, an Active-Learning Approach in a Multi-Disciplinary Setting</article-title>&#x201d;, in <conf-name>ASEE Conferences</conf-name>, <conf-loc>Glassboro, New Jersey</conf-loc>, <conf-date>July 24-26, 2018</conf-date>. <comment>Available at: <ext-link ext-link-type="uri" xlink:href="https://peer.asee.org/31440%20">https://peer.asee.org/31440</ext-link>
</comment>. </citation>
</ref>
<ref id="B22">
<citation citation-type="web">
<person-group person-group-type="author">
<name>
<surname>Gibbons</surname>
<given-names>S.</given-names>
</name>
</person-group> (<year>2019</year>). <article-title>User Need Statements: The &#x2018;Define&#x2019; Stage in Design Thinking</article-title>. <comment>Available at: <ext-link ext-link-type="uri" xlink:href="https://www.nngroup.com/articles/user-need-statements/">https://www.nngroup.com/articles/user-need-statements/</ext-link>.</comment> </citation>
</ref>
<ref id="B23">
<citation citation-type="confproc">
<person-group person-group-type="author">
<name>
<surname>Haolin</surname>
<given-names>Z.</given-names>
</name>
<name>
<surname>Alicia</surname>
<given-names>B.</given-names>
</name>
<name>
<surname>Gary</surname>
<given-names>L.</given-names>
</name>
</person-group> (<year>2019</year>). &#x201c;<article-title>Full Paper: Assessment of Entrepreneurial Mindset Coverage in an Online First Year Design Course</article-title>.&#x201d; in <conf-name>2019 FYEE Conference</conf-name>, <conf-loc>Penn State University, Pennsylvania</conf-loc>. <conf-date>July 28-30, 2019</conf-date> </citation>
</ref>
<ref id="B24">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Hoge</surname>
<given-names>R. D.</given-names>
</name>
<name>
<surname>Butcher</surname>
<given-names>R.</given-names>
</name>
</person-group> (<year>1984</year>). <article-title>Analysis of Teacher Judgments of Pupil Achievement Levels</article-title>. <source>J.&#x20;Educ. Psychol.</source> <volume>76</volume>, <fpage>777</fpage>&#x2013;<lpage>781</lpage>. <pub-id pub-id-type="doi">10.1037/0022-0663.76.5.777</pub-id> </citation>
</ref>
<ref id="B25">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Hsieh</surname>
<given-names>H. F.</given-names>
</name>
<name>
<surname>Shannon</surname>
<given-names>S. E.</given-names>
</name>
</person-group> (<year>2005</year>). <article-title>Three Approaches to Qualitative Content Analysis</article-title>. <source>Qual. Health Res.</source> <volume>15</volume>, <fpage>1277</fpage>&#x2013;<lpage>1288</lpage>. <pub-id pub-id-type="doi">10.1177/1049732305276687</pub-id> </citation>
</ref>
<ref id="B26">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Hubley</surname>
<given-names>A. M.</given-names>
</name>
<name>
<surname>Zumbo</surname>
<given-names>B. D.</given-names>
</name>
</person-group> (<year>2011</year>). <article-title>Validity and the Consequences of Test Interpretation and Use</article-title>. <source>Soc. Indic. Res.</source> <volume>103</volume>, <fpage>219</fpage>&#x2013;<lpage>230</lpage>. <pub-id pub-id-type="doi">10.1007/s11205-011-9843-4</pub-id> </citation>
</ref>
<ref id="B27">
<citation citation-type="web">
<collab>Interaction Design Foundation</collab> (<year>2020</year>). <article-title>Point of View - Problem Statement</article-title>. <comment>Available at: <ext-link ext-link-type="uri" xlink:href="https://www.interaction-design.org/literature/topics/problem-statements">https://www.interaction-design.org/literature/topics/problem-statements</ext-link>
</comment>. </citation>
</ref>
<ref id="B28">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Jones</surname>
<given-names>I.</given-names>
</name>
<name>
<surname>Alcock</surname>
<given-names>L.</given-names>
</name>
</person-group> (<year>2014</year>). <article-title>Peer Assessment without Assessment Criteria</article-title>. <source>Stud. Higher Edu.</source> <volume>39</volume>, <fpage>1774</fpage>&#x2013;<lpage>1787</lpage>. <pub-id pub-id-type="doi">10.1080/03075079.2013.821974</pub-id> </citation>
</ref>
<ref id="B29">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Jones</surname>
<given-names>I.</given-names>
</name>
<name>
<surname>Swan</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Pollitt</surname>
<given-names>A.</given-names>
</name>
</person-group> (<year>2015</year>). <article-title>Assessing Mathematical Problem Solving Using Comparative Judgement</article-title>. <source>Int. J.&#x20;Sci. Math. Educ.</source> <volume>13</volume>, <fpage>151</fpage>&#x2013;<lpage>177</lpage>. <pub-id pub-id-type="doi">10.1007/s10763-013-9497-6</pub-id> </citation>
</ref>
<ref id="B30">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Jonsson</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Svingby</surname>
<given-names>G.</given-names>
</name>
</person-group> (<year>2007</year>). <article-title>The Use of Scoring Rubrics: Reliability, Validity and Educational Consequences</article-title>. <source>Educ. Res. Rev.</source> <volume>2</volume>, <fpage>130</fpage>&#x2013;<lpage>144</lpage>. <pub-id pub-id-type="doi">10.1016/j.edurev.2007.05.002</pub-id> </citation>
</ref>
<ref id="B31">
<citation citation-type="confproc">
<person-group person-group-type="author">
<name>
<surname>Karjalainen</surname>
<given-names>J.</given-names>
</name>
</person-group> (<year>2016</year>). &#x201c;<article-title>Design Thinking in Teaching: Product Concept Creation in the Devlab Program</article-title>&#x201d;, <conf-name>European Conference on Innovation and Entrepreneurship</conf-name>, <conf-loc>Karjalainen, Janne</conf-loc>, <conf-date>September 18, 2016</conf-date>. (<publisher-name>Academic Conferences International Limited</publisher-name>), <fpage>359</fpage>&#x2013;<lpage>364</lpage>. </citation>
</ref>
<ref id="B32">
<citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname>Kelley</surname>
<given-names>T. L.</given-names>
</name>
</person-group> (<year>1927</year>). <source>Interpretation of Educational Measurements</source>. <publisher-loc>Oxford, England</publisher-loc>: <publisher-name>World Book Co</publisher-name>. </citation>
</ref>
<ref id="B33">
<citation citation-type="confproc">
<person-group person-group-type="author">
<name>
<surname>Kernbach</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Nabergoj</surname>
<given-names>A. S.</given-names>
</name>
</person-group> (<year>2018</year>). &#x201c;<article-title>Visual Design Thinking: Understanding the Role of Knowledge Visualization in the Design Thinking Process</article-title>&#x201d;, <conf-name>2018 22nd International Conference Information Visualisation (IV)</conf-name>, <conf-loc>Fisciano, Italy</conf-loc>, <conf-date>July 10-13, 2018</conf-date> (<publisher-name>IEEE</publisher-name>), <fpage>362</fpage>&#x2013;<lpage>367</lpage>. <pub-id pub-id-type="doi">10.1109/iv.2018.00068</pub-id> </citation>
</ref>
<ref id="B34">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Kimbell</surname>
<given-names>R.</given-names>
</name>
</person-group> (<year>2008</year>). <article-title>E-Assessment in Project E-Scape</article-title>. <source>Des. Technol. Educ. Int. J.</source> <volume>12</volume>, <fpage>66</fpage>&#x2013;<lpage>76</lpage>. </citation>
</ref>
<ref id="B35">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Lammi</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Becker</surname>
<given-names>K.</given-names>
</name>
</person-group> (<year>2013</year>). <article-title>Engineering Design Thinking</article-title>. <source>J.&#x20;Technol. Educ.</source> <volume>24</volume>, <fpage>55</fpage>&#x2013;<lpage>77</lpage>. <pub-id pub-id-type="doi">10.21061/jte.v24i2.a.5</pub-id> </citation>
</ref>
<ref id="B36">
<citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname>Lindberg</surname>
<given-names>T.</given-names>
</name>
<name>
<surname>Meinel</surname>
<given-names>C.</given-names>
</name>
<name>
<surname>Wagner</surname>
<given-names>R.</given-names>
</name>
</person-group> (<year>2010</year>). &#x201c;<article-title>Design Thinking: A Fruitful Concept for IT Development?</article-title>&#x201d; in <source>Design Thinking. Understanding Innovation</source> (<publisher-loc>Berlin</publisher-loc>: <publisher-name>Springer</publisher-name>), <fpage>3</fpage>&#x2013;<lpage>18</lpage>. <pub-id pub-id-type="doi">10.1007/978-3-642-13757-0_1</pub-id> </citation>
</ref>
<ref id="B37">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Longtin</surname>
<given-names>S. E.</given-names>
</name>
</person-group> (<year>2014</year>). <article-title>Using the College Infrastructure to Support Students on the Autism Spectrum</article-title>. <source>J.&#x20;Postsecond. Educ. Disabil.</source> <volume>27</volume>, <fpage>63</fpage>&#x2013;<lpage>72</lpage>. </citation>
</ref>
<ref id="B38">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Lunz</surname>
<given-names>M. E.</given-names>
</name>
<name>
<surname>Stahl</surname>
<given-names>J.&#x20;A.</given-names>
</name>
</person-group> (<year>1990</year>). <article-title>Judge Consistency and Severity across Grading Periods</article-title>. <source>Eval. Health Prof.</source> <volume>13</volume>, <fpage>425</fpage>&#x2013;<lpage>444</lpage>. <pub-id pub-id-type="doi">10.1177/016327879001300405</pub-id> </citation>
</ref>
<ref id="B39">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Lunz</surname>
<given-names>M. E.</given-names>
</name>
<name>
<surname>Wright</surname>
<given-names>B. D.</given-names>
</name>
<name>
<surname>Linacre</surname>
<given-names>J.&#x20;M.</given-names>
</name>
</person-group> (<year>1990</year>). <article-title>Measuring the Impact of Judge Severity on Examination Scores</article-title>. <source>Appl. Meas. Edu.</source> <volume>3</volume>, <fpage>331</fpage>&#x2013;<lpage>345</lpage>. <pub-id pub-id-type="doi">10.1207/s15324818ame0304_3</pub-id> </citation>
</ref>
<ref id="B40">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Mahboub</surname>
<given-names>K. C.</given-names>
</name>
<name>
<surname>Portillo</surname>
<given-names>M. B.</given-names>
</name>
<name>
<surname>Liu</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Chandraratna</surname>
<given-names>S.</given-names>
</name>
</person-group> (<year>2004</year>). <article-title>Measuring and Enhancing Creativity</article-title>. <source>Eur. J.&#x20;Eng. Edu.</source> <volume>29</volume>, <fpage>429</fpage>&#x2013;<lpage>436</lpage>. <pub-id pub-id-type="doi">10.1080/03043790310001658541</pub-id> </citation>
</ref>
<ref id="B41">
<citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname>Mayring</surname>
<given-names>P.</given-names>
</name>
</person-group> (<year>2015</year>). &#x201c;<article-title>Qualitative Content Analysis: Theoretical Background and Procedures</article-title>,&#x201d; in <source>Approaches to Qualitative Research in Mathematics Education</source> (<publisher-loc>Berlin</publisher-loc>: <publisher-name>Springer</publisher-name>), <fpage>365</fpage>&#x2013;<lpage>380</lpage>. <pub-id pub-id-type="doi">10.1007/978-94-017-9181-6_13</pub-id> </citation>
</ref>
<ref id="B42">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Messick</surname>
<given-names>S.</given-names>
</name>
</person-group> (<year>1994</year>). <article-title>The Interplay of Evidence and Consequences in the Validation of Performance Assessments</article-title>. <source>Educ. Res.</source> <volume>23</volume>, <fpage>13</fpage>&#x2013;<lpage>23</lpage>. <pub-id pub-id-type="doi">10.2307/1176219</pub-id> </citation>
</ref>
<ref id="B43">
<citation citation-type="web">
<collab>National Academy of Engineering</collab> (<year>2008</year>). <article-title>Grand Challenges for Engineering</article-title>. <comment>Available at: <ext-link ext-link-type="uri" xlink:href="http://www.engineeringchallenges.org/challenges.aspx">http://www.engineeringchallenges.org/challenges.aspx</ext-link>
</comment>. </citation>
</ref>
<ref id="B44">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>O&#x2019;Leary-Kelly</surname>
<given-names>S. W.</given-names>
</name>
<name>
<surname>Vokurka</surname>
<given-names>R. J.</given-names>
</name>
</person-group> (<year>1998</year>). <article-title>The Empirical Assessment of Construct Validity</article-title>. <source>J.&#x20;Oper. Manag.</source> <volume>16</volume>, <fpage>387</fpage>&#x2013;<lpage>405</lpage>. </citation>
</ref>
<ref id="B45">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Olivera</surname>
<given-names>F.</given-names>
</name>
<name>
<surname>Straus</surname>
<given-names>S. G.</given-names>
</name>
</person-group> (<year>2004</year>). <article-title>Group-to-Individual Transfer of Learning: Cognitive and Social Factors</article-title>. <source>Small Group Res.</source> <volume>35</volume>, <fpage>440</fpage>&#x2013;<lpage>465</lpage>. <pub-id pub-id-type="doi">10.1177/1046496404263765</pub-id> </citation>
</ref>
<ref id="B46">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Palisse</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>King</surname>
<given-names>D. M.</given-names>
</name>
<name>
<surname>MacLean</surname>
<given-names>M.</given-names>
</name>
</person-group> (<year>2021</year>). <article-title>Comparative Judgement and the Hierarchy of Students&#x27; Choice Criteria</article-title>. <source>Int. J.&#x20;Math. Edu. Sci. Tech.</source>, <fpage>1</fpage>&#x2013;<lpage>21</lpage>. <pub-id pub-id-type="doi">10.1080/0020739x.2021.1962553</pub-id> </citation>
</ref>
<ref id="B47">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Pollitt</surname>
<given-names>A.</given-names>
</name>
</person-group> (<year>2012a</year>). <article-title>Comparative Judgement for Assessment</article-title>. <source>Int. J.&#x20;Technol. Des. Educ.</source> <volume>22</volume>, <fpage>157</fpage>&#x2013;<lpage>170</lpage>. <pub-id pub-id-type="doi">10.1007/s10798-011-9189-x</pub-id> </citation>
</ref>
<ref id="B48">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Pollitt</surname>
<given-names>A.</given-names>
</name>
</person-group> (<year>2015</year>). <article-title>On &#x2018;Reliability&#x2019; Bias in ACJ</article-title>. <source>Camb. Exam Res.</source> <volume>10</volume>, <fpage>1</fpage>&#x2013;<lpage>9</lpage>. <pub-id pub-id-type="doi">10.13140/RG.2.1.4207.3047</pub-id> </citation>
</ref>
<ref id="B49">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Pollitt</surname>
<given-names>A.</given-names>
</name>
</person-group> (<year>2012b</year>). <article-title>The Method of Adaptive Comparative Judgement</article-title>. <source>Assess. Educ. Principles, Pol. Pract.</source> <volume>19</volume>, <fpage>281</fpage>&#x2013;<lpage>300</lpage>. <pub-id pub-id-type="doi">10.1080/0969594x.2012.665354</pub-id> </citation>
</ref>
<ref id="B50">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Potter</surname>
<given-names>W. J.</given-names>
</name>
<name>
<surname>Levine&#x2010;Donnerstein</surname>
<given-names>D.</given-names>
</name>
</person-group> (<year>1999</year>). <article-title>Rethinking Validity and Reliability in Content Analysis</article-title>. <source>J.&#x20;Appl. Commun. Res.</source> <volume>27</volume>, <fpage>258</fpage>&#x2013;<lpage>284</lpage>. <pub-id pub-id-type="doi">10.1080/00909889909365539</pub-id> </citation>
</ref>
<ref id="B51">
<citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname>Pressman</surname>
<given-names>A.</given-names>
</name>
</person-group> (<year>2018</year>). <source>Design Thinking: A Guide to Creative Problem Solving for Everyone</source>. <publisher-loc>Oxfordshire</publisher-loc>: <publisher-name>Routledge</publisher-name>. </citation>
</ref>
<ref id="B52">
<citation citation-type="web">
<collab>QCDA</collab> (<year>1999</year>). <article-title>Importance of Design and Technology Key Stage 3</article-title>. <comment>Available at: <ext-link ext-link-type="uri" xlink:href="http://archive.teachfind.com/qcda/curriculum.qcda.gov.uk/key-stages-3-and-4/subjects/key-stage-3/design-and-technology/programme-of-study/index.html">http://archive.teachfind.com/qcda/curriculum.qcda.gov.uk/key-stages-3-and-4/subjects/key-stage-3/design-and-technology/programme-of-study/index.html</ext-link>.</comment> </citation>
</ref>
<ref id="B53">
<citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname>Rasch</surname>
<given-names>G.</given-names>
</name>
</person-group> (<year>1980</year>). <source>Probabilistic Models for Some Intelligence and Attainment Tests</source>. <edition>expanded edition</edition>. <publisher-loc>Chicago</publisher-loc>: <publisher-name>The University of Chicago Press</publisher-name>. </citation>
</ref>
<ref id="B54">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Reddy</surname>
<given-names>Y. M.</given-names>
</name>
<name>
<surname>Andrade</surname>
<given-names>H.</given-names>
</name>
</person-group> (<year>2010</year>). <article-title>A Review of Rubric Use in Higher Education</article-title>. <source>Assess. Eval. Higher Edu.</source> <volume>35</volume>, <fpage>435</fpage>&#x2013;<lpage>448</lpage>. <pub-id pub-id-type="doi">10.1080/02602930902862859</pub-id> </citation>
</ref>
<ref id="B55">
<citation citation-type="web">
<person-group person-group-type="author">
<name>
<surname>Rikke Friis</surname>
<given-names>D.</given-names>
</name>
<name>
<surname>Teo Yu</surname>
<given-names>S.</given-names>
</name>
</person-group> (<year>2020</year>). <article-title>Stage 2 in the Design Thinking Process: Define the Problem and Interpret the Results. Interact. Des. Found.</article-title> <comment>Available at: <ext-link ext-link-type="uri" xlink:href="https://www.interaction-design.org/literature/article/stage-2-in-the-design-thinking-process-define-the-problem-and-interpret-the-results">https://www.interaction-design.org/literature/article/stage-2-in-the-design-thinking-process-define-the-problem-and-interpret-the-results</ext-link>
</comment>. </citation>
</ref>
<ref id="B56">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Riofr&#xed;o</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Gettens</surname>
<given-names>R.</given-names>
</name>
<name>
<surname>Santamaria</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Keyser</surname>
<given-names>T.</given-names>
</name>
<name>
<surname>Musiak</surname>
<given-names>R.</given-names>
</name>
<name>
<surname>Spotts</surname>
<given-names>H.</given-names>
</name>
</person-group> (<year>2015</year>, &#x201c;<article-title>Innovation to Entrepreneurship in the First Year Engineering Experience</article-title>.&#x201d; in <conf-name>ASEE Conferences</conf-name>, <conf-loc>Seattle, Washington</conf-loc>, <conf-date>June 14-17, 2015</conf-date>. <pub-id pub-id-type="doi">10.18260/p.24306</pub-id> </citation>
</ref>
<ref id="B57">
<citation citation-type="confproc">
<person-group person-group-type="author">
<name>
<surname>Rowsome</surname>
<given-names>P.</given-names>
</name>
<name>
<surname>Seery</surname>
<given-names>N.</given-names>
</name>
<name>
<surname>Lane</surname>
<given-names>D.</given-names>
</name>
</person-group> (<year>2013</year>). &#x201c;<article-title>The Development of Pre-service Design Educator&#x2019;s Capacity to Make Professional Judgments on Design Capability Using Adaptive Comparative Judgment</article-title>&#x201d;. in <conf-name>2013 ASEE Annual Conference &#x26; Exposition</conf-name>, <conf-loc>Atlanta, Georgia</conf-loc>, <conf-date>June 23-26, 2013</conf-date>. <fpage>1</fpage>&#x2013;<lpage>10</lpage>. </citation>
</ref>
<ref id="B58">
<citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname>Schreier</surname>
<given-names>M.</given-names>
</name>
</person-group> (<year>2012</year>). <source>Qualitative Content Analysis in Practice</source>. <publisher-loc>NY, US</publisher-loc>. <publisher-name>Sage publications</publisher-name>. </citation>
</ref>
<ref id="B59">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Seery</surname>
<given-names>N.</given-names>
</name>
<name>
<surname>Canty</surname>
<given-names>D.</given-names>
</name>
<name>
<surname>Phelan</surname>
<given-names>P.</given-names>
</name>
</person-group> (<year>2012</year>). <article-title>The Validity and Value of Peer Assessment Using Adaptive Comparative Judgement in Design Driven Practical Education</article-title>. <source>Int. J.&#x20;Technol. Des. Educ.</source> <volume>22</volume>, <fpage>205</fpage>&#x2013;<lpage>226</lpage>. <pub-id pub-id-type="doi">10.1007/s10798-011-9194-0</pub-id> </citation>
</ref>
<ref id="B60">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Sohaib</surname>
<given-names>O.</given-names>
</name>
<name>
<surname>Solanki</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>Dhaliwa</surname>
<given-names>N.</given-names>
</name>
<name>
<surname>Hussain</surname>
<given-names>W.</given-names>
</name>
<name>
<surname>Asif</surname>
<given-names>M.</given-names>
</name>
</person-group> (<year>2019</year>). <article-title>Integrating Design Thinking into Extreme Programming</article-title>. <source>J.&#x20;Ambient Intell. Hum. Comput</source> <volume>10</volume>, <fpage>2485</fpage>&#x2013;<lpage>2492</lpage>. <pub-id pub-id-type="doi">10.1007/s12652-018-0932-y</pub-id> </citation>
</ref>
<ref id="B61">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Spooren</surname>
<given-names>P.</given-names>
</name>
</person-group> (<year>2010</year>). <article-title>On the Credibility of the Judge: A Cross-Classified Multilevel Analysis on Students&#x2019; Evaluation of Teaching</article-title>. <source>Stud. Educ. Eval.</source> <volume>36</volume>, <fpage>121</fpage>&#x2013;<lpage>131</lpage>. <pub-id pub-id-type="doi">10.1016/j.stueduc.2011.02.001</pub-id> </citation>
</ref>
<ref id="B69">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Strimel</surname>
<given-names>G. J.</given-names>
</name>
<name>
<surname>Bartholomew</surname>
<given-names>S. R.</given-names>
</name>
<name>
<surname>Purzer</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Zhang</surname>
<given-names>L.</given-names>
</name>
<name>
<surname>Ruesch</surname>
<given-names>E. Y.</given-names>
</name>
</person-group> (<year>2021</year>). <article-title>Informing Engineering Design Through Adaptive Comparative Judgment</article-title>. <source>Eur. J. Eng. Educ.</source> <volume>46</volume>, <fpage>227</fpage>&#x2013;<lpage>246</lpage>. </citation>
</ref>
<ref id="B62">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Thurstone</surname>
<given-names>L. L.</given-names>
</name>
</person-group> (<year>1927</year>). <article-title>A Law of Comparative Judgment</article-title>. <source>Psychol. Rev.</source> <volume>34</volume>, <fpage>273</fpage>&#x2013;<lpage>286</lpage>. <pub-id pub-id-type="doi">10.1037/h0070288</pub-id> </citation>
</ref>
<ref id="B63">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Van Blankenstein</surname>
<given-names>F. M.</given-names>
</name>
<name>
<surname>Dolmans</surname>
<given-names>D. H. J.&#x20;M.</given-names>
</name>
<name>
<surname>van der Vleuten</surname>
<given-names>C. P. M.</given-names>
</name>
<name>
<surname>Schmidt</surname>
<given-names>H. G.</given-names>
</name>
</person-group> (<year>2011</year>). <article-title>Which Cognitive Processes Support Learning during Small-Group Discussion? the Role of Providing Explanations and Listening to Others</article-title>. <source>Instr. Sci.</source> <volume>39</volume>, <fpage>189</fpage>&#x2013;<lpage>204</lpage>. <pub-id pub-id-type="doi">10.1007/s11251-009-9124-7</pub-id> </citation>
</ref>
<ref id="B64">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Van Daal</surname>
<given-names>T.</given-names>
</name>
<name>
<surname>Lesterhuis</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Coertjens</surname>
<given-names>L.</given-names>
</name>
<name>
<surname>Donche</surname>
<given-names>V.</given-names>
</name>
<name>
<surname>De Maeyer</surname>
<given-names>S.</given-names>
</name>
</person-group> (<year>2019</year>). <article-title>Validity of Comparative Judgement to Assess Academic Writing: Examining Implications of its Holistic Character and Building on a Shared Consensus</article-title>.&#x20;<source>Assess. Educ. Principles, Pol. Pract.</source> <volume>26</volume>, <fpage>59</fpage>&#x2013;<lpage>74</lpage>. <pub-id pub-id-type="doi">10.1080/0969594x.2016.1253542</pub-id> </citation>
</ref>
<ref id="B65">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Wedell-Wedellsborg</surname>
<given-names>T.</given-names>
</name>
</person-group> (<year>2017</year>). <article-title>Are You Solving the Right Problems</article-title>. <source>Harv. Bus. Rev.</source> <volume>95</volume>, <fpage>76</fpage>&#x2013;<lpage>83</lpage>. </citation>
</ref>
<ref id="B66">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Wilson</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Wright</surname>
<given-names>C. R.</given-names>
</name>
</person-group> (<year>1993</year>). <article-title>The Predictive Validity of Student Self- Evaluations, Teachers&#x27; Assessments, and Grades for Performance on the Verbal Reasoning and Numerical Ability Scales of the Differential Aptitude Test for a Sample of Secondary School Students Attj7Ending Rural Appalachia Schools</article-title>. <source>Educ. Psychol. Meas.</source> <volume>53</volume>, <fpage>259</fpage>&#x2013;<lpage>270</lpage>. <pub-id pub-id-type="doi">10.1177/0013164493053001029</pub-id> </citation>
</ref>
<ref id="B67">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Wolcott</surname>
<given-names>M. D.</given-names>
</name>
<name>
<surname>McLaughlin</surname>
<given-names>J.&#x20;E.</given-names>
</name>
<name>
<surname>Hubbard</surname>
<given-names>D. K.</given-names>
</name>
<name>
<surname>Rider</surname>
<given-names>T. R.</given-names>
</name>
<name>
<surname>Umstead</surname>
<given-names>K.</given-names>
</name>
</person-group> (<year>2021</year>). <article-title>Twelve Tips to Stimulate Creative Problem-Solving with Design Thinking</article-title>. <source>Med. Teach.</source> <volume>43</volume>, <fpage>501</fpage>&#x2013;<lpage>508</lpage>. <pub-id pub-id-type="doi">10.1080/0142159X.2020.1807483</pub-id> </citation>
</ref>
<ref id="B68">
<citation citation-type="web">
<person-group person-group-type="author">
<name>
<surname>Woolery</surname>
<given-names>E.</given-names>
</name>
</person-group> (<year>2019</year>). <article-title>Design Thinking Handbook</article-title>. <comment>Available at: <ext-link ext-link-type="uri" xlink:href="https://www.designbetter.co/design-thinking">https://www.designbetter.co/design-thinking</ext-link> April</comment>. </citation>
</ref>
</ref-list>
</back>
</article>