<?xml version="1.0" encoding="UTF-8" standalone="no"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v2.3 20070202//EN" "journalpublishing.dtd">
<article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" article-type="review-article">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">Front. Psychology</journal-id>
<journal-title>Frontiers in Psychology</journal-title>
<abbrev-journal-title abbrev-type="pubmed">Front. Psychology</abbrev-journal-title>
<issn pub-type="epub">1664-1078</issn>
<publisher>
<publisher-name>Frontiers Research Foundation</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="doi">10.3389/fpsyg.2012.00229</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Psychology</subject>
<subj-group>
<subject>Hypothesis and Theory</subject>
</subj-group>
</subj-group>
</article-categories>
<title-group>
<article-title>Calibration Research: Where Do We Go from Here?</article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author" corresp="yes">
<name><surname>Bol</surname> <given-names>Linda</given-names></name>
<xref ref-type="aff" rid="aff1"><sup>1</sup></xref>
<xref ref-type="author-notes" rid="fn001">&#x0002A;</xref>
</contrib>
<contrib contrib-type="author">
<name><surname>Hacker</surname> <given-names>Douglas J.</given-names></name>
<xref ref-type="aff" rid="aff2"><sup>2</sup></xref>
</contrib>
</contrib-group>
<aff id="aff1"><sup>1</sup><institution>Department of Educational Foundations and Leadership, Old Dominion University</institution> <country>Norfolk, VA, USA</country></aff>
<aff id="aff2"><sup>2</sup><institution>Department of Educational Psychology, University of Utah, Salt Lake City</institution> <country>UT, USA</country></aff>
<author-notes>
<fn fn-type="edited-by"><p>Edited by: Steve Myran, Old Dominion University, USA</p></fn>
<fn fn-type="edited-by"><p>Reviewed by: Anastasia Kitsantas, George Mason University, USA; Heidi Andrade, University at Albany, USA</p></fn>
<fn fn-type="corresp" id="fn001"><p>&#x0002A;Correspondence: Linda Bol, Department of Educational Foundations and Leadership, College of Education, Old Dominion University, Norfolk, VA 23529, USA. e-mail: <email>lbol&#x00040;odu.edu</email></p></fn>
<fn fn-type="other" id="fn002"><p>This article was submitted to Frontiers in Educational Psychology, a specialty of Frontiers in Psychology.</p></fn>
</author-notes>
<pub-date pub-type="epub">
<day>09</day>
<month>07</month>
<year>2012</year>
</pub-date>
<pub-date pub-type="collection">
<year>2012</year>
</pub-date>
<volume>3</volume>
<elocation-id>229</elocation-id>
<history>
<date date-type="received">
<day>01</day>
<month>05</month>
<year>2012</year>
</date>
<date date-type="accepted">
<day>19</day>
<month>06</month>
<year>2012</year>
</date>
</history>
<permissions>
<copyright-statement>Copyright &#x000A9; 2012 Bol and Hacker.</copyright-statement>
<copyright-year>2012</copyright-year>
<license license-type="open-access" xlink:href="http://www.frontiersin.org/licenseagreement"><p>This is an open-access article distributed under the terms of the <uri xlink:href="http://creativecommons.org/licenses/by/3.0/">Creative Commons Attribution License</uri>, which permits use, distribution and reproduction in other forums, provided the original authors and source are credited and subject to any copyright notices concerning any third-party graphics etc.</p></license>
</permissions>
<abstract>
<p>Research on calibration remains a popular line of inquiry. Calibration is the degree of fit between a person&#x02019;s judgment of performance and his or her actual performance. Given the continued interest in this topic, the questions posed in this article are fruitful directions to pursue to help address gaps in calibration research. In this article, we have identified six research directions that if productively pursued, could greatly expand our knowledge of calibration. The six research directions are: (a) what are the effects of varying the anchoring mechanisms from which calibration judgments are made, (b) how does calibration accuracy differ as a function of incentives and task authenticity, (c) how do students self-report the basis of their calibration judgments, (d) how do group interactions and social comparisons affect calibration accuracy, (e) what is the relation between absolute and relative accuracy, and (f) to what extent does calibration accuracy predict achievement? To help point the way to where we go from here in calibration research, we provide these research questions, propose research methods designed to address them, and identify prior, related studies that have shown promise in leading the way to fill these gaps in the literature.</p>
</abstract>
<kwd-group>
<kwd>calibration</kwd>
<kwd>metacognition</kwd>
<kwd>self-regulated learning</kwd>
<kwd>social cognition</kwd>
<kwd>research methods</kwd>
</kwd-group>
<counts>
<fig-count count="1"/>
<table-count count="1"/>
<equation-count count="0"/>
<ref-count count="39"/>
<page-count count="6"/>
<word-count count="5598"/>
</counts>
</article-meta>
</front>
<body>
<p>Calibration has been defined as the degree of fit between a person&#x02019;s judgment of performance and his or her actual performance (Keren, <xref ref-type="bibr" rid="B17">1991</xref>). As such, calibration reflects a metacognitive monitoring process that provides information about the status of one&#x02019;s knowledge or strategies at a cognitive level (Nelson, <xref ref-type="bibr" rid="B26">1996</xref>). Based on this information, control at a metacognitive level can be exerted to regulate one&#x02019;s knowledge or strategies. Therefore, greater accuracy in a person&#x02019;s judgments of performance (i.e., being well calibrated) creates greater potential for self-regulation (Zimmerman and Moylan, <xref ref-type="bibr" rid="B39">2009</xref>).</p>
<p>The broad research literature in educational psychology and the more specific literature on self-regulated learning reveal a growing interest in calibration that is well-warranted. For instance, students studying for a test need to be accurate in monitoring their knowledge acquisition and retention if they hope to successfully control further study. On one hand, students may develop a false sense of their mastery of studied material and overestimate how well they will perform. These students&#x02019; positive biases could lead to premature termination of study and place them at risk for failure (Hacker et al., <xref ref-type="bibr" rid="B12">2008a</xref>). On the other hand, students may underestimate how well they will perform. These negative biases also can be detrimental to academic performance because students may fail to disengage from studying and misallocate study time if they assume the material is not yet mastered. When students demonstrate strong biases in their calibration judgments, they may not take the remedial steps necessary to improve or evaluate their responses during or after an exam (Hacker et al., <xref ref-type="bibr" rid="B13">2008b</xref>).</p>
<p>Although an exhaustive review of the research on calibration is beyond the scope of this paper, there are some consistent findings. Many studies have indicated that calibration accuracy is linked to achievement level (e.g., Hacker et al., <xref ref-type="bibr" rid="B14">2000</xref>; Grimes, <xref ref-type="bibr" rid="B11">2002</xref>; Bol et al., <xref ref-type="bibr" rid="B3">2005</xref>; Nietfeld et al., <xref ref-type="bibr" rid="B27">2005</xref>). In general, higher-achieving students tend to be more accurate but more underconfident when compared to their lower-achieving counterparts. Another consistent finding is that postdictions are typically more accurate than predictions (Pressley and Ghatala, <xref ref-type="bibr" rid="B29">1990</xref>; Maki and Serra, <xref ref-type="bibr" rid="B22">1992</xref>). This phenomenon makes intuitive sense because a person should be better able to judge how he or she performed after the completion of the task due to familiarity and exposure to the task itself (Hacker et al., <xref ref-type="bibr" rid="B14">2000</xref>). However, task difficulty also influences calibration accuracy. Juslin et al. (<xref ref-type="bibr" rid="B16">2000</xref>) identified the <italic>hard-easy effect</italic> in which students tend to be more accurate but underconfident on easy items and less accurate but overconfident on difficult items.</p>
<p>However, other findings have been less consistent and some areas of investigation have not yet been broached. Our purpose is to propose a research agenda that will shed light on the inconsistent findings and address those areas of research that have not yet received attention. We will propose our agenda using Zimmerman and colleagues&#x02019; social cognitive model of self-regulation, specifically their personal feedback loop, as a theoretical foundation on which research can be guided (Schunk and Zimmerman, <xref ref-type="bibr" rid="B33">1997</xref>; Zimmerman, <xref ref-type="bibr" rid="B38">2008</xref>; Zimmerman and Moylan, <xref ref-type="bibr" rid="B39">2009</xref>). Briefly stated, self-regulation depends on this personal feedback loop, which provides a person with the necessary information about the status of one&#x02019;s knowledge or strategies. The self-regulatory feedback consists of three cyclical phases: forethought, performance, and self-reflection (see Figure <xref ref-type="fig" rid="F1">1</xref>).</p>
<fig id="F1" position="float">
<label>Figure 1</label>
<caption><p><bold>Zimmerman&#x02019;s (<xref ref-type="bibr" rid="B37">2000</xref>) cyclical model of self-regulation</bold>.</p></caption>
<graphic xlink:href="fpsyg-03-00229-g001.tif"/>
</fig>
<p>The forethought phase sets the stage for action by providing information about the components of the task at hand, what goals and strategies need to be initiated, and whether the learner has the self-efficacy and self-motivation to accomplish the task. Learners may have a difficult time accurately self-assessing each of these areas of forethought for several reasons, two of which we address here. Estimates of performance have been shown to be biased toward some initial anchor, and learners often do not adequately adjust from these anchors. Therefore, knowing the psychological bases for these anchors and how learners can debias their judgments is not a trivial matter (research question a). In addition, because studies on the effects of incentives on motivation to achieve greater accuracy have been mixed, greater attention needs to be focused on how motivation can be manipulated (research question b).</p>
<p>During the performance phase, the learners gain feedback concerning self-control and self-observation, processes that are essential for continued attention to and action on a task. Learners self-explanations about how and why they use their self-observations to self-control and whether those self-explanations are mediated or moderated by other factors such as attributional style (research question c) or social influences (research question d) are critical areas of investigation. Maintaining attention and action on a task also requires that ongoing performance is being judged accurately. Performance can be judged at a global level (e.g., How prepared is a learner for an upcoming test?) and at local levels (e.g., Is the answer to this question correct?). Knowing whether there is a relation between these global and local levels of judgment can provide insights into the psychological mechanisms upon which they are made (research question e).</p>
<p>Finally, during the self-reflection phase, the learner makes self-judgments and self-reactions on their performance. This feedback on whether actual achievement, the end product of self-regulation, is high or low, satisfactory or unsatisfactory, or simply just good enough, then exerts an influence on whether further action will be taken in the forethought phase. Therefore, knowing whether there is a payoff to self-regulation is instrumental to continued self-regulation (research question f).</p>
<sec>
<title>Proposed Directions</title>
<p>Our attention now turns more specifically to our proposed research agenda on how knowledge of calibration can be promoted in further lines of inquiry. Table <xref ref-type="table" rid="T1">1</xref> presents the six research areas and questions already addressed as well as designs and variables aligned with these questions. In addition, we identify prior, related studies that have shown promise in filling these gaps in the literature.</p>
<table-wrap position="float" id="T1">
<label>Table 1</label>
<caption><p><bold>Proposed questions, design, and exemplar studies in calibration research</bold>.</p></caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th align="left">Research questions</th>
<th align="left">Design</th>
<th align="left">Variables</th>
</tr>
</thead>
<tbody>
<tr>
<td align="left">What are the effects of varying the anchoring mechanisms from which calibration judgments are made?</td>
<td align="left">True experimental</td>
<td align="left"><italic>Treatment</italic>: manipulating initial judgments<break/><italic>Measures</italic>: extent and type of adjustments for judgments</td>
</tr>
<tr>
<td align="left">How does calibration accuracy differ as a function of incentives and task authenticity?</td>
<td align="left">Comparative</td>
<td align="left"><italic>Independent variables</italic>: type of task, and incentive<break/><italic>Measure</italic>: calibration accuracy</td>
</tr>
<tr>
<td align="left">How do students self-report the basis for their calibration judgments?</td>
<td align="left">Qualitative</td>
<td align="left"><italic>Measures</italic>: interviews and think-aloud protocols</td>
</tr>
<tr>
<td align="left">How do group interactions and social comparisons affect calibration accuracy?</td>
<td align="left">Experimental, factorial</td>
<td align="left"><italic>Treatment</italic>: individual or group settings with or without social comparisons</td>
</tr>
<tr>
<td align="left"/>
<td align="left"/>
<td align="left"><italic>Measures</italic>: calibration accuracy, group interactions</td>
</tr>
<tr>
<td align="left">What is the relationship between absolute and relative accuracy?</td>
<td align="left">Correlational</td>
<td align="left"><italic>Measures</italic>: absolute and relative accuracy on items, topics/concepts, and overall performance</td>
</tr>
<tr>
<td align="left">To what extent does calibration accuracy predict achievement?</td>
<td align="left">Correlational</td>
<td align="left"><italic>Measures</italic>: calibration accuracy and achievement</td>
</tr>
</tbody>
</table>
</table-wrap>
<sec>
<title>What are the effects of varying the anchoring mechanisms from which calibration judgments are made?</title>
<p>In their seminal article, <italic>Judgment under Uncertainty: Heuristics and Biases</italic>, Tversky and Kahneman (<xref ref-type="bibr" rid="B34">1974</xref>) proposed that people make estimates of their performance by starting from some initial value and make adjustments that are biased toward that initial value. Their claim was that these adjustments are often insufficient so that the estimates continue to appear biased. The anchoring-and-adjustment effect has been widely recognized in the decision-making literature (e.g., Mussweiler et al., <xref ref-type="bibr" rid="B25">2000</xref>; Epley and Gilovich, <xref ref-type="bibr" rid="B9">2004</xref>, <xref ref-type="bibr" rid="B10">2005</xref>), and it has been used to explain underconfidence in calibration research (e.g., Scheck et al., <xref ref-type="bibr" rid="B30">2004</xref>).</p>
<p>Researchers of anchoring-and-adjustment effects make a distinction about who sets the initial value from which subsequent adjustments are made. In some cases, the initial value is set by another person (e.g., a salesperson setting the price for a new car) or by oneself (e.g., when guessing how long it takes Mars to orbit the sun, people often select an anchor on Earth&#x02019;s orbit and then adjust from that value; Epley and Gilovich, <xref ref-type="bibr" rid="B9">2004</xref>). These self-generated anchors are the ones that we believe could potentially influence people&#x02019;s calibration judgments.</p>
<p>Scheck and Nelson (<xref ref-type="bibr" rid="B31">2005</xref>) used anchoring-and-adjustment as an explanation for the underconfidence-with-practice (UWP) effect proposed by Koriat et al. (<xref ref-type="bibr" rid="B18">2002</xref>). The UWP effect is a robust finding in which people initially show overconfident calibration when making judgments of learning (JOLs) but subsequently become underconfident after their second study trial. Scheck and Nelson (<xref ref-type="bibr" rid="B31">2005</xref>) hypothesized that people form a psychological anchor for their JOLs somewhere between 30 and 50% of correct recall and adjust their JOLs either upwards or downwards based on whether performance is above or below this band. They found that when performance was above 50% after the second study trial, participants adjusted both immediate and delayed JOLs downward in relation to recall, thereby appearing to be underconfident. When performance was below 30% after the second study trial, participants adjusted their delayed JOLs upwards in relation to recall, thereby appearing to be overconfident; and when performance was at 30%, their immediate JOLs were near perfect. If accuracy of calibration is a necessary condition for self-regulated learning, then a potent research question is how can higher achievers overcome their underconfident JOLs and lower achievers overcome their overconfident JOLs without adversely impacting students who are between the two groups?</p>
<p>Research into this question can be informed by research conducted by Epley and Gilovich (<xref ref-type="bibr" rid="B10">2005</xref>) in which insufficient adjustments were compensated by providing financial incentives that motivated participants to make additional adjustments. Also, forewarning participants that initial adjustments are inadequate helped them to engage in additional adjustments, resulting in greater accuracy. Finally, Zhao and Linderholm (<xref ref-type="bibr" rid="B36">2011</xref>) found that providing information about peer performance on a task can serve as an anchor for metacomprehension judgments and can be used to debias judgments. We propose initiating research on this topic by posing the following question, &#x0201C;What are the effects of varying the anchoring mechanism from which calibration judgments are made?&#x0201D; Experimental work following any one of these lines of research could be pursued to demonstrate viable ways to encourage adjustment away from initial judgments and toward greater calibration accuracy.</p>
</sec>
<sec>
<title>How does calibration accuracy differ as a function of incentives and task authenticity?</title>
<p>Granted, motivation is a broad construct, but as in other areas of education research, calibration cannot be thoroughly understood without reference to motivational variables. To narrow the focus of this line of inquiry, we have selected incentives and task authenticity as starting points. These represent reasonable starting points because they represent salient constructs in the forethought phase of self-regulated learning and can be readily generalized to classroom contexts. In our conceptualization, incentives would be some type of course credit, and task authenticity would be learning course content. Focusing on learning course content stands in stark contrast to typically used tasks in calibration research, such as learning paired associates in a remote tribal language to control for prior learning.</p>
<p>Studies examining the impact of incentives on calibration are rare. In the preceding section, we referenced Epley and Gilovich&#x02019;s (<xref ref-type="bibr" rid="B10">2005</xref>) study in which financial incentives were employed to motivate participants to adjust their metacognitive judgments. In another more ecologically valid study, we manipulated incentives and reflection in a fully crossed quasi-experiment conducted in college course (Hacker et al., <xref ref-type="bibr" rid="B12">2008a</xref>). Students in an extrinsic reward condition were told they would receive one to four additional points on each of three exams, depending on their calibration accuracy, with more points given for greater accuracy. We found that incentives significantly improved calibration accuracy but only among lower-achieving students, which may have been the result of greater motivation on their part to be accurate to earn the additional points. Schraw et al. (<xref ref-type="bibr" rid="B32">1993</xref>) also provided evidence for the effectiveness of incentives for promoting calibration accuracy. In their procedure, students also received extra credit either for improving performance on a test or increasing their calibration accuracy. Though performance improved in both incentive conditions, the findings revealed that incentivizing accurate calibration was more effective than incentivizing improved performance.</p>
<p>Thus, the research question we propose is how does calibration accuracy differ as a function of incentive or task authenticity? Because it would be difficult to isolate these variables while controlling for all other influences, comparative studies might be more appropriate initially. In future research, the influence of both incentives and task authenticity could be investigated in the same study. For example, in a within-subjects design, calibration accuracy might be compared for students completing more authentic versus contrived tasks in conditions where incentives are or are not present.</p>
</sec>
<sec>
<title>How do students self-report the basis for their calibration judgments?</title>
<p>Earlier we posed a research question related to how individuals might anchor and then adjust their calibration. What we have not adequately addressed is how students self-report the basis for calibration judgments. Gathering self-report data from students regarding their calibration judgments is rare but not unprecedented. In a mixed methods study, Hacker et al. (<xref ref-type="bibr" rid="B12">2008a</xref>) asked college students to identify factors that influenced the accuracy of their predictions and postdictions. Attributional style was used as the theoretical framework to organize the data. The most frequent explanations focused on internal, student-centered constructs. Students were most likely to attribute discrepancies between their scores and calibration judgments to how much or how well they studied or how well they felt they knew the material. Another frequently reported factor was test-taking ability and prior performance on tests as well as expectations for test content and difficulty. Similar results were reported in a study with middle school students (Bol et al., <xref ref-type="bibr" rid="B5">2010</xref>). In this study, immediately after making their predictions, students were asked why they predicted that score. The most frequent categories of responses centered on time and effort spent studying, global perceptions of their own abilities, and past performance. After making their postdictions, students again were asked to explain why these were accurate or inaccurate. Explanations focused on knowing the number answered correctly, their expectations of test difficulty, the effort exerted in studying, and their global sense of self-confidence.</p>
<p>Bandura&#x02019;s model of reciprocal determinism has also been used to categorize responses (Dinsmore and Parkinson, <xref ref-type="bibr" rid="B7">2012</xref>). Participants were asked to explain how they arrived at or what was considered when making confidence judgments. Instances of the <italic>a priori</italic> categories observed in student responses included prior knowledge, characteristics of the text and the items, and guessing. Students often cited a combination of personal and task characteristics. The combination of how students explain their calibration judgments and how judgments are measured may be important for understanding how judgments contribute to performance.</p>
<p>Although the results from a few studies have helped us understand how students describe the basis for their calibration judgments, more research is warranted. As employed in the studies just described, a qualitative approach grounded in a theoretical framework would be most appropriate and revealing. Qualitative data could be collected via open-ended responses to surveys and think-aloud protocols in which real-time data would be collected as students are considering, making, and explaining their calibration judgments.</p>
<p>Threats to validity in self-reported data cannot be avoided. Consequently, in the qualitative tradition, we might rely on triangulation strategies to support the credibility or transferability of findings. Researchers (Azevedo et al., <xref ref-type="bibr" rid="B1">2010</xref>; Winne, <xref ref-type="bibr" rid="B35">2010</xref>) recommend combining self-report data about self-regulated learning with trace evidence that reflects students&#x02019; cognitive operations (e.g., highlighting text). Data collected in computer-based learning environments should facilitate these types of studies. For example, if students attribute their lack of understanding of a topic to lack of study time, actual time spent studying the topic could be calculated. We might also follow their navigation patterns to determine whether they return to content judged to be in need of further study.</p>
</sec>
<sec>
<title>How do group interactions and social comparisons affect calibration accuracy?</title>
<p>Experimental manipulations centered on group interactions and social comparisons have already shown promise in improving calibration accuracy. In our recent factorial experiment with high school biology students (Bol et al., <xref ref-type="bibr" rid="B4">2012</xref>), half of the students practiced calibration in groups while the other half practiced calibration individually. The second treatment variable was whether students used guidelines to gage their judgments of how well they mastered the content. We found both group settings and guidelines to be effective in promoting calibration accuracy and achievement. Other studies have demonstrated that the combination of group learning contexts and guiding questions promoted metacognitive skills and achievement (Kramarski and Mevarech, <xref ref-type="bibr" rid="B20">2003</xref>; Kramarski and Dudai, <xref ref-type="bibr" rid="B19">2009</xref>).</p>
<p>Group work logically elicits implicit or explicit social comparisons. Carvalho and Yuzawa (<xref ref-type="bibr" rid="B6">2001</xref>) manipulated social comparisons by presenting some participants with information concerning the mean percentage of correctly answered questions that a fictitious group of fellow students presumably scored. This information was presented prior to participants making their own metacognitive judgments on their performance. The results indicated that social comparisons did impact the magnitude of metacognitive judgments with greater magnitude in judgments associated with higher performance. Other results suggested that participants with little confidence in their judgments may be particularly susceptible to social influences.</p>
<p>Given that group settings and social comparisons can influence metacognitive judgments, the next logical step would be to manipulate both of these variables in a factorial experiment. The question posed is how do group interactions and social comparisons affect calibration accuracy? Students would be asked to calibrate in group or individual settings and would do so with or without social comparisons. Because earlier studies suggest that guidelines promote accuracy in metacognitive judgments (Kramarski and Dudai, <xref ref-type="bibr" rid="B19">2009</xref>; Bol et al., <xref ref-type="bibr" rid="B4">2012</xref>), all four groups would receive guidelines. The social comparisons could be presented to half of the students as part of the guidelines employed by students, and could take the form of mean accuracy scores achieved by low, middle, and high achievers. However, rather than using fictitious scores as in the Carvalho and Yuzawa (<xref ref-type="bibr" rid="B6">2001</xref>) research, actual scores could be presented, which may help students differentiate between more and less reasonable calibration judgments and how they may be tied to achievement levels. This may be especially beneficial for lower-achieving students who tend to overestimate their performance (e.g., Bol et al., <xref ref-type="bibr" rid="B5">2010</xref>). Noting the group interactions among students assigned to this condition may further illuminate how social interactions or comparisons may affect calibration judgments.</p>
</sec>
<sec>
<title>What is the relationship between absolute and relative accuracy?</title>
<p>Calibration (aka absolute accuracy) provides estimates of overall memory retrieval (Lichtenstein et al., <xref ref-type="bibr" rid="B21">1982</xref>; Keren, <xref ref-type="bibr" rid="B17">1991</xref>; Nietfeld et al., <xref ref-type="bibr" rid="B28">2006</xref>), and relative accuracy (aka discrimination) provides estimates of whether a person&#x02019;s judgments can predict the likelihood of correct performance of one item relative to another (Nelson, <xref ref-type="bibr" rid="B26">1996</xref>). For instance, a student can judge that overall he or she will get 85% of the items on a test correct and in fact get 85% correct (i.e., perfect calibration accuracy), but upon closer examination, the student may have given high confidence judgments to items answered incorrectly and low confidence to items answered correctly, in which case, relative accuracy may be close to chance. Both types of accuracy are important for students to self-regulate their learning; however, whether there are shared psychological processes contributing to both is an important area for investigation (Maki et al., <xref ref-type="bibr" rid="B23">2005</xref>).</p>
<p>Accuracy of both global and item-level calibration judgments plays an important role in current and future study efforts because poor judgments can lead to either premature or protracted termination of study of general and specific content. Current research of metacomprehension judgments has shown that there is little or no relation between absolute and relative accuracy (Maki et al., <xref ref-type="bibr" rid="B23">2005</xref>). However, other areas of metacognitive research have not received much attention. For example, we are not aware of any research that has examined absolute and relative accuracy for a typical classroom exam, consisting of multiple-choice or true/false items. If there is a relation between the two, students&#x02019; overall global judgments about what they know about the to be tested material may be based on an appraisal of knowing specific and well-defined concepts from that material. In that case, the psychological processes that contribute to one may help to inform the other. However, if there is no relation between the two, then either there is a mismatch between what students believe is to be tested and what is actually tested, or the global judgments may not be helpful because they provide no indication of how students will perform on specific test knowledge.</p>
<p>Therefore, future correlational research could examine whether there a relation between absolute accuracy at the test-level and relative accuracy at the item-level. If such a relation exists, whether it varies by item difficulty or type would be important to examine further. Moreover, if this relation exists, developing interventions that could capitalize on it and provide students with ways of better judging item-level and test-level knowledge to prepare for tests would be important contributions to self-regulated learning.</p>
</sec>
<sec>
<title>To what extent does calibration accuracy predict achievement?</title>
<p>Although widespread acceptance has been given to the theoretical argument that accurate metacognitive monitoring is essential to self-regulated learning, the empirical question of whether achievement is enhanced because of accurate monitoring has received surprisingly scant attention. Correlational and experimental studies have established that monitoring can positively impact decisions about what to study (e.g., Metcalfe and Finn, <xref ref-type="bibr" rid="B24">2008</xref>; Hines et al., <xref ref-type="bibr" rid="B15">2009</xref>); but whether that studying leads to gains in achievement is a question in need of further support (Dunlosky and Rawson, <xref ref-type="bibr" rid="B8">2011</xref>). Some calibration studies have shown a positive relation between calibration accuracy and achievement level (e.g., Hacker et al., <xref ref-type="bibr" rid="B14">2000</xref>; Bol and Hacker, <xref ref-type="bibr" rid="B2">2001</xref>). In addition, Nietfeld et al. (<xref ref-type="bibr" rid="B28">2006</xref>) and Bol et al. (<xref ref-type="bibr" rid="B4">2012</xref>) demonstrated that students who participated in interventions to increase calibration accuracy realized higher gains in achievement than students who did not participate in them. However, in both these studies, treatment and classroom assignment were conflated, leaving open the question of whether internal validity was potentially compromised.</p>
<p>Dunlosky and Rawson (<xref ref-type="bibr" rid="B8">2011</xref>) experimentally manipulated judgment accuracy by asking participants to study key-term definitions, one group used an idea-unit standard in which the participants were shown their responses with the idea units contained in the correct answer, and another group used their responses but without access to the correct answer. After being shown their responses, participants made a self-score judgment about whether their answer was correct. The test-judge-study trails continued until a definition was judged as correct three times. Two days later, all participants were administered a retention test. Findings indicated that greater accuracy was related to greater retention. Moreover, they found that participants who were overconfident in their judgments prematurely terminated study, and as a consequence their retention suffered.</p>
<p>Similar experimental manipulations of accuracy need to be conducted to more firmly establish the link between calibration accuracy and achievement. Empirical findings provide multiple ways to manipulate accuracy: study guidelines used in group settings (Bol et al., <xref ref-type="bibr" rid="B4">2012</xref>), self-assessment with feedback (Nietfeld et al., <xref ref-type="bibr" rid="B28">2006</xref>), and feedback with idea-unit standards (Dunlosky and Rawson, <xref ref-type="bibr" rid="B8">2011</xref>). Employing these accuracy manipulations and measuring subsequent retention could provide valuable support for the importance of accurate monitoring and provide information about whether specific manipulations of accuracy lead to greater retention. In addition, the kinds of learning could be manipulated. The tasks used in an experiment could be varied from simple paired associates to multiple-choice tests to text comprehension. Firmly establishing the link between monitoring accuracy and achievement is a critical goal for calibration research.</p>
</sec>
</sec>
<sec>
<title>Summary and Conclusion</title>
<p>We do not assume that these are the only fruitful research directions to pursue in order to more thoroughly understand calibration and its role in promoting self-regulated learning. However, they represent a good start. The research agenda outlined is based on the social cognitive model of self-regulation developed by Zimmerman and his colleagues (Schunk and Zimmerman, <xref ref-type="bibr" rid="B33">1997</xref>; Zimmerman, <xref ref-type="bibr" rid="B38">2008</xref>; Zimmerman and Moylan, <xref ref-type="bibr" rid="B39">2009</xref>). In alignment with the forethought phase we propose investigating the effects of anchoring, incentives, and task authenticity on calibration judgments to reflect psychological processes linked to self-efficacy and motivation. Research questions focused on social influences, self-explanations, and the basis for metacognitive judgments are represented in the performance phase of the model. More specifically, we propose addressing student explanations for calibration judgments, the impact of group interactions and social comparison on calibration accuracy, and the relationship between absolute and relative accuracy. In the self-reflection phase, learners judge and react to their performance or achievement. The final question reflects the extent to which calibration accuracy predicts achievement. Feedback on performance and self-reflection influences subsequent, cyclical phases of self-regulated learning.</p>
<p>Similarly, the methods we propose are not exhaustive and reflect examples of how these questions may be pursued. The methods we propose range from qualitative approaches investigating how students explain their calibrations judgments, comparing motivational factors linked to task characteristics, correlating absolute and relative accuracy, predicting achievement based on calibration accuracy, and manipulating anchoring and adjustment effects as well as social interactions in controlled experiments.</p>
<p>Calibration research will be further advanced when we identify patterns of findings guided by sound theoretical models and based on precise descriptions of terms, measures, contexts, tasks, and populations. As we have argued previously (Hacker et al., <xref ref-type="bibr" rid="B13">2008b</xref>), calibration has been measured in different ways but largely studied in more contrived contexts using college students. Granted, we must consider the trade-off between internal and external validity as we move into more naturalistic settings, such as classrooms and employ more authentic tasks. Various research methods with varying levels of control will better inform our questions overall.</p>
</sec>
<sec>
<title>Conflict of Interest Statement</title>
<p>The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.</p>
</sec>
</body>
<back>
<ref-list>
<title>References</title>
<ref id="B1"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Azevedo</surname> <given-names>R.</given-names></name> <name><surname>Moos</surname> <given-names>D. C.</given-names></name> <name><surname>Johnson</surname> <given-names>A. M.</given-names></name> <name><surname>Chauncey</surname> <given-names>A. D.</given-names></name></person-group> (<year>2010</year>). <article-title>Measuring cognitive and regulatory processes during hypermedia learning: issues and challenges</article-title>. <source>Educ. Psychol.</source> <volume>45</volume>, <fpage>210</fpage>&#x02013;<lpage>223</lpage>.<pub-id pub-id-type="doi">10.1080/00461520.2010.515934</pub-id></citation></ref>
<ref id="B2"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bol</surname> <given-names>L.</given-names></name> <name><surname>Hacker</surname> <given-names>D.</given-names></name></person-group> (<year>2001</year>). <article-title>A comparison of the effects of practice tests and traditional review on performance and calibration</article-title>. <source>J. Exp. Educ.</source> <volume>69</volume>, <fpage>133</fpage>&#x02013;<lpage>151</lpage>.<pub-id pub-id-type="doi">10.1080/00220970109600653</pub-id></citation></ref>
<ref id="B3"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bol</surname> <given-names>L.</given-names></name> <name><surname>Hacker</surname> <given-names>D. J.</given-names></name> <name><surname>O&#x02019;Shea</surname> <given-names>P.</given-names></name> <name><surname>Allen</surname> <given-names>D.</given-names></name></person-group> (<year>2005</year>). <article-title>The influence of overt practice, achievement level, and explanatory style on calibration accuracy and performance</article-title>. <source>J. Exp. Educ.</source> <volume>73</volume>, <fpage>269</fpage>&#x02013;<lpage>290</lpage>.<pub-id pub-id-type="doi">10.3200/JEXE.73.4.269-290</pub-id></citation></ref>
<ref id="B4"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bol</surname> <given-names>L.</given-names></name> <name><surname>Hacker</surname> <given-names>D. J.</given-names></name> <name><surname>Walck</surname> <given-names>C.</given-names></name> <name><surname>Nunnery</surname> <given-names>J.</given-names></name></person-group> (<year>2012</year>). <article-title>The effect of individual or group guidelines on the calibration accuracy and achievement of high school biology students</article-title>. <source>Contemp. Educ. Psychol</source>.<pub-id pub-id-type="doi">10.1016/j.cedpsych.2012.02.004</pub-id><pub-id pub-id-type="pmid">22711971</pub-id></citation></ref>
<ref id="B5"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bol</surname> <given-names>L.</given-names></name> <name><surname>Riggs</surname> <given-names>R.</given-names></name> <name><surname>Hacker</surname> <given-names>D. J.</given-names></name> <name><surname>Nunnery</surname> <given-names>J.</given-names></name></person-group> (<year>2010</year>). <article-title>The calibration accuracy of middle school students in math classes</article-title>. <source>J. Res. Educ.</source> <volume>21</volume>, <fpage>81</fpage>&#x02013;<lpage>96</lpage>.</citation></ref>
<ref id="B6"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Carvalho</surname> <given-names>M. K. F.</given-names></name> <name><surname>Yuzawa</surname> <given-names>M.</given-names></name></person-group> (<year>2001</year>). <article-title>The effects of social cues on confidence judgments mediated by knowledge and regulation of cognition</article-title>. <source>J. Exp. Educ.</source> <volume>69</volume>, <fpage>325</fpage>&#x02013;<lpage>343</lpage>.<pub-id pub-id-type="doi">10.1080/00220970109599491</pub-id></citation></ref>
<ref id="B7"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Dinsmore</surname> <given-names>D.</given-names></name> <name><surname>Parkinson</surname> <given-names>M.</given-names></name></person-group> (<year>2012</year>). <article-title>What are confidence judgments made of? Students&#x02019; explanations for their confidence ratings and what that means for calibration</article-title>. <source>Instr. Psychol</source>.</citation></ref>
<ref id="B8"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Dunlosky</surname> <given-names>J.</given-names></name> <name><surname>Rawson</surname> <given-names>K. A.</given-names></name></person-group> (<year>2011</year>). <article-title>Overconfidence produces underachievement: inaccurate self-evaluations undermine students&#x02019; learning and retention</article-title>. <source>Learn. Instr.</source> <volume>22</volume>, <fpage>271</fpage>&#x02013;<lpage>280</lpage>.<pub-id pub-id-type="doi">10.1016/j.learninstruc.2011.08.003</pub-id></citation></ref>
<ref id="B9"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Epley</surname> <given-names>N.</given-names></name> <name><surname>Gilovich</surname> <given-names>T.</given-names></name></person-group> (<year>2004</year>). <article-title>Are adjustments insufficient?</article-title> <source>Pers. Soc. Psychol. Bull.</source> <volume>30</volume>, <fpage>447</fpage>&#x02013;<lpage>460</lpage>.<pub-id pub-id-type="doi">10.1177/0146167203261889</pub-id><pub-id pub-id-type="pmid">15070474</pub-id></citation></ref>
<ref id="B10"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Epley</surname> <given-names>N.</given-names></name> <name><surname>Gilovich</surname> <given-names>T.</given-names></name></person-group> (<year>2005</year>). <article-title>When effortful thinking influences judgmental anchoring: differential effects of forewarning and incentives on self-generated and externally provided anchors</article-title>. <source>J. Behav. Decis. Mak.</source> <volume>18</volume>, <fpage>199</fpage>&#x02013;<lpage>212</lpage>.<pub-id pub-id-type="doi">10.1002/bdm.495</pub-id></citation></ref>
<ref id="B11"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Grimes</surname> <given-names>P. W.</given-names></name></person-group> (<year>2002</year>). <article-title>The overconfident principles of economics students: an examination of a metacognitive skill</article-title>. <source>J. Econ. Educ.</source> <volume>33</volume>, <fpage>15</fpage>&#x02013;<lpage>30</lpage>.<pub-id pub-id-type="doi">10.1080/00220480209596121</pub-id></citation></ref>
<ref id="B12"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hacker</surname> <given-names>D.</given-names></name> <name><surname>Bol</surname> <given-names>L.</given-names></name> <name><surname>Bahbahani</surname> <given-names>K.</given-names></name></person-group> (<year>2008a</year>). <article-title>Explaining calibration accuracy in classroom contexts: the effects of incentives, reflection, and explanatory style</article-title>. <source>Metacogn. Learn.</source> <volume>3</volume>, <fpage>101</fpage>&#x02013;<lpage>121</lpage>.<pub-id pub-id-type="doi">10.1007/s11409-008-9021-5</pub-id></citation></ref>
<ref id="B13"><citation citation-type="book"><person-group person-group-type="author"><name><surname>Hacker</surname> <given-names>D. J.</given-names></name> <name><surname>Bol</surname> <given-names>L.</given-names></name> <name><surname>Keener</surname> <given-names>M. C.</given-names></name></person-group> (<year>2008b</year>). <article-title>&#x0201C;Metacognition in education: a focus on calibration,&#x0201D;</article-title> in <source>Handbook of Memory and Metacognition</source>, eds <person-group person-group-type="editor"><name><surname>Dunlosky</surname> <given-names>J.</given-names></name> <name><surname>Bjork</surname> <given-names>R.</given-names></name></person-group> (<publisher-loc>Mahwah, NJ</publisher-loc>: <publisher-name>Lawrence Erlbaum Associates</publisher-name>), <fpage>411</fpage>&#x02013;<lpage>455</lpage>.</citation></ref>
<ref id="B14"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hacker</surname> <given-names>D. J.</given-names></name> <name><surname>Bol</surname> <given-names>L.</given-names></name> <name><surname>Horgan</surname> <given-names>D.</given-names></name> <name><surname>Rakow</surname> <given-names>E. A.</given-names></name></person-group> (<year>2000</year>). <article-title>Test prediction and performance in a classroom context</article-title>. <source>J. Educ. Psychol.</source> <volume>92</volume>, <fpage>160</fpage>&#x02013;<lpage>170</lpage>.<pub-id pub-id-type="doi">10.1037/0022-0663.92.1.160</pub-id></citation></ref>
<ref id="B15"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hines</surname> <given-names>J. C.</given-names></name> <name><surname>Touron</surname> <given-names>D. R.</given-names></name> <name><surname>Hertzog</surname> <given-names>C.</given-names></name></person-group> (<year>2009</year>). <article-title>Metacognitive influences on study time allocation in an associative recognition task: an analysis of adult age differences</article-title>. <source>Psychol. Aging</source> <volume>24</volume>, <fpage>462</fpage>&#x02013;<lpage>475</lpage>.<pub-id pub-id-type="doi">10.1037/a0014417</pub-id><pub-id pub-id-type="pmid">19485662</pub-id></citation></ref>
<ref id="B16"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Juslin</surname> <given-names>P.</given-names></name> <name><surname>Winman</surname> <given-names>A.</given-names></name> <name><surname>Olsson</surname> <given-names>H.</given-names></name></person-group> (<year>2000</year>). <article-title>Na&#x000EF;ve empiricism and dogmatisim in confidence research: a critical examination of the hard-easy effect</article-title>. <source>Psychol. Rev.</source> <volume>107</volume>, <fpage>384</fpage>&#x02013;<lpage>396</lpage>.<pub-id pub-id-type="doi">10.1037/0033-295X.107.2.384</pub-id><pub-id pub-id-type="pmid">10789203</pub-id></citation></ref>
<ref id="B17"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Keren</surname> <given-names>G.</given-names></name></person-group> (<year>1991</year>). <article-title>Calibration and probability judgments: conceptual and methodological issues</article-title>. <source>Acta Psychol. (Amst.)</source> <volume>77</volume>, <fpage>217</fpage>&#x02013;<lpage>273</lpage>.<pub-id pub-id-type="doi">10.1016/0001-6918(91)90036-Y</pub-id></citation></ref>
<ref id="B18"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Koriat</surname> <given-names>A.</given-names></name> <name><surname>Sheffer</surname> <given-names>L.</given-names></name> <name><surname>Ma&#x02019;ayan</surname> <given-names>H.</given-names></name></person-group> (<year>2002</year>). <article-title>Comparing objective and subjective learning curves: judgments of learning exhibit increased underconfidence with practice</article-title>. <source>J. Exp. Psychol. Gen.</source> <volume>131</volume>, <fpage>147</fpage>&#x02013;<lpage>162</lpage>.<pub-id pub-id-type="doi">10.1037/0096-3445.131.2.147</pub-id><pub-id pub-id-type="pmid">12049237</pub-id></citation></ref>
<ref id="B19"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kramarski</surname> <given-names>B.</given-names></name> <name><surname>Dudai</surname> <given-names>V.</given-names></name></person-group> (<year>2009</year>). <article-title>Group metacognitive support for on-line inquiry in mathematics with differential self-questioning</article-title>. <source>J. Educ. Comput. Res.</source> <volume>40</volume>, <fpage>377</fpage>&#x02013;<lpage>404</lpage>.<pub-id pub-id-type="doi">10.2190/EC.40.4.a</pub-id></citation></ref>
<ref id="B20"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kramarski</surname> <given-names>B.</given-names></name> <name><surname>Mevarech</surname> <given-names>Z. R.</given-names></name></person-group> (<year>2003</year>). <article-title>Enhancing mathematical reasoning in the classroom: the effects of cooperative learning and metacognitive training</article-title>. <source>Am. Educ. Res. J.</source> <volume>40</volume>, <fpage>281</fpage>&#x02013;<lpage>310</lpage>.<pub-id pub-id-type="doi">10.3102/00028312040001281</pub-id></citation></ref>
<ref id="B21"><citation citation-type="book"><person-group person-group-type="author"><name><surname>Lichtenstein</surname> <given-names>S.</given-names></name> <name><surname>Fischhoff</surname> <given-names>B.</given-names></name> <name><surname>Phillips</surname> <given-names>L. D.</given-names></name></person-group> (<year>1982</year>). <article-title>&#x0201C;Calibration of probabilities: the state of the art to 1980,&#x0201D;</article-title> in <source>Judgment Under Uncertainty: Heuristics and Biases</source>, eds <person-group person-group-type="editor"><name><surname>Kahneman</surname> <given-names>D.</given-names></name> <name><surname>Slovic</surname> <given-names>P.</given-names></name> <name><surname>Tversky</surname> <given-names>A.</given-names></name></person-group> (<publisher-loc>Hillsdale, NJ</publisher-loc>: <publisher-name>Erlbaum</publisher-name>), <fpage>306</fpage>&#x02013;<lpage>334</lpage>.</citation></ref>
<ref id="B22"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Maki</surname> <given-names>R. H.</given-names></name> <name><surname>Serra</surname> <given-names>M.</given-names></name></person-group> (<year>1992</year>). <article-title>The basis of test predictions for text material</article-title>. <source>J. Exp. Psychol. Learn. Mem. Cogn.</source> <volume>18</volume>, <fpage>116</fpage>&#x02013;<lpage>126</lpage>.<pub-id pub-id-type="doi">10.1037/0278-7393.18.1.116</pub-id></citation></ref>
<ref id="B23"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Maki</surname> <given-names>R. H.</given-names></name> <name><surname>Shields</surname> <given-names>M.</given-names></name> <name><surname>Wheller</surname> <given-names>A. E.</given-names></name> <name><surname>Zacchilli</surname> <given-names>T. L.</given-names></name></person-group> (<year>2005</year>). <article-title>Individual differences in absolute and relative metacomprehension accuracy</article-title>. <source>J. Educ. Psychol.</source> <volume>97</volume>, <fpage>723</fpage>&#x02013;<lpage>731</lpage>.<pub-id pub-id-type="doi">10.1037/0022-0663.97.4.723</pub-id></citation></ref>
<ref id="B24"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Metcalfe</surname> <given-names>J.</given-names></name> <name><surname>Finn</surname> <given-names>B.</given-names></name></person-group> (<year>2008</year>). <article-title>Evidence that judgments of learning are causally related to study choice</article-title>. <source>Psychon. Bull. Rev.</source> <volume>15</volume>, <fpage>174</fpage>&#x02013;<lpage>179</lpage>.<pub-id pub-id-type="doi">10.3758/PBR.15.1.174</pub-id><pub-id pub-id-type="pmid">18605499</pub-id></citation></ref>
<ref id="B25"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Mussweiler</surname> <given-names>T.</given-names></name> <name><surname>Strack</surname> <given-names>F.</given-names></name> <name><surname>Pfeiffer</surname> <given-names>T.</given-names></name></person-group> (<year>2000</year>). <article-title>Overcoming the inevitable anchoring effect: considering the opposite compensates for selective accessibility</article-title>. <source>Pers. Soc. Psychol. Bull.</source> <volume>26</volume>, <fpage>1142</fpage>&#x02013;<lpage>1150</lpage>.<pub-id pub-id-type="doi">10.1177/01461672002611010</pub-id></citation></ref>
<ref id="B26"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Nelson</surname> <given-names>T. O.</given-names></name></person-group> (<year>1996</year>). <article-title>Gamma is a measure of the accuracy of predicting performance on one item relative to another item, not of the absolute performance on an individual item</article-title>. <source>Appl. Cogn. Psychol.</source> <volume>10</volume>, <fpage>257</fpage>&#x02013;<lpage>260</lpage>.<pub-id pub-id-type="doi">10.1002/(SICI)1099-0720(199606)10:3&#x0003C;257::AID-ACP400&#x0003E;3.0.CO;2-9</pub-id></citation></ref>
<ref id="B27"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Nietfeld</surname> <given-names>J. L.</given-names></name> <name><surname>Cao</surname> <given-names>L.</given-names></name> <name><surname>Osborne</surname> <given-names>J. W.</given-names></name></person-group> (<year>2005</year>). <article-title>Metacognitive monitoring accuracy and student performance in the postsecondary classroom</article-title>. <source>J. Exp. Educ.</source> <volume>74</volume>, <fpage>7</fpage>&#x02013;<lpage>28</lpage>.</citation></ref>
<ref id="B28"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Nietfeld</surname> <given-names>J. L.</given-names></name> <name><surname>Cao</surname> <given-names>L.</given-names></name> <name><surname>Osborne</surname> <given-names>J. W.</given-names></name></person-group> (<year>2006</year>). <article-title>The effect of distributed monitoring exercises and feedback on performance, monitoring accuracy, and self-efficacy</article-title>. <source>Metacogn. Learn.</source> <volume>1</volume>, <fpage>159</fpage>&#x02013;<lpage>179</lpage>.<pub-id pub-id-type="doi">10.1007/s10409-006-9595-6</pub-id></citation></ref>
<ref id="B29"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Pressley</surname> <given-names>M.</given-names></name> <name><surname>Ghatala</surname> <given-names>E. S.</given-names></name></person-group> (<year>1990</year>). <article-title>Self-regulated learning: monitoring learning from text</article-title>. <source>Educ. Psychol.</source> <volume>25</volume>, <fpage>19</fpage>&#x02013;<lpage>33</lpage>.<pub-id pub-id-type="doi">10.1207/s15326985ep2501_3</pub-id></citation></ref>
<ref id="B30"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Scheck</surname> <given-names>P.</given-names></name> <name><surname>Meeter</surname> <given-names>M.</given-names></name> <name><surname>Nelson</surname> <given-names>T. O.</given-names></name></person-group> (<year>2004</year>). <article-title>Anchoring effects in the absolute accuracy of immediate versus delayed judgments of learning</article-title>. <source>J. Mem. Lang.</source> <volume>51</volume>, <fpage>71</fpage>&#x02013;<lpage>79</lpage>.<pub-id pub-id-type="doi">10.1016/j.jml.2004.03.004</pub-id></citation></ref>
<ref id="B31"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Scheck</surname> <given-names>P.</given-names></name> <name><surname>Nelson</surname> <given-names>T. O.</given-names></name></person-group> (<year>2005</year>). <article-title>Lack of pervasiveness of the under confidence-with-practice effect: boundary conditions and an explanation via anchoring</article-title>. <source>J. Exp. Psychol. Gen.</source> <volume>134</volume>, <fpage>124</fpage>&#x02013;<lpage>128</lpage>.<pub-id pub-id-type="doi">10.1037/0096-3445.134.1.124</pub-id><pub-id pub-id-type="pmid">15702968</pub-id></citation></ref>
<ref id="B32"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Schraw</surname> <given-names>G.</given-names></name> <name><surname>Potenza</surname> <given-names>M. T.</given-names></name> <name><surname>Nebelsick-Gullet</surname> <given-names>L.</given-names></name></person-group> (<year>1993</year>). <article-title>Constraints on the calibration of performance</article-title>. <source>Contemp. Educ. Psychol.</source> <volume>18</volume>, <fpage>455</fpage>&#x02013;<lpage>463</lpage>.<pub-id pub-id-type="doi">10.1006/ceps.1993.1034</pub-id></citation></ref>
<ref id="B33"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Schunk</surname> <given-names>D.</given-names></name> <name><surname>Zimmerman</surname> <given-names>B. J.</given-names></name></person-group> (<year>1997</year>). <article-title>Social origins of self-regulatory competence</article-title>. <source>Educ. Psychol.</source> <volume>32</volume>, <fpage>195</fpage>&#x02013;<lpage>208</lpage>.<pub-id pub-id-type="doi">10.1207/s15326985ep3204_1</pub-id></citation></ref>
<ref id="B34"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Tversky</surname> <given-names>A.</given-names></name> <name><surname>Kahneman</surname> <given-names>D.</given-names></name></person-group> (<year>1974</year>). <article-title>Judgment under uncertainty: heuristics and biases</article-title>. <source>Science</source> <volume>185</volume>, <fpage>1124</fpage>&#x02013;<lpage>1131</lpage>.<pub-id pub-id-type="doi">10.1126/science.185.4157.1124</pub-id><pub-id pub-id-type="pmid">17835457</pub-id></citation></ref>
<ref id="B35"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Winne</surname> <given-names>P. H.</given-names></name></person-group> (<year>2010</year>). <article-title>Improving measurements of self-regulated learning</article-title>. <source>Educ. Psychol.</source> <volume>45</volume>, <fpage>267</fpage>&#x02013;<lpage>276</lpage>.<pub-id pub-id-type="doi">10.1080/00461520.2010.517150</pub-id></citation></ref>
<ref id="B36"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zhao</surname> <given-names>Q.</given-names></name> <name><surname>Linderholm</surname> <given-names>T.</given-names></name></person-group> (<year>2011</year>). <article-title>Anchoring effects on prospective and retrospective metacomprehension judgments as a function of peer performance information</article-title>. <source>Metacogn. Learn.</source> <volume>6</volume>, <fpage>25</fpage>&#x02013;<lpage>43</lpage>.<pub-id pub-id-type="doi">10.1007/s11409-010-9065-1</pub-id></citation></ref>
<ref id="B37"><citation citation-type="book"><person-group person-group-type="author"><name><surname>Zimmerman</surname> <given-names>G. J.</given-names></name></person-group> (<year>2000</year>). <article-title>&#x0201C;Attaining self-regulation: a social cognitive perspective,&#x0201D;</article-title> in <source>Handbook of Self-Regulation</source>, eds <person-group person-group-type="editor"><name><surname>Boekaerts</surname> <given-names>M.</given-names></name> <name><surname>Pintrich</surname> <given-names>P. R.</given-names></name> <name><surname>Zinder</surname> <given-names>M.</given-names></name></person-group> (<publisher-loc>San Diego</publisher-loc>: <publisher-name>Academic Press</publisher-name>), <fpage>13</fpage>&#x02013;<lpage>39</lpage>.</citation></ref>
<ref id="B38"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zimmerman</surname> <given-names>B. J.</given-names></name></person-group> (<year>2008</year>). <article-title>Investigating self-regulation and motivation: historical background, methodological developments, and future prospects</article-title>. <source>Am. Educ. Res. J.</source> <volume>45</volume>, <fpage>166</fpage>&#x02013;<lpage>118</lpage>.<pub-id pub-id-type="doi">10.3102/0002831207312909</pub-id></citation></ref>
<ref id="B39"><citation citation-type="book"><person-group person-group-type="author"><name><surname>Zimmerman</surname> <given-names>B. J.</given-names></name> <name><surname>Moylan</surname> <given-names>A. R.</given-names></name></person-group> (<year>2009</year>). <article-title>&#x0201C;Self-regulation: where metacognition and motivation intersect,&#x0201D;</article-title> in <source>Handbook of Metacognition in Education</source>, eds <person-group person-group-type="editor"><name><surname>Hacker</surname> <given-names>D. J.</given-names></name> <name><surname>Dunlosky</surname> <given-names>J.</given-names></name> <name><surname>Graesser</surname> <given-names>A. C.</given-names></name></person-group> (<publisher-loc>New York</publisher-loc>: <publisher-name>Routledge</publisher-name>), <fpage>299</fpage>&#x02013;<lpage>315</lpage>.</citation></ref>
</ref-list>
</back>
</article>