<?xml version="1.0" encoding="utf-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v2.3 20070202//EN" "journalpublishing.dtd">
<article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" article-type="research-article" dtd-version="2.3" xml:lang="EN">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">Front. Psychol.</journal-id>
<journal-title>Frontiers in Psychology</journal-title>
<abbrev-journal-title abbrev-type="pubmed">Front. Psychol.</abbrev-journal-title>
<issn pub-type="epub">1664-1078</issn>
<publisher>
<publisher-name>Frontiers Media S.A.</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="doi">10.3389/fpsyg.2023.1116386</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Psychology</subject>
<subj-group>
<subject>Original Research</subject>
</subj-group>
</subj-group>
</article-categories>
<title-group>
<article-title>The effect of level-marked mathematics tasks on students&#x2019; self-efficacy: An experimental study</article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author" corresp="yes">
<name>
<surname>Herset</surname>
<given-names>Maria</given-names>
</name>
<xref rid="aff1" ref-type="aff"><sup>1</sup></xref>
<xref rid="c001" ref-type="corresp"><sup>&#x002A;</sup></xref>
<uri xlink:href="https://loop.frontiersin.org/people/2127237/overview"/>
</contrib>
<contrib contrib-type="author">
<name>
<surname>El Ghami</surname>
<given-names>Mohamed</given-names>
</name>
<xref rid="aff1" ref-type="aff"><sup>1</sup></xref>
<uri xlink:href="https://loop.frontiersin.org/people/2117613/overview"/>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Bjerke</surname>
<given-names>Annette Hessen</given-names>
</name>
<xref rid="aff2" ref-type="aff"><sup>2</sup></xref>
<uri xlink:href="https://loop.frontiersin.org/people/1929818/overview"/>
</contrib>
</contrib-group>
<aff id="aff1"><sup>1</sup><institution>Faculty of Education and Art, Nord University</institution>, <addr-line>Nesna</addr-line>, <country>Norway</country></aff>
<aff id="aff2"><sup>2</sup><institution>Faculty of Education and International Studies, OsloMet</institution>, <addr-line>Oslo</addr-line>, <country>Norway</country></aff>
<author-notes>
<fn id="fn0001" fn-type="edited-by"><p>Edited by: Yusuf F. Zakariya, University of Agder, Norway</p></fn>
<fn id="fn0002" fn-type="edited-by"><p>Reviewed by: Francisco Alegre, University of Jaume I, Spain; Maria M. Nascimento, University of Tr&#x00E1;s-os-Montes e Alto Douro, Portugal; Ioannis Georgakopoulos, University of West Attica, Greece</p></fn>
<corresp id="c001">&#x002A;Correspondence: Maria Herset, <email>maria.herset@nord.no</email></corresp>
<fn id="fn0003" fn-type="other"><p>This article was submitted to Educational Psychology, a section of the journal Frontiers in Psychology</p></fn>
</author-notes>
<pub-date pub-type="epub">
<day>17</day>
<month>03</month>
<year>2023</year>
</pub-date>
<pub-date pub-type="collection">
<year>2023</year>
</pub-date>
<volume>14</volume>
<elocation-id>1116386</elocation-id>
<history>
<date date-type="received">
<day>05</day>
<month>12</month>
<year>2022</year>
</date>
<date date-type="accepted">
<day>01</day>
<month>03</month>
<year>2023</year>
</date>
</history>
<permissions>
<copyright-statement>Copyright &#x00A9; 2023 Herset, El Ghami and Bjerke.</copyright-statement>
<copyright-year>2023</copyright-year>
<copyright-holder>Herset, El Ghami and Bjerke</copyright-holder>
<license xlink:href="http://creativecommons.org/licenses/by/4.0/">
<p>This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.</p>
</license>
</permissions>
<abstract>
<p>This study investigates whether and to what extent students&#x2019; self-efficacy in mathematics is affected by level-marked mathematics tasks. An online survey with an experimental design was used to collect data from lower secondary school students in Norway (<italic>n</italic>&#x2009;=&#x2009;436). The effect of level-marked mathematics tasks was measured by comparing students&#x2019; responses to tasks with no level marking with their responses to the same tasks marked as being easy, medium or difficult. The study&#x2019;s design was set up carefully, featuring experimental and control groups. A Wilcoxon test showed a significant gap in students&#x2019; self-efficacy when approaching the same tasks without level marking and with difficult-level marking. In addition, a Friedman test showed that the gap between students&#x2019; self-efficacy when encountering the same task with and without level marking expanded significantly with increasing difficulty markings. This result has implications for students in terms of their mathematics learning and for mathematics teachers in terms of their future differentiation initiatives.</p>
</abstract>
<kwd-group>
<kwd>self-efficacy</kwd>
<kwd>differentiated instruction</kwd>
<kwd>tiered lessons</kwd>
<kwd>mathematics tasks</kwd>
<kwd>mathematics education</kwd>
<kwd>textbook in mathematics</kwd>
</kwd-group>
<counts>
<fig-count count="3"/>
<table-count count="6"/>
<equation-count count="0"/>
<ref-count count="51"/>
<page-count count="10"/>
<word-count count="7917"/>
</counts>
</article-meta>
</front>
<body>
<sec id="sec1" sec-type="intro">
<label>1.</label>
<title>Introduction</title>
<p>The question of how to ensure high-quality mathematics experiences for all students that specifically meet their individual needs challenges teachers around the world. This challenge calls for differentiating initiatives that provide &#x201C;equal opportunities to participate, and engage&#x201D; (<xref ref-type="bibr" rid="ref7">Christenson and Wager, 2012</xref>, p. 194). The purpose of differentiation is to tailor instruction so that there are &#x201C;multiple options for taking in information&#x201D; (<xref ref-type="bibr" rid="ref47">Tomlinson, 2001</xref>, p. 1) to achieve an optimal learning experience and to improve self-efficacy in students (<xref ref-type="bibr" rid="ref24">Mathiassen, 2009</xref>; <xref ref-type="bibr" rid="ref28">NOU, 2016</xref>, p. 62).</p>
<p>According to <xref ref-type="bibr" rid="ref47">Tomlinson (2001)</xref>, there is a need to differentiate instruction in terms of content (what students learn), process (how they make sense of ideas and information), and product (how students demonstrate what they have learned). Here, we focus on differentiated instruction based on content and readiness by using level-marked mathematics tasks, as in tiered teaching (<xref ref-type="bibr" rid="ref34">Pierce and Adams, 2005</xref>). We know that level-marked tasks feature in mathematics teachers&#x2019; accounts of their teaching (<xref ref-type="bibr" rid="ref4">Br&#x00E4;ndstr&#x00F6;m, 2005</xref>; <xref ref-type="bibr" rid="ref9">Czegl&#x00E9;dy and Sz&#x00E1;sz, 2005</xref>; <xref ref-type="bibr" rid="ref11">Eriksen et al., 2022</xref>) and are used extensively as differentiation initiatives in mathematics classrooms (<xref ref-type="bibr" rid="ref15">Grave and Pepin, 2015</xref>). In this regard, many mathematics textbooks have a system for marking the difficulty of tasks to help students &#x201C;find their way&#x201D; through them (<xref ref-type="bibr" rid="ref20">Imsen, 2020</xref>, p. 421). Mathematics textbooks have long held a strong position as the main resource for planning and executing the teaching of mathematics (<xref ref-type="bibr" rid="ref36">Robitaille and Travers, 1992</xref>; <xref ref-type="bibr" rid="ref18">Howson, 1995</xref>; <xref ref-type="bibr" rid="ref41">Stein et al., 2007</xref>; <xref ref-type="bibr" rid="ref21">Jablonka and Johansson, 2010</xref>) and recent studies have confirmed their persistent use (<xref ref-type="bibr" rid="ref10">Dolonen et al., 2016</xref>).</p>
<p>An appropriate level of difficulty in mathematics is important for ensuring mastery experiences for students, and it is therefore necessary for textbooks and teachers to take differentiated instruction into account (<xref ref-type="bibr" rid="ref39">Skaalvik and Fossen, 1995</xref>). However, there is a need to examine the interaction between students&#x2019; self-efficacy and teachers&#x2019; differentiation initiatives more closely (<xref ref-type="bibr" rid="ref16">Herset, 2014</xref>; <xref ref-type="bibr" rid="ref26">McNeill and Polly, 2023</xref>; <xref ref-type="bibr" rid="ref17">Herset and El Ghami, 2022</xref>). To the best of our knowledge, no research has reported on how the extensive use of level-marked tasks affects students&#x2019; mathematics self-efficacy. Hence, since self-efficacy &#x2013; a person&#x2019;s &#x201C;beliefs in one&#x2019;s capabilities to organise and execute the courses of action required to produce given attainments&#x201D; (<xref ref-type="bibr" rid="ref1">Bandura, 1997</xref>, p. 3) &#x2013; is a future-oriented construct that correlates with achievement (<xref ref-type="bibr" rid="ref32">Pajares and Miller, 1995</xref>; <xref ref-type="bibr" rid="ref30">Pajares, 1996</xref>), we aimed to report the results from a Norwegian study investigating the effects of level-marked mathematics tasks on students&#x2019; mathematics self-efficacy. While previous research has focused on students&#x2019; changes in self-efficacy over time, making it hard to say exactly why these changes took place (<xref ref-type="bibr" rid="ref44">Street et al., 2022a</xref>), the current study investigates how self-efficacy is affected by level-marked tasks within a short time span (allowing no other factors to influence their change in self-efficacy, if present). In this way, this paper sheds new light on tiered teaching according to readiness.</p>
</sec>
<sec id="sec2">
<label>2.</label>
<title>Theoretical framework and research question</title>
<p>Before we examine previous research on differentiated instruction in mathematics and the role of the textbook and its use of level-marked tasks, we begin this section by providing a more detailed account of self-efficacy, its sources and its importance for mathematics learning in individuals.</p>
<sec id="sec3">
<label>2.1.</label>
<title>Self-efficacy beliefs</title>
<p>According to <xref ref-type="bibr" rid="ref1">Bandura (1997)</xref>, self-efficacy beliefs differ in level, strength and generality. <italic>Level</italic> refers to whether a person perceives a given task as easy or difficult, and is a personal opinion that affects one&#x2019;s choice of task or activity, one&#x2019;s effort and one&#x2019;s persistence (<xref ref-type="bibr" rid="ref1">Bandura, 1997</xref>). People with low self-efficacy for accomplishing a task may avoid the task, while a more efficacious person will persist longer when encountering difficulties, with more motivation to prepare for and put effort into completing the task at hand (<xref ref-type="bibr" rid="ref37">Schunk, 1991</xref>). <xref ref-type="bibr" rid="ref43">Street et al. (2017)</xref> claimed that students&#x2019; perceptions of difficulty levels differ and may not reflect the actual difficulty of the task. How students perceive task difficulty is important because this perception affects their self-efficacy (<xref ref-type="bibr" rid="ref6">Chen and Zimmerman, 2007</xref>; <xref ref-type="bibr" rid="ref45">Street et al., 2022b</xref>).</p>
<p>Self-efficacy can also vary in <italic>strength</italic>, revealing how strong a person&#x2019;s beliefs are that they can complete a given task, and <italic>generality</italic>, which refers to a person&#x2019;s breadth of knowledge and mastery of various topics. <xref ref-type="bibr" rid="ref1">Bandura (1997)</xref> therefore distinguished between specific self-efficacy and general self-efficacy, as self-efficacy can vary depending on the specific task, theme or subject. This was also supported by <xref ref-type="bibr" rid="ref44">Street et al.&#x2019;s (2022a)</xref> study, in which students&#x2019; self-efficacy in geometry and algebra differed. In this paper, we are mostly concerned with measuring the strength of students&#x2019; self-efficacy, while also revealing some aspects of their level of self-efficacy, as the two constructs are clearly related (<xref ref-type="bibr" rid="ref1">Bandura, 1997</xref>).</p>
<p><xref ref-type="bibr" rid="ref1">Bandura (1997)</xref> proposed four sources as crucial in fostering self-efficacy in individuals. <italic>Mastery experience</italic>, which is about interpreting the results of one&#x2019;s own previous attainment, was considered by <xref ref-type="bibr" rid="ref1">Bandura (1997)</xref> to be the most powerful source, a statement repeatedly confirmed and reported in a growing body of research (e.g., <xref ref-type="bibr" rid="ref42">Stevens et al., 2006</xref>; <xref ref-type="bibr" rid="ref48">Usher and Pajares, 2009</xref>; <xref ref-type="bibr" rid="ref22">Jo&#x00EB;t et al., 2011</xref>; <xref ref-type="bibr" rid="ref5">Butz and Usher, 2015</xref>). Mastery experiences have been found necessary for students to develop and preserve expectations of mastery (<xref ref-type="bibr" rid="ref40">Skaalvik and Skaalvik, 2018</xref>, p. 197). <italic>Vicarious experience</italic> is derived from observing others performing a task, which is important in building self-efficacy beliefs in individuals (<xref ref-type="bibr" rid="ref1">Bandura, 1997</xref>). In mathematics, if students watch others who are similar to them, such as classmates, accomplishing a difficult task, it may convince them that they are able to succeed as well (<xref ref-type="bibr" rid="ref37">Schunk, 1991</xref>). However, previous research has shown contradictory results when it comes to the relationship between self-efficacy and vicarious experience; for example, <xref ref-type="bibr" rid="ref22">Jo&#x00EB;t et al. (2011)</xref> found no significant correlation between vicarious experience and self-efficacy, while <xref ref-type="bibr" rid="ref48">Usher and Pajares (2009)</xref> suggested the opposite. What seems to be uncontested is that information obtained vicariously typically has a weaker effect on self-efficacy than students&#x2019; own performance-based information (<xref ref-type="bibr" rid="ref37">Schunk, 1991</xref>).</p>
<p>The third source, <italic>social persuasion</italic>, involves evaluative feedback from others and is based on the assumption that encouragement from others can enhance students&#x2019; beliefs in their capability to perform a given task at a certain level (<xref ref-type="bibr" rid="ref1">Bandura, 1997</xref>). Several studies have shown a significant correlation between self-efficacy and social persuasion (e.g., <xref ref-type="bibr" rid="ref42">Stevens et al., 2006</xref>; <xref ref-type="bibr" rid="ref48">Usher and Pajares, 2009</xref>; <xref ref-type="bibr" rid="ref22">Jo&#x00EB;t et al., 2011</xref>), but this source&#x2019;s contribution to enhanced self-efficacy has been found to be temporary if a subsequent effort leads to poor results (<xref ref-type="bibr" rid="ref37">Schunk, 1991</xref>). In light of social persuasion&#x2019;s limited ability to create enduring improvements in self-efficacy, <xref ref-type="bibr" rid="ref1">Bandura (1997)</xref> viewed it as a comparatively weak source. The final source, <italic>physiological and affective states</italic>, refers to the influence of anxiety, mood, stress and fatigue on self-efficacy beliefs (<xref ref-type="bibr" rid="ref1">Bandura, 1997</xref>). For example, students with high anxiety levels may undermine their beliefs about their own abilities. Previous studies vary in their reports on the relationship between physiological and affective states and self-efficacy; for example, <xref ref-type="bibr" rid="ref42">Stevens et al. (2006)</xref> and <xref ref-type="bibr" rid="ref48">Usher and Pajares (2009)</xref> found significant correlations, while <xref ref-type="bibr" rid="ref22">Jo&#x00EB;t et al. (2011)</xref> did not. <xref ref-type="bibr" rid="ref1">Bandura (1997)</xref> viewed this particular source of self-efficacy information as the least influential, as it does not reliably diagnose capability.</p>
<p>According to <xref ref-type="bibr" rid="ref1">Bandura (1997)</xref>, self-efficacy is important because it influences motivational, decisional, cognitive and emotional processes. He asserted that a person with high self-efficacy would think more strategically and optimistically than a person with low self-efficacy. In addition, he found that self-efficacy influenced people&#x2019;s choices, realisation of accomplishments, levels of stress and depression, effort, persistence, goals and achievement (<xref ref-type="bibr" rid="ref2">Bandura, 2006</xref>). This has also been found in the body of literature reporting on self-efficacy in the context of learning mathematics, in which self-efficacy may influence task choice, effort, persistence, self-evaluation, resilience and achievement (<xref ref-type="bibr" rid="ref51">Zimmerman and Martinez-Pons, 1990</xref>; <xref ref-type="bibr" rid="ref32">Pajares and Miller, 1995</xref>; <xref ref-type="bibr" rid="ref30">Pajares, 1996</xref>; <xref ref-type="bibr" rid="ref35">Ramdass and Zimmerman, 2008</xref>; <xref ref-type="bibr" rid="ref38">Schunk and Mullen, 2012</xref>; <xref ref-type="bibr" rid="ref50">Zakariya, 2021</xref>), and is an even better predictor of achievement when students are accurate in judging their self-efficacy (<xref ref-type="bibr" rid="ref6">Chen and Zimmerman, 2007</xref>).</p>
<p>When measuring self-efficacy, it is important to measure self-efficacy close in time to the given task (<xref ref-type="bibr" rid="ref1">Bandura, 1997</xref>). Moreover, <xref ref-type="bibr" rid="ref2">Bandura (2006)</xref> recommended not using a &#x2018;one-measure-fits-all&#x2019; approach since it is often too general, but rather, to measure perceived self-efficacy as tailored to the object of interest. This is supported by several researchers who claim that, to increase prediction, measuring self-efficacy should be task-specific and measured before the task is performed (<xref ref-type="bibr" rid="ref32">Pajares and Miller, 1995</xref>; <xref ref-type="bibr" rid="ref49">Zakariya et al., 2019</xref>). While taking all these considerations into account, additionally, since mathematics self-efficacy is concerned with perceived capability, in the current study, we use the phrase &#x201C;can do&#x201D; instead of &#x201C;will do&#x201D;, as recommended by <xref ref-type="bibr" rid="ref2">Bandura (2006)</xref>. <xref ref-type="bibr" rid="ref1">Bandura (1997)</xref> pointed out that &#x201C;will&#x201D; is about intention and is not a measure of a person&#x2019;s judgement of their capabilities.</p>
</sec>
<sec id="sec4">
<label>2.2.</label>
<title>Differentiation in mathematics textbooks</title>
<p>As discussed in the introduction, mathematics textbooks hold a strong position as the main resource for planning and executing mathematics teaching (<xref ref-type="bibr" rid="ref36">Robitaille and Travers, 1992</xref>; <xref ref-type="bibr" rid="ref18">Howson, 1995</xref>; <xref ref-type="bibr" rid="ref41">Stein et al., 2007</xref>; <xref ref-type="bibr" rid="ref21">Jablonka and Johansson, 2010</xref>; <xref ref-type="bibr" rid="ref10">Dolonen et al., 2016</xref>) and are known to be extensively used in mathematics education across the world (<xref ref-type="bibr" rid="ref14">Glasnovi&#x0107; Gracin, 2014</xref>). For example, in <xref ref-type="bibr" rid="ref13">Glasnovic Gracin&#x2019;s (2011)</xref> study, textbooks were found to have an important place in mathematics teaching and learning in lower secondary education; teachers used them extensively to prepare lessons, both for using the methodology presented and as the main source for students&#x2019; practice. However, another example from a study investigating education in Estonia, Finland and Norway indicated that &#x201C;almost 45% of the teachers use the textbook simply as an exercise book&#x201D; (<xref ref-type="bibr" rid="ref23">Lepik et al., 2015</xref>, p. 129). These findings, in combination with the need for differentiation initiatives in mathematics teaching (<xref ref-type="bibr" rid="ref47">Tomlinson, 2001</xref>), highlight the need to investigate differentiation in textbooks to determine whether they are doing the job.</p>
<p>Differentiation in mathematics varies between countries (<xref ref-type="bibr" rid="ref33">Pepin and Haggerty, 2003</xref>; <xref ref-type="bibr" rid="ref19">Howson, 2013</xref>). A comparative study of mathematics textbook use conducted by <xref ref-type="bibr" rid="ref33">Pepin and Haggerty (2003)</xref> revealed how France, England and Germany approached differentiation differently. In France, teachers used the same textbook for all students of the same age. While the content of the lessons was the same, the tasks were differentiated, and the teachers were responsible for selecting tasks from the textbook for the different students according to their abilities. In England, students were divided into three groups according to ability; each group had their own books, with tasks adjusted to their level. In Germany, students were grouped into different school types based on their prior achievements in school. Approaches also varied between school types, as textbooks were used as a framework and support for learning in low-achieving students but were used to a lesser extent amongst high-achieving students. Accordingly, <xref ref-type="bibr" rid="ref33">Pepin and Haggerty (2003)</xref> found that concerns related to differentiation differed amongst the three countries.</p>
<p>Similarly, <xref ref-type="bibr" rid="ref23">Lepik et al. (2015</xref>, p. 142) found that textbooks were used quite differently in Estonia, Finland and Norway based on how teachers saw their endeavour to differentiate; in Norway, 64% of the teachers agreed that the tasks in the textbook were adapted to both weak and strong students, while only half of the Estonian teachers and 46% of the Finnish teachers agreed with this statement. <xref ref-type="bibr" rid="ref4">Br&#x00E4;ndstr&#x00F6;m (2005)</xref> also reported on the use of mathematics textbooks in Sweden and found that the textbooks themselves seemed to guide the differentiation. Students often started on the same page, which described the theory and presented a set of tasks, and then undertook a diagnostic test before being divided into different levels based on the results of the test. In summary, even if textbooks&#x2019; structures and teachers&#x2019; use of textbooks differ between countries, textbooks consistently play a significant role in differentiation initiatives. The body of literature seems to support <xref ref-type="bibr" rid="ref9">Czegl&#x00E9;dy and Sz&#x00E1;sz (2005)</xref>, who asserted that the appropriate use of textbooks supports differentiation.</p>
<p>In line with <xref ref-type="bibr" rid="ref14">Glasnovi&#x0107; Gracin (2014)</xref>, who drew attention to the need for research on the content and structure of textbooks, we were unable to find research reporting on the composition of textbooks and the distribution of different content components (such as the proportion of level-marked tasks). Therefore, knowing that selecting tasks is an essential part of teachers&#x2019; interactions with mathematics textbooks (<xref ref-type="bibr" rid="ref25">Matic and Glasnovic Gracin, 2016</xref>), the first author of this paper took a closer look at the three most commonly used lower secondary mathematics textbooks in Norway (<xref ref-type="bibr" rid="ref46">Tesfamicael and Lundeby, 2019</xref>) and found that between 60% and 98% of the tasks in these textbooks were level-marked tasks. While this study was conducted more out of curiosity than for the purpose of research, the high proportion of level-marked tasks suggests that they are worthy of further investigation.</p>
<p>In this paper, we aim to investigate whether the use of level-marked tasks as a differentiation initiative affects students&#x2019; beliefs about their ability to accomplish a given task. Against this backdrop, this paper advances the following research question: To what extent does the level marking of mathematics tasks affect students&#x2019; self-efficacy?</p>
</sec>
</sec>
<sec id="sec5" sec-type="materials|methods">
<label>3.</label>
<title>Materials and methods</title>
<p>To investigate the effect of level-marked tasks on students&#x2019; self-efficacy, an online survey with a complex design was developed by the first author for a larger research project. The purpose of the larger project was to investigate the effect of level-marked tasks on students&#x2019; self-efficacy and to explore whether and how level marking affects motivational, decisional, cognitive and/or emotional processes. Hence, 11 tasks from the topic &#x201C;arithmetic and algebra&#x201D; formed the basis for an online survey. Of these, nine were retrieved from a national test in mathematics (<xref ref-type="bibr" rid="ref27">Norwegian Directorate for Education and Training, n.d.</xref>), one was chosen, with some adjustment, from a mathematics website (<xref ref-type="bibr" rid="ref29">Omtvei, n.d.</xref>), and one unsolvable task was created by the first author of this paper. The difficulty level of task A-I follows from the national test, and the difficulty level of Task J was marked as &#x201C;hard&#x201D; since only 17% of the students in a pilot study solved it correctly (<xref ref-type="bibr" rid="ref17">Herset and El Ghami, 2022</xref>). <xref rid="fig1" ref-type="fig">Figure 1</xref> illustrates the difficulty level of each of the 11 included tasks.</p>
<fig position="float" id="fig1">
<label>Figure 1</label>
<caption>
<p>Difficulty levels for the 11 tasks in the larger project.</p>
</caption>
<graphic xlink:href="fpsyg-14-1116386-g001.tif"/>
</fig>
<sec id="sec6">
<label>3.1.</label>
<title>Selected tasks and design</title>
<p>To answer the research question in this paper, we analysed the responses given to Tasks A&#x2013;C. They were chosen because they are similar in terms of difficulty level, topic and word length. This similarity is important when comparing students&#x2019; self-efficacy between tasks. To avoid a floor or ceiling effect (<xref ref-type="bibr" rid="ref12">Everitt, 2002</xref>), it was important to choose tasks at an appropriate level&#x2014;that is, tasks that were not too difficult or too easy. According to <xref ref-type="bibr" rid="ref3">Bj&#x00F6;rnsson (2016)</xref>, 70% of Norwegian students are within the range of mastery levels 3&#x2013;5, and 10% of students are at mastery level 1 in the national test in mathematics. For this reason, we chose tasks at mastery level 2 (Tasks A&#x2013;C) for this study. The tasks are shown in <xref rid="tab1" ref-type="table">Table 1</xref>. The students were asked to read the task and respond to the question, &#x201C;How certain are you that you can solve this problem correctly?,&#x201D; using a 100-point scale ranging from &#x201C;Not certain at all&#x201D; (0) to &#x201C;Absolutely certain&#x201D; (100), as recommended in the literature (<xref ref-type="bibr" rid="ref31">Pajares et al., 2001</xref>; <xref ref-type="bibr" rid="ref2">Bandura, 2006</xref>; <xref ref-type="bibr" rid="ref49">Zakariya, 2019</xref>).</p>
<table-wrap position="float" id="tab1">
<label>Table 1</label>
<caption>
<p>The three selected tasks (authors&#x2019; translation).</p>
</caption>
<table frame="hsides" rules="groups">
<tbody>
<tr>
<td align="left" valign="top">Task A</td>
<td align="left" valign="top">In <italic>Barcelona</italic>, you find the not-yet-completed church known as the Sagrada Fam&#x00ED;lia. They started building it in 1882, and it is supposed to be finished in 2026. How many years do they expect it will take to build the Sagrada Fam&#x00ED;lia?</td>
</tr>
<tr>
<td align="left" valign="top">Task B</td>
<td align="left" valign="top"><italic>Rita</italic> is on holiday in <italic>Greece</italic>. She wants to rent a scooter. It costs NOK 25 per 5&#x2009;min. How much does it cost to rent the scooter for 1&#x2009;h?</td>
</tr>
<tr>
<td align="left" valign="top">Task C</td>
<td align="left" valign="top"><italic>Silja</italic> wants to take a swimming test. To do that, she has to swim 200&#x2009;m without taking a break. The length of the pool is 12.5&#x2009;m. How many lengths does <italic>Silja</italic> have to swim?</td>
</tr>
</tbody>
</table>
</table-wrap>
<p>Because we utilised only selected parts of the collected data here, we describe only the aspects of the online survey that enabled us to gather these data. When students signed in to the survey, they were randomly assigned to one of four groups: the control group (CG) or to one of three experimental groups (EG<sub>i</sub>, i&#x2009;=&#x2009;1, 2, 3). Once assigned to a group, the students received two sets of tasks (see <xref rid="fig2" ref-type="fig">Figure 2</xref>). Set 1 was identical for all four groups, while Set 2 was different in terms of the labelling of the tasks (and are labelled 2<sub>a</sub>, 2<sub>b</sub>, 2<sub>c</sub>, and 2<sub>d</sub> accordingly).</p>
<fig position="float" id="fig2">
<label>Figure 2</label>
<caption>
<p>Outline of the design.</p>
</caption>
<graphic xlink:href="fpsyg-14-1116386-g002.tif"/>
</fig>
<p>In Set 1, none of the tasks were level marked. This was true for all four groups. In Set 2<sub>a</sub>, CG participants received Tasks A&#x2013;C again and none of the tasks were level marked. In Sets 2<sub>b</sub>, 2<sub>c</sub>, and 2<sub>d</sub>, the students were presented with Tasks A&#x2013;C again, but this time they were marked as &#x201C;easy,&#x201D; &#x201C;medium&#x201D; and &#x201C;difficult,&#x201D; and the marking changed between groups (see <xref rid="fig2" ref-type="fig">Figure 2</xref>). In all four editions of Set 2, to avoid the tasks being identical to Set 1, the words in italics in <xref rid="tab1" ref-type="table">Table 1</xref> were replaced to give the tasks a new &#x201C;outlook&#x201D; (e.g., in Task B, <italic>Rita</italic> was replaced with <italic>Alex</italic>, <italic>Greece</italic> was replaced with <italic>France</italic>, and <italic>she</italic> was replaced with <italic>he</italic>). As shown in <xref rid="fig2" ref-type="fig">Figure 2</xref>, we marked the tasks in Set 2 with an apostrophe (A&#x2032;, B&#x2032; and C&#x2032;) to illustrate that they got a new &#x201C;outlook&#x201D; without changing the content.</p>
<p>To clarify the design, <xref rid="fig3" ref-type="fig">Figure 3</xref> shows an example of how Task C appeared for EG<sub>1</sub> in Set 1 and Set 2<sub>b</sub>. As shown, everything appears similar apart from the names (&#x201C;Daria&#x201D; and &#x201C;Silja&#x201D;) and in addition, in Set 2<sub>b</sub>, Task C&#x2032; is marked as &#x201C;difficult&#x201D;.</p>
<fig position="float" id="fig3">
<label>Figure 3</label>
<caption>
<p>An example of how Task C appears for EG<sub>1</sub> (translated by the first author).</p>
</caption>
<graphic xlink:href="fpsyg-14-1116386-g003.tif"/>
</fig>
<p>In this study, following <xref ref-type="bibr" rid="ref8">Cohen et al. (2018)</xref>, we viewed reliability as equivalence, consistency and stability. The design of our study enabled a comparison between how students responded to similar tasks, and even the same task, with and without level markings. The CG was included for reliability purposes only, and Wilcoxon tests revealed no significant change in self-efficacy scores between Set 1 and Set 2<sub>a</sub> (both sets without level-marked tasks) for any of the tasks; thus, reliability as equivalence was considered to have been achieved. Reliability as consistency was tested in the CG, where a Friedman repeated test showed no significant difference (which was exactly what we wanted) when comparing students&#x2019; difference in self-efficacy (Set 2 &#x2013; Set 1) between each of the three tasks A&#x2013;C. We did not use the instrument repeatedly over time, so stability was not evaluated, which could be considered a limitation of our cross-sectional study.</p>
</sec>
<sec id="sec7">
<label>3.2.</label>
<title>Participants</title>
<p>Since the population is large and widely dispersed, we used cluster sampling (<xref ref-type="bibr" rid="ref8">Cohen et al., 2018</xref>). After the first author had randomly chosen schools across Norway to participate in this study, students in grades 8 and 9 (i.e., aged 13&#x2013;15&#x2009;years) were recruited by first contacting the chosen schools&#x2019; principals. If they were willing to participate, they encouraged the school&#x2019;s mathematics teachers to facilitate their students&#x2019; participation. Because of COVID-19, some of the randomly chosen schools were not able to participate and were replaced by other schools. The students responded to the survey during class, and the teachers made sure that the data were collected following a set of predetermined instructions (e.g., students shall not collaborate) and that ethical guidelines were followed (e.g., no student shall feel obligated to participate).</p>
<p>An analysis of missing patterns suggested that some of the data were incomplete or monotone, indicating that participants had skipped items; hence, 84 responses were removed. In addition, three response strings were detected as outliers, of which two were deleted because of extreme values and the third was removed because the participant spent an unrealistic amount of time on the survey. The final sample used in this analysis included <italic>n</italic>&#x2009;=&#x2009;349 students, of which 172 (49.3%) were female and 177 (50.7%) were male, coming from 23 schools from all regions in Norway (47% from Northern Norway, 10% from Mid Norway, 9% from Western Norway, 4% from Southern Norway and 30% from Eastern Norway). The students were distributed as follows: <italic>n</italic>&#x2009;=&#x2009;90 in CG, <italic>n</italic>&#x2009;=&#x2009;94 in EG<sub>1</sub>, <italic>n</italic>&#x2009;=&#x2009;74 in EG<sub>2</sub>, <italic>n</italic>&#x2009;=&#x2009;91 in EG<sub>3</sub> (see <xref rid="fig2" ref-type="fig">Figure 2</xref>).</p>
</sec>
<sec id="sec8">
<label>3.3.</label>
<title>Statistical methods</title>
<p>In response to the call by <xref ref-type="bibr" rid="ref26">McNeill and Polly (2023)</xref> for more research examining the interaction between students&#x2019; self-efficacy and teachers&#x2019; differentiation initiatives, and in line with our research question, our data collection design enabled us to investigate both how and to what extent differentiation in the form of level-marked tasks affects students&#x2019; self-efficacy. The survey design allowed us to investigate how the different level markings of tasks affected students&#x2019; responses. Hence, we formulated the following two hypotheses:</p>
<disp-quote>
<p><italic>H1</italic>: There is a gap in students&#x2019; self-efficacy when approaching the same tasks with and without level marking.</p>
</disp-quote>
<disp-quote>
<p><italic>H2</italic>: The gap between students&#x2019; self-efficacy when encountering the same tasks with and without level marking expands with increasing difficulty markings.</p>
</disp-quote>
<p>The hypotheses are formulated in such a way that H2 makes sense only if our data support H1. To test H1 and H2, we merged all student responses to easy-marked tasks and did the same for medium-marked and difficult-marked tasks. H1 was tested by comparing the medians of students&#x2019; self-efficacy scores when receiving the same task with and without level marking. We used two-tailed test as suggested by <xref ref-type="bibr" rid="ref8">Cohen et al. (2018)</xref> because the non-directional hypothesis indicates only difference, and not whether self-efficacy would be positively or negatively affected by level-marked tasks. Because the data were nonparametric, we used a series of Wilcoxon tests. To test H2, we used the Friedman test to check whether the difference in students&#x2019; self-efficacy when receiving tasks with and without level markings was significantly different between easy-, medium- and difficult-marked tasks.</p>
<p>The overall project was given full ethics approval by the Norwegian Social Science Data Service, ensuring the interests of the participants. We are aware of the limitations of this study, which are mainly connected to the small sample size and skewed distribution of the participating schools across Norway. We are mindful of the limits on the generalisability of our results.</p>
</sec>
</sec>
<sec id="sec9" sec-type="results">
<label>4.</label>
<title>Results</title>
<p>Our research question and associated hypotheses were formulated on the basis of the reviewed literature. Taken together, if both hypotheses held, we would have an argument for the effect of level-marked mathematics tasks on students&#x2019; self-efficacy. Descriptive statistics related to tasks A, B and C are presented in <xref rid="tab2" ref-type="table">Table 2</xref>.</p>
<table-wrap position="float" id="tab2">
<label>Table 2</label>
<caption>
<p>Students&#x2019; self-efficacy in Set 1 and Set 2.</p>
</caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th rowspan="2"/>
<th align="left" valign="top" rowspan="2">Set 1</th>
<th colspan="3"/>
<th align="left" valign="top" colspan="4">Set 2</th>
<th/>
</tr>
<tr>
<th align="center" valign="top">Median</th>
<th align="center" valign="top">Mean</th>
<th align="center" valign="top">SD</th>
<th align="center" valign="top">Median</th>
<th align="center" valign="top">Mean</th>
<th align="center" valign="top">SD</th>
</tr>
</thead>
<tbody>
<tr>
<td align="left" valign="top" rowspan="3">Task A</td>
<td align="left" valign="top">EG<sub>1</sub></td>
<td align="char" valign="top" char=".">100.00</td>
<td align="char" valign="top" char=".">87.89</td>
<td align="char" valign="top" char=".">19.94</td>
<td align="left" valign="top">EG<sub>1(Easy)</sub></td>
<td align="char" valign="top" char=".">99.50</td>
<td align="char" valign="top" char=".">85.11</td>
<td align="char" valign="top" char=".">21.48</td>
</tr>
<tr>
<td align="left" valign="top">EG<sub>2</sub></td>
<td align="char" valign="top" char=".">98.50</td>
<td align="char" valign="top" char=".">87.09</td>
<td align="char" valign="top" char=".">20.74</td>
<td align="left" valign="top">EG<sub>2(Medium)</sub></td>
<td align="char" valign="top" char=".">96.50</td>
<td align="char" valign="top" char=".">86.24</td>
<td align="char" valign="top" char=".">21.33</td>
</tr>
<tr>
<td align="left" valign="top">EG<sub>3</sub></td>
<td align="char" valign="top" char=".">97.00</td>
<td align="char" valign="top" char=".">85.76</td>
<td align="char" valign="top" char=".">21.48</td>
<td align="left" valign="top">EG<sub>3(Difficult)</sub></td>
<td align="char" valign="top" char=".">95.00</td>
<td align="char" valign="top" char=".">82.15</td>
<td align="char" valign="top" char=".">23.08</td>
</tr>
<tr>
<td align="left" valign="top" rowspan="3">Task B</td>
<td align="left" valign="top">EG<sub>3</sub></td>
<td align="char" valign="top" char=".">100.00</td>
<td align="char" valign="top" char=".">86.90</td>
<td align="char" valign="top" char=".">21.22</td>
<td align="left" valign="top">EG<sub>3(Easy)</sub></td>
<td align="char" valign="top" char=".">100.00</td>
<td align="char" valign="top" char=".">86.08</td>
<td align="char" valign="top" char=".">20.28</td>
</tr>
<tr>
<td align="left" valign="top">EG<sub>1</sub></td>
<td align="char" valign="top" char=".">97.50</td>
<td align="char" valign="top" char=".">86.26</td>
<td align="char" valign="top" char=".">19.56</td>
<td align="left" valign="top">EG<sub>1(Medium)</sub></td>
<td align="char" valign="top" char=".">94.50</td>
<td align="char" valign="top" char=".">83.53</td>
<td align="char" valign="top" char=".">20.28</td>
</tr>
<tr>
<td align="left" valign="top">EG<sub>2</sub></td>
<td align="char" valign="top" char=".">99.00</td>
<td align="char" valign="top" char=".">88.19</td>
<td align="char" valign="top" char=".">20.40</td>
<td align="left" valign="top">EG<sub>2(Difficult)</sub></td>
<td align="char" valign="top" char=".">95.00</td>
<td align="char" valign="top" char=".">84.32</td>
<td align="char" valign="top" char=".">23.00</td>
</tr>
<tr>
<td align="left" valign="top" rowspan="3">Task C</td>
<td align="left" valign="top">EG<sub>2</sub></td>
<td align="char" valign="top" char=".">90.00</td>
<td align="char" valign="top" char=".">79.38</td>
<td align="char" valign="top" char=".">25.73</td>
<td align="left" valign="top">EG<sub>2(Easy)</sub></td>
<td align="char" valign="top" char=".">92.00</td>
<td align="char" valign="top" char=".">80.50</td>
<td align="char" valign="top" char=".">26.09</td>
</tr>
<tr>
<td align="left" valign="top">EG<sub>3</sub></td>
<td align="char" valign="top" char=".">88.00</td>
<td align="char" valign="top" char=".">80.65</td>
<td align="char" valign="top" char=".">20.91</td>
<td align="left" valign="top">EG<sub>3(Medium)</sub></td>
<td align="char" valign="top" char=".">90.00</td>
<td align="char" valign="top" char=".">80.54</td>
<td align="char" valign="top" char=".">22.31</td>
</tr>
<tr>
<td align="left" valign="top">EG<sub>1</sub></td>
<td align="char" valign="top" char=".">90.00</td>
<td align="char" valign="top" char=".">78.22</td>
<td align="char" valign="top" char=".">25.88</td>
<td align="left" valign="top">EG<sub>1(Difficult)</sub></td>
<td align="char" valign="top" char=".">80.00</td>
<td align="char" valign="top" char=".">73.25</td>
<td align="char" valign="top" char=".">27.16</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<p>EG<sub>1-3</sub>: Tasks without level marking (Set 1). EG<sub>1-3(easy/medium/difficult)</sub>: Tasks with level markings (Set 2).</p>
</table-wrap-foot>
</table-wrap>
<p>When comparing the two &#x201C;mean&#x201D; columns (columns 4 and 8) and the two &#x201C;median&#x201D; columns (columns 3 and 7) in <xref rid="tab2" ref-type="table">Table 2</xref>, we see how reported self-efficacy declines as tasks go from no level marking to being marked as difficult.</p>
<p>Because of the way in which this study was designed, all students in the EGs received the three similar tasks twice. This means that all students, regardless of which EG they were in, received three tasks in Set 2 with different level markings: easy, medium and difficult (see <xref rid="fig2" ref-type="fig">Figure 2</xref> in the methods section). Hence, we had 259 student responses (i.e., one response from each of the [94&#x2009;+&#x2009;74&#x2009;+&#x2009;91] students in all three EGs) to easy-marked tasks, medium-marked tasks and difficult-marked tasks. This enabled us, in hypothesis testing, to examine the differences in self-efficacy of the responses between no level marking and easy-level marking, between no level marking and medium-level marking, and between no level marking and difficult-level marking. We found that the effect of difficult-level marking was the largest, as illustrated in <xref rid="tab3" ref-type="table">Table 3</xref>.</p>
<table-wrap position="float" id="tab3">
<label>Table 3</label>
<caption>
<p>Mean difference in students&#x2019; self-efficacy between Set 2 (easy-, medium- and difficult-level marking) and Set 1 (without level marking).</p>
</caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th align="left" valign="top">Set 2 &#x2013; Set 1</th>
<th align="center" valign="top">Mean difference in students&#x2019; self-efficacy</th>
</tr>
</thead>
<tbody>
<tr>
<td align="left" valign="top">Easy-level marking &#x2013; Without level marking</td>
<td align="char" valign="top" char=".">&#x2212;0.98</td>
</tr>
<tr>
<td align="left" valign="top">Medium-level marking &#x2013; Without level marking</td>
<td align="char" valign="top" char=".">&#x2212;1.27</td>
</tr>
<tr>
<td align="left" valign="top">Difficult-level marking &#x2013; Without level marking</td>
<td align="char" valign="top" char=".">&#x2212;4.18</td>
</tr>
</tbody>
</table>
</table-wrap>
<p>As the same students&#x2019; in the EGs answered Sets 1 and 2, the sample is dependent, and the Wilcoxon test was used because the data were not normally distributed (<xref ref-type="bibr" rid="ref8">Cohen et al., 2018</xref>). As shown in <xref rid="tab4" ref-type="table">Table 4</xref>, a Wilcoxon test revealed that students&#x2019; self-efficacy was significantly lower when tasks were marked as difficult, <italic>z</italic>&#x2009;=&#x2009;&#x2212;4.033, <italic>p</italic>&#x2009;&#x003C;&#x2009;0.001. There was no significant difference between no level marking and medium-level marking (<italic>z</italic>&#x2009;=&#x2009;&#x2212;0.930, <italic>p</italic>&#x2009;=&#x2009;0.353) or between no level marking and easy-level marking (<italic>z</italic>&#x2009;=&#x2009;&#x2212;0.233, <italic>p</italic>&#x2009;=&#x2009;0.824).</p>
<table-wrap position="float" id="tab4">
<label>Table 4</label>
<caption>
<p>Wilcoxon test of the difference in students&#x2019; self-efficacy (Set 2&#x2013;Set 1).</p>
</caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th/>
<th/>
<th align="center" valign="top"><italic>N</italic></th>
<th align="center" valign="top">Mean rank</th>
<th align="center" valign="top">Sum of ranks</th>
<th align="center" valign="top"><italic>Z</italic></th>
<th align="center" valign="top">Two-tailed value of <italic>p</italic></th>
</tr>
</thead>
<tbody>
<tr>
<td align="left" valign="top" rowspan="4">Set 2 (easy-level marking) &#x2013; Set 1 (without level marking)</td>
<td align="left" valign="top">Negative ranks</td>
<td align="center" valign="top">61<xref rid="tfn1" ref-type="table-fn"><sup>a</sup></xref></td>
<td align="char" valign="top" char=".">74.66</td>
<td align="char" valign="top" char=".">4554.50</td>
<td align="char" valign="top" char=".">&#x2212;0.233<xref rid="tfn4" ref-type="table-fn"><sup>d</sup></xref></td>
<td align="char" valign="top" char=".">0.824</td>
</tr>
<tr>
<td align="left" valign="top">Positive ranks</td>
<td align="center" valign="top">72<xref rid="tfn2" ref-type="table-fn"><sup>b</sup></xref></td>
<td align="char" valign="top" char=".">60.51</td>
<td align="char" valign="top" char=".">4356.50</td>
<td/>
<td/>
</tr>
<tr>
<td align="left" valign="top">Ties</td>
<td align="center" valign="top">126<xref rid="tfn3" ref-type="table-fn"><sup>c</sup></xref></td>
<td/>
<td/>
<td/>
<td/>
</tr>
<tr>
<td align="left" valign="top">Total</td>
<td align="center" valign="top">256</td>
<td/>
<td/>
<td/>
<td/>
</tr>
<tr>
<td align="left" valign="top" rowspan="4">Set 2 (medium-level marking) &#x2013; Set 1 (without level marking)</td>
<td align="left" valign="top">Negative ranks</td>
<td align="center" valign="top">79d<xref rid="tfn1" ref-type="table-fn"><sup>a</sup></xref></td>
<td align="char" valign="top" char=".">79.99</td>
<td align="char" valign="top" char=".">6319.00</td>
<td align="char" valign="top" char=".">&#x2212;0.930<xref rid="tfn4" ref-type="table-fn"><sup>d</sup></xref></td>
<td align="char" valign="top" char=".">0.353</td>
</tr>
<tr>
<td align="left" valign="top">Positive ranks</td>
<td align="center" valign="top">73<xref rid="tfn2" ref-type="table-fn"><sup>b</sup></xref></td>
<td align="char" valign="top" char=".">72.73</td>
<td align="char" valign="top" char=".">5309.00</td>
<td/>
<td/>
</tr>
<tr>
<td align="left" valign="top">Ties</td>
<td align="center" valign="top">107<xref rid="tfn3" ref-type="table-fn"><sup>c</sup></xref></td>
<td/>
<td/>
<td/>
<td/>
</tr>
<tr>
<td align="left" valign="top">Total</td>
<td align="center" valign="top">259</td>
<td/>
<td/>
<td/>
<td/>
</tr>
<tr>
<td align="left" valign="top" rowspan="4">Set 2 (difficult-level marking) &#x2013; Set 1 (without level marking)</td>
<td align="left" valign="top">Negative ranks</td>
<td align="center" valign="top">101<xref rid="tfn1" ref-type="table-fn"><sup>a</sup></xref></td>
<td align="char" valign="top" char=".">88.19</td>
<td align="char" valign="top" char=".">8907.50</td>
<td align="char" valign="top" char=".">&#x2212;4.033<xref rid="tfn4" ref-type="table-fn"><sup>d</sup></xref></td>
<td align="char" valign="top" char=".">&#x003C;0.001&#x002A;&#x002A;</td>
</tr>
<tr>
<td align="left" valign="top">Positive ranks</td>
<td align="center" valign="top">60<xref rid="tfn2" ref-type="table-fn"><sup>b</sup></xref></td>
<td align="char" valign="top" char=".">68.89</td>
<td align="char" valign="top" char=".">4133.50</td>
<td/>
<td/>
</tr>
<tr>
<td align="left" valign="top">Ties</td>
<td align="center" valign="top">98<xref rid="tfn3" ref-type="table-fn"><sup>c</sup></xref></td>
<td/>
<td/>
<td/>
<td/>
</tr>
<tr>
<td align="left" valign="top">Total</td>
<td align="center" valign="top">259</td>
<td/>
<td/>
<td/>
<td/>
</tr>
</tbody>
</table>
<table-wrap-foot>
<p>&#x002A;&#x002A;The difference is significant at the 0.01 level.</p>
<fn id="tfn1">
<label>a</label>
<p>Set2&#x2009;&#x003C;&#x2009;Set1.</p>
</fn>
<fn id="tfn2">
<label>b</label>
<p>Set2&#x2009;&#x003E;&#x2009;Set1.</p>
</fn>
<fn id="tfn3">
<label>c</label>
<p>Set2&#x2009;=&#x2009;Set1.</p>
</fn>
<fn id="tfn4">
<label>d</label>
<p>Based on positive ranks.</p>
</fn>
</table-wrap-foot>
</table-wrap>
<p>To test H2&#x2014;that is, to determine whether the differences highlighted in <xref rid="tab3" ref-type="table">Table 3</xref> were statistically significant&#x2014;Friedman tests were carried out (see <xref rid="tab5" ref-type="table">Table 5</xref>). This revealed a significant effect of the level marking on students&#x2019; self-efficacy, <italic>&#x03C7;</italic><sup>2</sup> (2, <italic>n</italic>&#x2009;=&#x2009;259)&#x2009;=&#x2009;11.413, <italic>p</italic>&#x2009;=&#x2009;0.003, &#x003C;0.01. The medians indicated that students&#x2019; differences in self-efficacy were highest when the tasks were marked as difficult, followed by medium- and easy-level marking.</p>
<table-wrap position="float" id="tab5">
<label>Table 5</label>
<caption>
<p>Friedman test.</p>
</caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th/>
<th align="center" valign="top"><italic>N</italic></th>
<th align="center" valign="top">Mean rank</th>
<th align="center" valign="top"><italic>&#x03C7;</italic><sup>2</sup></th>
<th align="center" valign="top">df</th>
<th align="center" valign="top">Value of <italic>p</italic></th>
</tr>
</thead>
<tbody>
<tr>
<td align="left" valign="top">Easy</td>
<td align="center" valign="top">259</td>
<td align="char" valign="top" char=".">1.90</td>
<td align="char" valign="top" char=".">11.413</td>
<td align="center" valign="top">2</td>
<td align="char" valign="top" char=".">0.003&#x002A;&#x002A;</td>
</tr>
<tr>
<td align="left" valign="top">Medium</td>
<td align="center" valign="top">259</td>
<td align="char" valign="top" char=".">1.96</td>
<td/>
<td/>
<td/>
</tr>
<tr>
<td align="left" valign="top">Difficult</td>
<td align="center" valign="top">259</td>
<td align="char" valign="top" char=".">2.14</td>
<td/>
<td/>
<td/>
</tr>
</tbody>
</table>
<table-wrap-foot>
<p>&#x002A;&#x002A;The difference is significant at the 0.01 level.</p>
</table-wrap-foot>
</table-wrap>
<p>Further analyses with Friedman tests were conducted to follow up pairwise comparisons. These pairs were set up in the following manner: Pair 1 compared <italic>x</italic> and <italic>y</italic>, where <italic>x</italic> is the difference in median between &#x201C;self-efficacy with no level marking&#x201D; and &#x201C;self-efficacy with easy-level marking&#x201D; and <italic>y</italic> is the difference in median between &#x201C;self-efficacy with no level marking&#x201D; and &#x201C;self-efficacy with medium-level marking.&#x201D; In the same manner, Pair 2 dealt with students&#x2019; responses to medium- and difficult-level marked tasks and Pair 3 with easy- and difficult-marked tasks (see <xref rid="tab6" ref-type="table">Table 6</xref>).</p>
<table-wrap position="float" id="tab6">
<label>Table 6</label>
<caption>
<p>Pairwise comparisons.</p>
</caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th/>
<th align="left" valign="top">Level marking</th>
<th align="center" valign="top"><italic>N</italic></th>
<th align="center" valign="top">Test statistic</th>
<th align="center" valign="top">Std. error</th>
<th align="center" valign="top">Two-tailed value of <italic>p</italic></th>
</tr>
</thead>
<tbody>
<tr>
<td align="left" valign="top">Pair 1</td>
<td align="left" valign="top">Easy</td>
<td align="center" valign="top">259</td>
<td align="char" valign="top" char=".">&#x2212;0.066</td>
<td align="char" valign="top" char=".">0.088</td>
<td align="char" valign="top" char="." rowspan="2">0.455</td>
</tr>
<tr>
<td/>
<td align="left" valign="top">Medium</td>
<td align="center" valign="top">259</td>
<td/>
<td/>
</tr>
<tr>
<td align="left" valign="top">Pair 2</td>
<td align="left" valign="top">Medium</td>
<td align="center" valign="top">259</td>
<td align="char" valign="top" char=".">&#x2212;0.176</td>
<td align="char" valign="top" char=".">0.088</td>
<td align="char" valign="top" char="." rowspan="2">0.046&#x002A;</td>
</tr>
<tr>
<td/>
<td align="left" valign="top">Difficult</td>
<td align="center" valign="top">259</td>
<td/>
<td/>
</tr>
<tr>
<td align="left" valign="top">Pair 3</td>
<td align="left" valign="top">Easy</td>
<td align="center" valign="top">259</td>
<td align="char" valign="top" char=".">&#x2212;0.241</td>
<td align="char" valign="top" char=".">0.088</td>
<td align="char" valign="top" char="." rowspan="2">0.006&#x002A;&#x002A;</td>
</tr>
<tr>
<td/>
<td align="left" valign="top">Difficult</td>
<td align="center" valign="top">259</td>
<td/>
<td/>
</tr>
</tbody>
</table>
<table-wrap-foot>
<p>&#x002A;The difference is significant at the 0.05 level. &#x002A;&#x002A;The difference is significant at the 0.01 level.</p>
</table-wrap-foot>
</table-wrap>
<p>Overall, the results in <xref rid="tab6" ref-type="table">Table 6</xref> show that the effect on students&#x2019; self-efficacy was significant when testing Pair 2 (going from no level marking to difficult-marked tasks, compared to going from no level marking to medium-marked tasks; <italic>p</italic>&#x2009;=&#x2009;0.046), and Pair 3 (going from no level marking to difficult-marked tasks, compared to going from no level marking to easy-marked tasks; <italic>p</italic>&#x2009;=&#x2009;0.006). The trend also applied in testing Pair 1 (going from no level marking to medium-marked tasks, compared to going from no level marking to easy-marked tasks), but this difference was not significant (<italic>p</italic>&#x2009;=&#x2009;0.455). However, the effect on students&#x2019; self-efficacy was significantly larger when going from no level marking to difficult-marked tasks, compared to going from no level marking to easy- and medium-marked tasks. Taken together, this shows that the gap between students&#x2019; self-efficacy when encountering the same tasks with and without level marking expands going from easy- to difficult-marked tasks and from medium to difficult-marked tasks.</p>
</sec>
<sec id="sec10">
<label>5.</label>
<title>Discussion and concluding remarks</title>
<p>When encountering a mathematics task, most people are affected by additional information, such as information about the task&#x2019;s level of difficulty. The most striking result from our analysis was the extent to which tasks marked as difficult had a negative effect on students&#x2019; self-efficacy. We found that students reported a significantly lower level of self-efficacy when encountering tasks marked as difficult compared to when they encountered the same task without level marking. Further, the difference in students&#x2019; self-efficacy when solving tasks with and without level marking became larger when the markings denoted increasing difficulty levels. Here, we discuss what this finding means for students in terms of their mathematics learning and what it means for mathematics teachers&#x2019; differentiation initiatives and for future mathematics textbooks.</p>
<p>Whether a student perceives a given task as being easy or difficult is a matter of personal opinion. This affects the student&#x2019;s level of self-efficacy, which in turn influences the strength of their self-efficacy (<xref ref-type="bibr" rid="ref1">Bandura, 1997</xref>). The negative effect of difficult-level markings on students&#x2019; self-efficacy highlights that even when all students receive the same task, the expectation of mastery becomes lower when a task is marked as difficult. This is consistent with <xref ref-type="bibr" rid="ref43">Street et al.&#x2019;s (2017)</xref> finding that students&#x2019; perceptions of difficulty could be different from the actual difficulty level. When tasks were marked as easy, this did not affect students&#x2019; self-efficacy, which suggests that the students did not perceive the tasks to be any easier than when no level markings were given. Keeping in mind that the first author designed the study using easy tasks&#x2014;at mastery level 2 of 5 (<xref ref-type="bibr" rid="ref3">Bj&#x00F6;rnsson, 2016</xref>)&#x2014;an effect for easy marking might have arisen if the focus had been on tasks with a higher difficulty level. More research is required to determine how level markings affect different levels of actual difficulty.</p>
<p>Although the sources of self-efficacy were not directly measured in this study, the results of our study apply to this body of research. As reported in previous research, mastery experience is the most powerful source of self-efficacy (<xref ref-type="bibr" rid="ref1">Bandura, 1997</xref>; <xref ref-type="bibr" rid="ref42">Stevens et al., 2006</xref>; <xref ref-type="bibr" rid="ref48">Usher and Pajares, 2009</xref>; <xref ref-type="bibr" rid="ref22">Jo&#x00EB;t et al., 2011</xref>; <xref ref-type="bibr" rid="ref5">Butz and Usher, 2015</xref>), and this is a good reason for believing that some of the students&#x2019; previous mastery experiences with difficult-marked tasks had affected their self-efficacy negatively. This is in line with <xref ref-type="bibr" rid="ref40">Skaalvik and Skaalvik (2018</xref>, p. 197), who claimed that mastery experiences are necessary for students to develop and preserve expectations of mastery. A possible interpretation of this finding is that level marking affects students&#x2019; perceptions of the level of difficulty, and if their mastery experience has previously been low when solving tasks marked as difficult, their level of self-efficacy may decrease. This resonates with <xref ref-type="bibr" rid="ref1">Bandura (1997)</xref>, <xref ref-type="bibr" rid="ref6">Chen and Zimmerman (2007)</xref>, and <xref ref-type="bibr" rid="ref45">Street et al. (2022b)</xref>, who suggested that students&#x2019; opinions about whether tasks are easy or difficult affect their self-efficacy.</p>
<p>Our results can also be attributed to students&#x2019; physiological and affective states, in that their self-efficacy beliefs are informed by anxiety, mood, stress and fatigue (<xref ref-type="bibr" rid="ref1">Bandura, 1997</xref>). When told that a task is difficult, some draw on this comparatively weak source of self-efficacy, with detrimental results. This could explain some of the negative effects we found. No positive effects of level marking were found, which seems to indicate that level marking does not improve students&#x2019; physical or emotional well-being. However, previous studies are inconsistent in their conclusions on this point; for example, <xref ref-type="bibr" rid="ref42">Stevens et al. (2006)</xref> and <xref ref-type="bibr" rid="ref48">Usher and Pajares (2009)</xref> found significant correlations between self-efficacy and physiological and affective states, while <xref ref-type="bibr" rid="ref22">Jo&#x00EB;t et al. (2011)</xref> did not. In terms of the last two sources of self-efficacy&#x2014;social persuasion and vicarious experiences&#x2014;we could only speculate about how they may have affected our results. Qualitative research is required to investigate this in greater detail.</p>
<p>Surprisingly, no positive effect of level marking on students&#x2019; self-efficacy was found. However, the present study did not investigate how level marking affects students with different self-efficacy strengths. It is likely that the effect of level marking is different for groups of students with high and low self-efficacy. This was supported by <xref ref-type="bibr" rid="ref37">Schunk (1991)</xref>, who claimed that a person with a high sense of self-efficacy would be more motivated, persist longer and be willing to expend a higher degree of effort. Further research is required to determine exactly how the effect of level marking on students&#x2019; self-efficacy varies by strength of self-efficacy, as well as how the effect of level marking varies between groups of students (e.g., according to gender, grade, motivational factors and mastery experiences).</p>
<p>We are aware that our research may have some limitations related to the voluntary nature of participation in the survey, sample size and data collection taking place <italic>via</italic> the schools&#x2019; principals. We attempted to select schools randomly, but because several schools withdrew due to COVID-19, we had to choose several schools in one district to obtain sufficient data. Moreover, in Norway, there are ~113,700 students in grades 8 and 9, and our data collection consists of <italic>n</italic>&#x2009;=&#x2009;436 students. On the one hand, according to the sample size table (<xref ref-type="bibr" rid="ref8">Cohen et al., 2018</xref>, p. 207), a sample of 383 students is recommended, which is lower than the number of participants in our study (<italic>n</italic>&#x2009;=&#x2009;436). On the other hand, the participants were divided into different groups and there were missing data, so the sample size might be a limitation. Moreover, due to COVID, surveys were distributed to students by their teacher, which limited our opportunities to ensure sufficiently good and purposeful data collection. These limitations highlight the difficulty of collecting data, especially during COVID.</p>
<p>In reviewing the literature, we found that some countries, such as England and Germany, utilise mathematics textbooks that are adapted to different levels of ability (<xref ref-type="bibr" rid="ref33">Pepin and Haggerty, 2003</xref>), indicating that level-marked tasks may not appear consistently in English and German mathematics classrooms. However, the use of level-marked tasks is extensive in Norwegian mathematics textbooks (<xref ref-type="bibr" rid="ref15">Grave and Pepin, 2015</xref>) and classrooms (<xref ref-type="bibr" rid="ref11">Eriksen et al., 2022</xref>). Although <xref ref-type="bibr" rid="ref39">Skaalvik and Fossen (1995)</xref> claimed that textbooks require differentiation, our finding that the level marking of tasks negatively affects students&#x2019; self-efficacy suggests that there is a need to investigate this in more detail. <xref ref-type="bibr" rid="ref4">Br&#x00E4;ndstr&#x00F6;m (2005)</xref> raised questions regarding the level marking of tasks in Swedish mathematics textbooks nearly two decades ago, and to our knowledge, nothing has changed since then.</p>
<p><xref ref-type="bibr" rid="ref14">Glasnovi&#x0107; Gracin (2014)</xref> highlighted the need for research on the content and structure of textbooks. We add to this call by pointing the research path in the direction of level-marked tasks, specifically in terms of the number of such tasks in textbooks, their stated purpose as specified by textbook authors and how they are intended to contribute to better learning. The findings of the current study show that the level marking of tasks appears to have a detrimental effect on students&#x2019; beliefs in their own ability to accomplish the tasks. Finding that difficult-level-marked mathematics tasks may result in reduced self-efficacy in students may indicate that marking tasks as difficult has consequences for students&#x2019; learning. The level marking of tasks may result in students&#x2019; avoidance of difficult tasks and lead to low and inaccurate self-efficacy judgements, which can in turn affect their achievement. This negative effect on students&#x2019; self-efficacy is the opposite of what level marking is intended to achieve.</p>
<p>Our results contribute to a new understanding of level-marked tasks in mathematics textbooks as a differentiation initiative. The results indicate that level marking does not improve self-efficacy, which contradicts the purpose of differentiation (<xref ref-type="bibr" rid="ref24">Mathiassen, 2009</xref>; <xref ref-type="bibr" rid="ref28">NOU, 2016</xref>, p. 62). The finding that difficult-level marking of tasks reduces students&#x2019; self-efficacy has implications for mathematics teachers in terms of their choice of differentiation initiatives. This study adds new insights to the body of research reporting on how self-efficacy affects task choice, effort, persistence, self-evaluation, resilience and achievement (<xref ref-type="bibr" rid="ref51">Zimmerman and Martinez-Pons, 1990</xref>; <xref ref-type="bibr" rid="ref32">Pajares and Miller, 1995</xref>; <xref ref-type="bibr" rid="ref30">Pajares, 1996</xref>; <xref ref-type="bibr" rid="ref35">Ramdass and Zimmerman, 2008</xref>; <xref ref-type="bibr" rid="ref38">Schunk and Mullen, 2012</xref>; <xref ref-type="bibr" rid="ref50">Zakariya, 2021</xref>), and may have implications for how teachers use level-marked tasks in the classroom. If teachers allow students to choose between level-marked tasks, a negative consequence might be that some students avoid tasks marked as difficult. However, considering that the present study investigated only three tasks, more research is required to determine how level-marked tasks affect students&#x2019; cognitive, affective, selective and motivational processes. In addition, we recommend that future research include more than one mathematics task per level to measure the internal consistency of students&#x2019; self-efficacy. We are currently in the process of investigating the effect of level-marked tasks on students&#x2019; performance, persistence and choice of tasks, for a future examination of how level-marked tasks affect students&#x2019; learning of mathematics.</p>
</sec>
<sec id="sec11" sec-type="data-availability">
<title>Data availability statement</title>
<p>The raw data supporting the conclusions of this article will be made available by the authors, without undue reservation.</p>
</sec>
<sec id="sec12">
<title>Ethics statement</title>
<p>The studies involving human participants were reviewed and approved by NSD &#x2013; Norwegian centre for research data. Written informed consent to participate in this study was provided by the participants&#x2019; legal guardian/next of kin.</p>
</sec>
<sec id="sec13">
<title>Author contributions</title>
<p>All authors listed have made a substantial, direct, and intellectual contribution to the work and approved it for publication.</p>
</sec>
<sec id="conf1" sec-type="COI-statement">
<title>Conflict of interest</title>
<p>The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.</p>
</sec>
<sec id="sec100" sec-type="disclaimer">
<title>Publisher&#x2019;s note</title>
<p>All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.</p>
</sec>
</body>
<back>
<ref-list>
<title>References</title>
<ref id="ref1"><citation citation-type="book"><person-group person-group-type="author"><name><surname>Bandura</surname> <given-names>A.</given-names></name></person-group> (<year>1997</year>). <source>Self-efficacy: the exercise of control</source>, <publisher-loc>New York</publisher-loc>: <publisher-name>W.H. Freeman</publisher-name>.</citation></ref>
<ref id="ref2"><citation citation-type="book"><person-group person-group-type="author"><name><surname>Bandura</surname> <given-names>A.</given-names></name></person-group> (<year>2006</year>). &#x201C;<article-title>Guide to the construction of self-efficacy scales</article-title>&#x201D; in <source>Self-efficacy beliefs of adolescents</source>. eds. <person-group person-group-type="editor"><name><surname>Pajares</surname> <given-names>F.</given-names></name> <name><surname>Urdan</surname> <given-names>T.</given-names></name></person-group>, vol. <volume>5</volume> (<publisher-loc>Greenwich</publisher-loc>: <publisher-name>Information Age</publisher-name>), <fpage>307</fpage>&#x2013;<lpage>337</lpage>.</citation></ref>
<ref id="ref3"><citation citation-type="other"><person-group person-group-type="author"><name><surname>Bj&#x00F6;rnsson</surname> <given-names>J. K.</given-names></name></person-group> (<year>2016</year>). Metodegrunnlag for nasjonale pr&#x00F8;ver [methods choices for national tests]. Utdanningsdirektoratet. Available at: <ext-link xlink:href="https://www.udir.no/globalassets/filer/vurdering/nasjonaleprover/metodegrunnlag-fornasjonale-prover-august-2018.pdf" ext-link-type="uri">https://www.udir.no/globalassets/filer/vurdering/nasjonaleprover/metodegrunnlag-fornasjonale-prover-august-2018.pdf</ext-link></citation></ref>
<ref id="ref4"><citation citation-type="book"><person-group person-group-type="author"><name><surname>Br&#x00E4;ndstr&#x00F6;m</surname> <given-names>A.</given-names></name></person-group> (<year>2005</year>). <source>Differentiated tasks in mathematics textbooks. An analysis of the levels of difficulty. Licentiate thesis</source>, <publisher-loc>Lule&#x00E5;</publisher-loc>: <publisher-name>Lule&#x00E5; University of Technology</publisher-name>.</citation></ref>
<ref id="ref5"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Butz</surname> <given-names>A. R.</given-names></name> <name><surname>Usher</surname> <given-names>E. L.</given-names></name></person-group> (<year>2015</year>). <article-title>Salient sources of early adolescents&#x2019; self-efficacy in two domains</article-title>. <source>Contemp. Educ. Psychol.</source> <volume>42</volume>, <fpage>49</fpage>&#x2013;<lpage>61</lpage>. doi: <pub-id pub-id-type="doi">10.1016/j.cedpsych.2015.04.001</pub-id></citation></ref>
<ref id="ref6"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Chen</surname> <given-names>P.</given-names></name> <name><surname>Zimmerman</surname> <given-names>B.</given-names></name></person-group> (<year>2007</year>). <article-title>A cross-national comparison study on the accuracy of self-efficacy beliefs of middle-school mathematics students</article-title>. <source>J. Exp. Educ.</source> <volume>75</volume>, <fpage>221</fpage>&#x2013;<lpage>244</lpage>. doi: <pub-id pub-id-type="doi">10.3200/JEXE.75.3.221-244</pub-id></citation></ref>
<ref id="ref7"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Christenson</surname> <given-names>B.</given-names></name> <name><surname>Wager</surname> <given-names>A. A.</given-names></name></person-group> (<year>2012</year>). <article-title>Increasing participation through differentiation</article-title>. <source>Teach. Child. Math.</source> <volume>19</volume>, <fpage>194</fpage>&#x2013;<lpage>200</lpage>. doi: <pub-id pub-id-type="doi">10.5951/teacchilmath.19.3.0194</pub-id></citation></ref>
<ref id="ref8"><citation citation-type="book"><person-group person-group-type="author"><name><surname>Cohen</surname> <given-names>L.</given-names></name> <name><surname>Manion</surname> <given-names>L.</given-names></name> <name><surname>Morrison</surname> <given-names>A. K.</given-names></name></person-group> (<year>2018</year>). <source>Research methods in education</source> <publisher-loc>London</publisher-loc>: <publisher-name>Routledge</publisher-name>.</citation></ref>
<ref id="ref9"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Czegl&#x00E9;dy</surname> <given-names>I.</given-names></name> <name><surname>Sz&#x00E1;sz</surname> <given-names>R.</given-names></name></person-group> (<year>2005</year>). <article-title>The mathematics textbook as an aid to differentiation: a first Hungarian example</article-title>. <source>Teach. Math. Comput. Sci.</source> <volume>3</volume>, <fpage>35</fpage>&#x2013;<lpage>53</lpage>. doi: <pub-id pub-id-type="doi">10.5485/TMCS.2005.0076</pub-id></citation></ref>
<ref id="ref10"><citation citation-type="book"><person-group person-group-type="author"><name><surname>Dolonen</surname> <given-names>J. A.</given-names></name> <name><surname>Furberg</surname> <given-names>A.</given-names></name> <name><surname>Gilje</surname> <given-names>O.</given-names></name> <name><surname>Ingulfsen</surname> <given-names>L.</given-names></name> <name><surname>Kluge</surname> <given-names>A.</given-names></name> <name><surname>Knain</surname> <given-names>E.</given-names></name> <etal/></person-group>. (<year>2016</year>). <source>Med ARK &#x0026; APP. Bruk avl&#x00E6;remidler og ressurser for l&#x00E6;ring p&#x00E5; tvers av arbeidsformer</source> <publisher-name>University of Oslo</publisher-name>.</citation></ref>
<ref id="ref11"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Eriksen</surname> <given-names>E.</given-names></name> <name><surname>Solomon</surname> <given-names>Y.</given-names></name> <name><surname>Bjerke</surname> <given-names>A. H.</given-names></name> <name><surname>Gray</surname> <given-names>J.</given-names></name> <name><surname>Kleve</surname> <given-names>B.</given-names></name></person-group> (<year>2022</year>). <article-title>Making decisions about attainment grouping in mathematics: teacher agency and autonomy in Norway</article-title>. <source>Res. Pap. Educ.</source>, <fpage>1</fpage>&#x2013;<lpage>21</lpage>. doi: <pub-id pub-id-type="doi">10.1080/02671522.2022.2135014</pub-id></citation></ref>
<ref id="ref12"><citation citation-type="book"><person-group person-group-type="author"><name><surname>Everitt</surname> <given-names>B. S.</given-names></name></person-group> (<year>2002</year>). <source>The Cambridge dictionary of statistics</source> <publisher-loc>Cambridge</publisher-loc>: <publisher-name>Cambridge University Press</publisher-name>.</citation></ref>
<ref id="ref13"><citation citation-type="book"><person-group person-group-type="author"><name><surname>Glasnovic Gracin</surname> <given-names>D.</given-names></name></person-group> (<year>2011</year>). <source>Requirements in mathematics textbooks and PISA assessment. Dissertation</source> <publisher-loc>Klagenfurt</publisher-loc>: <publisher-name>Alpen-Adria-Universit&#x00E4;t Klagenfurt</publisher-name>.</citation></ref>
<ref id="ref14"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Glasnovi&#x0107; Gracin</surname> <given-names>D.</given-names></name></person-group> (<year>2014</year>). <article-title>Mathematics textbook as an object of research/Matemati&#x010D;ki ud&#x017E;benik kao predmet istra&#x017E;ivanja</article-title>. <source>Croatian J. Educ.</source> <volume>16</volume>, <fpage>211</fpage>&#x2013;<lpage>237</lpage>. doi: <pub-id pub-id-type="doi">10.15516/cje.v16i0.721</pub-id></citation></ref>
<ref id="ref15"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Grave</surname> <given-names>I.</given-names></name> <name><surname>Pepin</surname> <given-names>B.</given-names></name></person-group> (<year>2015</year>). <article-title>Teachers&#x2019; use of resources in and for mathematics teaching</article-title>. <source>Nordic Stud. Math. Educ.</source> <volume>20</volume>, <fpage>199</fpage>&#x2013;<lpage>222</lpage>.</citation></ref>
<ref id="ref16"><citation citation-type="book"><person-group person-group-type="author"><name><surname>Herset</surname> <given-names>M. K.</given-names></name></person-group> (<year>2014</year>). <source>Niv&#x00E5;differensierte oppgaver og mestringsforventning i matematikkfaget &#x2013; En studie av elever p&#x00E5; 9. trinn i m&#x00F8;te med niv&#x00E5;markerte oppgaver. Master&#x2019;s thesis</source>, <publisher-loc>Oslo</publisher-loc>: <publisher-name>Department of Teacher Education and School Research Faculty of Education, University of Oslo</publisher-name> Available at: <ext-link xlink:href="https://www.duo.uio.no/bitstream/handle/10852/41231/HersetMaster.pdf?sequence=1&#x0026;isAllowed=y" ext-link-type="uri">https://www.duo.uio.no/bitstream/handle/10852/41231/HersetMaster.pdf?sequence=1&#x0026;isAllowed=y</ext-link>.</citation></ref>
<ref id="ref17"><citation citation-type="other"><person-group person-group-type="author"><name><surname>Herset</surname> <given-names>M.</given-names></name> <name><surname>El Ghami</surname> <given-names>M.</given-names></name></person-group> (<year>2022</year>). &#x201C;The effect of level-marking mathematical tasks on students&#x2019; time spent on such tasks and correct solutions: an experimental study,&#x201D; in <italic>Twelfth congress of the European Society for Research in mathematics education (CERME12)</italic>, Feb 2022, Bozen-Bol, Italy.</citation></ref>
<ref id="ref18"><citation citation-type="book"><person-group person-group-type="author"><name><surname>Howson</surname> <given-names>G.</given-names></name></person-group> (<year>1995</year>). <source>Mathematics textbooks: a comparative study of grade 8 texts</source>, vol. <volume>3</volume>, <publisher-loc>Vancouver</publisher-loc>: <publisher-name>Pacific Educational Press</publisher-name>.</citation></ref>
<ref id="ref19"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Howson</surname> <given-names>G.</given-names></name></person-group> (<year>2013</year>). <article-title>The development of mathematics textbooks: historical reflections from a personal perspective</article-title>. <source>ZDM</source> <volume>45</volume>, <fpage>647</fpage>&#x2013;<lpage>658</lpage>. doi: <pub-id pub-id-type="doi">10.1007/s11858-013-0511-9</pub-id></citation></ref>
<ref id="ref20"><citation citation-type="book"><person-group person-group-type="author"><name><surname>Imsen</surname> <given-names>G.</given-names></name></person-group> (<year>2020</year>). <source>L&#x00E6;rerens verden &#x2013; Innf&#x00F8;ring i generell didaktikk</source>. <edition>6th</edition> Edn <publisher-loc>Oslo</publisher-loc>: <publisher-name>Universitetsforlaget</publisher-name>.</citation></ref>
<ref id="ref21"><citation citation-type="book"><person-group person-group-type="author"><name><surname>Jablonka</surname> <given-names>E.</given-names></name> <name><surname>Johansson</surname> <given-names>M.</given-names></name></person-group> (<year>2010</year>). &#x201C;<article-title>Using texts and tasks</article-title>&#x201D; in <source>The first sourcebook on Nordic research in mathematics education</source>. eds. <person-group person-group-type="editor"><name><surname>Sriraman</surname> <given-names>B.</given-names></name> <name><surname>Bergsten</surname> <given-names>C.</given-names></name> <name><surname>Goodchild</surname> <given-names>S.</given-names></name> <name><surname>Palsdottir</surname> <given-names>G.</given-names></name> <name><surname>Dahl</surname> <given-names>B.</given-names></name> <name><surname>Haapasalo</surname> <given-names>L.</given-names></name></person-group> (<publisher-loc>Charlotte</publisher-loc>: <publisher-name>Information Age Publishing</publisher-name>), <fpage>363</fpage>&#x2013;<lpage>372</lpage>.</citation></ref>
<ref id="ref22"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Jo&#x00EB;t</surname> <given-names>G.</given-names></name> <name><surname>Usher</surname> <given-names>E. L.</given-names></name> <name><surname>Bressoux</surname> <given-names>P.</given-names></name></person-group> (<year>2011</year>). <article-title>Sources of self-efficacy: an investigation of elementary school students in France</article-title>. <source>J. Educ. Psychol.</source> <volume>103</volume>, <fpage>649</fpage>&#x2013;<lpage>663</lpage>. doi: <pub-id pub-id-type="doi">10.1037/a0024048</pub-id></citation></ref>
<ref id="ref23"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lepik</surname> <given-names>M.</given-names></name> <name><surname>Grevholm</surname> <given-names>B.</given-names></name> <name><surname>Viholainen</surname> <given-names>A.</given-names></name></person-group> (<year>2015</year>). <article-title>Using textbooks in the mathematics classroom&#x2013;the teachers&#x2019; view</article-title>. <source>Nordic Stud. Math. Educ.</source> <volume>20</volume>, <fpage>129</fpage>&#x2013;<lpage>156</lpage>.</citation></ref>
<ref id="ref24"><citation citation-type="book"><person-group person-group-type="author"><name><surname>Mathiassen</surname> <given-names>K.</given-names></name></person-group> (<year>2009</year>). &#x201C;<article-title>Lektor &#x2013; adjunkt &#x2013; l&#x00E6;rer: Artikler for studiet i praktisk-pedagogisk utdanning</article-title>&#x201D; in <source>Differensiert undervisning</source>. eds. <person-group person-group-type="editor"><name><surname>Mikkelsen</surname> <given-names>I. R.</given-names></name> <name><surname>Flademoe</surname> <given-names>H.</given-names></name></person-group> (<publisher-loc>Oslo</publisher-loc>: <publisher-name>Universitetsforlaget</publisher-name>), <fpage>123</fpage>&#x2013;<lpage>136</lpage>.</citation></ref>
<ref id="ref25"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Matic</surname> <given-names>L. J.</given-names></name> <name><surname>Glasnovic Gracin</surname> <given-names>D.</given-names></name></person-group> (<year>2016</year>). <article-title>The use of the textbook as an artefact in the classroom: a case study in the light of a socio-didactical tetrahedron</article-title>. <source>J. Math. Didakt</source> <volume>37</volume>, <fpage>349</fpage>&#x2013;<lpage>374</lpage>. doi: <pub-id pub-id-type="doi">10.1007/s13138-016-0091-7</pub-id></citation></ref>
<ref id="ref26"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>McNeill</surname> <given-names>H.</given-names></name> <name><surname>Polly</surname> <given-names>D.</given-names></name></person-group> (<year>2023</year>). <article-title>Exploring primary grades teachers&#x2019; perceptions of their Students&#x2019; mathematics self-Efficacy and how they differentiate instruction</article-title>. <source>Early Childhood Educ. J.</source> <volume>51</volume>, <fpage>79</fpage>&#x2013;<lpage>88</lpage>.</citation></ref>
<ref id="ref27"><citation citation-type="other"><person-group person-group-type="author"><collab id="coll1">Norwegian Directorate for Education and Training</collab></person-group> (<year>n.d.</year>). Eksempeloppgaver og tidligere nasjonale pr&#x00F8;ver. Available at: <ext-link xlink:href="https://www.udir.no/eksamen-og-prover/prover/eksempeloppgaver-tidligere-nasjonale-prover/8-9-trinn/regning/?path=cefglhhcefglif" ext-link-type="uri">https://www.udir.no/eksamen-og-prover/prover/eksempeloppgaver-tidligere-nasjonale-prover/8-9-trinn/regning/?path=cefglhhcefglif</ext-link></citation></ref>
<ref id="ref28"><citation citation-type="journal"><person-group person-group-type="author"><collab id="coll2">NOU</collab></person-group> (<year>2016</year>). <article-title>Mer &#x00E5; hente: Bedre l&#x00E6;ring for elever med stort l&#x00E6;ringspotensial [more to gain &#x2014; better learning for students with higher learning potential]</article-title>. <source>Ministry Educ. Res.</source> Available at: <ext-link xlink:href="https://www.regjeringen.no/contentassets/15542e6ffc5f4159ac5e47b91db91bc0/no/pdfs/nou201620160014000dddpdfs.pdf" ext-link-type="uri">https://www.regjeringen.no/contentassets/15542e6ffc5f4159ac5e47b91db91bc0/no/pdfs/nou201620160014000dddpdfs.pdf</ext-link></citation></ref>
<ref id="ref29"><citation citation-type="other"><person-group person-group-type="author"><name><surname>Omtvei</surname> <given-names>T.</given-names></name></person-group> (<year>n.d.</year>). Hefte med probleml&#x00F8;sningsoppgaver: Ukas n&#x00F8;tt 2008/2009. <ext-link xlink:href="http://Matematikk.org" ext-link-type="uri">Matematikk.org</ext-link>. Available at: <ext-link xlink:href="https://www.matematikk.org/binfil/download2.php?tid=83942&#x0026;h=c8ec261d0ffa8ace1be7aee45247b363" ext-link-type="uri">https://www.matematikk.org/binfil/download2.php?tid=83942&#x0026;h=c8ec261d0ffa8ace1be7aee45247b363</ext-link></citation></ref>
<ref id="ref30"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Pajares</surname> <given-names>F.</given-names></name></person-group> (<year>1996</year>). <article-title>Self-efficacy beliefs in academic settings</article-title>. <source>Rev. Educ. Res.</source> <volume>66</volume>, <fpage>543</fpage>&#x2013;<lpage>578</lpage>. doi: <pub-id pub-id-type="doi">10.2307/1170653</pub-id></citation></ref>
<ref id="ref31"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Pajares</surname> <given-names>F.</given-names></name> <name><surname>Hartley</surname> <given-names>J.</given-names></name> <name><surname>Valiante</surname> <given-names>G.</given-names></name></person-group> (<year>2001</year>). <article-title>Response format in writing self-efficacy assessment: greater discrimination increases prediction</article-title>. <source>Meas. Eval. Couns. Dev.</source> <volume>33</volume>, <fpage>214</fpage>&#x2013;<lpage>221</lpage>. doi: <pub-id pub-id-type="doi">10.1080/07481756.2001.12069012</pub-id></citation></ref>
<ref id="ref32"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Pajares</surname> <given-names>F.</given-names></name> <name><surname>Miller</surname> <given-names>M. D.</given-names></name></person-group> (<year>1995</year>). <article-title>Mathematics self-efficacy and mathematics performances: the need for specificity of assessment</article-title>. <source>J. Couns. Psychol.</source> <volume>42</volume>, <fpage>190</fpage>&#x2013;<lpage>198</lpage>. doi: <pub-id pub-id-type="doi">10.1037/0022-0167.42.2.190</pub-id></citation></ref>
<ref id="ref33"><citation citation-type="book"><person-group person-group-type="author"><name><surname>Pepin</surname> <given-names>B.</given-names></name> <name><surname>Haggerty</surname> <given-names>L.</given-names></name></person-group> (<year>2003</year>). &#x201C;<article-title>Mathematics textbooks and their use by teachers: a window into the education world of particular countries</article-title>&#x201D; in <source>Curriculum landscapes and trends</source>. eds. <person-group person-group-type="editor"><name><surname>Akker</surname> <given-names>J.</given-names></name> <name><surname>Kuiper</surname> <given-names>W.</given-names></name> <name><surname>Hameyer</surname> <given-names>U.</given-names></name></person-group> (<publisher-loc>Dordrecht</publisher-loc>: <publisher-name>Springer</publisher-name>), <fpage>73</fpage>&#x2013;<lpage>100</lpage>.</citation></ref>
<ref id="ref34"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Pierce</surname> <given-names>R. L.</given-names></name> <name><surname>Adams</surname> <given-names>C. M.</given-names></name></person-group> (<year>2005</year>). <article-title>Using tiered lessons in mathematics</article-title>. <source>Math. Teach. Middle School</source> <volume>11</volume>, <fpage>144</fpage>&#x2013;<lpage>149</lpage>. doi: <pub-id pub-id-type="doi">10.5951/MTMS.11.3.0144</pub-id></citation></ref>
<ref id="ref35"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ramdass</surname> <given-names>D.</given-names></name> <name><surname>Zimmerman</surname> <given-names>B. J.</given-names></name></person-group> (<year>2008</year>). <article-title>Effects of self-correction strategy training on middle school students&#x2019; self-efficacy, self-evaluation, and mathematics division learning</article-title>. <source>J. Adv. Acad.</source> <volume>20</volume>, <fpage>18</fpage>&#x2013;<lpage>41</lpage>. doi: <pub-id pub-id-type="doi">10.4219/jaa-2008-869</pub-id></citation></ref>
<ref id="ref36"><citation citation-type="book"><person-group person-group-type="author"><name><surname>Robitaille</surname> <given-names>D. F.</given-names></name> <name><surname>Travers</surname> <given-names>K. J.</given-names></name></person-group> (<year>1992</year>). &#x201C;<article-title>International studies of achievement in mathematics</article-title>&#x201D; in <source>Handbook of research on mathematics teaching and learning</source>. ed. <person-group person-group-type="editor"><name><surname>Grouws</surname> <given-names>D. A.</given-names></name></person-group> (<publisher-loc>New York</publisher-loc>: <publisher-name>Macmillan</publisher-name>), <fpage>687</fpage>&#x2013;<lpage>709</lpage>.</citation></ref>
<ref id="ref37"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Schunk</surname> <given-names>D. H.</given-names></name></person-group> (<year>1991</year>). <article-title>Self-efficacy and academic motivation</article-title>. <source>Educ. Psychol.</source> <volume>26</volume>, <fpage>207</fpage>&#x2013;<lpage>231</lpage>. doi: <pub-id pub-id-type="doi">10.1080/00461520.1991.9653133</pub-id></citation></ref>
<ref id="ref38"><citation citation-type="book"><person-group person-group-type="author"><name><surname>Schunk</surname> <given-names>D. H.</given-names></name> <name><surname>Mullen</surname> <given-names>C. A.</given-names></name></person-group> (<year>2012</year>). &#x201C;<article-title>Self-efficacy as an engaged learner</article-title>&#x201D; in <source>Handbook of research on student engagement</source>. eds. <person-group person-group-type="editor"><name><surname>Christenson</surname> <given-names>S. L.</given-names></name> <name><surname>Reschly</surname> <given-names>A. L.</given-names></name> <name><surname>Wylie</surname> <given-names>C.</given-names></name></person-group> (<publisher-loc>New York</publisher-loc>: <publisher-name>Springer Science &#x0026; Business Media</publisher-name>), <fpage>219</fpage>&#x2013;<lpage>235</lpage>.</citation></ref>
<ref id="ref39"><citation citation-type="book"><person-group person-group-type="author"><name><surname>Skaalvik</surname> <given-names>E. M.</given-names></name> <name><surname>Fossen</surname> <given-names>I.</given-names></name></person-group> (<year>1995</year>). <source>Tilpassing og Differensiering: Idealer og Realiteter i Norsk Grunnskole</source>, <publisher-loc>Trondheim</publisher-loc>: <publisher-name>Tapir</publisher-name>.</citation></ref>
<ref id="ref40"><citation citation-type="book"><person-group person-group-type="author"><name><surname>Skaalvik</surname> <given-names>E. M.</given-names></name> <name><surname>Skaalvik</surname> <given-names>S.</given-names></name></person-group> (<year>2018</year>). &#x201C;<article-title>Skolen som L&#x00E6;ringsarena</article-title>&#x201D; in <source>Selvoppfatning, Motivasjon og L&#x00E6;ring</source>. <edition>3rd</edition> ed (<publisher-loc>Oslo</publisher-loc>: <publisher-name>Universitetsforlaget</publisher-name>)</citation></ref>
<ref id="ref41"><citation citation-type="book"><person-group person-group-type="author"><name><surname>Stein</surname> <given-names>M. K.</given-names></name> <name><surname>Remillard</surname> <given-names>J. T.</given-names></name> <name><surname>Smith</surname> <given-names>M. S.</given-names></name></person-group> (<year>2007</year>). &#x201C;<article-title>How curriculum influences student learning</article-title>&#x201D; in <source>Second handbook of research on mathematics teaching and learning: A project of the National Council of teachers of mathematics</source>. ed. <person-group person-group-type="editor"><name><surname>Lester</surname> <given-names>F.</given-names> <suffix>Jr.</suffix></name></person-group> (<publisher-loc>Charlotte</publisher-loc>: <publisher-name>Information Age Publishing</publisher-name>), <fpage>319</fpage>&#x2013;<lpage>369</lpage>.</citation></ref>
<ref id="ref42"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Stevens</surname> <given-names>T.</given-names></name> <name><surname>Oliv&#x00E1;rez</surname> <given-names>A.</given-names> <suffix>Jr.</suffix></name> <name><surname>Hamman</surname> <given-names>D.</given-names></name></person-group> (<year>2006</year>). <article-title>The role of cognition, motivation, and emotion in explaining the mathematics achievement gap between Hispanic and white students</article-title>. <source>Hisp. J. Behav. Sci.</source> <volume>28</volume>, <fpage>161</fpage>&#x2013;<lpage>186</lpage>. doi: <pub-id pub-id-type="doi">10.1177/0739986305286103</pub-id></citation></ref>
<ref id="ref43"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Street</surname> <given-names>E. K.</given-names></name> <name><surname>Malmberg</surname> <given-names>L.</given-names></name> <name><surname>Stylianides</surname> <given-names>J. G.</given-names></name></person-group> (<year>2017</year>). <article-title>Level, strength and facet-specific self-efficacy in mathematics test performance</article-title>. <source>ZDM Math. Educ</source> <volume>49</volume>, <fpage>379</fpage>&#x2013;<lpage>395</lpage>. doi: <pub-id pub-id-type="doi">10.1007/s11858-017-0833-0</pub-id></citation></ref>
<ref id="ref44"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Street</surname> <given-names>K. E.</given-names></name> <name><surname>Malmberg</surname> <given-names>L. E.</given-names></name> <name><surname>Stylianides</surname> <given-names>G. J.</given-names></name></person-group> (<year>2022a</year>). <article-title>Changes in students&#x2019; self-efficacy when learning a new topic in mathematics: a micro-longitudinal study</article-title>. <source>Educ. Stud. Math.</source> <volume>111</volume>, <fpage>515</fpage>&#x2013;<lpage>541</lpage>. doi: <pub-id pub-id-type="doi">10.1007/s10649-022-10165-1</pub-id></citation></ref>
<ref id="ref45"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Street</surname> <given-names>K. E.</given-names></name> <name><surname>Stylianides</surname> <given-names>G. J.</given-names></name> <name><surname>Malmberg</surname> <given-names>L. E.</given-names></name></person-group> (<year>2022b</year>). <article-title>Differential relationships between mathematics self-efficacy and national test performance according to perceived task difficulty</article-title>. <source>Assess. Educ.: Principles Policy Pract.</source> <volume>29</volume>, <fpage>288</fpage>&#x2013;<lpage>309</lpage>. doi: <pub-id pub-id-type="doi">10.1080/0969594X.2022.2095980</pub-id></citation></ref>
<ref id="ref46"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Tesfamicael</surname> <given-names>S. A.</given-names></name> <name><surname>Lundeby</surname> <given-names>&#x00D8;. A.</given-names></name></person-group> (<year>2019</year>). <article-title>A comparative study of Norwegian and Ethiopian textbooks: the case of relations and functions using anthropological theory of didactics (ATD)</article-title>. <source>Univ. J. Educ. Res.</source> <volume>7</volume>, <fpage>754</fpage>&#x2013;<lpage>765</lpage>. doi: <pub-id pub-id-type="doi">10.13189/ujer.2019.070315</pub-id></citation></ref>
<ref id="ref47"><citation citation-type="book"><person-group person-group-type="author"><name><surname>Tomlinson</surname> <given-names>C. A.</given-names></name></person-group> (<year>2001</year>). <source>How to differentiate in Mxed-ability classrooms</source>. <edition>2nd</edition> Edn <publisher-loc>Alexandria</publisher-loc>: <publisher-name>ASCD</publisher-name>.</citation></ref>
<ref id="ref48"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Usher</surname> <given-names>E. L.</given-names></name> <name><surname>Pajares</surname> <given-names>F.</given-names></name></person-group> (<year>2009</year>). <article-title>Sources of self-efficacy in mathematics: a validation study</article-title>. <source>Contemp. Educ. Psychol.</source> <volume>34</volume>, <fpage>89</fpage>&#x2013;<lpage>101</lpage>. doi: <pub-id pub-id-type="doi">10.1016/j.cedpsych.2008.09.002</pub-id></citation></ref>
<ref id="ref49"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zakariya</surname> <given-names>Y. F.</given-names></name></person-group> (<year>2019</year>). <article-title>Study approaches in higher education mathematics: investigating the statistical behavior of an instrument translated into Norwegian</article-title>. <source>Educ. Sci.</source> <volume>9</volume>:<fpage>191</fpage>. doi: <pub-id pub-id-type="doi">10.3390/educsci9030191</pub-id></citation></ref>
<ref id="ref50"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zakariya</surname> <given-names>Y. F.</given-names></name></person-group> (<year>2021</year>). <article-title>Self-efficacy between previous and current mathematics performance of undergraduate students: an instrumental variable approach to exposing a causalrelationship</article-title>. <source>Front. Psychol.</source> <volume>11</volume>:<fpage>556607</fpage>. doi: <pub-id pub-id-type="doi">10.3389/fpsyg.2020.556607</pub-id>, PMID: <pub-id pub-id-type="pmid">33536959</pub-id></citation></ref>
<ref id="ref51"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zimmerman</surname> <given-names>B. J.</given-names></name> <name><surname>Martinez-Pons</surname> <given-names>M.</given-names></name></person-group> (<year>1990</year>). <article-title>Student differences in self-regulated learning: relating grade, sex, and giftedness to self-efficacy and strategy use</article-title>. <source>J. Educ. Psychol.</source> <volume>82</volume>, <fpage>51</fpage>&#x2013;<lpage>59</lpage>. doi: <pub-id pub-id-type="doi">10.1037/0022-0663.82.1.51</pub-id></citation></ref>
</ref-list>
</back>
</article>