<?xml version="1.0" encoding="UTF-8" standalone="no"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v2.3 20070202//EN" "journalpublishing.dtd">
<article xml:lang="EN" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" article-type="editorial">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">Front. Psychol.</journal-id>
<journal-title>Frontiers in Psychology</journal-title>
<abbrev-journal-title abbrev-type="pubmed">Front. Psychol.</abbrev-journal-title>
<issn pub-type="epub">1664-1078</issn>
<publisher>
<publisher-name>Frontiers Media S.A.</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="doi">10.3389/fpsyg.2023.1132185</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Psychology</subject>
<subj-group>
<subject>Editorial</subject>
</subj-group>
</subj-group>
</article-categories>
<title-group>
<article-title>Editorial: Persistence of measurement problems in psychological research</article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author" corresp="yes">
<name><surname>Meier</surname> <given-names>Scott T.</given-names></name>
<xref ref-type="corresp" rid="c001"><sup>&#x0002A;</sup></xref>
<uri xlink:href="http://loop.frontiersin.org/people/772383/overview"/>
</contrib>
</contrib-group>
<aff><institution>Department of Counseling, School, and Educational Psychology, University at Buffalo</institution>, <addr-line>Buffalo, NY</addr-line>, <country>United States</country></aff>
<author-notes>
<fn fn-type="edited-by"><p>Edited and reviewed by: Gene Michael Alarcon, Air Force Research Laboratory, United States</p></fn>
<corresp id="c001">&#x0002A;Correspondence: Scott T. Meier &#x02709; <email>stmeier&#x00040;buffalo.edu</email></corresp>
<fn fn-type="other" id="fn001"><p>This article was submitted to Quantitative Psychology and Measurement, a section of the journal Frontiers in Psychology</p></fn></author-notes>
<pub-date pub-type="epub">
<day>23</day>
<month>01</month>
<year>2023</year>
</pub-date>
<pub-date pub-type="collection">
<year>2023</year>
</pub-date>
<volume>14</volume>
<elocation-id>1132185</elocation-id>
<history>
<date date-type="received">
<day>27</day>
<month>12</month>
<year>2022</year>
</date>
<date date-type="accepted">
<day>11</day>
<month>01</month>
<year>2023</year>
</date>
</history>
<permissions>
<copyright-statement>Copyright &#x000A9; 2023 Meier.</copyright-statement>
<copyright-year>2023</copyright-year>
<copyright-holder>Meier</copyright-holder>
<license xlink:href="http://creativecommons.org/licenses/by/4.0/"><p>This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.</p></license> </permissions>
<related-article id="RA1" related-article-type="commentary-article" xlink:href="https://www.frontiersin.org/research-topics/31595/persistence-of-measurement-problems-in-psychological-research" ext-link-type="uri">Editorial on the Research Topic <article-title>Persistence of measurement problems in psychological research</article-title></related-article>
<kwd-group>
<kwd>measurement</kwd>
<kwd>psychology</kwd>
<kwd>replication</kwd>
<kwd>scientific crises</kwd>
<kwd>testing</kwd>
</kwd-group>
<counts>
<fig-count count="0"/>
<table-count count="0"/>
<equation-count count="0"/>
<ref-count count="37"/>
<page-count count="5"/>
<word-count count="3592"/>
</counts>
</article-meta>
</front>
<body>
<sec sec-type="intro" id="s1">
<title>Introduction</title>
<p>As we observed in the announcement of this Research Topic, reviews of the history of science suggest that new measurement approaches drive scientific development (Kuhn, <xref ref-type="bibr" rid="B19">1970</xref>; Cone and Foster, <xref ref-type="bibr" rid="B6">1991</xref>; Tryon, <xref ref-type="bibr" rid="B35">1991</xref>; Meier, <xref ref-type="bibr" rid="B27">1994</xref>, <xref ref-type="bibr" rid="B28">2008</xref>). Tryon (<xref ref-type="bibr" rid="B35">1991</xref>) wrote that scientific progress results from a measurement method&#x00027;s capacity to correct and extend human senses into new domains and provide new data that transforms theory. Small improvements in measurement can accumulate and result in significant observational advances (Meier, <xref ref-type="bibr" rid="B27">1994</xref>, <xref ref-type="bibr" rid="B28">2008</xref>). Despite decades of recognition of measurement problems in psychology and related fields, contemporary scholars continue to raise alarms about the state of psychological measurement. Longstanding problems such as the validity of self-reports and inconsistencies in test scores across sources (e.g., ratings of children by parents, teachers, and children) will recur until the field finds effective solutions (Benjamin and Baker, <xref ref-type="bibr" rid="B1">2009</xref>; Lilienfeld, <xref ref-type="bibr" rid="B21">2017</xref>). Historical references provide evidence that psychology&#x00027;s measurement problems are not random occurrences, but periodic problems that appear, fade away without resolution, and are later rediscovered (Meier, <xref ref-type="bibr" rid="B27">1994</xref>, <xref ref-type="bibr" rid="B28">2008</xref>; Lilienfeld, <xref ref-type="bibr" rid="B21">2017</xref>).</p>
</sec>
<sec id="s2">
<title>Critique of current methods</title>
<p>Lilienfeld and Strother (<xref ref-type="bibr" rid="B22">2020</xref>, p. 281) wrote that &#x0201C;many researchers pay little heed to the psychometric properties of their measures, cavalierly neglecting them, or taking them for granted.&#x0201D; They provided examples of four irrational beliefs about measurement that contribute to the replication crisis in psychology: (a) the name of a measure reflects its content, (b) reliability is not a major concern for laboratory measures, (c) large sample sizes are unnecessary when data are difficult to collect, and (d) construct validity can be adequately assessed by estimates of convergent validity alone. Similarly, Flake and Fried (<xref ref-type="bibr" rid="B12">2020</xref>) placed the key source of problems as what they termed a <italic>measurement schmeasurement</italic> attitude whereby researchers and other users of psychological tests sidestep measurement problems by ignoring them.</p>
<p>Another indicator of the neglect of measurement is a tendency for researchers to operationalize a construct through a single test rather than expend the resources necessary to conduct a thorough construct explication (cf. Scheel et al., <xref ref-type="bibr" rid="B32">2021</xref>). Edison&#x00027;s attempt to find a proper filament for the light bulb offers an appropriate metaphor here. When Edison invented the light bulb, he reportedly tested 3,000 of types of materials to identify filaments that generated light but minimized heat (Palermo, <xref ref-type="bibr" rid="B31">2017</xref>). In contrast, to believe that a single iteration of a psychological test, often developed in the early stages of research in a domain, represents the best explication of any single construct appears highly unlikely.</p>
<p>Nevertheless, examples abound of psychological tests and operations that have been adopted as the standard, default method in many areas of research with minimal discussion of potential alternatives. Bianchi et al. (<xref ref-type="bibr" rid="B2">2015</xref>) found that in a review of measures employed in research on occupational stress and burnout, the self-report Maslach Burnout Inventory (MBI; Maslach and Jackson, <xref ref-type="bibr" rid="B25">1981</xref>) was employed in &#x0007E;80% of review studies. Similarly, in the domain of working alliance research in psychotherapy, the Working Alliance Inventory (WAI; Horvath and Greenberg, <xref ref-type="bibr" rid="B15">1989</xref>) has been employed in &#x0007E;70% of studies (Fl&#x000FC;ckiger et al., <xref ref-type="bibr" rid="B13">2018</xref>; Meier and Feeley, <xref ref-type="bibr" rid="B29">2021</xref>). This constitutes both a mono-operation bias and mono-method bias in that research findings will be influenced by use of a single test employing a single method (Campbell and Fiske, <xref ref-type="bibr" rid="B4">1959</xref>).</p>
<p>Finally, in the quest to find statistical significance (Nosek et al., <xref ref-type="bibr" rid="B30">2013</xref>; Ledgerwood, <xref ref-type="bibr" rid="B20">2014</xref>), quantitative researchers typically conduct a power analysis focused on sample size and expected effect size, and then adjust the former to increase power to detect an effect (Cohen, <xref ref-type="bibr" rid="B5">1992</xref>; Houle et al., <xref ref-type="bibr" rid="B16">2005</xref>; Drummond and Vowler, <xref ref-type="bibr" rid="B11">2012</xref>). The strategy to increase sample size increases the likelihood of finding a statistically significant finding, which increases the odds of publication, but often results in the detection of a small to moderate effects (Lipsey, <xref ref-type="bibr" rid="B23">1990</xref>) that are difficult to replicate. At the analysis end of a study, researchers employ advanced statistical methods such as structural equation modeling and item response theory that provide the veneer of scientific sophistication but whose results largely depend upon the quality of the data produced during measurement (Cone and Foster, <xref ref-type="bibr" rid="B6">1991</xref>).</p>
<p>A parallel line of thinking is evident in attempts to identify questionable research practices (QRPs) related to statistical procedures that may potentially create problems for subsequent attempts at replication. Discussing QRPs, Ulrich and Miller (<xref ref-type="bibr" rid="B36">2020</xref>) proposed that the base rate of true effects strongly influences replication rate in scientific results. Their central thesis is that &#x0201C;low power within a research area reduces replicability for purely statistical reasons, because it reduces the ratio of true positives to false positives&#x0201D; (p. 2). From this perspective, strategies such as data peeking and selective reporting have little effect on replication rate. They conclude that &#x0201C;low base rates of true effects&#x02014;not too-large <italic>a</italic> levels, too-low power, or p-hacking&#x02014;are most likely to be the major causes of poor replicability, so researchers concerned about replicability should pay special attention to the issue of base rates&#x0201D; (p. 18). If studies of psychological effects evidence low base rates, then careful development of psychological tests able to detect small to moderate effects would seem to be of paramount importance.</p>
</sec>
<sec id="s3">
<title>Alternatives to current practices</title>
<p>Method effects (MEs) refer to the observation that scores on every quantitative variable, index, and measure at least partially reflect the methodology employed to collect data. Cote and Buckley&#x00027;s (<xref ref-type="bibr" rid="B7">1987</xref>) research (Williams et al., <xref ref-type="bibr" rid="B37">1989</xref>) concluded that about 25% of variance in scores on a typical measure results from sample and measurement characteristics. Even seemingly minor methodological conditions can influence results. Studies have found, for example, that (a) the gender of a researcher present in an experimental setting could influence the behavioral performance of rats and mice and (b) a priming study&#x00027;s results were unintentionally influenced by the fact that the researcher who packaged materials for the priming or control groups was also the individual who handed the materials to all participants (Brown et al., <xref ref-type="bibr" rid="B3">2014</xref>).</p>
<p>Historically, one of the goals of the test development process was to reduce or eliminate MEs in psychological measurement. A ceiling effect in a set of test scores, for example, should not be present because items or scales with skewed scores are typically identified and eliminated during the item analysis procedure. MEs can provide clues about where test developers should re-examine construct explication, the process of connecting theoretical constructs to observed behaviors (Torgerson, <xref ref-type="bibr" rid="B34">1958</xref>). When MEs are present, a problem has occurred in explication, and exploration of the problem provides an opportunity to deepen substantive knowledge and improve the power of measurement devices. Construct explication consists of four resource-intensive steps.</p>
<list list-type="simple">
<list-item><p>1. Review and/or develop substantive theory related to the construct(s).</p></list-item>
<list-item><p>2. Review and/or develop methodological theory related to the construct(s).</p></list-item>
<list-item><p>3. Employ the results of one and two to create appropriate measure(s).</p></list-item>
<list-item><p>4. Repeat the process in a program of research to improve the power of developed measures to detect effects of interest.</p></list-item>
</list>
<p>Step 2 is often problematic in contemporary psychological study: Researchers may minimize methodological considerations, hence, <italic>measurement schmeasurement</italic> (Flake and Fried, <xref ref-type="bibr" rid="B12">2020</xref>). In any study, methodological decisions must be made regarding who will be measured (sampling), how the data should be observed (test characteristics), and how the data will be employed (test purpose). In much contemporary research, however, methodology has become detached from theories about the construct, with (a) convenience sampling, (b) self-report as the default method of data collection, and (c) coefficient alpha, factor analysis, and correlational procedures as the default analyses for evaluating the quality of item response data (Maul, <xref ref-type="bibr" rid="B26">2017</xref>).</p>
<p>The major paradigm for psychological testing historically has been to select persons for entrance to educational, business, and military settings on the basis of individuals&#x00027; measured traits. Consequently, test developers have favored trait-based items and tasks designed to discriminate among individuals and predict future performance (Dawis, <xref ref-type="bibr" rid="B9">1987</xref>; Danziger, <xref ref-type="bibr" rid="B8">1990</xref>) and have sought items that maximize stability over time and detection of individual differences. For other testing purposes, however, this paradigm can reduce power.</p>
<p>Even when the goal of a test is to detect intervention effects (Lipsey, <xref ref-type="bibr" rid="B24">1983</xref>, <xref ref-type="bibr" rid="B23">1990</xref>; Tryon, <xref ref-type="bibr" rid="B35">1991</xref>; Meier, <xref ref-type="bibr" rid="B27">1994</xref>) test developers and users may default to selection-based testing procedures. Stinchfield et al. (<xref ref-type="bibr" rid="B33">2007</xref>) created the Gambling Treatment Outcome Monitoring System (GAMTOMS), a measure intended to assess changes in gambling behaviors following treatment. Using 286 participants (including 237 gambling treatment clients) in 2 studies, Stinchfield et al. provided evidence for the GAMTOMS&#x00027; internal consistency, 1-week test-retest reliability, content validity, convergent validity, discriminant validity, predictive validity, and construct validity. Change-sensitivity would appear to be a critical criterion for evaluating the GAMTOMS&#x00027; intended purpose, that is, the power to detect change in gambling behaviors following intervention. With the exception of a single item examining stages of change, no analyses evaluated whether GAMTOMS&#x00027; items or scores could detect change over time or in response to an intervention.</p>
</sec>
<sec sec-type="conclusions" id="s4">
<title>Conclusion</title>
<p>Hirsch (<xref ref-type="bibr" rid="B14">2009</xref>) provided a historical perspective regarding how scientists in any scientific domain make progress in measurement.</p>
<disp-quote><p>A young discipline is bound to move first through the data it can gather most easily. And as it does, it also defines more exactly what it must measure to test its theories. As the low-hanging fruit vanish, and the most precious of fruits are spotted high above, bigger investments in harvesting equipment become necessary.</p></disp-quote>
<p>Psychology has harvested its low-hanging fruit, primarily through self-report, interview, and experimental methodologies that simply operationalize (rather than evaluate) constructs. We challenge researchers and test developers to (a) evaluate a measurement issue in every study you conduct, (b) build a knowledge base and accompanying questions about how method affects findings with the constructs you research, and (c) implement a new measurement procedure during pilot studies. Noteworthy examples in this regard can found in (a) <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.3389/fpsyg.2022.911629">Charamut&#x00027;s et al.</ext-link> description of the trait, context, and source effects in measurement of youth mental health, (b) Dohrenwend&#x00027;s (<xref ref-type="bibr" rid="B10">2006</xref>) discussion of intracategory variability on stress self-report measures and a possible solution with narrative rating scales, and (c) Tryon&#x00027;s (<xref ref-type="bibr" rid="B35">1991</xref>) analysis of how trait and state effects can be separated and detected in a single dataset.</p>
<p>As a field we must systematically step back and think more deeply about how to measure and better detect the effects of psychological phenomena of interest. Failure to pursue new directions means that research crises such as the replication problem will recur. The studies in this Research Topic, summarized below, offer examples of innovative possibilities in psychological measurement.</p>
</sec>
<sec id="s5">
<title>Summary of Research Topic manuscripts</title>
<sec>
<title>Multi-informant reports</title>
<p><ext-link ext-link-type="uri" xlink:href="https://doi.org/10.3389/fpsyg.2022.911629">Charamut et al.</ext-link> observed that assessment of youth mental health problems typically involves data collection from multiple informants that can vary substantially. One explanation is situational specificity: Children and adolescents vary in the situations where they display problem behaviors, and observers such as teachers and parents vary in where they observe these behaviors. <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.3389/fpsyg.2022.911629">Charamut et al.</ext-link> presents a sophisticated evaluation of Kraemer&#x00027;s et al. (2003) Satellite Model that consists of the context in which an informant observes the youth undergoing evaluation as well as the source of data (e.g., self vs. other). Users of this approach select informants who vary in their contexts and perspectives, thus allowing for a third component (i.e., trait) to reflect common variance, aspects that generalize across informants&#x00027; contexts and perspectives. Thus, the Satellite Model examines both common variance (i.e., trait) and domain-relevant unique variance (i.e., context and perspectives).</p>
<p><ext-link ext-link-type="uri" xlink:href="https://doi.org/10.3389/fpsyg.2022.911629">Charamut&#x00027;s et al.</ext-link> research employed 134 clinical and community adolescents ages 14&#x02013;15 and their parents who completed six parallel measures of adolescent mental health. The measures assessed social anxiety, social phobia, fear of evaluation, work and social adjustments, and depression. Adolescents also participated in a simulated social interaction observed by a third, untrained informant who completed the same six measures. This design was based on research showing discrepancies between parent and adolescent reports of adolescent social interaction and allowed the researchers to make predictions about specific results that should and should not occur. Using Principal Components Analysis, they found that &#x0201C;all informants&#x00027; reports loaded positively onto the trait component, informants&#x00027; reports from different contexts (i.e., parent vs. UUO) loaded onto the context component in opposite directions, and adolescent self-reports loaded onto the perspective component in a direction opposite of the loadings observed from the two observer informants (i.e., parent and UUO).&#x0201D; Interestingly, patterns of reports by source tended to evidence similar ranks across domains (e.g., parent &#x0003E; teacher; youth &#x0003C; parent).</p>
</sec>
<sec>
<title>Measurement invariance</title>
<p><ext-link ext-link-type="uri" xlink:href="https://doi.org/10.3389/fpsyg.2022.931296">De Los Reyes et al.</ext-link> noted that studies of measurement invariance attempt to determine whether irrelevant conditions influence the function of measurement devices. These irrelevant conditions should not contain variance related to understanding measurement in the domain is being measured; the author&#x00027; example of irrelevant conditions was cultural/racial background during the measurement of intelligence. <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.3389/fpsyg.2022.931296">De Los Reyes et al.</ext-link> apply this reasoning in the area of youth mental health and the well-known problems of informant discrepancies where reports about a child&#x00027;s social, emotional, and behavioral problems often evidence differences as assessed by the child, parent, teacher, and other professionals. In the authors&#x00027; view, the key is to identify sources of both common and unique variance in informants&#x00027; reports, and they illustrate both problems and opportunities to improve youth measurement using this approach. Their key takeaway is that &#x0201C;Efforts to distinguish between domain-relevant and domain-irrelevant measurement conditions should precede use of measurement invariance techniques.&#x0201D;</p>
</sec>
<sec>
<title>Working Alliance Inventory psychometric properties</title>
<p><ext-link ext-link-type="uri" xlink:href="https://doi.org/10.3389/fpsyg.2022.945294">Paap et al.</ext-link> examined the psychometric properties of the Working Alliance Inventory (WAI) <italic>via</italic> a review of 66 studies published during 1989&#x02013;2021. The WAI is the most frequently employed measure for studying the working alliance, the connection between client and therapist that has been empirically demonstrated to be related to therapy outcomes. Sample sizes of review studies ranged from 8 to 1,786 participants; mean age ranged from 6 to 98 years; and WAI studies were conducted in 23 countries and 16 languages. Using COSMIN criteria, they found that evidence for measurement properties was lacking in most studies. This includes a lack of evidence for content validity, factor structure, and reliability estimates; <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.3389/fpsyg.2022.945294">Paap et al.</ext-link> also reported conflicting evidence for divergent (discriminant) validity. The authors concluded that further research is needed regarding the theoretical framework underlying the measurement of the working alliance.</p>
</sec>
<sec>
<title>MIMIC model for cognitive neuroscience</title>
<p><ext-link ext-link-type="uri" xlink:href="https://doi.org/10.3389/fpsyg.2022.943613">Rosen et al.</ext-link> noted that while cognitive neuroscience has provided methods that enhance detection of signal-to-noise ratio from neuroimaging data, problems remain in summarizing behavioral data using aggregated scores, and item response theory (IRT). <ext-link ext-link-type="uri" xlink:href="https://doi.org/10.3389/fpsyg.2022.943613">Rosen et al.</ext-link> also observed that differential item functioning (DIF) can be present with cognitive neuroscience data and that techniques such as the Multiple Indicator Multiple Cause (MIMIC) model can identify and cope with these issues. Previous research has applied the MIMIC model to explore brain-behavior relationships (Kievit et al., <xref ref-type="bibr" rid="B17">2011</xref>, <xref ref-type="bibr" rid="B18">2012</xref>), allowing researchers to model an individual&#x00027;s cognitive ability onto their brain volume. Similarly, this research, using simulations and an empirical study, demonstrated how measurement techniques used to describe brain-behavior relationships can improve statistical power.</p>
</sec>
</sec>
<sec sec-type="author-contributions" id="s6">
<title>Author contributions</title>
<p>The author confirms being the sole contributor of this work and has approved it for publication.</p>
</sec>
</body>
<back>
<sec sec-type="COI-statement" id="conf1">
<title>Conflict of interest</title>
<p>The author declares that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.</p>
</sec>
<sec sec-type="disclaimer" id="s7">
<title>Publisher&#x00027;s note</title>
<p>All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.</p>
</sec>
<ref-list>
<title>References</title>
<ref id="B1">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Benjamin</surname> <given-names>L. T.</given-names></name> <name><surname>Baker</surname> <given-names>D. B.</given-names></name></person-group> (<year>2009</year>). <article-title>Recapturing a context for psychology: the role of history</article-title>. <source>Perspect. Psychol. Sci.</source> <volume>4</volume>, <fpage>97</fpage>&#x02013;<lpage>98</lpage>. <pub-id pub-id-type="doi">10.1111/j.1745-6924.2009.01097.x</pub-id><pub-id pub-id-type="pmid">26158840</pub-id></citation></ref>
<ref id="B2">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bianchi</surname> <given-names>R.</given-names></name> <name><surname>Schonfeld</surname> <given-names>I. S.</given-names></name> <name><surname>Larent</surname> <given-names>E.</given-names></name></person-group> (<year>2015</year>). <article-title>Burnout-depression overlap: a review</article-title>. <source>Clin. Psychol. Rev.</source> <volume>36</volume>, <fpage>28</fpage>&#x02013;<lpage>41</lpage>. <pub-id pub-id-type="doi">10.1016/j.cpr.2015.01.004</pub-id><pub-id pub-id-type="pmid">25638755</pub-id></citation></ref>
<ref id="B3">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Brown</surname> <given-names>S. D.</given-names></name> <name><surname>Furrow</surname> <given-names>D.</given-names></name> <name><surname>Hill</surname> <given-names>D. F.</given-names></name> <name><surname>Gable</surname> <given-names>J. C.</given-names></name> <name><surname>Porter</surname> <given-names>L. P.</given-names></name> <name><surname>Jacobs</surname> <given-names>W. J.</given-names></name></person-group> (<year>2014</year>). <article-title>A duty to describe: better the devil you know than the devil you don&#x00027;t</article-title>. <source>Perspect. Psychol. Sci.</source> <volume>9</volume>, <fpage>626</fpage>&#x02013;<lpage>640</lpage>. <pub-id pub-id-type="doi">10.1177/1745691614551749</pub-id><pub-id pub-id-type="pmid">26186113</pub-id></citation></ref>
<ref id="B4">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Campbell</surname> <given-names>D. T.</given-names></name> <name><surname>Fiske</surname> <given-names>D. W.</given-names></name></person-group> (<year>1959</year>). <article-title>Convergent and discriminant validation by the multitrait-multimethod matrix</article-title>. <source>Psychol. Bull.</source> <volume>56</volume>, <fpage>81</fpage>&#x02013;<lpage>105</lpage>. <pub-id pub-id-type="doi">10.1037/h0046016</pub-id><pub-id pub-id-type="pmid">13634291</pub-id></citation></ref>
<ref id="B5">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Cohen</surname> <given-names>J.</given-names></name></person-group> (<year>1992</year>). <article-title>A power primer</article-title>. <source>Psychol. Bull.</source> <volume>112</volume>, <fpage>155</fpage>&#x02013;<lpage>159</lpage>. <pub-id pub-id-type="doi">10.1037/0033-2909.112.1.155</pub-id></citation>
</ref>
<ref id="B6">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Cone</surname> <given-names>J. D.</given-names></name> <name><surname>Foster</surname> <given-names>S. L.</given-names></name></person-group> (<year>1991</year>). <article-title>Training in measurement: always the bridesmaid</article-title>. <source>Am. Psychol.</source> <volume>46</volume>, <fpage>653</fpage>&#x02013;<lpage>654</lpage>. <pub-id pub-id-type="doi">10.1037/0003-066X.46.6.653</pub-id></citation>
</ref>
<ref id="B7">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Cote</surname> <given-names>J. A.</given-names></name> <name><surname>Buckley</surname> <given-names>R.</given-names></name></person-group> (<year>1987</year>). <article-title>Estimating trait, method, and error variance: generalizing across 70 construct validation studies</article-title>. <source>J. Market. Res.</source> <volume>24</volume>, <fpage>315</fpage>&#x02013;<lpage>318</lpage>. <pub-id pub-id-type="doi">10.1177/002224378702400308</pub-id></citation>
</ref>
<ref id="B8">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Danziger</surname> <given-names>K.</given-names></name></person-group> (<year>1990</year>). <source>Constructing the Subject: Historical Origins of Psychological Research</source>. <publisher-loc>Cambridge</publisher-loc>: <publisher-name>Cambridge University Press.</publisher-name> <pub-id pub-id-type="doi">10.1017/CBO9780511524059</pub-id></citation>
</ref>
<ref id="B9">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Dawis</surname> <given-names>R. V.</given-names></name></person-group> (<year>1987</year>). <article-title>Scale construction</article-title>. <source>J. Counsel. Psychol.</source> <volume>34</volume>, <fpage>481</fpage>&#x02013;<lpage>489</lpage>. <pub-id pub-id-type="doi">10.1037/0022-0167.34.4.481</pub-id></citation>
</ref>
<ref id="B10">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Dohrenwend</surname> <given-names>B. P.</given-names></name></person-group> (<year>2006</year>). <article-title>Inventorying stressful life vents as risk factors for psychopathology: toward resolution of the problem of intracategory variability</article-title>. <source>Psychol. Bull.</source> <volume>132</volume>, <fpage>477</fpage>&#x02013;<lpage>495</lpage>. <pub-id pub-id-type="doi">10.1037/0033-2909.132.3.477</pub-id><pub-id pub-id-type="pmid">16719570</pub-id></citation></ref>
<ref id="B11">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Drummond</surname> <given-names>G. B.</given-names></name> <name><surname>Vowler</surname> <given-names>S. L.</given-names></name></person-group> (<year>2012</year>). <article-title>Not different is not the same as the same: how can we tell?</article-title> <source>J. Physiol.</source> <volume>590</volume>, <fpage>5257</fpage>&#x02013;<lpage>5260</lpage>. <pub-id pub-id-type="doi">10.1113/jphysiol.2012.244442</pub-id><pub-id pub-id-type="pmid">23118061</pub-id></citation></ref>
<ref id="B12">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Flake</surname> <given-names>J. K.</given-names></name> <name><surname>Fried</surname> <given-names>E. I.</given-names></name></person-group> (<year>2020</year>). <article-title>Measurement schmeasurement: questionable measure practices and how to avoid them</article-title>. <source>Adv. Methods Pract. Psychol. Sci.</source> <volume>3</volume>, <fpage>456</fpage>&#x02013;<lpage>465</lpage>. <pub-id pub-id-type="doi">10.1177/2515245920952393</pub-id></citation>
</ref>
<ref id="B13">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Fl&#x000FC;ckiger</surname> <given-names>C.</given-names></name> <name><surname>Del Re</surname> <given-names>A. C.</given-names></name> <name><surname>Wampold</surname> <given-names>B. E.</given-names></name> <name><surname>Horvath</surname> <given-names>A. O.</given-names></name></person-group> (<year>2018</year>). <article-title>The alliance in adult psychotherapy: a meta-analytic synthesis</article-title>. <source>Psychother. Theory Res. Pract.</source> <volume>55</volume>, <fpage>316</fpage>&#x02013;<lpage>340</lpage>. <pub-id pub-id-type="doi">10.1037/pst0000172</pub-id><pub-id pub-id-type="pmid">29792475</pub-id></citation></ref>
<ref id="B14">
<citation citation-type="web"><person-group person-group-type="author"><name><surname>Hirsch</surname> <given-names>A. E.</given-names></name></person-group> (<year>2009</year>). <source>A New Kind of Big Science.</source> Retrieved from: <ext-link ext-link-type="uri" xlink:href="https://archive.nytimes.com/opinionator.blogs.nytimes.com/2009/01/13/guest-column-a-new-kind-of-big-science/">https://archive.nytimes.com/opinionator.blogs.nytimes.com/2009/01/13/guest-column-a-new-kind-of-big-science/</ext-link> (accessed December 26, 2022).</citation>
</ref>
<ref id="B15">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Horvath</surname> <given-names>A. O.</given-names></name> <name><surname>Greenberg</surname> <given-names>L. S.</given-names></name></person-group> (<year>1989</year>). <article-title>Development and validation of the working alliance inventory</article-title>. <source>J. Counsel. Psychol.</source> <volume>36</volume>, <fpage>223</fpage>&#x02013;<lpage>233</lpage>. <pub-id pub-id-type="doi">10.1037/0022-0167.36.2.223</pub-id><pub-id pub-id-type="pmid">29733745</pub-id></citation></ref>
<ref id="B16">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Houle</surname> <given-names>T. T.</given-names></name> <name><surname>Donald</surname> <given-names>B.</given-names></name> <name><surname>Penzien</surname> <given-names>D. B.</given-names></name> <name><surname>Chris</surname> <given-names>K.</given-names></name> <name><surname>Houle</surname> <given-names>C. K.</given-names></name></person-group> (<year>2005</year>). <article-title>Statistical power and sample size estimation for headache research: an overview and power calculation tools</article-title>. <source>Headache</source> <volume>45</volume>, <fpage>414</fpage>&#x02013;<lpage>418</lpage>. <pub-id pub-id-type="doi">10.1111/j.1526-4610.2005.05092.x</pub-id><pub-id pub-id-type="pmid">15953257</pub-id></citation></ref>
<ref id="B17">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kievit</surname> <given-names>R. A.</given-names></name> <name><surname>Romeijn</surname> <given-names>J.-W.</given-names></name> <name><surname>Waldorp</surname> <given-names>L. J.</given-names></name> <name><surname>Wicherts</surname> <given-names>J. M.</given-names></name> <name><surname>Scholte</surname> <given-names>H. S.</given-names></name> <name><surname>Borsboom</surname> <given-names>D.</given-names></name></person-group> (<year>2011</year>). <article-title>Modeling mind and matter: reductionism and psychological measurement in cognitive neuroscience</article-title>. <source>Psychol. Inq.</source> <volume>22</volume>, <fpage>139</fpage>&#x02013;<lpage>157</lpage>. <pub-id pub-id-type="doi">10.1080/1047840X.2011.567962</pub-id></citation>
</ref>
<ref id="B18">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kievit</surname> <given-names>R. A.</given-names></name> <name><surname>Rooijen</surname> <given-names>H.</given-names></name> <name><surname>van Wicherts</surname> <given-names>J. M.</given-names></name> <name><surname>Waldorp</surname> <given-names>L. J.</given-names></name> <name><surname>Kan</surname> <given-names>K.-J.</given-names></name> <name><surname>Scholte</surname> <given-names>H. S.</given-names></name> <etal/></person-group>. (<year>2012</year>). <article-title>Intelligence and the brain: A model-based approach</article-title>. <source>Cogn. Neurosci.</source> <volume>3</volume>, <fpage>89</fpage>&#x02013;<lpage>97</lpage>. <pub-id pub-id-type="doi">10.1080/17588928.2011.628383</pub-id><pub-id pub-id-type="pmid">24168689</pub-id></citation></ref>
<ref id="B19">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Kuhn</surname> <given-names>T. S.</given-names></name></person-group> (<year>1970</year>). <source>The Structure of Scientific Revolutions.</source> <publisher-loc>Chicago, IL</publisher-loc>: <publisher-name>University of Chicago Press</publisher-name>.</citation>
</ref>
<ref id="B20">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ledgerwood</surname> <given-names>A.</given-names></name></person-group> (<year>2014</year>). <article-title>Introduction to the special section on advancing our methods and practices</article-title>. <source>Perspect. Psychol. Sci.</source> <volume>9</volume>, <fpage>275</fpage>&#x02013;<lpage>277</lpage>. <pub-id pub-id-type="doi">10.1177/1745691614529448</pub-id><pub-id pub-id-type="pmid">26173263</pub-id></citation></ref>
<ref id="B21">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lilienfeld</surname> <given-names>S. O.</given-names></name></person-group> (<year>2017</year>). <article-title>Clinical psychological science: then and now</article-title>. <source>Clinical Psychological Science</source> <volume>5</volume>, <fpage>3</fpage>&#x02013;<lpage>13</lpage>. <pub-id pub-id-type="doi">10.1177/2167702616673363</pub-id></citation>
</ref>
<ref id="B22">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lilienfeld</surname> <given-names>S. O.</given-names></name> <name><surname>Strother</surname> <given-names>A. N.</given-names></name></person-group> (<year>2020</year>). <article-title>Psychological measurement and the replication crisis: four sacred cows</article-title>. <source>Can. Psychol.</source> <volume>61</volume>, <fpage>281</fpage>&#x02013;<lpage>288</lpage>. <pub-id pub-id-type="doi">10.1037/cap0000236</pub-id></citation>
</ref>
<ref id="B23">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Lipsey</surname> <given-names>M.</given-names></name></person-group> (<year>1990</year>). <source>Design Sensitivity</source>. <publisher-loc>Newbury Park, CA</publisher-loc>: <publisher-name>Sage</publisher-name>.</citation>
</ref>
<ref id="B24">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lipsey</surname> <given-names>M. W.</given-names></name></person-group> (<year>1983</year>). <article-title>A scheme for assessing measurement sensitivity in program evaluation and other applied research</article-title>. <source>Psychol. Bull.</source> <volume>94</volume>, <fpage>152</fpage>&#x02013;<lpage>165</lpage>. <pub-id pub-id-type="doi">10.1037/0033-2909.94.1.152</pub-id><pub-id pub-id-type="pmid">6622618</pub-id></citation></ref>
<ref id="B25">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Maslach</surname> <given-names>C.</given-names></name> <name><surname>Jackson</surname> <given-names>S. E.</given-names></name></person-group> (<year>1981</year>). <article-title>The measurement of experienced burnout</article-title>. <source>J. Organ. Behav.</source> <volume>2</volume>, <fpage>99</fpage>&#x02013;<lpage>113</lpage>. <pub-id pub-id-type="doi">10.1002/job.4030020205</pub-id></citation>
</ref>
<ref id="B26">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Maul</surname> <given-names>A.</given-names></name></person-group> (<year>2017</year>). <article-title>Rethinking traditional methods of survey validation</article-title>. <source>Measure. Interdiscipl. Res. Perspect.</source> <volume>15</volume>, <fpage>51</fpage>&#x02013;<lpage>69</lpage>. <pub-id pub-id-type="doi">10.1080/15366367.2017.1348108</pub-id></citation>
</ref>
<ref id="B27">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Meier</surname> <given-names>S. T.</given-names></name></person-group> (<year>1994</year>). <source>The Chronic Crisis in Psychological Measurement and Assessment</source>. <publisher-loc>New York, NY</publisher-loc>: <publisher-name>Academic Press</publisher-name>.</citation>
</ref>
<ref id="B28">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Meier</surname> <given-names>S. T.</given-names></name></person-group> (<year>2008</year>). <source>Measuring Change in Counseling and Psychotherapy</source>. <publisher-loc>New York, NY</publisher-loc>: <publisher-name>Guilford Press</publisher-name>.</citation>
</ref>
<ref id="B29">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Meier</surname> <given-names>S. T.</given-names></name> <name><surname>Feeley</surname> <given-names>T. H.</given-names></name></person-group> (<year>2021</year>). <article-title>Ceiling effects suggest a threshold structure for working alliance</article-title>. <source>J. Counsel. Psychol</source>. <volume>69</volume>, <fpage>235</fpage>&#x02013;<lpage>245</lpage>. <pub-id pub-id-type="doi">10.1037/cou0000564</pub-id><pub-id pub-id-type="pmid">34292029</pub-id></citation></ref>
<ref id="B30">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Nosek</surname> <given-names>B. A.</given-names></name> <name><surname>Spies</surname> <given-names>J. R.</given-names></name> <name><surname>Motyl</surname> <given-names>M.</given-names></name></person-group> (<year>2013</year>). <article-title>Scientific utopia: II. Restructuring incentives and practices to promote truth over publishability</article-title>. <source>Perspect. Psychol. Sci.</source> <volume>7</volume>, <fpage>615</fpage>&#x02013;<lpage>631</lpage>. <pub-id pub-id-type="doi">10.1177/1745691612459058</pub-id><pub-id pub-id-type="pmid">26168121</pub-id></citation></ref>
<ref id="B31">
<citation citation-type="web"><person-group person-group-type="author"><name><surname>Palermo</surname> <given-names>E.</given-names></name></person-group> (<year>2017</year>). <source>Who Invented the Light Bulb?</source> Retrieved from: <ext-link ext-link-type="uri" xlink:href="https://www.livescience.com/43424-who-invented-the-light-bulb.html">www.livescience.com/43424-who-invented-the-light-bulb.html</ext-link> (accessed December 26, 2022).</citation>
</ref>
<ref id="B32">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Scheel</surname> <given-names>A. M.</given-names></name> <name><surname>Tiokhin</surname> <given-names>L.</given-names></name> <name><surname>Isaager</surname> <given-names>P. M.</given-names></name> <name><surname>Lakens</surname> <given-names>D.</given-names></name></person-group> (<year>2021</year>). <article-title>Why hypothesis testers should spend less time testing hypotheses</article-title>. <source>Perspect. Psychol. Sci.</source> <volume>16</volume>, <fpage>744</fpage>&#x02013;<lpage>755</lpage>. <pub-id pub-id-type="doi">10.1177/1745691620966795</pub-id><pub-id pub-id-type="pmid">33326363</pub-id></citation></ref>
<ref id="B33">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Stinchfield</surname> <given-names>R.</given-names></name> <name><surname>Winters</surname> <given-names>K. C.</given-names></name> <name><surname>Botzet</surname> <given-names>A.</given-names></name> <name><surname>Jerstad</surname> <given-names>S.</given-names></name> <name><surname>Breyer</surname> <given-names>J.</given-names></name></person-group> (<year>2007</year>). <article-title>Development and psychometric evaluation of the gambling treatment outcome monitoring system (GAMTOMS)</article-title>. <source>Psychol. Addict. Behav.</source> <volume>21</volume>, <fpage>174</fpage>&#x02013;<lpage>184</lpage>. <pub-id pub-id-type="doi">10.1037/0893-164X.21.2.174</pub-id><pub-id pub-id-type="pmid">17563137</pub-id></citation></ref>
<ref id="B34">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Torgerson</surname> <given-names>W. S.</given-names></name></person-group> (<year>1958</year>). <source>Theory and Methods of Scaling.</source> <publisher-loc>New York, NY</publisher-loc>: <publisher-name>Wiley</publisher-name>.</citation>
</ref>
<ref id="B35">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Tryon</surname> <given-names>W. W.</given-names></name></person-group> (<year>1991</year>). <source>Activity Measurement in Psychology and Medicine</source>. <publisher-loc>New York, NY</publisher-loc>: <publisher-name>Plenum</publisher-name>. <pub-id pub-id-type="doi">10.1007/978-1-4757-9003-0</pub-id></citation>
</ref>
<ref id="B36">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ulrich</surname> <given-names>R.</given-names></name> <name><surname>Miller</surname> <given-names>J.</given-names></name></person-group> (<year>2020</year>). <article-title>Questionable research practices may have little effect on replicability</article-title>. <source>eLife</source> <volume>9</volume>, <fpage>e58237</fpage>. <pub-id pub-id-type="doi">10.7554/eLife.58237.sa2</pub-id><pub-id pub-id-type="pmid">32930092</pub-id></citation></ref>
<ref id="B37">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Williams</surname> <given-names>L. J.</given-names></name> <name><surname>Cote</surname> <given-names>J. A.</given-names></name> <name><surname>Buckley</surname> <given-names>M. R.</given-names></name></person-group> (<year>1989</year>). <article-title>Lack of method variance in self-reported affect and perceptions at work: reality or artifact?</article-title> <source>J. Appl. Psychol.</source> <volume>74</volume>, <fpage>462</fpage>&#x02013;<lpage>468</lpage>. <pub-id pub-id-type="doi">10.1037/0021-9010.74.3.462</pub-id></citation>
</ref>
</ref-list> 
</back>
</article> 