<?xml version="1.0" encoding="UTF-8" standalone="no"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v2.3 20070202//EN" "journalpublishing.dtd">
<article xml:lang="EN" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" article-type="review-article">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">Front. Psychol.</journal-id>
<journal-title>Frontiers in Psychology</journal-title>
<abbrev-journal-title abbrev-type="pubmed">Front. Psychol.</abbrev-journal-title>
<issn pub-type="epub">1664-1078</issn>
<publisher>
<publisher-name>Frontiers Media S.A.</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="doi">10.3389/fpsyg.2025.1605110</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Psychology</subject>
<subj-group>
<subject>Review</subject>
</subj-group>
</subj-group>
</article-categories>
<title-group>
<article-title>Augmenting art crossmodally: possibilities and pitfalls</article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author" corresp="yes">
<name><surname>Spence</surname> <given-names>Charles</given-names></name>
<xref ref-type="aff" rid="aff1"><sup>1</sup></xref>
<xref ref-type="corresp" rid="c001"><sup>&#x0002A;</sup></xref>
<xref ref-type="author-notes" rid="fn001"><sup>&#x02020;</sup></xref>
<uri xlink:href="http://loop.frontiersin.org/people/1194/overview"/>
<role content-type="https://credit.niso.org/contributor-roles/conceptualization/"/>
<role content-type="https://credit.niso.org/contributor-roles/writing-original-draft/"/>
<role content-type="https://credit.niso.org/contributor-roles/writing-review-editing/"/>
</contrib>
<contrib contrib-type="author">
<name><surname>Di Stefano</surname> <given-names>Nicola</given-names></name>
<xref ref-type="aff" rid="aff2"><sup>2</sup></xref>
<xref ref-type="author-notes" rid="fn002"><sup>&#x02020;</sup></xref>
<uri xlink:href="http://loop.frontiersin.org/people/394017/overview"/>
<role content-type="https://credit.niso.org/contributor-roles/writing-original-draft/"/>
<role content-type="https://credit.niso.org/contributor-roles/writing-review-editing/"/>
</contrib>
</contrib-group>
<aff id="aff1"><sup>1</sup><institution>Crossmodal Research Laboratory, Department of Experimental Psychology, Oxford University</institution>, <addr-line>Oxford</addr-line>, <country>United Kingdom</country></aff>
<aff id="aff2"><sup>2</sup><institution>Institute of Cognitive Sciences and Technologies, National Research Council</institution>, <addr-line>Rome</addr-line>, <country>Italy</country></aff>
<author-notes>
<fn fn-type="edited-by"><p>Edited by: Steven Brown, McMaster University, Canada</p></fn>
<fn fn-type="edited-by"><p>Reviewed by: Brigitte R&#x000F6;der, University of Hamburg, Germany</p>
<p>David Howes, Concordia University, Canada</p></fn>
<corresp id="c001">&#x0002A;Correspondence: Charles Spence <email>charles.spence&#x00040;psy.ox.ac.uk</email></corresp>
<fn fn-type="other" id="fn001"><p>&#x02020;ORCID: Charles Spence <ext-link ext-link-type="uri" xlink:href="https://orcid.org/0000-0003-2111-072X">orcid.org/0000-0003-2111-072X</ext-link></p></fn>
<fn fn-type="other" id="fn002"><p>Nicola Di Stefano <ext-link ext-link-type="uri" xlink:href="https://orcid.org/0000-0002-9286-0395">orcid.org/0000-0002-9286-0395</ext-link></p></fn></author-notes>
<pub-date pub-type="epub">
<day>14</day>
<month>07</month>
<year>2025</year>
</pub-date>
<pub-date pub-type="collection">
<year>2025</year>
</pub-date>
<volume>16</volume>
<elocation-id>1605110</elocation-id>
<history>
<date date-type="received">
<day>02</day>
<month>04</month>
<year>2025</year>
</date>
<date date-type="accepted">
<day>05</day>
<month>06</month>
<year>2025</year>
</date>
</history>
<permissions>
<copyright-statement>Copyright &#x000A9; 2025 Spence and Di Stefano.</copyright-statement>
<copyright-year>2025</copyright-year>
<copyright-holder>Spence and Di Stefano</copyright-holder>
<license xlink:href="http://creativecommons.org/licenses/by/4.0/"><p>This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.</p></license>
</permissions>
<abstract>
<p>In this narrative historical review, we take a closer look at the question of whether it is possible to augment works of art through crossmodal (specifically audiovisual) means. We start by highlighting an important distinction between three classes of audiovisual crossmodal correspondence: Namely those operating on individual sensory stimuli (so-called basic correspondences), those operating on dynamically-changing stimuli, or else on combinations of unisensory stimuli (so-called mid-level correspondences), and those operating on complex and often aesthetically-meaningful stimuli, such as music and paintings. We also highlight another important distinction between the literature on crossmodal matching and that dedicated to demonstrating crossmodal effects. The latter distinction aligns, in some sense, onto the distinction between crossmodal mapping and crossmodal effects. Although it may not be possible, in any meaningful sense, to translate works of art from one modality into another, that does not deny the possibility of augmenting a work of art by the deliberate addition of stimulation presented to another sensory modality. The aims and objectives of those who have attempted to augment works of art by introducing additional sensory stimulation are discussed. We also draw attention to a number of challenges and/or pitfalls (such as the distraction offered by recourse to the phenomenon of synaesthesia) for those interested in augmenting auditory/visual art crossmodally.</p></abstract>
<kwd-group>
<kwd>art</kwd>
<kwd>audiovisual</kwd>
<kwd>crossmodal</kwd>
<kwd>Gestalt</kwd>
<kwd>intersensoriality</kwd>
<kwd>multisensory</kwd>
<kwd>sensory translation</kwd>
<kwd>sensory augmentation</kwd>
</kwd-group>
<contract-num rid="cn001">AH/L007053/1</contract-num>
<contract-sponsor id="cn001">Arts and Humanities Research Council<named-content content-type="fundref-id">https://doi.org/10.13039/501100000267</named-content></contract-sponsor>
<counts>
<fig-count count="0"/>
<table-count count="4"/>
<equation-count count="0"/>
<ref-count count="345"/>
<page-count count="21"/>
<word-count count="20792"/>
</counts>
<custom-meta-wrap>
<custom-meta>
<meta-name>section-at-acceptance</meta-name>
<meta-value>Performance Science</meta-value>
</custom-meta>
</custom-meta-wrap>
</article-meta>
</front>
<body>
<sec sec-type="intro" id="s1">
<title>1 Introduction</title>
<p>In recent decades, there has been a growing awareness within the cognitive neurosciences that the senses do not operate in isolation (Barlow and Mollon, <xref ref-type="bibr" rid="B17">1982</xref>), and that crossmodal interactions are the norm, not the exception (see Calvert et al., <xref ref-type="bibr" rid="B49">2004</xref>).<xref ref-type="fn" rid="fn0001"><sup>1</sup></xref> Crossmodality refers to the way in which the stimuli presented in one sense may influence people&#x00027;s perception of the stimuli that happen to be presented in another sense. Crossmodality is different from the term multisensory, with the latter referring to those situations when the senses come together to deliver a multisensory representation of an object or event (Stein et al., <xref ref-type="bibr" rid="B297">2010</xref>). Classic examples of multisensory perception include everything from audiovisual speech perception to multisensory flavor perception. Crossmodal effects, by contrast, are more often observed when seemingly unrelated unisensory stimuli are presented at around the same time, such as when the presentation of a bright light makes a simultaneously-presented sound appear louder than it otherwise would (see Spence, <xref ref-type="bibr" rid="B281">2018</xref>, for a review).</p>
<p>One of the most exciting recent areas in crossmodal research concerns the crossmodal correspondences. This is the name given to the almost synaesthesia-like surprising connections between features, attributes, or dimensions of experience presented in different senses. Often confused with synaesthesia (in that both phenomena are surprising when first you hear about them), they are fundamentally different (see Deroy and Spence, <xref ref-type="bibr" rid="B80">2013</xref>). For instance, there is typically a crossmodal association but no inducer-concurrent mapping in the case of crossmodal correspondences. Furthermore, crossmodal correspondences tend to be consensually shared across groups of people, whereas the inducer-concurrent mapping in the context of synaesthesia is, by definition, idiosyncratic between one synaesthete and the next. It is the crossmodal correspondences that we are interested.</p>
<p>In this narrative historical review, we first introduce an important distinction between three classes of audiovisual crossmodal correspondence: Namely those operating on individual sensory stimuli (so-called basic correspondences), those operating on dynamically-changing stimuli, or else on combinations of unisensory stimuli (so-called mid-level correspondences), and those operating on complex and often aesthetically-meaningful stimuli, such as music and paintings (Section 1.1). We also highlight an important distinction between the literature on crossmodal matching and that dedicated to demonstrating crossmodal effects. The latter distinction aligns, in some sense, with the distinction between crossmodal mapping and crossmodal effects. In Section 1.2, we briefly consider the kinds of explanations that have been put forward to account for crossmodal correspondences to date. Thereafter, in Sections 2&#x02013;4, we review the evidence concerning audiovisual crossmodal matching and crossmodal effects operating at the level of basic, mid-level, and complex crossmodal correspondences, respectively.</p>
<p>In Section 5, we build on this groundwork by reflecting on the putative existence of crossmodal art. We take a closer look at the question of whether it is possible to augment works of art through crossmodal (specifically audiovisual) means. We show how the phenomenon of synaesthesia, at least as conceptualized by cognitive neuroscientists fails to enlighten theorizing when it comes to thinking about the social/structural construction of meaning in the context of the crossmodal augmentation of art (Dimova, <xref ref-type="bibr" rid="B89">2024</xref>; L&#x000E9;vi-Strauss, <xref ref-type="bibr" rid="B165">1997</xref>; Spence and Deroy, <xref ref-type="bibr" rid="B286">2013</xref>). We discuss how although it may not be possible, in any meaningful sense, to translate works of art from one modality into another (see Spence and Di Stefano, <xref ref-type="bibr" rid="B289">2024a</xref>), that does not deny the possibility of augmenting a work of art by the deliberate addition of stimulation presented to another sensory modality. Finally, we discuss the aims and objectives of those who have attempted to augment works of art by introducing additional sensory stimulation. The intriguing notion of intersensoriality is also outlined. This term has been introduced by Howes (<xref ref-type="bibr" rid="B135">2025</xref>) to refer to art that crosses sensory borders (Classen, <xref ref-type="bibr" rid="B65">1998</xref>). We draw attention to a number of the challenges and/or pitfalls for those interested in wanting to augment auditory/visual art crossmodally (Section 6). We conclude, in Section 7, by challenging the notion, implicit in much creative practice, that elemental correspondences (that is, crossmodal correspondences based on simple stimulus properties), provide the most relevant insights when it comes to augmenting art crossmodally, given that works of art are mostly complex, multi-elemental, and rich in semantic and emotional meaning.</p>
<sec>
<title>1.1 Basic, mid-level, and complex crossmodal correspondences: a novel typology</title>
<p>Audiovisual crossmodal correspondences can be grouped into different classes based on the (perceived) complexity of the stimuli involved. In particular, while some researchers have chosen to study crossmodal correspondences between basic (i.e., individual) sensory stimuli (i.e., specific features or dimensions), others have chosen to focus on crossmodal correspondences (and Gestalt grouping) in the case of what might be called mid-level correspondences instead (i.e., those correspondences that are experienced between combinations of unisensory stimuli, such as an auditory melody and a sequence of visual stimuli).<xref ref-type="fn" rid="fn0002"><sup>2</sup></xref> A third group of studies, meanwhile, has focused on crossmodal correspondences involving more complex and often semantically-meaningful stimuli, such as, for example, works of art (primarily paintings and classical music) and other popular entertainment media (such as film clips, TV adverts, and popular music selections). While the majority of the research that has been published to date has tended to focus on those correspondences that can be demonstrated between stimuli at the same level of complexity, a number of studies have also investigated crossmodal correspondences operating across levels (as when color properties are associated with pieces of music).</p>
<p>Basic audiovisual crossmodal correspondences operate between discrete unimodal sensory stimuli, such as, for example, pure tones, color patches, and sounds and lights varying in terms of their intensity (i.e., loudness and brightness). It has sometimes been suggested that certain of these basic crossmodal correspondences may reflect the existence of amodal sensory dimensions, such as intensity, shape, texture, brightness, etc. (e.g., Bond and Stevens, <xref ref-type="bibr" rid="B35">1969</xref>; Ellermeier et al., <xref ref-type="bibr" rid="B96">2021</xref>; Lewkowicz and Turkewitz, <xref ref-type="bibr" rid="B166">1980</xref>; Marks et al., <xref ref-type="bibr" rid="B190">1987</xref>; Root and Ross, <xref ref-type="bibr" rid="B252">1965</xref>). However, that possibility has been rejected recently, at least in the case of sensory correspondences (see Spence and Di Stefano, <xref ref-type="bibr" rid="B290">2024b</xref>, for a review). Rather, at least according to Spence and Di Stefano, putatively amodal dimensions of experience, such as for sensory intensity, should be reconceptualized in terms of a crossmodal correspondence instead (e.g., between auditory intensity and visually-perceived brightness, say). Similarly, Spence&#x00027;s (<xref ref-type="bibr" rid="B280">2015</xref>) suggestion from a decade ago that the crossmodal correspondences between individual sensory attributes might operate on the basis of perceived similarity has also recently been discounted, at least in the case of audition and vision (Di Stefano and Spence, <xref ref-type="bibr" rid="B85">2024a</xref>).<xref ref-type="fn" rid="fn0003"><sup>3</sup></xref></p>
<p>Mid-level correspondences typically operate between structured combinations of unisensory stimuli, such as short sequences of sounds (that may make up a melody or a spatiotemporally arranged pattern of visual stimuli). Numerous studies, often in the Gestalt tradition, have demonstrated how the grouping of sequences of stimuli presented in one sensory modality may influence the presentation of stimuli presented in the other modality, as for example, in the context of the so-called &#x0201C;crossmodal dynamic capture effect&#x0201D; (Soto-Faraco et al., <xref ref-type="bibr" rid="B275">2003</xref>; see Spence et al., <xref ref-type="bibr" rid="B293">2007</xref>, for a review). Other research, meanwhile, has demonstrated the existence of crossmodal effects in a range of other experimental situations (O&#x00027;Leary and Rhodes, <xref ref-type="bibr" rid="B215">1984</xref>; see also Huang et al., <xref ref-type="bibr" rid="B138">2012</xref>; Rahne et al., <xref ref-type="bibr" rid="B240">2008</xref>; Schwartz et al., <xref ref-type="bibr" rid="B262">2012</xref>). So, for example, crossmodal effects of auditory perceptual grouping on visual perceptual grouping, and <italic>vice versa</italic>, have been reported. At the same time, however, a number of studies have also demonstrated mid-level crossmodal correspondences that are seemingly more metaphorical or analogical in nature (e.g., Ravignani and Sonnweber, <xref ref-type="bibr" rid="B242">2017</xref>; Wagner et al., <xref ref-type="bibr" rid="B317">1981</xref>; though see also Jewanski, <xref ref-type="bibr" rid="B147">2010</xref>; Lindholm, <xref ref-type="bibr" rid="B170">2022</xref>). A separate literature has demonstrated how temporal synchrony, and the presentation of correlated inputs to different senses, can also facilitate crossmodal binding (e.g., Lin et al., <xref ref-type="bibr" rid="B168">2022</xref>; Parise et al., <xref ref-type="bibr" rid="B223">2013</xref>, <xref ref-type="bibr" rid="B225">2012</xref>).</p>
<p>In parallel, various researchers have explored audiovisual correspondences using more complex and semantically-rich stimuli. In such cases, the complexity of the stimuli involved means that it is harder to explain any crossmodal correspondences that are observed based on specific individual auditory/visual physical stimulus attributes/dimensions (e.g., such as frequency and hue; see Duthie, <xref ref-type="bibr" rid="B91">2013</xref>; Duthie and Duthie, <xref ref-type="bibr" rid="B92">2015</xref>), thus leading researchers to suggest the existence of features that may be capable of mediating those associations at more of a cognitive level. So, for instance, Albertazzi et al. (<xref ref-type="bibr" rid="B5">2015</xref>) demonstrated the existence of consistent audiovisual associations between highly-complex stimuli consisting of a series of 15 paintings by an Italian artist and 15 music excerpts from the classic guitar repertoire (e.g., Villas Lobos, Albeniz). These associations were explained using the semantic differential technique based on perceptual and emotional associations (e.g., bright and calm, respectively) (see also Cowles, <xref ref-type="bibr" rid="B74">1935</xref>, for an early study, Spence, <xref ref-type="bibr" rid="B283">2020a</xref>, for a review, and Iosifyan et al., <xref ref-type="bibr" rid="B143">2022</xref>, for a more recent study documenting crossmodal associations between paintings and sounds). According to the &#x0201C;emotional mediation hypothesis&#x0201D;, those stimuli that are presented in different sensory domains are more likely to be matched if they share similar affective meanings (see Spence, <xref ref-type="bibr" rid="B283">2020a</xref>, for a review). This has emerged as one of the most powerful accounts of this class of complex crossmodal correspondences, and is especially powerful in helping to explain audiovisual associations (e.g., Di Stefano et al., <xref ref-type="bibr" rid="B83">2024</xref>, for cross-cultural evidence), though the approach has also been applied to help explain crossmodal associations involving olfactory stimuli (e.g., Di Stefano et al., <xref ref-type="bibr" rid="B88">2022</xref>; Schifferstein and Tanudjaja, <xref ref-type="bibr" rid="B259">2004</xref>; Spence, <xref ref-type="bibr" rid="B284">2020b</xref>) and even taste stimuli as well (Spence and Levitan, <xref ref-type="bibr" rid="B292">2021</xref>).</p>
<p>At the same time, however, a separate distinction can be made between those studies that have merely sought to establish the existence of crossmodal correspondences between the stimuli presented at a particular level (i.e., basic, mid-level, or complex)&#x02014;crossmodal matching studies&#x02014;and those studies that have looked to determine whether there are any crossmodal effects, evidenced by the presence of stimuli in one modality having an impact on those presented in the other modality (see <xref ref-type="table" rid="T1">Table 1</xref>). The general belief amongst researchers would appear to be that there should be an increased probability of crossmodal binding or multisensory integration (occasionally resulting in the perceived unification of the component sensory stimuli; Eisenstein, <xref ref-type="bibr" rid="B95">1947</xref>; Wells, <xref ref-type="bibr" rid="B333">1980</xref>) when stimuli are crossmodally congruent as compared to when presented with crossmodally incongruent pairings instead (Iwamiya, <xref ref-type="bibr" rid="B146">2013</xref>).</p>
<table-wrap position="float" id="T1">
<label>Table 1</label>
<caption><p>Broad categorization of different types of crossmodal correspondence.</p></caption>
<table frame="box" rules="all">
<thead>
<tr style="background-color:#8f9496;color:#ffffff">
<th valign="top" align="left"><bold>Type of correspondence</bold></th>
<th valign="top" align="left"><bold>Stimuli involved</bold></th>
<th valign="top" align="left"><bold>Crossmodal matching (CM)</bold></th>
<th valign="top" align="left"><bold>Crossmodal effect (CE) or multisensory interaction</bold></th>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left">Basic (or low-level) correspondence (more sensory than emotional)</td>
<td valign="top" align="left">Pure tone, loudness, pitch; colour patch, hue, brightness.</td>
<td valign="top" align="left">Multiple crossmodal correspondences documented, though disagreement about the pitch-hue mapping.</td>
<td valign="top" align="left">Coupling priors, unity effect (Parise and Spence, <xref ref-type="bibr" rid="B222">2009</xref>)</td>
</tr> <tr>
<td valign="top" align="left">Mid-level correspondence</td>
<td valign="top" align="left">Rhythm, musical mode; colour harmony, spatiotemporal patterns</td>
<td valign="top" align="left">Metaphoric matching, structural similarity; numerosity correspondence, analogical mapping.</td>
<td valign="top" align="left">Gestalt grouping (O&#x00027;Leary and Rhodes, <xref ref-type="bibr" rid="B215">1984</xref>); Modulation of crossmodal dynamic capture effect (Spence, <xref ref-type="bibr" rid="B280">2015</xref>; Spence et al., <xref ref-type="bibr" rid="B293">2007</xref>)</td>
</tr> <tr>
<td valign="top" align="left">Complex (or high- level) more emotional than sensory; Stylistic correspondences (Hasenfus et al., <xref ref-type="bibr" rid="B127">1983</xref>; Siefkes and Arielli, <xref ref-type="bibr" rid="B269">2015</xref>)</td>
<td valign="top" align="left">Painting, piece of music, film clips, TV adverts; ballet/dance (Opera excluded as it is intrinsically audiovisual)</td>
<td valign="top" align="left">Music matched to painting; Painting chosen to match music. Visuals matched to music; movements/body shapes or gesture and music? Light show during concert?</td>
<td valign="top" align="left">Does music the change viewer&#x00027;s interpretation of painting or film? What is the impact of presenting incongruent music? Occasional reports of unification of inputs (e.g., in Scriabin&#x00027;s Prometheus, Wells, <xref ref-type="bibr" rid="B333">1980</xref>; Video jockeys matching visuals to music; e.g., Jean-Michel Jarr&#x000E9;)</td>
</tr></tbody>
</table>
</table-wrap>
<p>While crossmodal correspondences operate between sensory stimuli presented at different levels of complexity (e.g., between individual auditory stimuli and complex visual stimuli), the majority of research that has been published to date has tended to present stimuli at the same level of complexity in both the auditory and visual modalities. Nevertheless, a few studies have been published documenting crossmodal correspondences between stimuli at different levels, such as, for example, demonstrating crossmodal correspondences between short snippets of music and isolated color patches (e.g., Palmer et al., <xref ref-type="bibr" rid="B221">2013</xref>; Whiteford et al., <xref ref-type="bibr" rid="B334">2018</xref>).</p>
<p>There are two possibly controversial claims (or observations) that we wish to make in this review: The first is that the mere existence of a crossmodal correspondence, as revealed by research on crossmodal matching, has no necessary implications for the existence of a crossmodal effect; The second is that the existence of crossmodal correspondences operating between individual sensory stimuli has no necessary implications as far as the existence of crossmodal effects operating between complex stimuli are concerned. That is, many crossmodal effects have been demonstrated in the literature without any consideration of the crossmodal congruency, or otherwise, of the component unimodal stimuli (see London, <xref ref-type="bibr" rid="B180">1954</xref>; Welch and Warren, <xref ref-type="bibr" rid="B331">1980</xref>, <xref ref-type="bibr" rid="B332">1986</xref>, for reviews). So, for example, the presentation of a tone has been reported to enhance the rated brightness of a simultaneously-presented light (Stein et al., <xref ref-type="bibr" rid="B298">1996</xref>; though see Odgaard et al., <xref ref-type="bibr" rid="B213">2003</xref>), or vice versa (Odgaard et al., <xref ref-type="bibr" rid="B214">2004</xref>). In this case, there was absolutely no consideration of whether the frequency of the sound was in any sense congruent (or incongruent) with the color of the light. Meanwhile, many studies have demonstrated how crossmodal effects (albeit different ones) occur in the case of both congruent and incongruent combinations of sensory stimuli (Chen et al., <xref ref-type="bibr" rid="B57">2025</xref>). So, for example, Chen and colleagues reported that auditory stimuli influenced people&#x00027;s aesthetic perception of pictures regardless of the semantic or spatial congruency of the component stimuli.</p>
<p>It sometimes appears to be implicitly assumed by researchers working in the area (i.e., in crossmodal correspondences research) that the existence of a crossmodal effect can be taken as evidence of the existence of a crossmodal correspondence between one or several of the sensory features or dimensions involved. As such, the view that we wish to advocate here is that the existence of a crossmodal correspondence can only be convincingly demonstrated by the results of crossmodal matching studies (that is, in studies where the participants are explicitly asked to pick the best match of the available options that have been provided to them<xref ref-type="fn" rid="fn0004"><sup>4</sup></xref> or, in more refined/informative protocols, to rate the extent to which all of the presented stimuli from different sensory domains match one another). While it would seem likely that crossmodal correspondences, as demonstrated in so many crossmodal matching studies, will likely give rise to measurable differences when a participant&#x00027;s responses to congruent vs. incongruent combinations of crossmodal stimuli are compared (Parise and Spence, <xref ref-type="bibr" rid="B222">2009</xref>, <xref ref-type="bibr" rid="B224">2012</xref>), it is important to remember that many (perhaps even the majority of) crossmodal effects may actually have little to do with crossmodal correspondences.</p>
</sec>
<sec>
<title>1.2 Explaining crossmodal correspondences</title>
<p>Over the years, a number of different mechanisms have been put forward to try to explain crossmodal correspondences (Motoki et al., <xref ref-type="bibr" rid="B201">2023</xref>; Spence, <xref ref-type="bibr" rid="B278">2011</xref>; Spence and Di Stefano, <xref ref-type="bibr" rid="B289">2024a</xref>). These include statistical, structural (perhaps better referred to as physiological correspondences; see Spence and Di Stefano, <xref ref-type="bibr" rid="B289">2024a</xref>), semantic (though again possibly better referred to as &#x0201C;lexical&#x0201D;, see Walker, <xref ref-type="bibr" rid="B319">2012</xref>) as well as the emotional account (Spence, <xref ref-type="bibr" rid="B283">2020a</xref>) (see <xref ref-type="table" rid="T2">Table 2</xref>). It is important to recognize that the most appropriate explanation for a given crossmodal correspondence is to some extent independent of the complexity of the component unisensory stimuli that are involved. That said, claims regarding the existence of amodal sensory and similarity-based crossmodal correspondences have typically been made in relation to individual sensory stimuli. Meanwhile, Gestalt grouping, as well as spatiotemporal synchrony, is more often mentioned as the most probable explanation for any crossmodal effects operating at the level of mid-level correspondences (Spence, <xref ref-type="bibr" rid="B280">2015</xref>). Emotional mediation tends to be the most frequently mentioned explanation of audiovisual correspondences between complex and aesthetically-meaningful stimuli, such as, for example, music and painting/images (e.g., Di Stefano et al., <xref ref-type="bibr" rid="B83">2024</xref>). That said, emotional mediation is occasionally raised as a possible explanation in the case of crossmodal correspondences between individual perceptual stimuli, while basic sensory correspondences (e.g., between pitch and size) could potentially also contribute to what might, at first, be taken as stylistic correspondences (see Duthie, <xref ref-type="bibr" rid="B91">2013</xref>; Duthie and Duthie, <xref ref-type="bibr" rid="B92">2015</xref>; Siefkes and Arielli, <xref ref-type="bibr" rid="B269">2015</xref>).</p>
<table-wrap position="float" id="T2">
<label>Table 2</label>
<caption><p>Summary of the various different types of crossmodal correspondence that have been proposed that have to connect auditory and visual stimuli and selected literature sources suggested and/or supported them.</p></caption>
<table frame="box" rules="all">
<thead>
<tr style="background-color:#8f9496;color:#ffffff">
<th valign="top" align="left"><bold>Type of correspondence</bold></th>
<th valign="top" align="left"><bold>Description/explanation</bold></th>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left">Structural</td>
<td valign="top" align="left">Correspondence based on structural alignment of stimulus dimensions (e.g., both pitch and hue being circular dimensions)</td>
</tr> <tr>
<td valign="top" align="left">Physical</td>
<td valign="top" align="left">Correspondence based on fact that stimuli themselves are of a similar type (e.g., both sound and light being considered as waves)</td>
</tr> <tr>
<td valign="top" align="left">Physiological<sup>&#x0002A;</sup></td>
<td valign="top" align="left">Correspondence based on similar neural hypothetical similar neural encoding principle (e.g., increases in intensity being represented by increased neural firing)</td>
</tr> <tr>
<td valign="top" align="left">Psychological -perceptual (-Amodal)</td>
<td valign="top" align="left">Correspondence based on perceptual similarity between the component stimuli (sometimes referred to as intersensory equivalence)</td>
</tr> <tr>
<td valign="top" align="left">Mood-based/affective/emotional-mediation</td>
<td valign="top" align="left">Correspondence based on mood/emotion associated with each of the component stimuli being similar</td>
</tr> <tr>
<td valign="top" align="left">Associative/statistical</td>
<td valign="top" align="left">Correspondence based on associative learning/internalization of crossmodal statistical regularities in environment</td>
</tr></tbody>
</table>
<table-wrap-foot>
<p><sup>&#x0002A;</sup>Note that Spence (<xref ref-type="bibr" rid="B278">2011</xref>) originally labeled this category &#x02018;Structural&#x00027;. However, given that this label had already been introduced 20 years earlier by Sebba (<xref ref-type="bibr" rid="B264">1991</xref>) to describe dimensional alignment of perceptual space, it would seem more appropriate to call this category &#x0201C;Physiological&#x0201D;, given that that is the putative cause/source of the correspondence.</p>
</table-wrap-foot>
</table-wrap>
</sec>
</sec>
<sec id="s2">
<title>2 Crossmodal correspondences between simple sensory stimuli</title>
<p>One of the most extensively studied areas in research on audiovisual correspondences concerns the associations that have been demonstrated between individual sensory stimuli (or attributes), such as pitch, brightness, or loudness. Over the last 50 years or so, a very wide array of crossmodal matching studies has been published. More recently, a number of studies demonstrating crossmodal effects on perceptual binding as well as in a variety of speeded response tasks that are modulated by the crossmodal correspondences that exist between individual sensory stimuli (attributes or sensory dimensions) have also appeared in the literature; it is to this research that we turn next.</p>
<sec>
<title>2.1 Crossmodal matching</title>
<p>Many published studies have documented crossmodal correspondences between simple stimuli, or stimulus dimensions (see Spence, <xref ref-type="bibr" rid="B278">2011</xref>; Spence and Sathian, <xref ref-type="bibr" rid="B294">2020</xref>, for reviews of the literature on basic audiovisual correspondences). In fact, the literature reveals a multitude of crossmodal correspondences between, for instance, auditory pitch and a variety of visually-presented dimensions: These include size (e.g., Anikin and Johansson, <xref ref-type="bibr" rid="B10">2019</xref>; Bonetti and Costa, <xref ref-type="bibr" rid="B36">2018</xref>; Evans and Treisman, <xref ref-type="bibr" rid="B98">2010</xref>; Gallace and Spence, <xref ref-type="bibr" rid="B108">2006</xref>; Mondloch and Maurer, <xref ref-type="bibr" rid="B199">2004</xref>), shape/angularity (Marks, <xref ref-type="bibr" rid="B187">1987</xref>; Murari et al., <xref ref-type="bibr" rid="B205">2020</xref>; Parise and Spence, <xref ref-type="bibr" rid="B224">2012</xref>), lightness, brightness (e.g., Brunel et al., <xref ref-type="bibr" rid="B45">2015</xref>; Hubbard, <xref ref-type="bibr" rid="B139">1996</xref>; Klapetek et al., <xref ref-type="bibr" rid="B158">2012</xref>; Marks, <xref ref-type="bibr" rid="B186">1974</xref>, <xref ref-type="bibr" rid="B187">1987</xref>, <xref ref-type="bibr" rid="B188">1989</xref>), hue (e.g., Melara, <xref ref-type="bibr" rid="B194">1989</xref>; Spence and Di Stefano, <xref ref-type="bibr" rid="B287">2022a</xref>, for a review) and elevation (Bonetti and Costa, <xref ref-type="bibr" rid="B37">2019</xref>; Wrembel, <xref ref-type="bibr" rid="B339">2009</xref>; Chiou and Rich, <xref ref-type="bibr" rid="B61">2012</xref>), etc. (see also Sun et al., <xref ref-type="bibr" rid="B300">2018</xref>). The existence of such correspondences has often been explained in terms of the statistical or structural/physiological account (Spence, <xref ref-type="bibr" rid="B278">2011</xref>).</p>
<p>There has, however, been widespread disagreement in the case of a putative crossmodal correspondence between auditory pitch and visual hue. Some commentators have been convinced that such a correspondence must exist based on the structural similarity of the underlying sensory dimensions (Gombrich, <xref ref-type="bibr" rid="B116">1960</xref>; Sebba, <xref ref-type="bibr" rid="B264">1991</xref>) while others deny the existence of such a correspondence, or else disagree concerning what the most appropriate mapping would be (Bernstein et al., <xref ref-type="bibr" rid="B26">1971</xref>; Caivano, <xref ref-type="bibr" rid="B48">1994</xref>; Davis, <xref ref-type="bibr" rid="B79">1979</xref>; Garner, <xref ref-type="bibr" rid="B110">1978</xref>; Newton, <xref ref-type="bibr" rid="B209">1704</xref>). At the same time, however, other researchers have argued that methodological issues (i.e., confounding hue and brightness) may have compromised the interpretation of several earlier studies (O&#x00027;Malley, <xref ref-type="bibr" rid="B216">1957</xref>; Simpson et al., <xref ref-type="bibr" rid="B272">1956</xref>; Wicker, <xref ref-type="bibr" rid="B337">1968</xref>). Despite the structural features that might be shared by the visual dimension of hue and the auditory dimension of pitch (e.g., categorical perception in the continuous nature of wavelengths/frequency range scales), as highlighted in the reviews from Spence and Di Stefano (<xref ref-type="bibr" rid="B287">2022a</xref>, <xref ref-type="bibr" rid="B289">2024a</xref>), there would appear to be little agreement as far as which hue should, in fact, be matched with which musical note. What is more, structural issues soon emerge when it comes to explaining color-sound correspondences based on any perceptual, psychophysical, or physical similarity between the two component stimuli. For instance, when considering the color spectrum and audible sound range, the former fundamentally differ in terms of the non-linear distribution of tones along the audible frequency spectrum as compared to colors in the visible range. Moreover, a fundamental property of auditory perception, namely octave periodicity, seems to have no parallel in vision, apart from some metaphorical and indirect usages of the same terms for referring to colors by some commentators (e.g., Pridmore, <xref ref-type="bibr" rid="B237">1992</xref>).</p>
<p>More recent research has drawn attention to the existence of a number of robust crossmodal correspondences between visual features (i.e., surface textures) and auditory timbre, due to the allegedly intrinsic multisensoriality of timbre characterization/semantics (e.g., Reuter et al., <xref ref-type="bibr" rid="B245">2018</xref>; Reymore and Lindsey, <xref ref-type="bibr" rid="B246">2025</xref>; Saitis et al., <xref ref-type="bibr" rid="B258">2020</xref>; Wallmark, <xref ref-type="bibr" rid="B322">2019</xref>).<xref ref-type="fn" rid="fn0005"><sup>5</sup></xref> For instance, Reymore et al. (<xref ref-type="bibr" rid="B247">2023</xref>) demonstrated that timbre semantic associations mostly refer to the visual appearance of objects, such as the material or haptic features of surface textures (e.g., raspy, smooth, brilliant). Several other recent studies have, though, deepened our understanding of the relationship between auditory roughness and shapes, by demonstrating that listeners tend to match more dissonant sounds to spikier and rougher objects/shapes (see also Murari et al., <xref ref-type="bibr" rid="B206">2015</xref>). Giannos et al. (<xref ref-type="bibr" rid="B113">2021</xref>) assessed whether non-tonal and highly dissonant harmonic stimuli are associated with rough images, while more consonant stimuli are associated with the images of low visual roughness. To test this particular hypothesis, the researchers harmonized a fixed melody in seven different styles, including highly tonal, non-tonal, and random variations. The participants in this study were asked to match the melodies to images of variable roughness (i.e., black and white 2D and 3D images that represented surfaces with different degrees smoothness/roughness). In this case, the results confirmed that auditory dissonance was indeed highly correlated with visual roughness (see also Di Stefano and Spence, <xref ref-type="bibr" rid="B84">2022</xref>, for a review).</p>
<p>Other researchers have demonstrated the existence of audiovisual crossmodal correspondences between musical timbre and visual shapes (Adeli et al., <xref ref-type="bibr" rid="B3">2014</xref>; Gurman et al., <xref ref-type="bibr" rid="B120">2021</xref>). For instance, the participants in a study by Adeli et al. (<xref ref-type="bibr" rid="B3">2014</xref>), were asked to select the visual equivalent of a given sound, i.e., its shape, color (or grayscale) and vertical position. A strong association between timbre and visual shapes was observed, with soft timbres being more frequently associated with blue, green or light gray rounded shapes, while harsh timbres were associated with red, yellow or dark gray sharp angular shapes. Importantly, these findings were subsequently replicated by Gurman et al. (<xref ref-type="bibr" rid="B120">2021</xref>).</p>
</sec>
<sec>
<title>2.2 Crossmodal effects</title>
<p>A wide range of crossmodal correspondence effects have been demonstrated across a wide range of experimental paradigms (e.g., Evans and Treisman, <xref ref-type="bibr" rid="B98">2010</xref>; Gallace and Spence, <xref ref-type="bibr" rid="B108">2006</xref>; Parise and Spence, <xref ref-type="bibr" rid="B224">2012</xref>; and see Spence, <xref ref-type="bibr" rid="B278">2011</xref>; Spence and Sathian, <xref ref-type="bibr" rid="B294">2020</xref>, for reviews). So, for example, Parise and Spence (<xref ref-type="bibr" rid="B222">2009</xref>) published a series of psychophysical studies demonstrating increased spatiotemporal integration for crossmodally congruent combinations of auditory and visual stimuli (e.g., relative high-pitched sound with a relatively large visual stimulus) as compared to incongruent combinations (e.g., relatively high pitch and small visual size in an unspeeded psychophysical task).<xref ref-type="fn" rid="fn0006"><sup>6</sup></xref> Sound frequency has also been reported to influence people&#x00027;s responses to color lightness (Hagtvedt and Brasel, <xref ref-type="bibr" rid="B122">2016</xref>).</p>
<p>Other researchers have started to address the question of whether timbre modulates visual perception. So, for example, in a series of two experiments, Wallmark et al. (<xref ref-type="bibr" rid="B324">2021</xref>) investigated whether the timbre of a musical note (an acoustic prime) would affect the subsequent perception of, in the first experiment, brightness (dark&#x02013;bright dimension) and, in a second study, both brightness and spatial texture (smooth&#x02013;rough dimension). The participants had to identify a shift in roughness/brightness between two consecutively-presented target squares of subtly contrasting levels (rougher/brighter, smoother/darker, or the same) in a speeded-response paradigm. In a second experiment, prior to the presentation of the target square, the participants could be exposed to sounds that varied in terms of their roughness (smooth/rough). For visual stimuli, photos of sandpaper patches of contrasting grit sizes were used: baseline, medium roughness (100-grit); low roughness (150 grit); and high roughness (50 grit). A sine wave and a sawtooth wave they used as auditory stimuli. Modest evidence was found that timbres increase response bias in a semantically congruent manner when participants identify visual stimuli (e.g., when a &#x0201C;rough&#x0201D; saw-tooth wave accompanies the second of two identical spatial textures, the &#x0201C;rough&#x0201D; sound increased the probability of participants judging the second texture as rougher), thus suggesting that rough sounds may increase of the perceived roughness of the visual stimuli (see also Wallmark and Allen, <xref ref-type="bibr" rid="B323">2020</xref>). Other researchers, meanwhile, have chosen to investigate timbral brightness by means of multisensory interference phenomena (Saitis and Wallmark, <xref ref-type="bibr" rid="B257">2024</xref>).</p>
<p>Hence, in summary, there is currently robust experimental evidence for the existence of a wide range of audiovisual crossmodal correspondences involving pitch, such as pitch-size, pitch-brightness, pitch-height, with the exception of pitch-hue correspondences. In the latter case, despite the fact that many commentators are convinced of what the most appropriate crossmodal mapping is (see Spence and Di Stefano, <xref ref-type="bibr" rid="B289">2024a</xref>, for a review), there would appear to be little agreement except in odd cases (such as the color scarlet supposedly matching the sound of the trumpet; e.g., Kandinsky, <xref ref-type="bibr" rid="B153">1977</xref> (originally published in 1911); Leibniz, <xref ref-type="bibr" rid="B164">1896</xref> (originally published 1704); Locke, <xref ref-type="bibr" rid="B178">1690</xref>;<xref ref-type="fn" rid="fn0007"><sup>7</sup></xref> Ortmann, <xref ref-type="bibr" rid="B217">1933</xref>; see also Donnell-Kotrozo, <xref ref-type="bibr" rid="B90">1978</xref>). However, the fact that pitch-based crossmodal correspondences are relative in nature for normal listeners (Spence, <xref ref-type="bibr" rid="B282">2019</xref>),<xref ref-type="fn" rid="fn0008"><sup>8</sup></xref> means the consequences for those wanting to art crossmodally are unclear, though the question remains open for those with absolute pitch (see Di Stefano and Spence, <xref ref-type="bibr" rid="B86">2024b</xref>).<xref ref-type="fn" rid="fn0009"><sup>9</sup></xref> What is more, the sheer range of crossmodal correspondences that might be primed by a given auditory pitch means that it is difficult to see how a unique mapping could be established in (e.g.,) a sensory-substitution setting (Cavazos Quero et al., <xref ref-type="bibr" rid="B53">2021</xref>), or else when considering the question of sensory translation between one sense and another (Spence and Di Stefano, <xref ref-type="bibr" rid="B290">2024b</xref>). This challenge presumably also applies for those wanting to translate works of art from one sensory modality into another (e.g., see M&#x000FC;ller-Eberstein and van Noord, <xref ref-type="bibr" rid="B204">2019</xref>).</p>
</sec>
</sec>
<sec id="s3">
<title>3 Structural correspondences</title>
<p>Mid-level audiovisual crossmodal correspondences operate on dynamically-changing stimuli and/or on combinations of unisensory stimuli (attributes or dimensions; Guo et al., <xref ref-type="bibr" rid="B119">2023</xref>). They sit between the correspondences that have been observed with individual stimuli and those that operate at the level of more aesthetically-meaningful complex sets of stimuli.<xref ref-type="fn" rid="fn0010"><sup>10</sup></xref> Mid-level crossmodal matching is sometimes based on the common numerosity, or spatiotemporal organization, of the individual elements that happen to be presented in each sensory modality (see Spence, <xref ref-type="bibr" rid="B280">2015</xref>, for a review), while at other times it may be based on correlated unisensory input (Parise et al., <xref ref-type="bibr" rid="B225">2012</xref>, <xref ref-type="bibr" rid="B223">2013</xref>). One might also think of crossmodal correspondence that are based on the harmony of the stimuli presented in each modality (Alves, <xref ref-type="bibr" rid="B8">2005</xref>; Spence and Di Stefano, <xref ref-type="bibr" rid="B288">2022b</xref>; Wells, <xref ref-type="bibr" rid="B333">1980</xref>; cf. Pimentel et al., <xref ref-type="bibr" rid="B234">2025</xref>). At the same time, however, metaphorical (e.g., Ravignani and Sonnweber, <xref ref-type="bibr" rid="B242">2017</xref>; Wagner et al., <xref ref-type="bibr" rid="B317">1981</xref>) and emotional mediation has also been reported in at least some cases.</p>
<sec>
<title>3.1 Crossmodal matching</title>
<p>To date, far fewer crossmodal matching studies have been conducted at this level. What is more, the majority of research that has been published to date would appear to involve cross-level correspondences, such as music-to-color associations for single-line piano melodies (Lindborg and Friberg, <xref ref-type="bibr" rid="B169">2015</xref>; Palmer et al., <xref ref-type="bibr" rid="B220">2016</xref>). Researchers have assessed crossmodal correspondences between colors and music clips (Chen, <xref ref-type="bibr" rid="B54">2013</xref>), musical genres (Holm et al., <xref ref-type="bibr" rid="B133">2009</xref>; see also Bresin, <xref ref-type="bibr" rid="B44">2005</xref>), music tempo (Guo and Jiang, <xref ref-type="bibr" rid="B118">2023</xref>; see also Timmers, <xref ref-type="bibr" rid="B308">2022</xref>), melody (Cuddy, <xref ref-type="bibr" rid="B76">1985</xref>), and musical intervals Di Stefano et al. (submitted)<xref ref-type="fn" rid="fn0011"><sup>11</sup></xref>. Gaiger (<xref ref-type="bibr" rid="B104">2018</xref>) has even suggested that paintings can have rhythm. It is also easy to see how the emotional associations listeners have with major vs. minor chords could also be linked to colors (Parncutt, <xref ref-type="bibr" rid="B227">2014</xref>, <xref ref-type="bibr" rid="B228">2024</xref>; see also Carraturo et al., <xref ref-type="bibr" rid="B52">2025</xref>; Wanke et al., <xref ref-type="bibr" rid="B325">2025</xref>).</p>
</sec>
<sec>
<title>3.2 Crossmodal effects</title>
<p>When, for instance, the rate of repetitive auditory stimuli is increased or decreased, while a simultaneously-presented visual flicker remains constant, the latter appears to change accordingly with the auditory stimulus, an effect known as &#x0201C;auditory driving&#x0201D; (Gebhard and Mowbray, <xref ref-type="bibr" rid="B112">1959</xref>; Recanzone, <xref ref-type="bibr" rid="B243">2003</xref>, <xref ref-type="bibr" rid="B244">2009</xref>; Shipley, <xref ref-type="bibr" rid="B268">1964</xref>; Welch et al., <xref ref-type="bibr" rid="B330">1986</xref>; see also Boltz, <xref ref-type="bibr" rid="B33">2017</xref>, <xref ref-type="bibr" rid="B34">2018</xref>; Occelli et al., <xref ref-type="bibr" rid="B212">2011</xref>; Shams et al., <xref ref-type="bibr" rid="B265">2000</xref>). By contrast, changes in the rate of visual flicker do not appear to change the perceived rate of auditory flutter to anything like the same extent. This, then, is an example of auditory dominance (see Di Stefano and Spence, <xref ref-type="bibr" rid="B87">2025</xref>, for a number of other crossmodal influences in the perception of temporal structure).</p>
<p>O&#x00027;Leary and Rhodes (<xref ref-type="bibr" rid="B215">1984</xref>) conducted a study demonstrating a bidirectional influence between vision and audition in a task involving perceptual organization. Multiple studies have demonstrated how intramodal perceptual grouping precedes crossmodal influences in the context of the crossmodal dynamic capture effect (Soto-Faraco et al., <xref ref-type="bibr" rid="B275">2003</xref>; Spence et al., <xref ref-type="bibr" rid="B293">2007</xref>; see also Cook and Van Valkenburg, <xref ref-type="bibr" rid="B73">2009</xref>; Gilbert, <xref ref-type="bibr" rid="B115">1938</xref>; Maeda et al., <xref ref-type="bibr" rid="B183">2004</xref>; Takeshima and Gyoba, <xref ref-type="bibr" rid="B302">2013</xref>; Watanabe and Shimojo, <xref ref-type="bibr" rid="B326">2001</xref>).<xref ref-type="fn" rid="fn0012"><sup>12</sup></xref> Crossmodal congruency effects have also been documented based on stimulus identity, in this case reflecting a short temporal pattern (Frings and Spence, <xref ref-type="bibr" rid="B103">2010</xref>; cf. Huddleston et al., <xref ref-type="bibr" rid="B140">2008</xref>; Julesz and Hirsh, <xref ref-type="bibr" rid="B150">1972</xref>).</p>
<p>Consider here only how the affective meaning of musical scales depends on their direction&#x02014;namely, whether they are ascending or descending (Collier and Hubbard, <xref ref-type="bibr" rid="B70">2001</xref>). On the one hand, this association hints at the importance of the structural organization of the individual stimuli presented in a given modality, over-and-above whatever sensory qualities any of the individual elements may have when presented in isolation. This observation may also be linked to the relative nature of crossmodal correspondences, especially those involving auditory pitch (Spence, <xref ref-type="bibr" rid="B282">2019</xref>; see also Eysenck, <xref ref-type="bibr" rid="B99">1940</xref>). More broadly, the way in which individual elements are structured within a given sensory modality&#x02014;and how they relate across the senses&#x02014;brings us to the concept of harmony, a principle that has long been central to both music and visual aesthetics. In the Greek tradition, the term harmony originally related to the physical unification of different elements (Lippman, <xref ref-type="bibr" rid="B172">1963</xref>). As such, the concept of harmony found a natural application in the domain of music, conceived of as an artistic practice based on the juxtaposition of different auditory elements (i.e., sounds) fitted together to generate pleasant effects. In the field of vision science, a number of researchers have investigated the concept of harmony between color pairs (e.g., Allen and Guilford, <xref ref-type="bibr" rid="B7">1936</xref>; Burchett, <xref ref-type="bibr" rid="B47">2002</xref>; Field, <xref ref-type="bibr" rid="B101">1835</xref>; Palmer and Griscom, <xref ref-type="bibr" rid="B219">2013</xref>; Schloss and Palmer, <xref ref-type="bibr" rid="B260">2011</xref>). McNeill Whistler (<xref ref-type="bibr" rid="B192">1978</xref>) talked of paintings as harmonies (though see Kargon, <xref ref-type="bibr" rid="B154">2011</xref>). Some suggest that music and painting may be combined harmoniously (e.g., consider only the literature on &#x0201C;color music&#x0201D;; Arg&#x000FC;elles, <xref ref-type="bibr" rid="B12">1972</xref>; Plummer, <xref ref-type="bibr" rid="B235">1915</xref>; Rimington, <xref ref-type="bibr" rid="B250">1915</xref>; Sullivan, <xref ref-type="bibr" rid="B299">1914</xref>; Watkins, <xref ref-type="bibr" rid="B328">2018</xref>; Zilczer, <xref ref-type="bibr" rid="B344">1987</xref>). Given such considerations, it becomes natural to consider whether the concept of crossmodal harmony makes any sense (Kargon, <xref ref-type="bibr" rid="B154">2011</xref>; see Spence and Di Stefano, <xref ref-type="bibr" rid="B288">2022b</xref>, for a review).</p>
</sec>
</sec>
<sec id="s4">
<title>4 Crossmodal correspondences between complex auditory and visual stimuli</title>
<p>Experimental psychologists have long been interested in the crossmodal matching of complex auditory and visual stimuli, as well as on the crossmodal effects of one on the other (see Spence and Di Stefano, <xref ref-type="bibr" rid="B291">2025</xref>, for a review). For whatever reason, the majority of studies that have been published to date in this area would appear to have been conducted with images of paintings and short classical music selections (see Spence, <xref ref-type="bibr" rid="B283">2020a</xref>, for a review).</p>
<sec>
<title>4.1 Crossmodal matching</title>
<p>A number of studies have presented a small selection of stimuli in one sensory modality, with participants being required to match from a pre-selected set of stimuli presented in the other modality (see <xref ref-type="table" rid="T3">Table 3</xref>). It should, though, be noted that many of the early studies would appear to have been statistically underpowered by contemporary standards. Moreover, there have been relatively few attempts to replicate previous studies, as has become an increasingly common practice given the replication crisis in psychological science. That said, the majority of the research that has been published to date appears to demonstrate a consensual (i.e., a significantly non-random) tendency (though, of course, one should always be mindful of the so-called &#x0201C;file drawer problem&#x0201D;; see Rosenthal, <xref ref-type="bibr" rid="B253">1979</xref>).</p>
<table-wrap position="float" id="T3">
<label>Table 3</label>
<caption><p>Broad categorization of different types of audiovisual crossmodal matching and crossmodal effects involving complex multi-element aesthetically-meaningful sensory stimuli (see Spence and Di Stefano, <xref ref-type="bibr" rid="B291">2025</xref>).</p></caption>
<table frame="box" rules="all">
<thead>
<tr style="background-color:#8f9496;color:#ffffff">
<th valign="top" align="left"><bold>Type of correspondence</bold></th>
<th valign="top" align="left"><bold>Stimuli involved</bold></th>
<th valign="top" align="left"><bold>Crossmodal matching (CM)</bold></th>
<th valign="top" align="left"><bold>Crossmodal effect (CE)</bold></th>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left">Complex (or high-level)</td>
<td valign="top" align="left">Paintings and pieces of music</td>
<td valign="top" align="left">Walker, <xref ref-type="bibr" rid="B318">1927</xref>; Cowles, <xref ref-type="bibr" rid="B74">1935</xref>; Wehner, <xref ref-type="bibr" rid="B329">1966</xref>; Peretti, <xref ref-type="bibr" rid="B232">1972</xref>; Minnigerode et al., <xref ref-type="bibr" rid="B197">1976</xref>; Parrott, <xref ref-type="bibr" rid="B229">1982</xref>; Albertazzi et al., <xref ref-type="bibr" rid="B5">2015</xref>; Ran&#x0010D;i&#x00107; and Markovi&#x00107;, <xref ref-type="bibr" rid="B241">2019</xref>; Iosifyan et al., <xref ref-type="bibr" rid="B143">2022</xref></td>
<td valign="top" align="left">Lindner and Hynan, <xref ref-type="bibr" rid="B171">1987</xref>; Limbert and Polzella, <xref ref-type="bibr" rid="B167">1998</xref>; Actis-Grosso et al., <xref ref-type="bibr" rid="B1">2017</xref>; Ran&#x0010D;i&#x00107; and Markovi&#x00107;, <xref ref-type="bibr" rid="B241">2019</xref>; Braun Janzen et al., <xref ref-type="bibr" rid="B41">2023</xref>; Fekete et al., <xref ref-type="bibr" rid="B100">2023</xref>; Fink et al., <xref ref-type="bibr" rid="B102">2024</xref></td>
</tr></tbody>
</table>
</table-wrap>
<p>The majority of research that has been published to date would appear to be consistent with the claim that complex crossmodal associations between music and paintings are primarily mediated by emotion (see also Hung, <xref ref-type="bibr" rid="B142">2000</xref>).<xref ref-type="fn" rid="fn0013"><sup>13</sup></xref> More refined protocols typically present the auditory stimulus&#x02014;specifically, a musical excerpt&#x02014;while having the participants rate the extent to which it matched each of the pre-selected visual stimuli. This kind of experimental design allows researchers to better understand association trends (e.g., Di Stefano et al., <xref ref-type="bibr" rid="B83">2024</xref>, <xref ref-type="bibr" rid="B82">2025</xref>) while, at the same time, avoiding the risk of forcing a &#x0201C;best&#x0201D; choice response if none of the presented stimuli happen to clearly match the auditory excerpt. There have also been a number of studies looking at cross-level correspondences involving complex stimuli as but one component. For instance, Isbilen and Krumhansl (<xref ref-type="bibr" rid="B144">2016</xref>) conducted research highlighting the existence of emotion-mediated color associations to Bach&#x00027;s Well-Tempered Clavier. Furthermore, and similar to what has just been reported, the suggestion is that crossmodal correspondences between music and ambient color are mediated by emotion (Hauck et al., <xref ref-type="bibr" rid="B128">2022</xref>; though see Lipscomb and Kim, <xref ref-type="bibr" rid="B175">2004</xref>).</p>
</sec>
<sec>
<title>4.2 Crossmodal effects</title>
<p>Researchers have demonstrated that music (or musical emotion) can crossmodally influence the perceived (or at least rated) brightness of paintings (Bhattacharya and Lindsen, <xref ref-type="bibr" rid="B27">2016</xref>; see also Logeswaran and Bhattacharya, <xref ref-type="bibr" rid="B179">2009</xref>). For example, the participants in one study reported by Hong et al. (<xref ref-type="bibr" rid="B134">2024</xref>) were presented with a wide range of emotional music pieces alongside various visual stimuli. The results revealed that lower-pitched music tended to result in darker judgments of visual objects than when higher-pitched music was presented instead (again see <xref ref-type="table" rid="T3">Table 3</xref>).<xref ref-type="fn" rid="fn0014"><sup>14</sup></xref> Meanwhile, a study by Albertazzi et al. (<xref ref-type="bibr" rid="B6">2020</xref>) investigated the existence of crossmodal correspondences between contemporary art (paintings by Kandinsky) and music (excerpts from Sch&#x000F6;nberg). The experiment was conducted in two phases. In the first phase, the participants evaluated the perceptual characteristics first of visual stimuli (some pictures of Kandinsky&#x00027;s paintings, with varying perceptual characteristics and contents) and then of auditory stimuli (musical excerpts taken from the repertoire of Sch&#x000F6;nberg&#x00027;s piano works) relative to 11 pairs of adjectives tested on a continuous bipolar scale (i.e., based on Osgood&#x00027;s semantic differential technique; Osgood et al., <xref ref-type="bibr" rid="B218">1957</xref>). In the second phase, participants were required to associate pictures and musical excerpts. The results of the semantic differential demonstrated that certain paintings and musical excerpts were evaluated as semantically more similar, while others were evaluated as semantically more different. At the same time, the results of the direct association between musical excerpts and paintings showed both attractions and repulsions among the stimuli. The overall results provide significant insights into the relationship between concrete and abstract concepts and into the process of perceptual grouping in cross-modal phenomena (see also Ansani et al., <xref ref-type="bibr" rid="B11">2020</xref>; Boswell, <xref ref-type="bibr" rid="B38">2012</xref>; Ellis and Simons, <xref ref-type="bibr" rid="B97">2005</xref>; Iwamiya, <xref ref-type="bibr" rid="B145">1994</xref>).</p>
</sec>
<sec>
<title>4.3 Interim summary</title>
<p>Given the research that has been reviewed so far, it would appear clear that crossmodal matching occurs between all possible combinations of individual, organized, and more complex, semantically-meaningful stimuli. Having demonstrated such crossmodal correspondences the next question is how such findings inform attempts at sensory translation or sensory augmentation. At the outset here, one might consider whether crossmodal matching is perhaps more relevant when considering sensory translation, whereas the literature on crossmodal effects may be more pertinent for those wanting to augment art crossmodally. Returning to the two important claims that we put forward earlier, we might say that the mere existence of a crossmodal correspondence (as revealed by research on crossmodal matching), has no necessary implications for the existence of a crossmodal effect. A second claim is that the existence of crossmodal correspondences operating between individual sensory stimuli has no necessary implications as far as the existence of crossmodal effects operating between complex stimuli are concerned. Furthermore, it can also be argued that the absence of a demonstrable crossmodal effect cannot be taken as evidence of the absence of a crossmodal correspondence either. That is, it is at least theoretically plausible, that people might consensually rate certain combinations of stimuli as matching vs. mismatching, without there being any crossmodal effects on performance (e.g., in a speeded classification task). All three of these claims would appear consistent with the various sources of evidence that have been reviewed here.</p>
</sec>
</sec>
<sec id="s5">
<title>5 Attempts to augment auditory/visual art and entertainment crossmodally</title>
<p>There is an extensive literature on &#x0201C;color music&#x0201D;, sometimes called &#x0201C;visual music&#x0201D; (e.g., Adams, <xref ref-type="bibr" rid="B2">1995</xref>; Baker, <xref ref-type="bibr" rid="B15">2002</xref>; Dimova, <xref ref-type="bibr" rid="B89">2024</xref>; Galeyev, <xref ref-type="bibr" rid="B106">2003</xref>; Klein, <xref ref-type="bibr" rid="B159">1937</xref>; Liu, <xref ref-type="bibr" rid="B177">2022</xref>; Lupton, <xref ref-type="bibr" rid="B182">2018</xref>; Peacock, <xref ref-type="bibr" rid="B231">1988</xref>; Poast, <xref ref-type="bibr" rid="B236">2000</xref>; Sabaneev and Pring, <xref ref-type="bibr" rid="B255">1929</xref>; Sabaneyev, <xref ref-type="bibr" rid="B256">1911</xref>; Scholes, <xref ref-type="bibr" rid="B261">1970</xref>; Shaw Miller, <xref ref-type="bibr" rid="B266">2006</xref>; South, <xref ref-type="bibr" rid="B277">2001</xref>; Vergo, <xref ref-type="bibr" rid="B314">2012</xref>; Zika, <xref ref-type="bibr" rid="B342">2013</xref>, <xref ref-type="bibr" rid="B343">2018</xref>; Zilczer, <xref ref-type="bibr" rid="B344">1987</xref>, <xref ref-type="bibr" rid="B345">2016</xref>). These phrases refer to artistic interest in both translating music into color but also in augmenting one art form by additional stimulation in the other modality. When translation is attempted in the opposite direction it is sometimes referred to as &#x0201C;graphical sound&#x0201D; instead (Ben&#x0010D;i&#x00107;, <xref ref-type="bibr" rid="B23">2021</xref>). While it has been argued at length elsewhere that literal sensory translation may not be possible (Spence and Di Stefano, <xref ref-type="bibr" rid="B289">2024a</xref>; see also Zika, <xref ref-type="bibr" rid="B342">2013</xref>, <xref ref-type="bibr" rid="B343">2018</xref>), there is an interesting body of artistic work that probes the possibility of augmenting art crossmodally (see Spence and Di Stefano, <xref ref-type="bibr" rid="B291">2025</xref>).</p>
<p>Many of those interested in this area invoke the notion of synaesthesia when thinking about engaging multiple senses simultaneously (e.g., see Dimova, <xref ref-type="bibr" rid="B89">2024</xref>; Karwoski et al., <xref ref-type="bibr" rid="B155">1942</xref>; Spence and Di Stefano, <xref ref-type="bibr" rid="B291">2025</xref>; Waterworth, <xref ref-type="bibr" rid="B327">1997</xref>). However, as Deroy and Spence (<xref ref-type="bibr" rid="B80">2013</xref>) have previously argued at length, such an appeal (to synaesthesia) is fundamentally misguided, and that what may be much more important is to build on the crossmodal correspondences that are consensually shared by the majority of people if one wants to augment art crossmodally in a way that communicates meaningfully with the majority (cf. Spence, <xref ref-type="bibr" rid="B279">2012</xref>). Polina Dimova uses the term &#x0201C;synaesthetic metaphor&#x0201D; as a heuristic device (Dimova, <xref ref-type="bibr" rid="B89">2024</xref>), grouping congenital synaesthesia, literary synaesthesia (aka pseudo-synaesthesia), intermediality in the arts, and cultural synaesthesia together. As Howes (<xref ref-type="bibr" rid="B135">2025</xref>, p. 147) puts it, she: &#x0201C;engages with them contrapuntally, in the manner of a fugue. The right-thinking neuroscientist would be aghast at such heterodoxy and abominate it&#x0201D;. The reason being the Criterion of Consistency test that is now used as a gold standard to determine who is a &#x0201C;genuine synaesthete&#x0201D; according to neuroscience (Baron-Cohen et al., <xref ref-type="bibr" rid="B18">1993</xref>; Johnson et al., <xref ref-type="bibr" rid="B148">2013</xref>; Root et al., <xref ref-type="bibr" rid="B251">2025</xref>; Svartdal and Iversen, <xref ref-type="bibr" rid="B301">1989</xref>; though see also Harrison and Baron-Cohen, <xref ref-type="bibr" rid="B125">1996</xref>). Such a view, although aligning with the &#x0201C;neuromania&#x0201D; (Legrenzi and Umilt&#x000E0;, <xref ref-type="bibr" rid="B163">2011</xref>) or &#x0201C;cerebrocentrism&#x0201D; (Howes&#x00027; terminology) of so much contemporary cognitive neuroscience research, can seem overly constraining when considering the crossmodal augmentation of art (e.g., see Howes and Classen, <xref ref-type="bibr" rid="B136">2014</xref>; Young, <xref ref-type="bibr" rid="B340">2005</xref>). What is needed is rather more of a social/structuralist account of synaesthesia, consistent with the views put forward by L&#x000E9;vi-Strauss (<xref ref-type="bibr" rid="B165">1997</xref>). Echoing much the same point, half a century earlier, Vanechkina (<xref ref-type="bibr" rid="B312">1973</xref>) noted how almost all of those he spoke to specially emphasized the associative, metaphoric nature of &#x0201C;color hearing&#x0201D; in music and excluded from the artistic sphere any clinical cases of color hearing.</p>
<p>Separate from any attempt to use synaesthesia in crossmodal compositions, it should be recognized that certain art forms, such as live opera, inherently stimulate multiple senses; Live music performance too, though this varies all the way from simple inevitability of seeing musicians play in a classic music concert through to the visuals that augment more experimental music (see also McDonald et al., <xref ref-type="bibr" rid="B191">2022</xref>). In the latter cases, though, it is by no means always clear which is the dominant/primary modality. However, separate from these issues, there are genuinely unisensory experiences of art, painting, sculpture, at least as it is displayed in contemporary galleries.<xref ref-type="fn" rid="fn0015"><sup>15</sup></xref> At the other extreme, there may be accidental/incidental background music presented in a gallery environment (see <xref ref-type="table" rid="T4">Table 4</xref>). In such cases, there is unlikely to be any intentional connection, or expectation of relevance or resonance (Loureiro et al., <xref ref-type="bibr" rid="B181">2019</xref>).</p>
<table-wrap position="float" id="T4">
<label>Table 4</label>
<caption><p>Summary of various kinds of audiovisual artistic/entertainment experience, where music complements/augments the visuals, or vice versa.</p></caption>
<table frame="box" rules="all">
<thead>
<tr style="background-color:#8f9496;color:#ffffff">
<th valign="top" align="left"><bold>Artistic/entertainment experience w. music</bold></th>
<th valign="top" align="left"><bold>Relation between auditory and visual elements</bold></th>
<th valign="top" align="left"><bold>Example (relevant literature)</bold></th>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left">Attending a classical music concert in person</td>
<td valign="top" align="left">The seen gestures of the musicians and vocalists are the cause of the sounds that are heard</td>
<td valign="top" align="left">Vuoskowski et al., <xref ref-type="bibr" rid="B315">2014</xref>, <xref ref-type="bibr" rid="B316">2016</xref></td>
</tr> <tr>
<td valign="top" align="left">Dance/ballet performance, or a rhythmic gymnastics routine</td>
<td valign="top" align="left">Seen dancer&#x00027;s movements typically a response to the auditory score (e.g., The Nutcracker; or Swan Lake)</td>
<td valign="top" align="left">Chiat et al., <xref ref-type="bibr" rid="B58">2013</xref></td>
</tr> <tr>
<td valign="top" align="left">Going to the opera</td>
<td valign="top" align="left">The background music is designed to set a mood for the on-stage action</td>
<td valign="top" align="left">Though see Kierkegaard, <xref ref-type="bibr" rid="B157">1988</xref>, on Mozart&#x00027;s opera, Don Giovanni</td>
</tr> <tr>
<td valign="top" align="left">Attending a performance of Scriabin&#x00027;s Poem of Fire with <italic>luce</italic> accompaniment</td>
<td valign="top" align="left">Music composed first and luce designed to enhance the experience</td>
<td valign="top" align="left">Scriabin, <xref ref-type="bibr" rid="B263">1910</xref>; Gawboy and Townsend, <xref ref-type="bibr" rid="B111">2012</xref></td>
</tr> <tr>
<td valign="top" align="left">Watching an audiovisual Film or TV advert</td>
<td valign="top" align="left">Music normally added post-production in order to convey a particular mood, emotion, or atmosphere</td>
<td/>
</tr> <tr>
<td valign="top" align="left">Video-jockey (VJ); Jean-Michel Jarr&#x000E9; concert (Boltz, <xref ref-type="bibr" rid="B29">2013</xref>)</td>
<td valign="top" align="left">VJ mixes a visual accompaniment in response to pre-existing music</td>
<td valign="top" align="left">Synaesthetic visual augmentation of Mahler&#x00027;s 2nd symphony (Deutsch, <xref ref-type="bibr" rid="B81">2012</xref>; cf. Haverkamp, <xref ref-type="bibr" rid="B129">2020</xref>)</td>
</tr> <tr>
<td valign="top" align="left">Multimedia artworks composed to engage both eye and ear</td>
<td valign="top" align="left">Audiovisual stimuli may be synchronized, and/or may share a common, or intentionally conflicting emotional message</td>
<td valign="top" align="left">Basquin, <xref ref-type="bibr" rid="B19">2020</xref>; Ben&#x0010D;i&#x00107;, <xref ref-type="bibr" rid="B23">2021</xref>; Whitney, <xref ref-type="bibr" rid="B336">1980</xref></td>
</tr> <tr>
<td valign="top" align="left">Experiencing architecture w. musical accompaniment</td>
<td valign="top" align="left">Given stylistic similarities, music sometimes used to augment videos of architecture</td>
<td valign="top" align="left">Muecke and Zach, <xref ref-type="bibr" rid="B202">2007</xref>; Siefkes and Arielli, <xref ref-type="bibr" rid="B269">2015</xref></td>
</tr> <tr>
<td valign="top" align="left">Visual artwork augmented by auditory (or other sensory elements)</td>
<td valign="top" align="left">The auditory augmentation is designed to modify the viewer&#x00027;s experience of the artwork; Some examples of &#x0201C;color music&#x0201D; or &#x0201C;visual music&#x0201D; fit here</td>
<td valign="top" align="left">Tate Sensorium (Pursey and Lomas, <xref ref-type="bibr" rid="B238">2018</xref>)</td>
</tr></tbody>
</table>
<table-wrap-foot>
<p>Notice how the synchrony between the auditory and visual elements may sometimes exogenously encourage the integration of the audiovisual inputs (e.g., when attending a classical music concert; see also Muller, <xref ref-type="bibr" rid="B203">2010</xref>), while in other cases, the context in which the two channels of communication are experienced may encourage more of an endogenous, or top-down, attempt by the viewer (or audience) to combine the multisensory inputs.</p>
</table-wrap-foot>
</table-wrap>
<p>Iwamiya (<xref ref-type="bibr" rid="B146">2013</xref>) investigated the interaction between auditory and visual processing when listening to music in an audiovisual context. C. D. Friedrich, for example, exhibited pictures in low light accompanied by music (Siegel, <xref ref-type="bibr" rid="B270">1974</xref>). P. O. Runge exhibited the <italic>Times of the Day</italic> painting series in J. W. von Goethe music room and by doing so, the suggestion is that the aesthetic experience was escalated into a full multisensory experience (The Getty Research Institute, <xref ref-type="bibr" rid="B307">2013</xref>).</p>
<sec>
<title>5.1 Sensory translation: turning music into light</title>
<p>There has long been artistic interest in the possibility of sensory translation (Kargon, <xref ref-type="bibr" rid="B154">2011</xref>; see Spence and Di Stefano, <xref ref-type="bibr" rid="B287">2022a</xref>, for a review). Relevant here, Stechow (<xref ref-type="bibr" rid="B295">1953</xref>) highlighted an important distinction between: &#x0201C;translations from the visual arts into music and parallelisms between the visual arts and music&#x0201D; (p. 324, italics in original). Later, he observed that &#x0201C;it would seem to me that comparability of structure reveals a more &#x02018;real&#x00027; relationship between such works of art than a mere affinity of &#x02018;mood&#x00027; or &#x02018;texture&#x00027; could suggest&#x0201D; (Stechow, <xref ref-type="bibr" rid="B295">1953</xref>, p. 325); Notice how the latter comment presumably emphasizes the structural rather than the affective nature of underlying correspondences (though see Kargon, <xref ref-type="bibr" rid="B154">2011</xref>, on the contrary position). Just consider Alexander B. Rimington (1854&#x02013;1918) of the Royal College of Arts in London, who first presented his color organ to the public in 1895, and which he performed with up until 1911 in London (Rimington, <xref ref-type="bibr" rid="B248">1895</xref>, <xref ref-type="bibr" rid="B249">1911</xref>, <xref ref-type="bibr" rid="B250">1915</xref>). Father Castel, the French priest&#x00027;s earlier suggestion that every home would one day have an ocular harpsicord never came to pass (see Conrad, <xref ref-type="bibr" rid="B72">1999</xref>; Moritz, <xref ref-type="bibr" rid="B200">1997</xref>). The project was likely doomed to failure (Hankins, <xref ref-type="bibr" rid="B123">1994</xref>).<xref ref-type="fn" rid="fn0016"><sup>16</sup></xref> That is, the best that you can hope to do is to convey the emotional tone of a specific work (again, see Spence and Di Stefano, <xref ref-type="bibr" rid="B287">2022a</xref>, <xref ref-type="bibr" rid="B289">2024a</xref>, for reviews). Furthermore, sensory translation has typically been considered in terms of the element-by-element translation of individual sensory stimuli (Gibson, <xref ref-type="bibr" rid="B114">2023</xref>; Spence and Di Stefano, <xref ref-type="bibr" rid="B289">2024a</xref>). However, as we have just seen, mid-level and complex correspondences emerge as a result of intramodal perceptual grouping, and hence what might be more appropriate to translate is the unimodal Gestalt, in the augmentation of art and entertainment crossmodally (Bragdon, <xref ref-type="bibr" rid="B39">1916</xref>, <xref ref-type="bibr" rid="B40">1918</xref>).</p>
</sec>
<sec>
<title>5.2 Sensory augmentation: Scriabin&#x00027;s Poem of Fire</title>
<p>A somewhat distinct literature has emerged on the question of the crossmodal sensory augmentation of art. Here, one might consider Scriabin&#x00027;s (<xref ref-type="bibr" rid="B263">1910</xref>) &#x0201C;Prometheus: Poem of Fire&#x0201D; as a paradigmatic example (Hull, <xref ref-type="bibr" rid="B141">1927</xref>). This musical score is unusual in that it came with an accompanying score for the <italic>luce</italic> (Brent-Smith, <xref ref-type="bibr" rid="B42">1926a</xref>,<xref ref-type="bibr" rid="B43">b</xref>; Gawboy and Townsend, <xref ref-type="bibr" rid="B111">2012</xref>).<xref ref-type="fn" rid="fn0017"><sup>17</sup></xref> Although not performed in Scriabin&#x00027;s lifetime, the code for the light score (which had been thought lost) was discovered some decades later (Galeyev and Vanechkina, <xref ref-type="bibr" rid="B107">2001</xref>). Unfortunately, however, Scriabin died before the first performance of his work with <italic>luce</italic> accompaniment took place (in New York City in 1915; Baker, <xref ref-type="bibr" rid="B15">2002</xref>). Nevertheless contemporary performances of this multisensory artwork have sometimes incorporated both auditory and visual elements (Galeyev, <xref ref-type="bibr" rid="B105">1988</xref>; see Spence and Di Stefano, <xref ref-type="bibr" rid="B287">2022a</xref>, for a review). One can think of this as an attempt to augment (i.e., rather than to translate) music with a color organ. That said, Scriabin&#x00027;s intention for the <italic>luce</italic> remains shrouded in mystery. Confusing matters somewhat, Scriabin himself was a synaesthete (including color music; Triarhou, <xref ref-type="bibr" rid="B309">2016</xref>; Witztum and Lerner, <xref ref-type="bibr" rid="B338">2016</xref>).<xref ref-type="fn" rid="fn0018"><sup>18</sup></xref> Some commentators have suggested that the introduction of the <italic>luce</italic> may have been intended to help disambiguate elements in the musical score (Gawboy and Townsend, <xref ref-type="bibr" rid="B111">2012</xref>). However, this remains a question of ongoing theoretical debate (see also Howes, <xref ref-type="bibr" rid="B135">2025</xref>).</p>
<p>Intriguingly, although formal evaluation has not yet been conducted, those commentators who have experienced such intentional multisensory, or crossmodal, artistic presentations have, on occasion, hinted at an emergent experience, one that supports the role of Gestalt perceptual grouping in a crossmodal artistic context (though note that the rich notion of multisensory emergence likely reaches beyond the insights offered by the Gestalt approach). Just take the following quote from Wells (<xref ref-type="bibr" rid="B333">1980</xref>, p. 101&#x02013;105): &#x0201C;One has only to trace the remarkable unfolding of harmonic color in Scriabin&#x00027;s later works to realize that it was harmony above all that prompted him to seek a visual counterpart to his music. An equivalence of aural harmony and visual is so strongly implied in Scriabin&#x00027;s later works that for me they seem to emanate from a single source&#x02026;The author tells of events leading up to finding a possible link between musical harmony and visual color based on complementarity&#x02026;The parallelism of musical harmony with visual color departs radically from current hypotheses of tonal organization.&#x0201D; (cf. Spence and Di Stefano (submitted)<xref ref-type="fn" rid="fn0019"><sup>19</sup></xref>, on the notion of crossmodal counterpoint). Note here, though, that Wells is not a disinterested observer inasmuch as in his article he goes on to propose a crossmodal pitch-hue mapping of his own.</p>
<p>In principle, multisensory emergence might be just one example of an extraordinary multisensory experience (e.g., Critchley, <xref ref-type="bibr" rid="B75">1977</xref>; Ehrenzweig, <xref ref-type="bibr" rid="B93">1965</xref>). Sirius and Clarke (<xref ref-type="bibr" rid="B273">1994</xref>, p. 119) similarly write of how: &#x0201C;There is a long history of the association of music with visual spectacles of one sort or another (opera, theater, and in this century film, television and video), and a substantial amount of anecdotal evidence that the two media can interact in powerful and effective ways.&#x0201D; (see also Anderson, <xref ref-type="bibr" rid="B9">1993</xref>). The unification of the unimodal elements (as hinted at by Wells, <xref ref-type="bibr" rid="B333">1980</xref>) is another possible outcome that is sometimes mentioned in the literature albeit infrequently, as is the notion of resonance (perhaps a kind of crossmodal modulation) is also mentioned in more artistic contexts (e.g., Muecke and Zach, <xref ref-type="bibr" rid="B202">2007</xref>; cf. McQuarrie and Mick, <xref ref-type="bibr" rid="B193">1992</xref>). Researchers have demonstrated that visual cues can modulate the integration and segregation of objects in auditory scene analysis (Rahne et al., <xref ref-type="bibr" rid="B239">2007</xref>). Audiovisual Gestalten likely rely as much on temporal synchrony as much as any crossmodal correspondence at the level of individual sensory elements, or even the Gestalt impression (see also Alves, <xref ref-type="bibr" rid="B8">2005</xref>; Battey and Fischman, <xref ref-type="bibr" rid="B20">2016</xref>; HaCohen, <xref ref-type="bibr" rid="B121">2016</xref>; Haverkamp, <xref ref-type="bibr" rid="B129">2020</xref>; Kaduri, <xref ref-type="bibr" rid="B151">2016</xref>; Muller, <xref ref-type="bibr" rid="B203">2010</xref>; Russet and Starr, <xref ref-type="bibr" rid="B254">1988</xref>; Whitney, <xref ref-type="bibr" rid="B336">1980</xref>). At the same time, however, it has also become increasingly clear that there is no straightforward structural or psychological mapping between audition and vision.</p>
<p>While the term multimedia is broad, and covers multiple forms, there is a sense that some multimedia art works fit the description of crossmodal art in that the stimuli presented to eye and ear are in some sense independent, yet the case is different from the addition of music to film, where the music is seemingly always added as an afterthought (Lipscomb, <xref ref-type="bibr" rid="B173">2013</xref>; Tan et al., <xref ref-type="bibr" rid="B303">2013</xref>; Truckenbrod, <xref ref-type="bibr" rid="B310">1992</xref>). At the same time, however, there are empirical phenomena that are undoubtedly worthy of some form of explanation. Consider here only the phenomenal success of the music of Jean-Michel Jarr&#x000E9;, with the largest live audience for an entertainment event ever recorded at the time (in excess of 750,000 people; cf. Boltz, <xref ref-type="bibr" rid="B29">2013</xref>; Tan et al., <xref ref-type="bibr" rid="B303">2013</xref>) when a light show would accompany his musical performances. It would seem hard to explain why such multimedia performances were (at least for a time) so successful, without having recourse to some kind of emergent experience that elevated the multisensory experience beyond what would have been experienced at a traditional music concert (or color organ show; see Spence and Di Stefano, <xref ref-type="bibr" rid="B287">2022a</xref>).<xref ref-type="fn" rid="fn0020"><sup>20</sup></xref> At the time, these were the largest live entertainment audiences ever recorded, and thus speak to the power of simultaneous audiovisual stimulation.<xref ref-type="fn" rid="fn0021"><sup>21</sup></xref> In this case, the artist was augmenting his music, which came first, with visuals (see also Collopy, <xref ref-type="bibr" rid="B71">2000</xref>, and Chmiel and Schubert, <xref ref-type="bibr" rid="B62">2019</xref>, for a history).</p>
</sec>
<sec>
<title>5.3 Intersensoriality</title>
<p>Howes (<xref ref-type="bibr" rid="B135">2025</xref>) has recently introduced the notion of intersensoriality to describe art forms that deliberately cross sensory borders (Classen, <xref ref-type="bibr" rid="B65">1998</xref>). The notion of intersensory, somewhat distinct from both multisensory integration and synaesthetic translation, emphasizes relationality, complementarity, and emergent meaning. Here, the senses are not unified in an essentialist manner, nor merely co-activated, but brought into dialogue in a way that preserves and indeed accentuates their difference. Howes (<xref ref-type="bibr" rid="B135">2025</xref>, p. 143) writes of how: &#x0201C;&#x0201C;the new intersensory music and art history&#x0201D; departs from the conventional distribution and hierarchization of the sensible by focusing on the exchanges between&#x02014;and intercalibration of&#x02014;the senses. This redirection of attention is manifest in such works as Classen&#x00027;s (<xref ref-type="bibr" rid="B65">1998</xref>) chapter on &#x0201C;Crossing sensory borders in the arts&#x0201D; in The Color of Angels, Shaw-Miller&#x00027;s (<xref ref-type="bibr" rid="B267">2010</xref>) Eye hEar: The Visual in Music, and Veldhorst&#x00027;s (<xref ref-type="bibr" rid="B313">2018</xref>) Van Gogh and Music: A Symphony in Blue and Yellow, among other works (see especially Lauwrens, <xref ref-type="bibr" rid="B162">2012</xref>).&#x0201D; Howes (<xref ref-type="bibr" rid="B135">2025</xref>, p. 147) concludes his piece as follows: &#x0201C;here was an exhibition entitled &#x0201C;The Romantic Eye&#x0201D; going on at the Swedish Nationalmuseum at the time of the Stockholm conference. It was a little too ocularcentric for my taste, but one writing on a wall caught my eye: &#x0201C;&#x02018;An artist is each and every one for whom the aim of life is to develop one&#x00027;s senses&#x00027;&#x02014;Friedrich Schlegel&#x0201D;. Fair enough, but for the avant-garde it&#x00027;s different: for them, making music or making art involves straining at the bounds of sense, or crossing over (Classen, <xref ref-type="bibr" rid="B65">1998</xref>). Contemporary historians of music and art need to follow suit if they are to make any headway in the understanding of the art-music nexus at the conjuncture of the nineteenth and twentieth centuries as at the conjuncture of the twentieth and twenty-first centuries (see Jones et al., <xref ref-type="bibr" rid="B149">2006</xref>; Lauwrens, <xref ref-type="bibr" rid="B162">2012</xref>). As we have seen, it is the renvoi, the interplay, the interrelationships of the senses that define the most lasting aesthetic expressions of the two periods&#x0201D;.</p>
<p>The notion of intersensoriality can thus help to promote a relational view of the senses, one that resonates with L&#x000E9;vi-Strauss&#x00027; (<xref ref-type="bibr" rid="B165">1997</xref>) idea, expressed in <italic>Look, Listen, Read</italic>, that systems of perception in different senses can be homologous without necessarily being identical. In particular, his commentary on Father Castel&#x00027;s color organ (see L&#x000E9;vi-Strauss, <xref ref-type="bibr" rid="B165">1997</xref>, p. 129 and ff) exemplifies this: the device did not conflate sound and color into a single unified percept, but rather highlighted their relational differences. This relational interpretation also aligns with Baumgarten (<xref ref-type="bibr" rid="B21">1750</xref>) interpretation of aesthetics as, in Leibnizian terms, the capacity to grasp &#x0201C;unity-in-multiplicity&#x0201D; among sensible qualities. Here, one might also consider Septimus Piesse&#x00027;s &#x0201C;Gamut of Odors&#x0201D; (Piesse, <xref ref-type="bibr" rid="B233">1867</xref>) in which he matched 24 musical notes to a range of scents. While a casual reading might suggest that Piesse is proposing a direct crossmodal mapping between specific musical notes and odors, a closer reading of this English chemist&#x00027;s work, suggests instead that he had more of a relational mapping in mind; meaning that successful combinations of odors can be predicted on the basis of pairing associated notes that go well together. In his treatise, Piesse explicitly noted how sounds and odors blend together similarly, producing different degrees of &#x0201C;a nearly similar impression&#x0201D; in the sensory nerves (Piesse, <xref ref-type="bibr" rid="B233">1867</xref>, p. 39). Piesse also writes about how the mixture needed to prepare the odors for the handkerchief evokes effects on the smelling nerve &#x0201C;similar to that which music or the mixture of harmonious sounds produces upon the nerve of hearing, that of pleasure&#x0201D; (Piesse, <xref ref-type="bibr" rid="B233">1867</xref>, p. 219). He also claims that creating a mixture of scents is like creating a mixture of sounds, i.e., chords. At one point, he even writes that: &#x0201C;We have citron, lemon, orange peel, and verbena, forming a higher octave of smells, which blend in a similar manner&#x0201D; (Piesse, <xref ref-type="bibr" rid="B233">1867</xref>, p. 39). Piesse was seemingly convinced that the pleasantness of musical harmony resembles that of perfumes (consisting of various base notes), and he presented a scale of correspondences between sounds and odors, as he believed that &#x0201C;there is, as it were, an octave of odors like an octave in music&#x0201D; (Piesse, <xref ref-type="bibr" rid="B233">1867</xref>, p. 38).</p>
<p>In this light, mid-level crossmodal correspondences deserve renewed attention&#x02014;not as mere by-products of low-level sensory matching or high-level conceptual metaphor, but as hybrid, emergent forms of intersensory pattern recognition. One key research question here thus becomes: What sensory feature, structural property, or dynamic pattern drives these mid-level correspondences? They often resist easy categorization because they operate at the cusp of the perceptual and the conceptual, exhibiting characteristics of both. One might refer here to the &#x0201C;collideroscopic sensorium&#x0201D; (Howes, <xref ref-type="bibr" rid="B137">2023</xref>) as a useful metaphor for understanding how such intersensory correspondences open novel aesthetic dimensions without collapsing sensory boundaries.</p>
</sec>
<sec>
<title>5.4 Augmenting cinematic entertainment</title>
<p>In their 1928 &#x0201C;Statement on Sound&#x0201D;, Eisenstein, Pudovkin, and Alexandrov advocated the role of sound in film and for &#x0201C;the creation of a new orchestral counterpoint of sight-images and sound-images&#x0201D; (see Eisenstein et al., <xref ref-type="bibr" rid="B94">1999</xref>). Still more radically, in an essay called &#x0201C;Synchronization of Senses&#x0201D;, Eisenstein describes how a &#x0201C;single, unifying sound-picture image&#x0201D; might be developed as a &#x0201C;polyphonic structure&#x0201D; that &#x0201C;achieves its total effect through the composite sensation of all the pieces as a whole&#x0201D; [Eisenstein, <xref ref-type="bibr" rid="B95">1947</xref>; see Spence and Di Stefano (submitted)<xref ref-type="fn" rid="fn0019"><sup>19</sup></xref>]. Pointing to examples of synesthetic art including Rimbaud&#x0201D;s &#x0201C;color sonnet&#x0201D; and James NcNeill Whistler&#x0201D;s &#x0201C;color symphonies&#x0201D;, Eisenstein proposed that such effects could also be achieved in the cinema. At one point Eisenstein writes that: &#x0201C;To remove the barriers between sight and sound, between the seen world and the heard world &#x02026; To bring about a unity and a harmonious relationship between these two opposite spheres. What an absorbing task! The Greeks and Diderot, Wagner and Scriabin-who has not dreamt of this ideal?&#x0201D; It is the ideal of the Gesamtkunstwerk, the total work of art that synthesizes multiple media: precisely the reverse of the Laoco&#x000F6;n argument (see also Cardullo, <xref ref-type="bibr" rid="B50">2017</xref>; Kandinsky, <xref ref-type="bibr" rid="B153">1977</xref>; Smith, <xref ref-type="bibr" rid="B274">2007</xref>). Here, note we see common themes emerging across different artistic/entertainment media.</p>
<p>Note that film and advertising are different inasmuch as people assume that the music and visuals have been deliberately paired in order to achieve some emotional effect or convey some form of semantic meaning (see Bolivar et al., <xref ref-type="bibr" rid="B28">1994</xref>; Boltz, <xref ref-type="bibr" rid="B31">2001</xref>, <xref ref-type="bibr" rid="B32">2004</xref>; Boltz et al., <xref ref-type="bibr" rid="B30">1991</xref>; Bullerjahn and G&#x000FC;ldenring, <xref ref-type="bibr" rid="B46">1994</xref>; Chion, <xref ref-type="bibr" rid="B59">1990</xref>, <xref ref-type="bibr" rid="B60">2000</xref>; Clemente et al., <xref ref-type="bibr" rid="B66">2023</xref>; Cohen, <xref ref-type="bibr" rid="B67">1990</xref>, <xref ref-type="bibr" rid="B68">2010</xref>; Herget and Albrecht, <xref ref-type="bibr" rid="B132">2022</xref>; Hung, <xref ref-type="bibr" rid="B142">2000</xref>; Klein et al., <xref ref-type="bibr" rid="B160">2021</xref>; Lipscomb and Kendall, <xref ref-type="bibr" rid="B174">1994</xref>; Lipscomb and Tolchinsky, <xref ref-type="bibr" rid="B176">2005</xref>; Millet et al., <xref ref-type="bibr" rid="B196">2021</xref>; Nosal et al., <xref ref-type="bibr" rid="B211">2016</xref>; Parke et al., <xref ref-type="bibr" rid="B226">2007</xref>; Silas et al., <xref ref-type="bibr" rid="B271">2024</xref>; Steffens, <xref ref-type="bibr" rid="B296">2020</xref>; Thayer and Levenson, <xref ref-type="bibr" rid="B306">1983</xref>; Tan et al., <xref ref-type="bibr" rid="B304">2017</xref>, <xref ref-type="bibr" rid="B305">2007</xref>; see also Carr and Rickard, <xref ref-type="bibr" rid="B51">2016</xref>; Marin et al., <xref ref-type="bibr" rid="B185">2012</xref>). For instance, Hung highlights how music in TV advertising can be analyzed in terms of its sensual properties, but also in terms of its emotional effect, and there again at the level of any semantic effects.<xref ref-type="fn" rid="fn0022"><sup>22</sup></xref> Meanwhile, Gorbman (<xref ref-type="bibr" rid="B117">1987</xref>) suggested that any music applied to a film segment will affect the spectator because the latter automatically imposes meaning on such combinations. As such, there is likely to be an important difference between (artistic) media. Nevertheless, it is still interesting to note how many of the same themes and experimental manipulations (e.g., relating to the impact of congruent vs. incongruent combinations of film content and music stimuli) crop up in the literature on the effect of music on visual perception. Unfortunately, however, there is simply no opportunity (nor space) to review this parallel literature here. All we can do here is to highlight the existence of this large and highly-relevant experimental literature.</p>
</sec>
</sec>
<sec id="s6">
<title>6 Challenges to the crossmodal augmentation of art</title>
<p>Before concluding this narrative historical review, it is worth noting how there are a number of challenges, theoretical, practical, attentional, and potentially also ethical when it comes to the crossmodal augmentation of art.</p>
<sec>
<title>6.1 Theoretical challenges</title>
<p>A clear distinction between crossmodal augmentation in art (as in Scriabin, <xref ref-type="bibr" rid="B263">1910</xref>, <italic>Poem of Fire)</italic> and apparently related attempts to translate auditory stimuli into visual ones, as in the case of Rimington&#x00027;s Color Organ (see also Kargon, <xref ref-type="bibr" rid="B154">2011</xref>), should be highlighted here. In the latter case, at least according to the creator&#x00027;s intention, we are dealing with the straightforward attempt as sensory translation between audition and vision (Spence and Di Stefano, <xref ref-type="bibr" rid="B290">2024b</xref>). In the case of Scriabin, there is a possible translation based on the composer&#x00027;s own color-music synaesthesia followed by a combination of the two stimuli, possibly in order to help disambiguate certain elements of the musical composition visually. It is, however, likely that some attempts at augmentation may result in nothing more that redundancy of information across sensory domains&#x02014;similar to both seeing a battle and reading a description of it. This is what Banes (<xref ref-type="bibr" rid="B16">2001</xref>) describes, in the case of olfactory stimuli being used to provide the smell of what can be seen on stage, as pleonastic. It can be argued, by contrast, that crossmodal augmentation should by definition involve, more than mere translation or redundancy; it requires the addition of non-redundant sensory inputs from a different domain, enriching (or possibly disambiguating) the original experience.</p>
<p>Relatedly, one might draw attention to issues related to the sensory nature of artworks by observing that to have either sensory translation or crossmodally augmented art, the mere simultaneous perception of two sensory inputs is insufficient. For example, hearing Beethoven while viewing a Munch exhibit does not constitute sensory translation or augmentation. And what if a contemporary artist <italic>deliberately</italic> broadcasts Beethoven during the exhibition, explicitly presenting it as multisensory art, a ticket paid for it, and people made aware of this? Would that turn the mere coexistence of music and paintings into an example of crossmodal art? Let us return, then, to Danto (<xref ref-type="bibr" rid="B78">1981</xref>) with his key questions on the ontology of art, such as, for example: What makes something an artwork? What distinguishes art from mere objects? How does context shape art&#x00027;s identity? Danto&#x00027;s reflections challenge the idea that art is defined by its material properties alone, emphasizing instead its conceptual and contextual dimensions.</p>
<p>The crossmodal augmentation of art can therefore be seen as requiring more than mere than the simultaneous presentation of the relevant auditory and visual stimuli. It involves a deliberate, non-redundant integration of both, framed within an artistic context that invites interpretation. Crucially, the observer&#x00027;s awareness of this intentional combination transforms the experience from mere coexistence into a meaningful, augmented artistic phenomenon.</p>
</sec>
<sec>
<title>6.2 Practical challenges</title>
<p>It is worth pausing here to consider Behne&#x00027;s (<xref ref-type="bibr" rid="B22">1999</xref>) &#x0201C;habituation hypothesis&#x0201D;, according to which the effect of background music has steadily decreased due to the growing omnipresence and availability of music in day to-day life. In a meta-analysis, Behne observed that the effect of music on emotions, perceptions, and behavior had decreased by roughly 8% per decade. Such habituation, were it to be a replicable phenomenon, would obviously raise a practical challenge for those wanting to augment visual art with music, say. That said, K&#x000E4;mpfe et al. (<xref ref-type="bibr" rid="B152">2011</xref>) were unable to replicate this general historical trend in a more recent and more methodically controlled meta-analysis.</p>
<p>Separately, should any putative crossmodal augmentation rely on precise crossmodal temporal synchronization then there may be challenges around ensuring that different sensory inputs/elements align perfectly in real time (e.g., matching sound with visual effects). This can be problematic: Interactive or AI-driven experiences sometimes suffer from technology-induced lag in one or more senses, thus reducing the immersiveness of the experience. One might be reminded here of Bruce Naumann&#x00027;s (1969) artwork &#x0201C;Lip Sync&#x0201D; (see <ext-link ext-link-type="uri" xlink:href="https://www.moma.org/collection/works/107669">https://www.moma.org/collection/works/107669</ext-link>). The work consists of a close-up of a male face presented upside down repeatedly say &#x0201C;lip-sync&#x0201D;. The key point is that the asynchrony between the sound of the voice and the associated lip-movements continuously change in terms of the asynchrony thus drawing the viewer is as they can never quite pin down the relationship between the auditory and visual channels. One might consider this a genuinely audiovisual work of art, and as such, the notion of crossmodal augmentation doesn&#x00027;t really have any relevance.</p>
</sec>
<sec>
<title>6.3 Attentional constraints and the danger of sensory overload</title>
<p>The Danish philosopher Kierkegaard (<xref ref-type="bibr" rid="B157">1988</xref>, originally published in 1843) once wrote that: &#x0201C;It is a common experience that to strain two senses at the same time is not pleasant, and thus it is often disruptive to have to use the eyes a great deal at the same time as the ears are being used. Therefore, one is inclined to shut the eyes when listening to music. This is more or less true of all music, and <italic>in sensu eminentiori</italic> [in an eminent sense] of [Mozart&#x00027;s opera] Don Giovanni. As soon as the eyes are involved, the impression is disrupted, for the dramatic unity that presents itself to the eye is altogether subordinate and deficient in comparison with the musical unity that is heard simultaneously. My own experience has convinced me of this&#x0201D;. One might think of a view of artistic experience as grounded in sensory fragmentation compared to the multisensory inputs we ordinarily have to manage in daily life.</p>
<p>One argument suggested here is that part of what makes art powerful is its ability to isolate, emphasize, or distill specific aspects of perception. Unlike daily life, where we are bombarded with multisensory input that we must constantly filter, artistic experiences often provide a kind of structured reduction&#x02014;a selective framing of reality that allows for deeper contemplation. For instance, a still life with fruits and bottles strips away movement, sound, odors, and so forces us to focus on composition, color, and form instead. This could explain why we value these aesthetic experiences, because they allow us to maximize sensory engagement. Reintegrating sensory modalities in aesthetic experience to augment them implies adding more sensory information (haptics, olfaction, spatial audio, interactive visuals, etc.). The danger is that this may merely increase cognitive load, making it harder to focus on any one aspect, thus undermining the essence of art and artistic contemplation. Is multisensoriality in art a form of redundancy&#x02014;where different sensory modalities repeat the same message&#x02014;or does it truly open new aesthetic dimensions?<xref ref-type="fn" rid="fn0023"><sup>23</sup></xref></p>
<p>The argument seemingly has some cognitive implications. Freeing the senses from the burden of managing excessive stimuli implies freeing intellectual resources for deeper reflection and cognitive engagement. A key argument in favor of art as sensory fragmentation is that it does that to free the mind, allowing for more profound conceptual engagement. By narrowing the range of sensory input, art can liberate cognitive capacity for interpretation, imagination, and emotional resonance. In minimalist art, the deliberate reduction of visual complexity forces the viewer into an active intellectual engagement with form and space. By contrast, technologically-augmented multisensory experiences might do the opposite. Consider only the use of virtual/augmented reality (VR/AR) to augment aesthetic experience (e.g., Cho et al., <xref ref-type="bibr" rid="B63">2020</xref>; He et al., <xref ref-type="bibr" rid="B130">2018</xref>). If the mind is busy processing an excess of sensory inputs, does it have the capacity left to engage deeply with the ideas behind the work? One conclusion that could be drawn here is that while more sensory input might feel like an enrichment, it can also lead to passivity if the experience becomes too immersive or overwhelming. Consider here only Aitamurto et al.&#x00027;s (<xref ref-type="bibr" rid="B4">2018</xref>) study in which the impact of a mobile video see-through AR tour guide on users&#x00027; engagement with art was assessed. Besides showing many positives related to the AR technology itself, the results highlighted the downsides of the same technological augmentation related to the users feeling distracted by the AR guide and worried about excessive screen time.</p>
<p>Certainly, not all theorists/critics have necessarily appreciated the desire to engage more senses in art: As Arnheim stated in an essay entitled, with clear reference to the distinction among the arts in Lessing&#x00027;s 18th-century aesthetics, A New Laoco&#x000F6;n: &#x0201C;in their attempts to attract the audience, two media are fighting each other instead of capturing it by a united effort&#x0201D; (Arnheim, <xref ref-type="bibr" rid="B13">1957</xref>, p. 199; Babbitt, <xref ref-type="bibr" rid="B14">1910</xref>)&#x02014;this sounds like a problem of divided attention, and indeed problems of working memory have been suggested as a limiting factor when auditory and visual inputs are somehow independent. So, while in the best-case scenario, some sort of multisensory emergence may be experienced in the context of crossmodal art, there is always the danger of sensory overload should the various unisensory inputs not be sufficiently congruent (Malhotra, <xref ref-type="bibr" rid="B184">1984</xref>). One other practical challenge that it is worth noting relates to the observation that deliberate attempts to enhance experience by deliberately introducing additional sensory inputs (e.g., in the field of sensory marketing) have, counterintuitively, often proved unsuccessful (Spence, <xref ref-type="bibr" rid="B285">2021</xref>).</p>
</sec>
<sec>
<title>6.4 Is it ethical to deliberately augment a work of art crossmodally?</title>
<p>Finally, it is worth briefly considering the ethical question that was raised by David Lomas (from the Art History department at the University of Manchester), in relation to Flying Object&#x00027;s crossmodal augmentation of four works of visual art from the permanent collection at Tate Britain through the use of scent, sound, virtual touch, and even chocolate in 2015 (see Pursey and Lomas, <xref ref-type="bibr" rid="B238">2018</xref>). Given that the artists of the original works were all dead by the time of the exhibition, one can ask the question of whether it is ethical to deliberately try to modify the meaning/viewer&#x00027;s experience of these important works of art by deliberately augmenting then through some form of crossmodal stimulation? Nevertheless, the situation feels importantly different from those cases where the original artist chooses to augment their own work (think Scriabin, and Jean-Michel Jarr&#x000E9;). At the same time, the augmentation is clearly differentiated from the original experience. Here, one might consider whether one arrives at a different conclusion when thinking about the crossmodally augmented art as a (partially) different artwork. Consider here Eug&#x000E8;ne Bataille&#x00027;s, <italic>La Joconde fumant la pipe</italic>, (1897) or Duchamp&#x00027;s ready-made depicting Leonardo&#x00027;s Mona Lisa with mustache (<italic>L.H.O.O.Q, 1919</italic>) or Dal&#x000EC;&#x00027;s later alteration (Self-portrait, 1954).</p>
</sec>
</sec>
<sec sec-type="conclusions" id="s7">
<title>7 Conclusions</title>
<p>One of the key points to emerge from the present narrative historical review is that it may not be possible to use elemental correspondences (that is, crossmodal correspondences based on simple stimulus properties), given that works of art are mostly complex, multi-elemental, and semantically/emotionally meaningful. This conclusion seemingly contradicting Eisenstein&#x00027;s suggestion that there is no &#x0201C;pervading law of absolute meanings and correspondences between colors and sound.&#x0201D; (as quoted in Harrison, <xref ref-type="bibr" rid="B124">2001</xref>, p. 133&#x02013;134). One might also want to consider whether crossmodal effects are more likely to be experienced if the auditory and visual stimuli are clearly linked (i.e., rather than, say, when music is treated as no more than incidental background music; see Liu, <xref ref-type="bibr" rid="B177">2022</xref>). Returning to the main claims that were made in the Introduction: The division into basic, mid-level, and complex crossmodal correspondences certainly appears helpful when considering crossmodal correspondences/effects, especially in the context of translation/augmenting art experiences. At the same time, our sense is that more impressive outcomes are likely to be achieved should researchers and practitioners move away from the (idiosyncratic) synaesthesia mind-set, and rather focus instead on the consensually share crossmodal correspondences (see also Kargon, <xref ref-type="bibr" rid="B154">2011</xref>). It will undoubtedly be interesting to see how this &#x0201C;intersensoriality&#x0201D; in art develops now that theorizing has moved away from the distraction of synaesthesia that hindered the first wave of interest a little over a century ago (Dimova, <xref ref-type="bibr" rid="B89">2024</xref>; Howes, <xref ref-type="bibr" rid="B137">2023</xref>, <xref ref-type="bibr" rid="B135">2025</xref>).</p>
<p>One question for the future that we haven&#x00027;t had the opportunity to consider here is whether musical knowledge/expertise may affect audiovisual crossmodal correspondences and hence the way in which music and visuals interact in experience (Di Stefano et al., <xref ref-type="bibr" rid="B83">2024</xref>, <xref ref-type="bibr" rid="B82">2025</xref>; Di Stefano et al. (submitted)<xref ref-type="fn" rid="fn0011"><sup>11</sup></xref>; Guo et al., <xref ref-type="bibr" rid="B119">2023</xref>; Mok, <xref ref-type="bibr" rid="B198">2022</xref>; Vanechkina, <xref ref-type="bibr" rid="B311">1968</xref>; Walker, <xref ref-type="bibr" rid="B320">1987</xref>; see also Cutietta and Haggerty, <xref ref-type="bibr" rid="B77">1987</xref>). Ultimately, though, as we have just seen, there are a number of theoretical, practical, attentional, and ethical questions that may sometimes arise for those interested in augmenting works of auditory or visual art crossmodally.</p></sec>
</body>
<back>
<sec sec-type="author-contributions" id="s8">
<title>Author contributions</title>
<p>CS: Conceptualization, Writing &#x02013; original draft, Writing &#x02013; review &#x00026; editing. ND: Writing &#x02013; original draft, Writing &#x02013; review &#x00026; editing.</p>
</sec>
<sec sec-type="funding-information" id="s9">
<title>Funding</title>
<p>The author(s) declare that financial support was received for the research and/or publication of this article. AHRC grant helped fund this work.</p>
</sec>
<sec sec-type="COI-statement" id="conf1">
<title>Conflict of interest</title>
<p>The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.</p>
</sec>
<sec id="s10">
<title>Generative AI statement</title>
<p>The author(s) declare that no Gen AI was used in the creation of this manuscript.</p></sec>
<sec sec-type="disclaimer" id="s11">
<title>Publisher&#x00027;s note</title>
<p>All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.</p>
</sec>
<fn-group>
<fn id="fn0001"><p><sup>1</sup>As Stein et al. (<xref ref-type="bibr" rid="B298">1996</xref>) note in one of their articles: &#x0201C;there is no animal in which there is known to be a complete segregation of sensory processing&#x0201D;.</p></fn>
<fn id="fn0002"><p><sup>2</sup>Note that while the descriptor &#x0201C;structural&#x0201D; might appear appropriate for many mid-level correspondences, the term &#x0201C;structural correspondences&#x0201D; has already been used in several other contexts previously (see Bernard, <xref ref-type="bibr" rid="B25">1986</xref>; Sebba, <xref ref-type="bibr" rid="B264">1991</xref>; Spence, <xref ref-type="bibr" rid="B278">2011</xref>). Hence, to avoid confusion, we will use the descriptor mid-level instead.</p></fn>
<fn id="fn0003"><p><sup>3</sup>Here supporting the early position adopted by Helmholtz (<xref ref-type="bibr" rid="B131">1878/1971</xref>, p. 77), and <italic>contra</italic> Marks (<xref ref-type="bibr" rid="B189">2011</xref>; cf. Hartshorne, <xref ref-type="bibr" rid="B126">1934</xref>). In brief, Helmholtz claimed that sensory modalities are fundamentally distinct, preventing direct transitions or comparisons between them. For instance, one cannot determine whether sweetness is more similar to red or blue. Comparisons are meaningful only within the same modality, such as identifying yellow as closer to orange than to blue.</p></fn>
<fn id="fn0004"><p><sup>4</sup>It is worth noting that, in similar protocols, the existence of a statistically significant crossmodal correspondence between stimuli in two arbitrary perceptual dimensions (such as pitch and hue; Walker-Andrews, <xref ref-type="bibr" rid="B321">1994</xref>) only demonstrates that the chosen pairing was the best of the options that happened to be available to participants at the time they were asked (see Spence and Levitan, <xref ref-type="bibr" rid="B292">2021</xref>).</p></fn>
<fn id="fn0005"><p><sup>5</sup>And see Menouti et al. (<xref ref-type="bibr" rid="B195">2015</xref>) for the synesthetic angle to this phenomenon.</p></fn>
<fn id="fn0006"><p><sup>6</sup>That said, Sourav et al. (<xref ref-type="bibr" rid="B276">2025</xref>) were unable to replicate at least one of these specific findings.</p></fn>
<fn id="fn0007"><p><sup>7</sup>Note though that Locke does not present this example in order to endorse its solution, but to comment ironically: &#x0201C;For the hope to produce an <italic>idea</italic> of light or color by a sound, however formed, is to expect that sounds should be visible or colors audible, and to make the ears do the office of all the other senses. Which is all one as to say that we might taste, smell, and see by the ears: a sort of philosophy worthy only of <italic>Sancho Pancha</italic>, who had the faculty to see <italic>Dulcinea</italic> by hearsay.&#x0201D; (Locke, <xref ref-type="bibr" rid="B178">1690</xref>, pp. 29&#x02013;30&#x02014;emphasis in the original).</p></fn>
<fn id="fn0008"><p><sup>8</sup>Note, though, that the relative nature of crossmodal correspondences might not be restricted to pitch (see Cohen, <xref ref-type="bibr" rid="B69">1934</xref>).</p></fn>
<fn id="fn0009"><p><sup>9</sup>Note though that the Pantone color of the year a few years ago was released along with a matching soundscape.</p></fn>
<fn id="fn0010"><p><sup>10</sup>One challenging case here is consonant and dissonant stimuli (Di Stefano et al., <xref ref-type="bibr" rid="B88">2022</xref>). Some consider dissonant stimuli unitary, others believe that people are aware of the component stimuli.</p></fn>
<fn id="fn0011"><p><sup>11</sup>Di Stefano, N., Ansani, A., Schiavio, A., Saarikallio, S., Toiviainen, P., Brattico, E., et al. (submitted). &#x0201C;Minor seconds are bitter and blueish while major sixths are sweet and yellowish: a cross-cultural study of multisensory mappings of musical intervals,&#x0201D; in <italic>Cognition</italic>.</p></fn>
<fn id="fn0012"><p><sup>12</sup>Of course, it is entirely possible that any Gestalt perceptual grouping might only occur unimodally in the auditory and/or visual modalities, and crossmodal art might emerge from the parallel unisensory Gestalt perceptual grouping observed in each of the component modalities (cf. Kubovy and Yu, <xref ref-type="bibr" rid="B161">2012</xref>)&#x02014;one might consider this a kind of crossmodal correspondence between unisensory Gestalt attributes (see also Bender et al., <xref ref-type="bibr" rid="B24">1954</xref>; Spence et al., <xref ref-type="bibr" rid="B293">2007</xref>).</p></fn>
<fn id="fn0013"><p><sup>13</sup>That said, spatial conceptual associations between music and pictures have been revealed by researchers looking at the N400 effect (Zhou et al., <xref ref-type="bibr" rid="B341">2014</xref>).</p></fn>
<fn id="fn0014"><p><sup>14</sup>It might be worth considering whether the concept of halo dumping (Clark and Lawless, <xref ref-type="bibr" rid="B64">1994</xref>) could be relevant in those studies where the participants are only given a single dimension on which to respond.</p></fn>
<fn id="fn0015"><p><sup>15</sup>Though it is perhaps worth noting here that some early sculptural pieces were explicitly designed to be felt not seen (see Gallace and Spence, <xref ref-type="bibr" rid="B109">2014</xref>). The shiny patina on many public brass sculptures clearly speaks to people&#x00027;s desire to haptically interact with many works of art when allowed.</p></fn>
<fn id="fn0016"><p><sup>16</sup>As was the <italic>Pyrophone</italic> from Kastner (1870) that incorporated 13 foil-covered gas nozzles, lighted by crystal tubes (Kastner, <xref ref-type="bibr" rid="B156">1875</xref>).</p></fn>
<fn id="fn0017"><p><sup>17</sup>The piece was initially conceptualized as a total work of art combining music, a light score or &#x0201C;luce&#x0201D;, and even fragrance (Dimova, <xref ref-type="bibr" rid="B89">2024</xref>; Peacock, <xref ref-type="bibr" rid="B230">1985</xref>; Plummer, <xref ref-type="bibr" rid="B235">1915</xref>; Scriabin, <xref ref-type="bibr" rid="B263">1910</xref>). Although what fragrance Scriabin intended to use was never specified by the composer.</p></fn>
<fn id="fn0018"><p><sup>18</sup>In fact, he was tested by C. S. Myers at the University of Cambridge, and the results written-up in a couple of academic papers (Myers, <xref ref-type="bibr" rid="B207">1911</xref>, <xref ref-type="bibr" rid="B208">1914</xref>).</p></fn>
<fn id="fn0019"><p><sup>19</sup>Spence, C., and Di Stefano, N. (submitted). Crossmodal counterpoint: From music to multimedia &#x02013; Incongruency, cognitive dissonance, irony, and surrealism. <italic>i-Perception</italic>.</p></fn>
<fn id="fn0020"><p><sup>20</sup>I.e., just as has been suggested in the case of Scriabin due to the crossmodal correspondence between light and sound in Poem of Fire (Wells, <xref ref-type="bibr" rid="B333">1980</xref>; see also Boltz, <xref ref-type="bibr" rid="B29">2013</xref>; see also Spence and Di Stefano, <xref ref-type="bibr" rid="B291">2025</xref>).</p></fn>
<fn id="fn0021"><p><sup>21</sup>Although such audiovisual entertainments seemingly lost some of their magic subsequently.</p></fn>
<fn id="fn0022"><p><sup>22</sup>Note that semantic correspondences have also been shown to influence perception crossmodally (see Chen and Spence, <xref ref-type="bibr" rid="B55">2010</xref>, <xref ref-type="bibr" rid="B56">2011</xref>).</p></fn>
<fn id="fn0023"><p><sup>23</sup>One might also be reminded here of Nordau&#x00027;s (<xref ref-type="bibr" rid="B210">1898</xref>) suggestion that synaesthetic art (as popularized in the contemporary context by the likes of Waterworth, <xref ref-type="bibr" rid="B327">1997</xref>; Whitelaw, <xref ref-type="bibr" rid="B335">2008</xref>) was somehow degenerate (Nordau, <xref ref-type="bibr" rid="B210">1898</xref>).</p></fn>
</fn-group>
<ref-list>
<title>References</title>
<ref id="B1">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Actis-Grosso</surname> <given-names>R.</given-names></name> <name><surname>Zavagno</surname> <given-names>D.</given-names></name> <name><surname>Lega</surname> <given-names>C.</given-names></name> <name><surname>Zani</surname> <given-names>A.</given-names></name> <name><surname>Daneyko</surname> <given-names>O.</given-names></name> <name><surname>Cattaneo</surname> <given-names>Z.</given-names></name></person-group> (<year>2017</year>). <article-title>Can music be figurative? Exploring the possibility of crossmodal similarities between music and visual arts</article-title>. <source>Psihologija</source> <volume>50</volume>, <fpage>285</fpage>&#x02013;<lpage>306</lpage>. <pub-id pub-id-type="doi">10.2298/PSI1703285A</pub-id></citation>
</ref>
<ref id="B2">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Adams</surname> <given-names>C. S.</given-names></name></person-group> (<year>1995</year>). <article-title>Artistic parallels between Arnold Schoenberg&#x00027;s music and painting (1908-1912)</article-title>. <source>College Music Sympos.</source> <volume>35</volume>, <fpage>5</fpage>&#x02013;<lpage>21</lpage>.</citation>
</ref>
<ref id="B3">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Adeli</surname> <given-names>M.</given-names></name> <name><surname>Rouat</surname> <given-names>J.</given-names></name> <name><surname>Molotchnikoff</surname> <given-names>S.</given-names></name></person-group> (<year>2014</year>). <article-title>Audiovisual correspondence between musical timbre and visual shapes</article-title>. <source>Front. Hum. Neurosci.</source> <volume>8</volume>:<fpage>352</fpage>. <pub-id pub-id-type="doi">10.3389/fnhum.2014.00352</pub-id><pub-id pub-id-type="pmid">24910604</pub-id></citation></ref>
<ref id="B4">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Aitamurto</surname> <given-names>T.</given-names></name> <name><surname>Boin</surname> <given-names>J. B.</given-names></name> <name><surname>Chen</surname> <given-names>K.</given-names></name> <name><surname>Cherif</surname> <given-names>A.</given-names></name> <name><surname>Shridhar</surname> <given-names>S.</given-names></name></person-group> (<year>2018</year>). <article-title>&#x0201C;The impact of augmented reality on art engagement: Liking, impression of learning, and distraction,&#x0201D;</article-title> in <source>Virtual, Augmented and Mixed Reality: Applications in Health, Cultural Heritage, and Industry: 10</source><sup><italic>th</italic></sup> <italic>International Conference, VAMR 2018, Held as Part of HCI International 2018</italic> (<publisher-loc>Las Vegas, NV</publisher-loc>: <publisher-name>Springer International Publishing</publisher-name>), <fpage>153</fpage>&#x02013;<lpage>171</lpage>.</citation>
</ref>
<ref id="B5">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Albertazzi</surname> <given-names>L.</given-names></name> <name><surname>Canal</surname> <given-names>L.</given-names></name> <name><surname>Micciolo</surname> <given-names>R.</given-names></name></person-group> (<year>2015</year>). <article-title>Cross-modal association between materic painting and classical Spanish music</article-title>. <source>Front. Psychol.</source> <volume>6</volume>:<fpage>424</fpage>. <pub-id pub-id-type="doi">10.3389/fpsyg.2015.00424</pub-id><pub-id pub-id-type="pmid">25954217</pub-id></citation></ref>
<ref id="B6">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Albertazzi</surname> <given-names>L.</given-names></name> <name><surname>Canal</surname> <given-names>L.</given-names></name> <name><surname>Micciolo</surname> <given-names>R.</given-names></name> <name><surname>Hachen</surname> <given-names>I.</given-names></name></person-group> (<year>2020</year>). <article-title>Cross-modal perceptual organization in works of art</article-title>. <source>i-Perception.</source> <volume>11</volume>, <fpage>1</fpage>&#x02013;<lpage>22</lpage>. <pub-id pub-id-type="doi">10.1177/2041669520950750</pub-id><pub-id pub-id-type="pmid">32922715</pub-id></citation></ref>
<ref id="B7">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Allen</surname> <given-names>E. C.</given-names></name> <name><surname>Guilford</surname> <given-names>J. P.</given-names></name></person-group> (<year>1936</year>). <article-title>Factors determining the affective values of color combinations</article-title>. <source>Am. J. Psychol.</source> <volume>48</volume>, <fpage>643</fpage>&#x02013;<lpage>648</lpage>. <pub-id pub-id-type="doi">10.2307/1416516</pub-id><pub-id pub-id-type="pmid">9075296</pub-id></citation></ref>
<ref id="B8">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Alves</surname> <given-names>B.</given-names></name></person-group> (<year>2005</year>). <article-title>Digital harmony of sound and light</article-title>. <source>Comp. Music J.</source> <volume>29</volume>, <fpage>45</fpage>&#x02013;<lpage>54</lpage>. <pub-id pub-id-type="doi">10.1162/014892605775179982</pub-id></citation>
</ref>
<ref id="B9">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Anderson</surname> <given-names>J. D.</given-names></name></person-group> (<year>1993</year>). <article-title>Sound and image together: Cross-modal confirmation</article-title>. <source>Wide Angle</source> <volume>15</volume>, <fpage>30</fpage>&#x02013;<lpage>43</lpage>.</citation>
</ref>
<ref id="B10">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Anikin</surname> <given-names>A.</given-names></name> <name><surname>Johansson</surname> <given-names>N.</given-names></name></person-group> (<year>2019</year>). <article-title>Implicit associations between individual properties of color and sound</article-title>. <source>Attent. Percp. Psychophys.</source> <volume>81</volume>, <fpage>764</fpage>&#x02013;<lpage>777</lpage>. <pub-id pub-id-type="doi">10.3758/s13414-018-01639-7</pub-id><pub-id pub-id-type="pmid">30547381</pub-id></citation></ref>
<ref id="B11">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ansani</surname> <given-names>A.</given-names></name> <name><surname>Marini</surname> <given-names>M.</given-names></name> <name><surname>D&#x00027;Errico</surname> <given-names>F.</given-names></name> <name><surname>Poggi</surname> <given-names>I.</given-names></name></person-group> (<year>2020</year>). <article-title>How soundtracks shape what we see: Analyzing the influence of music on visual scenes through self-assessment, eye tracking, and pupillometry</article-title>. <source>Front. Psychol.</source> <volume>11</volume>:<fpage>556697</fpage>. <pub-id pub-id-type="doi">10.3389/fpsyg.2020.02242</pub-id><pub-id pub-id-type="pmid">33117211</pub-id></citation></ref>
<ref id="B12">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Arg&#x000FC;elles</surname> <given-names>J. A.</given-names></name></person-group> (<year>1972</year>). <source>Charles Henry and the Formation of a Psychophysical Aesthetic</source>. <publisher-loc>Chicago, IL</publisher-loc>: <publisher-name>University of Chicago Press</publisher-name>.</citation>
</ref>
<ref id="B13">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Arnheim</surname> <given-names>R.</given-names></name></person-group> (<year>1957</year>). <article-title>&#x0201C;New Laocoon: artistic composites and the talking film,&#x0201D;</article-title> in <source>Film as Art</source> (<publisher-loc>Berkeley, CA</publisher-loc>: <publisher-name>University of California Press</publisher-name>), <fpage>199</fpage>&#x02013;<lpage>230</lpage>.</citation>
</ref>
<ref id="B14">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Babbitt</surname> <given-names>I.</given-names></name></person-group> (<year>1910</year>). <source>The new Laokoon: An Essay on the Confusion of the Arts</source>. <publisher-loc>Boston, MA</publisher-loc>: <publisher-name>Houghton</publisher-name>.</citation>
</ref>
<ref id="B15">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Baker</surname> <given-names>J.</given-names></name></person-group> (<year>2002</year>). <article-title>&#x0201C;Prometheus and the quest for color-music: the world premiere of Scriabin&#x00027;s Poem of Fire with lights, New York, March 20, 1915,&#x0201D;</article-title> in <source>Music and Modern Art</source>, ed. J. Leggio (<publisher-loc>New York, NY</publisher-loc>: <publisher-name>Routledge</publisher-name>), <fpage>61</fpage>&#x02013;<lpage>95</lpage>.</citation>
</ref>
<ref id="B16">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Banes</surname> <given-names>S.</given-names></name></person-group> (<year>2001</year>). <article-title>Olfactory performances</article-title>. <source>TDR/The Drama Review</source> <volume>45</volume>, <fpage>68</fpage>&#x02013;<lpage>76</lpage>. <pub-id pub-id-type="doi">10.1162/105420401300079040</pub-id></citation>
</ref>
<ref id="B17">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Barlow</surname> <given-names>H.</given-names></name> <name><surname>Mollon</surname> <given-names>J.</given-names></name></person-group> (<year>1982</year>). <source>The Senses</source>. <publisher-loc>Cambridge, UK</publisher-loc>: <publisher-name>Cambridge University Press</publisher-name>.</citation>
</ref>
<ref id="B18">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Baron-Cohen</surname> <given-names>S.</given-names></name> <name><surname>Harrison</surname> <given-names>J.</given-names></name> <name><surname>Goldstein</surname> <given-names>L. H.</given-names></name> <name><surname>Wyke</surname> <given-names>M.</given-names></name></person-group> (<year>1993</year>). <article-title>Coloured speech perception: Is synaesthesia what happens when modularity breaks down?</article-title> <source>Perception</source> <volume>22</volume>, <fpage>419</fpage>&#x02013;<lpage>426</lpage>. <pub-id pub-id-type="doi">10.1068/p220419</pub-id><pub-id pub-id-type="pmid">8378132</pub-id></citation></ref>
<ref id="B19">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Basquin</surname> <given-names>K. S.</given-names></name></person-group> (<year>2020</year>). <source>Mary Ellen Bute: Pioneer Animator</source>. <publisher-loc>New Barnet, UK</publisher-loc>: <publisher-name>John Libbey Publishing</publisher-name>.</citation>
</ref>
<ref id="B20">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Battey</surname> <given-names>B.</given-names></name> <name><surname>Fischman</surname> <given-names>R.</given-names></name></person-group> (<year>2016</year>). <article-title>&#x0201C;Convergence of time and space,&#x0201D;</article-title> in <source>The Oxford Handbook of Sound and Image in Western Art</source>, ed. T. Kaduri (<publisher-loc>Oxford, UK</publisher-loc>: <publisher-name>Oxford University Press</publisher-name>), <fpage>61</fpage>&#x02013;<lpage>82</lpage>.</citation>
</ref>
<ref id="B21">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Baumgarten</surname> <given-names>A. G.</given-names></name></person-group> (<year>1750</year>). <source>Aesthetica.</source> Kleyb.</citation>
</ref>
<ref id="B22">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Behne</surname> <given-names>K.-E.</given-names></name></person-group> (<year>1999</year>). <article-title>Zu einer Theorie der Wirkungslosigkeit von (Hintergrund-)Musik [To a theory of the ineffectiveness of (background) music]</article-title>. <source>Musikpsychologie</source> <volume>14</volume>, <fpage>7</fpage>&#x02013;<lpage>23</lpage>.</citation>
</ref>
<ref id="B23">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ben&#x0010D;i&#x00107;</surname> <given-names>L.</given-names></name></person-group> (<year>2021</year>). <article-title>Graphical sound &#x02013; From inception up to the masterpieces</article-title>. <source>Academia Lett.</source> <volume>2021</volume>:<fpage>1108</fpage>. <pub-id pub-id-type="doi">10.20935/AL1108</pub-id></citation>
</ref>
<ref id="B24">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bender</surname> <given-names>M. B.</given-names></name> <name><surname>Green</surname> <given-names>M. A.</given-names></name> <name><surname>Fink</surname> <given-names>M.</given-names></name></person-group> (<year>1954</year>). <article-title>Patterns of perceptual organization with simultaneous stimuli</article-title>. <source>Arch. Neurol. Psychiatry</source> <volume>72</volume>, <fpage>233</fpage>&#x02013;<lpage>255</lpage>. <pub-id pub-id-type="doi">10.1001/archneurpsyc.1954.02330020101009</pub-id><pub-id pub-id-type="pmid">13180058</pub-id></citation></ref>
<ref id="B25">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bernard</surname> <given-names>J. W.</given-names></name></person-group> (<year>1986</year>). <article-title>Messiaen&#x00027;s synaesthesia: the correspondence between colour and sound structure in his music</article-title>. <source>Music Percept.</source> <volume>4</volume>, <fpage>41</fpage>&#x02013;<lpage>68</lpage>. <pub-id pub-id-type="doi">10.2307/40285351</pub-id></citation>
</ref>
<ref id="B26">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bernstein</surname> <given-names>I. H.</given-names></name> <name><surname>Eason</surname> <given-names>T. R.</given-names></name> <name><surname>Schurman</surname> <given-names>D. L.</given-names></name></person-group> (<year>1971</year>). <article-title>Hue-tone interaction: A negative result</article-title>. <source>Percep. Motor Skills</source> <volume>33</volume>, <fpage>1327</fpage>&#x02013;<lpage>1330</lpage>. <pub-id pub-id-type="doi">10.2466/pms.1971.33.3f.1327</pub-id><pub-id pub-id-type="pmid">5160057</pub-id></citation></ref>
<ref id="B27">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bhattacharya</surname> <given-names>J.</given-names></name> <name><surname>Lindsen</surname> <given-names>J. P.</given-names></name></person-group> (<year>2016</year>). <article-title>Music for a brighter world: brightness judgment bias by musical emotion</article-title>. <source>PLoS ONE</source> <volume>11</volume>:<fpage>e0148959</fpage>. <pub-id pub-id-type="doi">10.1371/journal.pone.0148959</pub-id><pub-id pub-id-type="pmid">26863420</pub-id></citation></ref>
<ref id="B28">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bolivar</surname> <given-names>V. J.</given-names></name> <name><surname>Cohen</surname> <given-names>A. J.</given-names></name> <name><surname>Fentress</surname> <given-names>J. C.</given-names></name></person-group> (<year>1994</year>). <article-title>Semantic and formal congruency in music and motion pictures: effects on the interpretation of visual action</article-title>. <source>Psychomusicology</source> <volume>13</volume>, <fpage>28</fpage>&#x02013;<lpage>59</lpage>. <pub-id pub-id-type="doi">10.1037/h0094102</pub-id></citation>
</ref>
<ref id="B29">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Boltz</surname> <given-names>M.</given-names></name></person-group> (<year>2013</year>). <article-title>&#x0201C;Music videos and visual influences on music perception and appreciation: Should you want your MTV?,&#x0201D;</article-title> in <source>The Psychology of Music in Multimedia</source>, eds. S. L. Tan, A. Cohen, S. Lipscomb, and R. Kendall (<publisher-loc>Oxford, UK</publisher-loc>: <publisher-name>Oxford University Press</publisher-name>), <fpage>217</fpage>&#x02013;<lpage>235</lpage>.</citation>
</ref>
<ref id="B30">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Boltz</surname> <given-names>M.</given-names></name> <name><surname>Schulkind</surname> <given-names>M.</given-names></name> <name><surname>Kantra</surname> <given-names>S.</given-names></name></person-group> (<year>1991</year>). <article-title>Effects of background music on the remembering of filmed events</article-title>. <source>Mem. Cognit.</source> <volume>19</volume>, <fpage>593</fpage>&#x02013;<lpage>606</lpage>. <pub-id pub-id-type="doi">10.3758/BF03197154</pub-id><pub-id pub-id-type="pmid">1721996</pub-id></citation></ref>
<ref id="B31">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Boltz</surname> <given-names>M. G.</given-names></name></person-group> (<year>2001</year>). <article-title>Musical soundtracks as a schematic influence on the cognitive processing of filmed events</article-title>. <source>Music Percept.</source> <volume>18</volume>, <fpage>427</fpage>&#x02013;<lpage>454</lpage>. <pub-id pub-id-type="doi">10.1525/mp.2001.18.4.427</pub-id></citation>
</ref>
<ref id="B32">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Boltz</surname> <given-names>M. G.</given-names></name></person-group> (<year>2004</year>). <article-title>The cognitive processing of film and musical soundtracks</article-title>. <source>Mem. Cognit.</source> <volume>32</volume>, <fpage>1194</fpage>&#x02013;<lpage>1205</lpage>. <pub-id pub-id-type="doi">10.3758/BF03196892</pub-id><pub-id pub-id-type="pmid">15813500</pub-id></citation></ref>
<ref id="B33">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Boltz</surname> <given-names>M. G.</given-names></name></person-group> (<year>2017</year>). <article-title>Auditory driving in cinematic art</article-title>. <source>Music Percept.</source> <volume>35</volume>, <fpage>77</fpage>&#x02013;<lpage>93</lpage>. <pub-id pub-id-type="doi">10.1525/mp.2017.35.1.77</pub-id></citation>
</ref>
<ref id="B34">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Boltz</surname> <given-names>M. G.</given-names></name></person-group> (<year>2018</year>). <article-title>Auditory driving and affective influences</article-title>. <source>Appl. Cogn. Psychol.</source> <volume>32</volume>, <fpage>512</fpage>&#x02013;<lpage>517</lpage>. <pub-id pub-id-type="doi">10.1002/acp.3420</pub-id></citation>
</ref>
<ref id="B35">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bond</surname> <given-names>B.</given-names></name> <name><surname>Stevens</surname> <given-names>S. S.</given-names></name></person-group> (<year>1969</year>). <article-title>Cross-modality matching of brightness to loudness by 5-year-olds</article-title>. <source>Percept. Psychophys.</source> <volume>6</volume>, <fpage>337</fpage>&#x02013;<lpage>339</lpage>. <pub-id pub-id-type="doi">10.3758/BF03212787</pub-id><pub-id pub-id-type="pmid">5217429</pub-id></citation></ref>
<ref id="B36">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bonetti</surname> <given-names>L.</given-names></name> <name><surname>Costa</surname> <given-names>M.</given-names></name></person-group> (<year>2018</year>). <article-title>Pitch-verticality and pitch-size cross-modal interactions</article-title>. <source>Psychol. Music</source> <volume>46</volume>, <fpage>340</fpage>&#x02013;<lpage>356</lpage>. <pub-id pub-id-type="doi">10.1177/0305735617710734</pub-id></citation>
</ref>
<ref id="B37">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bonetti</surname> <given-names>L.</given-names></name> <name><surname>Costa</surname> <given-names>M.</given-names></name></person-group> (<year>2019</year>). <article-title>Musical mode and visual-spatial cross-modal associations in infants and adults</article-title>. <source>Musicae Scientiae</source> <volume>23</volume>, <fpage>50</fpage>&#x02013;<lpage>68</lpage>. <pub-id pub-id-type="doi">10.1177/1029864917705001</pub-id></citation>
</ref>
<ref id="B38">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Boswell</surname> <given-names>S.</given-names></name></person-group> (<year>2012</year>). <article-title>&#x0201C;Music in cinema: how soundtrack composers act on the way people feel, keynote speech on Music and emotions: compositions perspectives,&#x0201D;</article-title> in <source>Presented at the 9th International Symposium CMMR 2012 on Music and Emotions</source>, London, UK, June <fpage>19</fpage>&#x02013;<lpage>22</lpage> (<publisher-loc>Berlin</publisher-loc>: <publisher-name>Springer</publisher-name>).</citation>
</ref>
<ref id="B39">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bragdon</surname> <given-names>C.</given-names></name></person-group> (<year>1916</year>). <article-title>Song and light</article-title>. <source>Architect. Record</source> <volume>4</volume>, <fpage>169</fpage>&#x02013;<lpage>172</lpage>.</citation>
</ref>
<ref id="B40">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Bragdon</surname> <given-names>C.</given-names></name></person-group> (<year>1918</year>). <source>Architecture and Democracy</source>. <publisher-loc>New York, NY</publisher-loc>: <publisher-name>Lector House</publisher-name>.</citation>
</ref>
<ref id="B41">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Braun Janzen</surname> <given-names>T.</given-names></name> <name><surname>de Oliveira</surname> <given-names>B.</given-names></name> <name><surname>Ventorim Ferreira</surname> <given-names>G.</given-names></name> <name><surname>Sato</surname> <given-names>J. R.</given-names></name> <name><surname>Feitosa-Santana</surname> <given-names>C.</given-names></name> <name><surname>Vanzella</surname> <given-names>P.</given-names></name></person-group> (<year>2023</year>). <article-title>The effect of background music on the aesthetic experience of a visual artwork in a naturalistic environment</article-title>. <source>Psychology of Music</source> <volume>51</volume>, <fpage>16</fpage>&#x02013;<lpage>32</lpage>. <pub-id pub-id-type="doi">10.1177/03057356221079866</pub-id></citation>
</ref>
<ref id="B42">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Brent-Smith</surname> <given-names>A.</given-names></name></person-group> (<year>1926a</year>). <article-title>Some reflections on the work of Scriabin</article-title>. <source>The Musical Times</source> <volume>67</volume>, <fpage>593</fpage>&#x02013;<lpage>595</lpage>. <pub-id pub-id-type="doi">10.2307/911826</pub-id></citation>
</ref>
<ref id="B43">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Brent-Smith</surname> <given-names>A.</given-names></name></person-group> (<year>1926b</year>). <article-title>Some reflections on the work of Scriabin</article-title>. <source>The Musical Times</source> <volume>67</volume>, <fpage>692</fpage>&#x02013;<lpage>694</lpage>. <pub-id pub-id-type="doi">10.2307/911978</pub-id></citation>
</ref>
<ref id="B44">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Bresin</surname> <given-names>R.</given-names></name></person-group> (<year>2005</year>). <article-title>&#x0201C;What is the color of that music performance?,&#x0201D;</article-title> <italic>in Proceedings of the International Computer Music Conference</italic> (<publisher-loc>Barcelona, Spain</publisher-loc>: <publisher-name>International Computer Music Association</publisher-name>), <fpage>367</fpage>-<lpage>370</lpage>.<pub-id pub-id-type="pmid">35369229</pub-id></citation></ref>
<ref id="B45">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Brunel</surname> <given-names>L.</given-names></name> <name><surname>Carvalho</surname> <given-names>P. F.</given-names></name> <name><surname>Goldstone</surname> <given-names>R. L.</given-names></name></person-group> (<year>2015</year>). <article-title>It does belong together: Cross-modal correspondences influence cross-modal integration during perceptual learning</article-title>. <source>Front. Psychol.</source> <volume>6</volume>:<fpage>358</fpage>. <pub-id pub-id-type="doi">10.3389/fpsyg.2015.00358</pub-id><pub-id pub-id-type="pmid">25914653</pub-id></citation></ref>
<ref id="B46">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bullerjahn</surname> <given-names>C.</given-names></name> <name><surname>G&#x000FC;ldenring</surname> <given-names>M.</given-names></name></person-group> (<year>1994</year>). <article-title>An empirical investigation of effects of film music using qualitative content analysis</article-title>. <source>Psychomusicology</source> <volume>13</volume>, <fpage>99</fpage>&#x02013;<lpage>118</lpage>. <pub-id pub-id-type="doi">10.1037/h0094100</pub-id></citation>
</ref>
<ref id="B47">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Burchett</surname> <given-names>K. E.</given-names></name></person-group> (<year>2002</year>). <article-title>Color harmony</article-title>. <source>Color Research and Application</source> <volume>29</volume>, <fpage>28</fpage>&#x02013;<lpage>31</lpage>. <pub-id pub-id-type="doi">10.1002/col.10004</pub-id></citation>
</ref>
<ref id="B48">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Caivano</surname> <given-names>J. L.</given-names></name></person-group> (<year>1994</year>). <article-title>Color and sound: physical and psychophysical relations</article-title>. <source>Color Res. Appl.</source> <volume>19</volume>, <fpage>126</fpage>&#x02013;<lpage>133</lpage>. <pub-id pub-id-type="doi">10.1111/j.1520-6378.1994.tb00072.x</pub-id></citation>
</ref>
<ref id="B49">
<citation citation-type="book"><person-group person-group-type="editor"><name><surname>Calvert</surname> <given-names>G.</given-names></name> <name><surname>Spence</surname> <given-names>C.</given-names></name> <name><surname>Stein</surname> <given-names>B. E.</given-names></name></person-group> (eds.). (<year>2004</year>). The handbook of multisensory processing. <publisher-loc>Cambridge, MA</publisher-loc>: <publisher-name>MIT Press</publisher-name>.</citation>
</ref>
<ref id="B50">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Cardullo</surname> <given-names>R. J.</given-names></name></person-group> (<year>2017</year>). <article-title>Gesamtkunstwerk, synesthesia, and the Avant-garde: Wassily Kandinsky&#x00027;s The Yellow Sound as a work of art</article-title>. <source>Hermeneia</source> <volume>18</volume>, <fpage>5</fpage>&#x02013;<lpage>21</lpage>.</citation>
</ref>
<ref id="B51">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Carr</surname> <given-names>S. M.</given-names></name> <name><surname>Rickard</surname> <given-names>N. S.</given-names></name></person-group> (<year>2016</year>). <article-title>The use of emotionally arousing music to enhance memory for subsequently presented images</article-title>. <source>Psychol. Music</source> <volume>44</volume>, <fpage>1145</fpage>&#x02013;<lpage>1157</lpage>. <pub-id pub-id-type="doi">10.1177/0305735615613846</pub-id></citation>
</ref>
<ref id="B52">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Carraturo</surname> <given-names>G.</given-names></name> <name><surname>Pando-Naude</surname> <given-names>V.</given-names></name> <name><surname>Costa</surname> <given-names>M.</given-names></name> <name><surname>Vuust</surname> <given-names>P.</given-names></name> <name><surname>Bonetti</surname> <given-names>L.</given-names></name> <name><surname>Brattico</surname> <given-names>E.</given-names></name></person-group> (<year>2025</year>). <article-title>The major-minor mode dichotomy in music perception</article-title>. <source>Phys. Life Rev.</source> <volume>52</volume>, <fpage>80</fpage>&#x02013;<lpage>106</lpage>. <pub-id pub-id-type="doi">10.1016/j.plrev.2024.11.017</pub-id><pub-id pub-id-type="pmid">39721138</pub-id></citation></ref>
<ref id="B53">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Cavazos Quero</surname> <given-names>L.</given-names></name> <name><surname>Lee</surname> <given-names>C.-H.</given-names></name> <name><surname>Cho</surname> <given-names>J.-D.</given-names></name></person-group> (<year>2021</year>). <article-title>Multi-sensory color code based on sound and scent for visual art appreciation</article-title>. <source>Electronics</source> <volume>10</volume>:<fpage>1696</fpage>. <pub-id pub-id-type="doi">10.3390/electronics10141696</pub-id></citation>
</ref>
<ref id="B54">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Chen</surname> <given-names>L.</given-names></name></person-group> (<year>2013</year>). <article-title>&#x0201C;Synaesthetic correspondence between auditory clips and colors: an empirical study,&#x0201D;</article-title> in <source>Intelligent Science and Intelligent Data Engineering. IScIDE 2012. Lecture Notes in Computer Science</source>, eds. J. Yang, F. Fang, and C. Sun (<publisher-loc>Berlin, Heidelberg</publisher-loc>: <publisher-name>Springer</publisher-name>), <fpage>7751</fpage>.</citation>
</ref>
<ref id="B55">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Chen</surname> <given-names>Y.-C.</given-names></name> <name><surname>Spence</surname> <given-names>C.</given-names></name></person-group> (<year>2010</year>). <article-title>When hearing the bark helps to identify the dog: Semantically-congruent sounds modulate the identification of masked pictures</article-title>. <source>Cognition</source> <volume>114</volume>, <fpage>389</fpage>&#x02013;<lpage>404</lpage>. <pub-id pub-id-type="doi">10.1016/j.cognition.2009.10.012</pub-id><pub-id pub-id-type="pmid">19909945</pub-id></citation></ref>
<ref id="B56">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Chen</surname> <given-names>Y.-C.</given-names></name> <name><surname>Spence</surname> <given-names>C.</given-names></name></person-group> (<year>2011</year>). <article-title>Crossmodal semantic priming by naturalistic sounds and spoken words enhances visual sensitivity</article-title>. <source>J. Exp. Psychol.</source> <volume>37</volume>, <fpage>1554</fpage>&#x02013;<lpage>1568</lpage>. <pub-id pub-id-type="doi">10.1037/a0024329</pub-id><pub-id pub-id-type="pmid">21688942</pub-id></citation></ref>
<ref id="B57">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Chen</surname> <given-names>Z.</given-names></name> <name><surname>Liu</surname> <given-names>Y.</given-names></name> <name><surname>Wang</surname> <given-names>X.</given-names></name> <name><surname>Wu</surname> <given-names>L.</given-names></name> <name><surname>Huang</surname> <given-names>J.</given-names></name></person-group> (<year>2025</year>). <article-title>Can sounds change the perception of pictures? Exploring the influence of semantic and spatial congruency on aesthetic perception</article-title>. <source>Collabra: Psychol.</source> <volume>11</volume>:<fpage>133241</fpage>. <pub-id pub-id-type="doi">10.1525/collabra.133241</pub-id></citation>
</ref>
<ref id="B58">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Chiat</surname> <given-names>L. F.</given-names></name> <name><surname>Ying</surname> <given-names>L. F.</given-names></name> <name><surname>Piaw</surname> <given-names>C. Y.</given-names></name></person-group> (<year>2013</year>). <article-title>Perception of congruence between music and movement in a rhythmic gymnastics routine</article-title>. <source>J. Basic Appl. Scient. Res.</source> <volume>3</volume>, <fpage>259</fpage>&#x02013;<lpage>268</lpage>.</citation>
</ref>
<ref id="B59">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Chion</surname> <given-names>M.</given-names></name></person-group> (<year>1990</year>). <source>Audio-Vision: Sound on Screen</source>. <publisher-loc>New York, NY</publisher-loc>: <publisher-name>Columbia University Press</publisher-name>.</citation>
</ref>
<ref id="B60">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Chion</surname> <given-names>M.</given-names></name></person-group> (<year>2000</year>). <article-title>&#x0201C;Audio-vision and sound,&#x0201D;</article-title> in Sound, eds. P. Kruth, and H. Stobart (<publisher-loc>Cambridge, UK</publisher-loc>: <publisher-name>Cambridge University Press</publisher-name>), <fpage>201</fpage>&#x02013;<lpage>221</lpage>.</citation>
</ref>
<ref id="B61">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Chiou</surname> <given-names>R.</given-names></name> <name><surname>Rich</surname> <given-names>A. N.</given-names></name></person-group> (<year>2012</year>). <article-title>Cross-modality correspondence between pitch and spatial location modulates attentional orienting</article-title>. <source>Perception</source> <volume>41</volume>, <fpage>339</fpage>&#x02013;<lpage>353</lpage>. <pub-id pub-id-type="doi">10.1068/p7161</pub-id><pub-id pub-id-type="pmid">22808586</pub-id></citation></ref>
<ref id="B62">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Chmiel</surname> <given-names>A.</given-names></name> <name><surname>Schubert</surname> <given-names>E.</given-names></name></person-group> (<year>2019</year>). <article-title>Psycho-historical contextualization for music and visual works: a literature review and comparison between artistic mediums</article-title>. <source>Front. Psychol.</source> <volume>10</volume>:<fpage>182</fpage>. <pub-id pub-id-type="doi">10.3389/fpsyg.2019.00182</pub-id><pub-id pub-id-type="pmid">30837907</pub-id></citation></ref>
<ref id="B63">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Cho</surname> <given-names>J. D.</given-names></name> <name><surname>Jeong</surname> <given-names>J.</given-names></name> <name><surname>Kim</surname> <given-names>J. H.</given-names></name> <name><surname>Lee</surname> <given-names>H.</given-names></name></person-group> (<year>2020</year>). <article-title>Sound coding color to improve artwork appreciation by people with visual impairments</article-title>. <source>Electronics</source> <volume>9</volume>:<fpage>1981</fpage>. <pub-id pub-id-type="doi">10.3390/electronics9111981</pub-id></citation>
</ref>
<ref id="B64">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Clark</surname> <given-names>C. C.</given-names></name> <name><surname>Lawless</surname> <given-names>H. T.</given-names></name></person-group> (<year>1994</year>). <article-title>Limiting response alternatives in time-intensity scaling: an examination of the halo dumping effect</article-title>. <source>Chem. Senses</source> <volume>19</volume>, <fpage>583</fpage>&#x02013;<lpage>594</lpage>. <pub-id pub-id-type="doi">10.1093/chemse/19.6.583</pub-id><pub-id pub-id-type="pmid">7735838</pub-id></citation></ref>
<ref id="B65">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Classen</surname> <given-names>C.</given-names></name></person-group> (<year>1998</year>). <source>The Color of Angels: Cosmology, Gender and the Aesthetic Imagination</source>. <publisher-loc>London, UK</publisher-loc>: <publisher-name>Routledge</publisher-name>.</citation>
</ref>
<ref id="B66">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Clemente</surname> <given-names>A.</given-names></name> <name><surname>Friberg</surname> <given-names>A.</given-names></name> <name><surname>Holzapfel</surname> <given-names>A.</given-names></name></person-group> (<year>2023</year>). <article-title>Relations between perceived affect and liking for melodies and visual designs</article-title>. <source>Emotion</source> <volume>23</volume>, <fpage>1584</fpage>&#x02013;<lpage>1605</lpage>. <pub-id pub-id-type="doi">10.1037/emo0001141</pub-id><pub-id pub-id-type="pmid">36227314</pub-id></citation></ref>
<ref id="B67">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Cohen</surname> <given-names>A. J.</given-names></name></person-group> (<year>1990</year>). <article-title>Associationism and musical soundtrack phenomena</article-title>. <source>Contemp. Music Rev.</source> <volume>9</volume>, <fpage>163</fpage>&#x02013;<lpage>178</lpage>. <pub-id pub-id-type="doi">10.1080/07494469300640421</pub-id></citation>
</ref>
<ref id="B68">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Cohen</surname> <given-names>A. J.</given-names></name></person-group> (<year>2010</year>). <article-title>&#x0201C;Music as a source of emotion in film,&#x0201D;</article-title> in <source>Handbook of Music and Emotion: Theory, Research, Applications</source>, eds. P. N. Juslin and J. A. Sloboda (<publisher-loc>Oxford, UK</publisher-loc>: <publisher-name>Oxford University Press</publisher-name>), <fpage>879</fpage>&#x02013;<lpage>908</lpage>.</citation>
</ref>
<ref id="B69">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Cohen</surname> <given-names>N. E.</given-names></name></person-group> (<year>1934</year>). <article-title>Equivalence of brightness across modalities</article-title>. <source>Am. J. Psychol.</source> <volume>46</volume>, <fpage>117</fpage>&#x02013;<lpage>119</lpage>. <pub-id pub-id-type="doi">10.2307/1416240</pub-id><pub-id pub-id-type="pmid">23039636</pub-id></citation></ref>
<ref id="B70">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Collier</surname> <given-names>W. G.</given-names></name> <name><surname>Hubbard</surname> <given-names>T. L.</given-names></name></person-group> (<year>2001</year>). <article-title>Musical scales and evaluations of happiness and awkwardness: Effects of pitch, direction, and scale mode</article-title>. <source>Am. J. Psychol.</source> <volume>114</volume>, <fpage>355</fpage>&#x02013;<lpage>375</lpage>. <pub-id pub-id-type="doi">10.2307/1423686</pub-id><pub-id pub-id-type="pmid">11641885</pub-id></citation></ref>
<ref id="B71">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Collopy</surname> <given-names>F.</given-names></name></person-group> (<year>2000</year>). <article-title>Colour, form, and motion &#x02013; dimensions of a musical art of light</article-title>. <source>Leonardo</source> <volume>33</volume>, <fpage>355</fpage>&#x02013;<lpage>360</lpage>. <pub-id pub-id-type="doi">10.1162/002409400552829</pub-id></citation>
</ref>
<ref id="B72">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Conrad</surname> <given-names>D.</given-names></name></person-group> (<year>1999</year>). <article-title>The Dichromaccord: reinventing the elusive color organ</article-title>. <source>Leonardo</source> <volume>32</volume>, <fpage>393</fpage>&#x02013;<lpage>398</lpage>. <pub-id pub-id-type="doi">10.1162/002409499553631</pub-id></citation>
</ref>
<ref id="B73">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Cook</surname> <given-names>L. A.</given-names></name> <name><surname>Van Valkenburg</surname> <given-names>D. L.</given-names></name></person-group> (<year>2009</year>). <article-title>Audio-visual organization and the temporal ventriloquism effect between grouped sequences: evidence that unimodal grouping precedes cross-modal integration</article-title>. <source>Perception</source> <volume>38</volume>, <fpage>1220</fpage>&#x02013;<lpage>1233</lpage>. <pub-id pub-id-type="doi">10.1068/p6344</pub-id><pub-id pub-id-type="pmid">19817154</pub-id></citation></ref>
<ref id="B74">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Cowles</surname> <given-names>J. T.</given-names></name></person-group> (<year>1935</year>). <article-title>An experimental study of the pairing of certain auditory and visual stimuli</article-title>. <source>J. Exp. Psychol.</source> <volume>18</volume>, <fpage>461</fpage>&#x02013;<lpage>469</lpage>. <pub-id pub-id-type="doi">10.1037/h0062202</pub-id></citation>
</ref>
<ref id="B75">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Critchley</surname> <given-names>M.</given-names></name></person-group> (<year>1977</year>). <article-title>&#x0201C;Ecstatic and synaesthetic experiences during music perception,&#x0201D;</article-title> in <source>Music and the Brain</source>, eds. M. Critchley, and R. A. Henson (<publisher-loc>Springfield, IL</publisher-loc>: <publisher-name>C. C. Thomas</publisher-name>), <fpage>217</fpage>&#x02013;<lpage>232</lpage>.</citation>
</ref>
<ref id="B76">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Cuddy</surname> <given-names>L. L.</given-names></name></person-group> (<year>1985</year>). <article-title>The color of melody</article-title>. <source>Music Percept.</source> <volume>2</volume>, <fpage>345</fpage>&#x02013;<lpage>360</lpage>. <pub-id pub-id-type="doi">10.2307/40285303</pub-id></citation>
</ref>
<ref id="B77">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Cutietta</surname> <given-names>R. A.</given-names></name> <name><surname>Haggerty</surname> <given-names>K. J.</given-names></name></person-group> (<year>1987</year>). <article-title>A comparative study of color association with music at various age levels</article-title>. <source>J. Res. Music Educ.</source> <volume>35</volume>, <fpage>78</fpage>&#x02013;<lpage>91</lpage>. <pub-id pub-id-type="doi">10.2307/3344984</pub-id></citation>
</ref>
<ref id="B78">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Danto</surname> <given-names>A. C.</given-names></name></person-group> (<year>1981</year>). <source>The Transfiguration of the Commonplace: A Philosophy of Art</source>. <publisher-loc>Cambridge, MA</publisher-loc>: <publisher-name>Harvard University Press</publisher-name>.</citation>
</ref>
<ref id="B79">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Davis</surname> <given-names>J. W.</given-names></name></person-group> (<year>1979</year>). <article-title>A response to Garner&#x00027;s observations on the relationship between colour and music</article-title>. <source>Leonardo</source> <volume>12</volume>, <fpage>218</fpage>&#x02013;<lpage>219</lpage>. <pub-id pub-id-type="doi">10.2307/1574213</pub-id></citation>
</ref>
<ref id="B80">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Deroy</surname> <given-names>O.</given-names></name> <name><surname>Spence</surname> <given-names>C.</given-names></name></person-group> (<year>2013</year>). <article-title>Why we are not all synesthetes (not even weakly so)</article-title>. <source>Psychon. Bull. Rev.</source> <volume>20</volume>, <fpage>643</fpage>&#x02013;<lpage>664</lpage>. <pub-id pub-id-type="doi">10.3758/s13423-013-0387-2</pub-id><pub-id pub-id-type="pmid">23413012</pub-id></citation></ref>
<ref id="B81">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Deutsch</surname> <given-names>J.</given-names></name></person-group> (<year>2012</year>). &#x0201C;Synaesthesia and synergy in art. Gustav Mahler&#x00027;s &#x0201C;Symphony No. 2 in C minor&#x0201D; as an example of interactive music visualization,&#x0201D; in <source>Sensory Perception&#x02014;Mind and Matter</source>, eds. F. G. Barth, P. Giampieri-Deutsch, and H.-D. Klein (<publisher-loc>Vienna</publisher-loc>: <publisher-name>Springer</publisher-name>), <fpage>215</fpage>&#x02013;<lpage>235</lpage>.</citation>
</ref>
<ref id="B82">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Di Stefano</surname> <given-names>N.</given-names></name> <name><surname>Ansani</surname> <given-names>A.</given-names></name> <name><surname>Schiavio</surname> <given-names>A.</given-names></name> <name><surname>Saarikallio</surname> <given-names>S.</given-names></name> <name><surname>Spence</surname> <given-names>C.</given-names></name></person-group> (<year>2025</year>). <article-title>Audiovisual associations in Saint-Sa&#x000EB;ns&#x00027; carnival of the animals: a cross-cultural investigation on the role of timbre</article-title>. <source>Empi. Stud. Arts</source>. <volume>43</volume>:<fpage>2</fpage>. <pub-id pub-id-type="doi">10.1177/02762374241308810</pub-id></citation>
</ref>
<ref id="B83">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Di Stefano</surname> <given-names>N.</given-names></name> <name><surname>Ansani</surname> <given-names>A.</given-names></name> <name><surname>Schiavio</surname> <given-names>A.</given-names></name> <name><surname>Spence</surname> <given-names>C.</given-names></name></person-group> (<year>2024</year>). <article-title>Prokofiev was (almost) right: A cross-cultural exploration of auditory-conceptual associations in Peter and the Wolf</article-title>. <source>Psychon. Bull. Rev.</source> <volume>31</volume>, <fpage>1735</fpage>&#x02013;<lpage>1744</lpage>. <pub-id pub-id-type="doi">10.3758/s13423-023-02435-7</pub-id><pub-id pub-id-type="pmid">38267741</pub-id></citation></ref>
<ref id="B84">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Di Stefano</surname> <given-names>N.</given-names></name> <name><surname>Spence</surname> <given-names>C.</given-names></name></person-group> (<year>2022</year>). <article-title>Roughness: A multisensory/crossmodal perspective</article-title>. <source>Atten. Percept. Psychophys.</source> <volume>84</volume>, <fpage>2087</fpage>&#x02013;<lpage>2114</lpage>. <pub-id pub-id-type="doi">10.3758/s13414-022-02550-y</pub-id><pub-id pub-id-type="pmid">36028614</pub-id></citation></ref>
<ref id="B85">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Di Stefano</surname> <given-names>N.</given-names></name> <name><surname>Spence</surname> <given-names>C.</given-names></name></person-group> (<year>2024a</year>). <article-title>Perceptual similarity: Insights from the crossmodal correspondences</article-title>. <source>Rev. Philos. Psychol.</source> <volume>15</volume>, <fpage>997</fpage>&#x02013;<lpage>1026</lpage>. <pub-id pub-id-type="doi">10.1007/s13164-023-00692-y</pub-id></citation>
</ref>
<ref id="B86">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Di Stefano</surname> <given-names>N.</given-names></name> <name><surname>Spence</surname> <given-names>C.</given-names></name></person-group> (<year>2024b</year>). <article-title>Should absolute pitch be considered as a unique example of context-free sensory judgments in humans?</article-title> <source>Cognition</source> <volume>249</volume>:<fpage>105805</fpage>. <pub-id pub-id-type="doi">10.1016/j.cognition.2024.105805</pub-id><pub-id pub-id-type="pmid">38761646</pub-id></citation></ref>
<ref id="B87">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Di Stefano</surname> <given-names>N.</given-names></name> <name><surname>Spence</surname> <given-names>C.</given-names></name></person-group> (<year>2025</year>). <article-title>&#x0201C;Perceiving temporal structure within and between the senses: A crossmodal/multisensory perspective,&#x0201D;</article-title> in <source>Attention, Perception, and Psychophysics</source> (<publisher-loc>Cham</publisher-loc>: <publisher-name>Springer</publisher-name>). <pub-id pub-id-type="doi">10.3758/s13414-025-03045-2</pub-id><pub-id pub-id-type="pmid">40295425</pub-id></citation></ref>
<ref id="B88">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Di Stefano</surname> <given-names>N.</given-names></name> <name><surname>Vuust</surname> <given-names>P.</given-names></name> <name><surname>Brattico</surname> <given-names>E.</given-names></name></person-group> (<year>2022</year>). <article-title>Consonance and dissonance perception. A critical review of the historical sources, multidisciplinary findings, and main hypotheses</article-title>. <source>Phys. Life Rev.</source> <volume>43</volume>, <fpage>273</fpage>&#x02013;<lpage>304</lpage>. <pub-id pub-id-type="doi">10.1016/j.plrev.2022.10.004</pub-id><pub-id pub-id-type="pmid">36372030</pub-id></citation></ref>
<ref id="B89">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Dimova</surname> <given-names>P.</given-names></name></person-group> (<year>2024</year>). <source>At the Crossroads of the Senses: The Synaesthetic Metaphor Across the arts in European Modernism.</source> <publisher-loc>University Park, PA</publisher-loc>: <publisher-name>Pennsylvania State University Press</publisher-name>.</citation>
</ref>
<ref id="B90">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Donnell-Kotrozo</surname> <given-names>C.</given-names></name></person-group> (<year>1978</year>). <article-title>Intersensory perception of music: Color me trombone</article-title>. <source>Music Educators J.</source> <volume>65</volume>, <fpage>32</fpage>&#x02013;<lpage>37</lpage>. <pub-id pub-id-type="doi">10.2307/3395546</pub-id></citation>
</ref>
<ref id="B91">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Duthie</surname> <given-names>A. C.</given-names></name></person-group> (<year>2013</year>). <source>Do Music and Art Influence One Another? Measuring Cross-Modal Similarities in Music and Art</source> (PhD thesis). <publisher-loc>Ame</publisher-loc>: <publisher-name>Iowa State University</publisher-name>.</citation>
</ref>
<ref id="B92">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Duthie</surname> <given-names>C.</given-names></name> <name><surname>Duthie</surname> <given-names>B.</given-names></name></person-group> (<year>2015</year>). <article-title>Do music and art influence one another? Measuring cross-modal similarities in music and art</article-title>. <source>Polymath</source>. <volume>5</volume>, <fpage>1</fpage>&#x02013;<lpage>22</lpage>.</citation>
</ref>
<ref id="B93">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Ehrenzweig</surname> <given-names>A.</given-names></name></person-group> (<year>1965</year>). <source>The Psychoanalysis of Artistic Vision and Hearing</source>. <publisher-loc>New York, NY</publisher-loc>: <publisher-name>Braziller</publisher-name>.<pub-id pub-id-type="pmid">40483550</pub-id></citation></ref>
<ref id="B94">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Eisenstein</surname> <given-names>S.</given-names></name> <name><surname>Pudovkin</surname> <given-names>W. I.</given-names></name> <name><surname>Alexandrov</surname> <given-names>G. V.</given-names></name></person-group> (<year>1999</year>). <article-title>&#x0201C;Statement on sound,&#x0201D;</article-title> in <source>Close up: 1927-1933</source>, eds. J. Donald, A. Friedberg, and L. Marcus (<publisher-loc>Princeton</publisher-loc>: <publisher-name>Princeton University Press</publisher-name>), <fpage>83</fpage>&#x02013;<lpage>86</lpage>.</citation>
</ref>
<ref id="B95">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Eisenstein</surname> <given-names>S. M.</given-names></name></person-group> (<year>1947</year>). <source>The Film Sense</source> (<publisher-loc>New York, NY</publisher-loc>: <publisher-name>Harcourt</publisher-name>).</citation>
</ref>
<ref id="B96">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ellermeier</surname> <given-names>W.</given-names></name> <name><surname>Kattner</surname> <given-names>F.</given-names></name> <name><surname>Raum</surname> <given-names>A.</given-names></name></person-group> (<year>2021</year>). <article-title>Cross-modal commutativity of magnitude productions of loudness and brightness</article-title>. <source>Attent. Percep. Psychophys.</source> <volume>83</volume>, <fpage>2955</fpage>&#x02013;<lpage>2967</lpage>. <pub-id pub-id-type="doi">10.3758/s13414-021-02324-y</pub-id><pub-id pub-id-type="pmid">34105093</pub-id></citation></ref>
<ref id="B97">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ellis</surname> <given-names>R. J.</given-names></name> <name><surname>Simons</surname> <given-names>R. F.</given-names></name></person-group> (<year>2005</year>). <article-title>The impact of music on subjective and physiological indices of emotion while viewing films</article-title>. <source>Psychomusicology</source> <volume>19</volume>, <fpage>15</fpage>&#x02013;<lpage>40</lpage>. <pub-id pub-id-type="doi">10.1037/h0094042</pub-id></citation>
</ref>
<ref id="B98">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Evans</surname> <given-names>K. K.</given-names></name> <name><surname>Treisman</surname> <given-names>A.</given-names></name></person-group> (<year>2010</year>). <article-title>Natural cross-modal mappings between visual and auditory features</article-title>. <source>J. Vision</source> <volume>10</volume>, <fpage>1</fpage>&#x02013;<lpage>12</lpage>. <pub-id pub-id-type="doi">10.1167/10.1.6</pub-id><pub-id pub-id-type="pmid">20143899</pub-id></citation></ref>
<ref id="B99">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Eysenck</surname> <given-names>H. J.</given-names></name></person-group> (<year>1940</year>). <article-title>The general factor in aesthetic judgements</article-title>. <source>Br. J. Psychol.</source> <volume>31</volume>, <fpage>94</fpage>&#x02013;<lpage>102</lpage>. <pub-id pub-id-type="doi">10.1111/j.2044-8295.1940.tb00977.x</pub-id></citation>
</ref>
<ref id="B100">
<citation citation-type="web"><person-group person-group-type="author"><name><surname>Fekete</surname> <given-names>A.</given-names></name> <name><surname>Specker</surname> <given-names>E.</given-names></name> <name><surname>Mikuni</surname> <given-names>J.</given-names></name> <name><surname>Trupp</surname> <given-names>M. D.</given-names></name> <name><surname>Leder</surname> <given-names>H.</given-names></name></person-group> (<year>2023</year>). <article-title>&#x0201C;When the painting meets its musical inspiration: The impact of multimodal art experience on aesthetic enjoyment and subjective well-being in the museum,&#x0201D;</article-title> in <source>Psychology of Aesthetics, Creativity, and the Arts</source>. Available online at: <ext-link ext-link-type="uri" xlink:href="https://psycnet.apa.org/doi/10.1037/aca0000641">https://psycnet.apa.org/doi/10.1037/aca0000641</ext-link></citation>
</ref>
<ref id="B101">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Field</surname> <given-names>G.</given-names></name></person-group> (<year>1835</year>). <source>Chromatics: Or the Analogy, Harmony, and Philosophy of Colours</source>. <publisher-loc>London, UK</publisher-loc>: <publisher-name>David Bogue</publisher-name>.</citation>
</ref>
<ref id="B102">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Fink</surname> <given-names>L.</given-names></name> <name><surname>Fiehn</surname> <given-names>H.</given-names></name> <name><surname>Wald-Fuhrmann</surname> <given-names>M.</given-names></name></person-group> (<year>2024</year>). <article-title>The role of audiovisual congruence in aesthetic appreciation of contemporary music and visual art</article-title>. <source>Sci. Rep.</source> <volume>14</volume>:<fpage>20923</fpage>. <pub-id pub-id-type="doi">10.1038/s41598-024-71399-y</pub-id><pub-id pub-id-type="pmid">39251764</pub-id></citation></ref>
<ref id="B103">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Frings</surname> <given-names>C.</given-names></name> <name><surname>Spence</surname> <given-names>C.</given-names></name></person-group> (<year>2010</year>). <article-title>Crossmodal congruency effects based on stimulus identity</article-title>. <source>Brain Res.</source> <volume>1354</volume>, <fpage>113</fpage>&#x02013;<lpage>122</lpage>. <pub-id pub-id-type="doi">10.1016/j.brainres.2010.07.058</pub-id><pub-id pub-id-type="pmid">20674555</pub-id></citation></ref>
<ref id="B104">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Gaiger</surname> <given-names>J.</given-names></name></person-group> (<year>2018</year>). <article-title>Can a painting have a rhythm?</article-title> <source>Br. J. Aesthet.</source> <volume>58</volume>, <fpage>363</fpage>&#x02013;<lpage>383</lpage>. <pub-id pub-id-type="doi">10.1093/aesthj/ayy026</pub-id></citation>
</ref>
<ref id="B105">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Galeyev</surname> <given-names>B. M.</given-names></name></person-group> (<year>1988</year>). <article-title>The fire of Prometheus: Music-kinetic art experimentation in the USSR</article-title>. <source>Leonardo</source> <volume>21</volume>, <fpage>383</fpage>&#x02013;<lpage>391</lpage>. <pub-id pub-id-type="doi">10.2307/1578701</pub-id></citation>
</ref>
<ref id="B106">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Galeyev</surname> <given-names>B. M.</given-names></name></person-group> (<year>2003</year>). <article-title>Evolution of gravitational synesthesia in music: to color and light</article-title>. <source>Leonardo</source> <volume>36</volume>, <fpage>129</fpage>&#x02013;<lpage>134</lpage>. <pub-id pub-id-type="doi">10.1162/002409403321554198</pub-id></citation>
</ref>
<ref id="B107">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Galeyev</surname> <given-names>B. M.</given-names></name> <name><surname>Vanechkina</surname> <given-names>I. L.</given-names></name></person-group> (<year>2001</year>). <article-title>Was Scriabin a synesthete?</article-title> <source>Leonardo</source> <volume>34</volume>, <fpage>357</fpage>&#x02013;<lpage>361</lpage>. <pub-id pub-id-type="doi">10.1162/00240940152549357</pub-id></citation>
</ref>
<ref id="B108">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Gallace</surname> <given-names>A.</given-names></name> <name><surname>Spence</surname> <given-names>C.</given-names></name></person-group> (<year>2006</year>). <article-title>Multisensory synesthetic interactions in the speeded classification of visual size</article-title>. <source>Percept. Psychophys.</source> <volume>68</volume>, <fpage>1191</fpage>&#x02013;<lpage>1203</lpage>. <pub-id pub-id-type="doi">10.3758/BF03193720</pub-id><pub-id pub-id-type="pmid">17355042</pub-id></citation></ref>
<ref id="B109">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Gallace</surname> <given-names>A.</given-names></name> <name><surname>Spence</surname> <given-names>C.</given-names></name></person-group> (<year>2014</year>). <article-title>&#x0201C;The neglected power of touch: What cognitive neuroscience tells us about the importance of touch in artistic communication,&#x0201D;</article-title> in <source>Sculpture and Touch</source>, ed. P. Dent (<publisher-loc>Farnham</publisher-loc>: <publisher-name>Ashgate Publishers</publisher-name>), <fpage>107</fpage>&#x02013;<lpage>124</lpage>.</citation>
</ref>
<ref id="B110">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Garner</surname> <given-names>W.</given-names></name></person-group> (<year>1978</year>). <article-title>The relationship between colour and music</article-title>. <source>Leonardo</source> <volume>11</volume>, <fpage>225</fpage>&#x02013;<lpage>226</lpage>. <pub-id pub-id-type="doi">10.2307/1574153</pub-id></citation>
</ref>
<ref id="B111">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Gawboy</surname> <given-names>A. M.</given-names></name> <name><surname>Townsend</surname> <given-names>J.</given-names></name></person-group> (<year>2012</year>). <article-title>Scriabin and the possible</article-title>. <source>MTO</source> <volume>18</volume>, <fpage>1</fpage>&#x02013;<lpage>21</lpage>. <pub-id pub-id-type="doi">10.30535/mto.18.2.2</pub-id></citation>
</ref>
<ref id="B112">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Gebhard</surname> <given-names>J. W.</given-names></name> <name><surname>Mowbray</surname> <given-names>G. H.</given-names></name></person-group> (<year>1959</year>). <article-title>On discriminating the rate of visual flicker and auditory flutter</article-title>. <source>Am. J. Psychol.</source> <volume>72</volume>, <fpage>521</fpage>&#x02013;<lpage>528</lpage>. <pub-id pub-id-type="doi">10.2307/1419493</pub-id></citation>
</ref>
<ref id="B113">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Giannos</surname> <given-names>K.</given-names></name> <name><surname>Athanasopoulos</surname> <given-names>G.</given-names></name> <name><surname>Cambouropoulos</surname> <given-names>E.</given-names></name></person-group> (<year>2021</year>). <article-title>Cross-modal associations between harmonic dissonance and visual roughness</article-title>. <source>Music Sci.</source> <volume>4</volume>:<fpage>20592043211055484</fpage>. <pub-id pub-id-type="doi">10.1177/20592043211055484</pub-id></citation>
</ref>
<ref id="B114">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Gibson</surname> <given-names>S.</given-names></name></person-group> (<year>2023</year>). <article-title>&#x0201C;Moving towards the performed image (colour organs, synesthesia and visual music): early Modernism (1900&#x02013;1955),&#x0201D;</article-title> in <source>Live Visuals: History, Theory, Practice</source>, eds. S. <person-group person-group-type="author"><name><surname>Gibson</surname> <given-names>S.</given-names></name></person-group>Arisona, D. Leishman, and A. Tanaka (<publisher-loc>London, UK</publisher-loc>: <publisher-name>Routledge</publisher-name>), <fpage>41</fpage>&#x02013;<lpage>61</lpage>.</citation>
</ref>
<ref id="B115">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Gilbert</surname> <given-names>G. M.</given-names></name></person-group> (<year>1938</year>). <article-title>A study in inter-sensory Gestalten</article-title>. <source>Psychol. Bull.</source> <volume>35</volume>:<fpage>698</fpage>.</citation>
</ref>
<ref id="B116">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Gombrich</surname> <given-names>E. H.</given-names></name></person-group> (<year>1960</year>). <source>Art and Illusion</source>. <publisher-loc>London, UK</publisher-loc>: <publisher-name>Phaidon Press</publisher-name>.</citation>
</ref>
<ref id="B117">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Gorbman</surname> <given-names>C.</given-names></name></person-group> (<year>1987</year>). <source>Unheard Melodies: Narrative Film Music</source>. <publisher-loc>Bloomington, IA</publisher-loc>: <publisher-name>Indiana University Press</publisher-name>.</citation>
</ref>
<ref id="B118">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Guo</surname> <given-names>Q.</given-names></name> <name><surname>Jiang</surname> <given-names>T.</given-names></name></person-group> (<year>2023</year>). <article-title>&#x0201C;Using crossmodal correspondence between colors and music to enhance online art exhibition visitors&#x00027; experience,&#x0201D;</article-title> in <source>Information for a Better World: Normality, Virtuality, Physicality, Inclusivity</source> (<publisher-loc>Cham</publisher-loc>: <publisher-name>Springer Nature Switzerland</publisher-name>).</citation>
</ref>
<ref id="B119">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Guo</surname> <given-names>X.</given-names></name> <name><surname>Qu</surname> <given-names>J.</given-names></name> <name><surname>Liu</surname> <given-names>M.</given-names></name> <name><surname>Liu</surname> <given-names>C.</given-names></name> <name><surname>Huang</surname> <given-names>J.</given-names></name></person-group> (<year>2023</year>). <article-title>Dynamic audio-visual correspondence in musicians and non-musicians</article-title>. <source>Psychol. Music</source> <volume>52</volume>, <fpage>175</fpage>&#x02013;<lpage>186</lpage>. <pub-id pub-id-type="doi">10.1177/03057356231185467</pub-id></citation>
</ref>
<ref id="B120">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Gurman</surname> <given-names>D.</given-names></name> <name><surname>McCormick</surname> <given-names>C.</given-names></name> <name><surname>Klein</surname> <given-names>R. M.</given-names></name></person-group> (<year>2021</year>). <article-title>Crossmodal correspondence between auditory timbre and visual shape</article-title>. <source>Multisens. Res.</source> <volume>35</volume>, <fpage>221</fpage>&#x02013;<lpage>241</lpage>. <pub-id pub-id-type="doi">10.1163/22134808-bja10067</pub-id><pub-id pub-id-type="pmid">35065536</pub-id></citation></ref>
<ref id="B121">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>HaCohen</surname> <given-names>R.</given-names></name></person-group> (<year>2016</year>). <article-title>&#x0201C;Between generation and suspension,&#x0201D;</article-title> in <source>The Oxford Handbook of Sound and Image in Western Art</source>, ed. T. Kaduri (<publisher-loc>Oxford, UK</publisher-loc>: <publisher-name>Oxford University Press</publisher-name>), <fpage>36</fpage>&#x02013;<lpage>60</lpage>.</citation>
</ref>
<ref id="B122">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hagtvedt</surname> <given-names>H.</given-names></name> <name><surname>Brasel</surname> <given-names>S. A.</given-names></name></person-group> (<year>2016</year>). <article-title>Crossmodal communication: Sound frequency influences consumer responses to color lightness</article-title>. <source>J. Market. Res.</source> <volume>53</volume>, <fpage>551</fpage>&#x02013;<lpage>562</lpage>. <pub-id pub-id-type="doi">10.1509/jmr.14.0414</pub-id><pub-id pub-id-type="pmid">11670861</pub-id></citation></ref>
<ref id="B123">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hankins</surname> <given-names>T. L.</given-names></name></person-group> (<year>1994</year>). <article-title>The ocular harpsichord of Louis-Bertrand Castel; Or, the instrument that wasn&#x00027;t</article-title>. <source>Osiris</source> <volume>9</volume>, <fpage>141</fpage>&#x02013;<lpage>156</lpage>. <pub-id pub-id-type="doi">10.1086/368734</pub-id></citation>
</ref>
<ref id="B124">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Harrison</surname> <given-names>J.</given-names></name></person-group> (<year>2001</year>). <source>Synaesthesia: The Strangest Thing</source>. <publisher-loc>Oxford, UK</publisher-loc>: <publisher-name>Oxford University Press</publisher-name>.</citation>
</ref>
<ref id="B125">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Harrison</surname> <given-names>J.</given-names></name> <name><surname>Baron-Cohen</surname> <given-names>S.</given-names></name></person-group> (<year>1996</year>). <article-title>Acquired and inherited forms of cross-modal correspondence</article-title>. <source>Neurocase</source> <volume>2</volume>, <fpage>245</fpage>&#x02013;<lpage>249</lpage>. <pub-id pub-id-type="doi">10.1080/13554799608402401</pub-id></citation>
</ref>
<ref id="B126">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Hartshorne</surname> <given-names>C.</given-names></name></person-group> (<year>1934</year>). <source>The Philosophy and Psychology of Sensation</source>. <publisher-loc>Chicago, IL</publisher-loc>: <publisher-name>University of Chicago Press</publisher-name>.</citation>
</ref>
<ref id="B127">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hasenfus</surname> <given-names>N.</given-names></name> <name><surname>Martindale</surname> <given-names>C.</given-names></name> <name><surname>Birnbaum</surname> <given-names>D.</given-names></name></person-group> (<year>1983</year>). <article-title>Psychological reality of cross-media artistic styles</article-title>. <source>J. Exp. Psychol.</source> <volume>9</volume>, <fpage>841</fpage>&#x02013;<lpage>863</lpage>. <pub-id pub-id-type="doi">10.1037//0096-1523.9.6.841</pub-id><pub-id pub-id-type="pmid">6227696</pub-id></citation></ref>
<ref id="B128">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hauck</surname> <given-names>P.</given-names></name> <name><surname>von Castell</surname> <given-names>C.</given-names></name> <name><surname>Hecht</surname> <given-names>H.</given-names></name></person-group> (<year>2022</year>). <article-title>Crossmodal correspondence between music and ambient color is mediated by emotion</article-title>. <source>Multisens. Res.</source> <volume>35</volume>, <fpage>407</fpage>&#x02013;<lpage>446</lpage>. <pub-id pub-id-type="doi">10.1163/22134808-bja10077</pub-id><pub-id pub-id-type="pmid">35985652</pub-id></citation></ref>
<ref id="B129">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Haverkamp</surname> <given-names>M.</given-names></name></person-group> (<year>2020</year>). <article-title>&#x0201C;Light, color and motion as crossmodal elements of baroque music,&#x0201D;</article-title> in <source>e-Forum Acusticum</source> 2020<italic>, Lyon, December 7-10</italic><sup><italic>th</italic></sup>. <pub-id pub-id-type="doi">10.48465/fa.2020.0021</pub-id></citation>
</ref>
<ref id="B130">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>He</surname> <given-names>Z.</given-names></name> <name><surname>Wu</surname> <given-names>L.</given-names></name> <name><surname>Li</surname> <given-names>X. R.</given-names></name></person-group> (<year>2018</year>). <article-title>When art meets tech: The role of augmented reality in enhancing museum experiences and purchase intentions</article-title>. <source>Tour. Managem.</source> <volume>68</volume>, <fpage>127</fpage>&#x02013;<lpage>139</lpage>. <pub-id pub-id-type="doi">10.1016/j.tourman.2018.03.003</pub-id></citation>
</ref>
<ref id="B131">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Helmholtz</surname> <given-names>H.</given-names></name></person-group> (<year>1878/1971</year>). <source>Treatise on Physiological Optics (Vol. II)</source>. <publisher-loc>New York, NY</publisher-loc>: <publisher-name>Dover Publications</publisher-name>.</citation>
</ref>
<ref id="B132">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Herget</surname> <given-names>A. K.</given-names></name> <name><surname>Albrecht</surname> <given-names>J.</given-names></name></person-group> (<year>2022</year>). <article-title>Soundtrack for reality? How to use music effectively in non-fictional media formats</article-title>. <source>Psychol. Music</source> <volume>50</volume>, <fpage>508</fpage>&#x02013;<lpage>529</lpage>. <pub-id pub-id-type="doi">10.1177/0305735621999091</pub-id></citation>
</ref>
<ref id="B133">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Holm</surname> <given-names>J.</given-names></name> <name><surname>Aaltonen</surname> <given-names>A.</given-names></name> <name><surname>Siirtola</surname> <given-names>H.</given-names></name></person-group> (<year>2009</year>). <article-title>Associating colours with musical genres</article-title>. <source>J. New Music Res.</source> <volume>38</volume>, <fpage>87</fpage>&#x02013;<lpage>100</lpage>. <pub-id pub-id-type="doi">10.1080/09298210902940094</pub-id></citation>
</ref>
<ref id="B134">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hong</surname> <given-names>Y. J.</given-names></name> <name><surname>Choi</surname> <given-names>A.</given-names></name> <name><surname>Lee</surname> <given-names>C.-E.</given-names></name> <name><surname>Cho</surname> <given-names>W.</given-names></name> <name><surname>Yoon</surname> <given-names>S.</given-names></name> <name><surname>Lee</surname> <given-names>K.</given-names></name></person-group> (<year>2024</year>). <article-title>Concurrent musical pitch height biases judgment of visual brightness</article-title>. <source>Psychol. Music</source>. <volume>53</volume>:<fpage>3</fpage>. <pub-id pub-id-type="doi">10.1177/03057356231216950</pub-id></citation>
</ref>
<ref id="B135">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Howes</surname> <given-names>D.</given-names></name></person-group> (<year>2025</year>). <article-title>The new intersensory music and art history: Review of &#x0201C;Sensuous Consonance: The Visual Arts in Conjunction with Music Around 1900&#x0201D;, organized by Linda Hinners and Tobias Lund, Royal Swedish Academy of Music and the Swedish Nationalmuseum, Stockholm, 4 October 2024</article-title>. <source>Senses Soc.</source> <volume>20</volume>, <fpage>143</fpage>&#x02013;<lpage>150</lpage>. <pub-id pub-id-type="doi">10.1080/17458927.2025.2463766</pub-id></citation>
</ref>
<ref id="B136">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Howes</surname> <given-names>D.</given-names></name> <name><surname>Classen</surname> <given-names>C.</given-names></name></person-group> (<year>2014</year>). <source>Ways of Sensing: Understanding the Senses in Society</source>. <publisher-loc>London, UK</publisher-loc>: <publisher-name>Routledge</publisher-name>.</citation>
</ref>
<ref id="B137">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Howes</surname> <given-names>H.</given-names></name></person-group> (<year>2023</year>). <article-title>The collideroscopic sensorium</article-title>. <source>HAU: J. Ethnographic Theory</source> <volume>13</volume>, <fpage>730</fpage>&#x02013;<lpage>734</lpage>. <pub-id pub-id-type="doi">10.1086/728409</pub-id></citation>
</ref>
<ref id="B138">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Huang</surname> <given-names>J.</given-names></name> <name><surname>Gamble</surname> <given-names>D.</given-names></name> <name><surname>Sarnlertsophon</surname> <given-names>K.</given-names></name> <name><surname>Wang</surname> <given-names>X.</given-names></name> <name><surname>Hsiao</surname> <given-names>S.</given-names></name></person-group> (<year>2012</year>). <article-title>Feeling music: Integration of auditory and tactile inputs in musical meter perception</article-title>. <source>PLoS ONE</source> <volume>7</volume>:<fpage>e48496</fpage>. <pub-id pub-id-type="doi">10.1371/journal.pone.0048496</pub-id><pub-id pub-id-type="pmid">23119038</pub-id></citation></ref>
<ref id="B139">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hubbard</surname> <given-names>T. L.</given-names></name></person-group> (<year>1996</year>). <article-title>Synesthesia-like mappings of lightness, pitch, and melodic interval</article-title>. <source>Am. J. Psychol.</source> <volume>109</volume>, <fpage>219</fpage>&#x02013;<lpage>238</lpage>. <pub-id pub-id-type="doi">10.2307/1423274</pub-id><pub-id pub-id-type="pmid">8644886</pub-id></citation></ref>
<ref id="B140">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Huddleston</surname> <given-names>W. E.</given-names></name> <name><surname>Lewis</surname> <given-names>J. W.</given-names></name> <name><surname>Phinney</surname> <given-names>R. E.</given-names></name> <name><surname>DeYoe</surname> <given-names>E. A.</given-names></name></person-group> (<year>2008</year>). <article-title>Auditory and visual attention-based apparent motion share functional parallels</article-title>. <source>Percept. Psychophys.</source> <volume>70</volume>, <fpage>1207</fpage>&#x02013;<lpage>1216</lpage>. <pub-id pub-id-type="doi">10.3758/PP.70.7.1207</pub-id><pub-id pub-id-type="pmid">18927004</pub-id></citation></ref>
<ref id="B141">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Hull</surname> <given-names>A. E.</given-names></name></person-group> (<year>1927</year>). <source>A great Russian tone poet: Scriabin</source> (2nd Ed.). <publisher-loc>London, UK</publisher-loc>: <publisher-name>Tr&#x000FC;bner and Co</publisher-name>.</citation>
</ref>
<ref id="B142">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hung</surname> <given-names>K.</given-names></name></person-group> (<year>2000</year>). <article-title>Narrative music in congruent and incongruent TV advertising</article-title>. <source>J. Advert.</source> <volume>29</volume>, <fpage>25</fpage>&#x02013;<lpage>34</lpage>. <pub-id pub-id-type="doi">10.1080/00913367.2000.10673601</pub-id></citation>
</ref>
<ref id="B143">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Iosifyan</surname> <given-names>M.</given-names></name> <name><surname>Sidoroff-Dorso</surname> <given-names>A.</given-names></name> <name><surname>Wolfe</surname> <given-names>J.</given-names></name></person-group> (<year>2022</year>). <article-title>Cross-modal associations between paintings and sounds: effects of embodiment</article-title>. <source>Perception</source> <volume>51</volume>, <fpage>871</fpage>&#x02013;<lpage>888</lpage>. <pub-id pub-id-type="doi">10.1177/03010066221126452</pub-id><pub-id pub-id-type="pmid">36217800</pub-id></citation></ref>
<ref id="B144">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Isbilen</surname> <given-names>E. S.</given-names></name> <name><surname>Krumhansl</surname> <given-names>C. L.</given-names></name></person-group> (<year>2016</year>). <article-title>The color of music: emotion-mediated associations to Bach&#x00027;s Well-Tempered Clavier</article-title>. <source>Psychomusicol.: Music, Mind Brain</source> <volume>26</volume>, <fpage>149</fpage>&#x02013;<lpage>161</lpage>. <pub-id pub-id-type="doi">10.1037/pmu0000147</pub-id></citation>
</ref>
<ref id="B145">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Iwamiya</surname> <given-names>S.</given-names></name></person-group> (<year>1994</year>). <article-title>Interaction between auditory and visual processing when listening to music in an audiovisual context: 1. Matching 2. Audio quality</article-title>. <source>Psychomusicology</source> <volume>13</volume>, <fpage>133</fpage>&#x02013;<lpage>154</lpage>. <pub-id pub-id-type="doi">10.1037/h0094098</pub-id></citation>
</ref>
<ref id="B146">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Iwamiya</surname> <given-names>S.</given-names></name></person-group> (<year>2013</year>). <article-title>&#x0201C;Perceived congruence between auditory and visual elements in multimedia,&#x0201D;</article-title> in <source>The Psychology of Music in Multimedia</source>, eds.S.-L. Tan, A. J. Cohen, S. D. Lipscomb, and R. A. Kendall (<publisher-loc>Oxford, UK</publisher-loc>: <publisher-name>Oxford University Press</publisher-name>), <fpage>141</fpage>&#x02013;<lpage>164</lpage>.</citation>
</ref>
<ref id="B147">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Jewanski</surname> <given-names>J.</given-names></name></person-group> (<year>2010</year>). <article-title>&#x0201C;Color-tone analogies: A systematic presentation of the principles of correspondence,&#x0201D;</article-title> in <source>Audiovisuology: A Multidisciplinary Survey of Audiovisual Culture</source>, eds. D. Daniels, S. Naumann, and J. Thoben (<publisher-loc>K&#x000F6;ln</publisher-loc>: <publisher-name>K&#x000F6;nig</publisher-name>), <fpage>77</fpage>&#x02013;<lpage>87</lpage>.</citation>
</ref>
<ref id="B148">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Johnson</surname> <given-names>D.</given-names></name> <name><surname>Allison</surname> <given-names>C.</given-names></name> <name><surname>Baron-Cohen</surname> <given-names>S.</given-names></name></person-group> (<year>2013</year>). <article-title>&#x0201C;The prevalence of synesthesia: The consistency revolution,&#x0201D;</article-title> in <source>Oxford Handbook of Synaesthesia</source>, eds. J. Simner and E. Hubbard (<publisher-loc>Oxford</publisher-loc>: <publisher-name>Oxford Academic</publisher-name>).</citation>
</ref>
<ref id="B149">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Jones</surname> <given-names>C. A.</given-names></name> <name><surname>Hasegawa</surname> <given-names>Y.</given-names></name> <name><surname>Jacobson</surname> <given-names>M.</given-names></name> <name><surname>Arning</surname> <given-names>B.</given-names></name></person-group> (<year>2006</year>). <source>Sensorium: Embodied Experience, Technology, and Contemporary Art.</source> <publisher-loc>Cambridge, MA</publisher-loc>: <publisher-name>MIT Press</publisher-name>.</citation>
</ref>
<ref id="B150">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Julesz</surname> <given-names>B.</given-names></name> <name><surname>Hirsh</surname> <given-names>I. J.</given-names></name></person-group> (<year>1972</year>). <article-title>&#x0201C;Visual and auditory perception - An essay of comparison,&#x0201D;</article-title> in <source>Human Communication: A Unified View</source>, eds. E. E. David, Jr., and P. B. Denes (<publisher-loc>New York, NY</publisher-loc>: <publisher-name>McGraw-Hill</publisher-name>), <fpage>283</fpage>&#x02013;<lpage>340</lpage>.</citation>
</ref>
<ref id="B151">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Kaduri</surname> <given-names>T.</given-names></name></person-group> (<year>2016</year>). <source>The Oxford Handbook of Sound and Image in Western Art</source>. <publisher-loc>Oxford, UK</publisher-loc>: <publisher-name>Oxford University Press</publisher-name>.</citation>
</ref>
<ref id="B152">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>K&#x000E4;mpfe</surname> <given-names>J.</given-names></name> <name><surname>Sedlmeier</surname> <given-names>P.</given-names></name> <name><surname>Renkewitz</surname> <given-names>F.</given-names></name></person-group> (<year>2011</year>). <article-title>The impact of background music on adult listeners. A meta-analysis</article-title>. <source>Psychol. Music</source> <volume>39</volume>, <fpage>424</fpage>&#x02013;<lpage>448</lpage>. <pub-id pub-id-type="doi">10.1177/0305735610376261</pub-id></citation>
</ref>
<ref id="B153">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Kandinsky</surname> <given-names>W.</given-names></name></person-group> (<year>1977</year>). <source>Concerning the Spiritual in art, Especially in Painting.</source> <publisher-loc>New York, NY</publisher-loc>: <publisher-name>Dover Publications</publisher-name>.</citation>
</ref>
<ref id="B154">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kargon</surname> <given-names>J.</given-names></name></person-group> (<year>2011</year>). <article-title>Harmonizing these two arts: Edmund Lind&#x00027;s The Music of Color</article-title>. <source>J. Des. Hist.</source> <volume>24</volume>, <fpage>1</fpage>&#x02013;<lpage>14</lpage>. <pub-id pub-id-type="doi">10.1093/jdh/epq042</pub-id></citation>
</ref>
<ref id="B155">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Karwoski</surname> <given-names>T. F.</given-names></name> <name><surname>Odbert</surname> <given-names>H. S.</given-names></name> <name><surname>Osgood</surname> <given-names>C. E.</given-names></name></person-group> (<year>1942</year>). <article-title>Studies in synesthetic thinking. II. The r&#x000F4;le of form in visual responses to music</article-title>. <source>J. General Psychol.</source> <volume>26</volume>, <fpage>199</fpage>&#x02013;<lpage>222</lpage>. <pub-id pub-id-type="doi">10.1080/00221309.1942.10545166</pub-id></citation>
</ref>
<ref id="B156">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Kastner</surname> <given-names>F.</given-names></name></person-group> (<year>1875</year>). <source>Invention du pyrophone: Exp&#x000E9;riences nouvelles sir les flammes chantantes</source> (2nd Ed.). <publisher-loc>Paris</publisher-loc>: <publisher-name>Typographie de A. Parent</publisher-name>.</citation>
</ref>
<ref id="B157">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Kierkegaard</surname> <given-names>S.</given-names></name></person-group> (<year>1988</year>). <source>Either/Or: Part 1</source>, eds. H. V. Hong and E. H. Hong (<publisher-loc>Princeton, NJ</publisher-loc>: <publisher-name>Princeton University Press</publisher-name>), <fpage>119</fpage>&#x02013;<lpage>120</lpage>.</citation>
</ref>
<ref id="B158">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Klapetek</surname> <given-names>A.</given-names></name> <name><surname>Ngo</surname> <given-names>M. K.</given-names></name> <name><surname>Spence</surname> <given-names>C.</given-names></name></person-group> (<year>2012</year>). <article-title>Does crossmodal correspondence modulate the facilitatory effect of auditory cues on visual search?</article-title> <source>Attent. Percept. Psychophys.</source> <volume>74</volume>, <fpage>1154</fpage>&#x02013;<lpage>1167</lpage>. <pub-id pub-id-type="doi">10.3758/s13414-012-0317-9</pub-id><pub-id pub-id-type="pmid">22648604</pub-id></citation></ref>
<ref id="B159">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Klein</surname> <given-names>A. B.</given-names></name></person-group> (<year>1937</year>). <source>Coloured Light: An Art Medium (Third Enlarged Edition of The Art of Light: Colour Music)</source>. <publisher-loc>London, UK</publisher-loc>: <publisher-name>Crosby Lockwood and Son</publisher-name>.</citation>
</ref>
<ref id="B160">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Klein</surname> <given-names>K.</given-names></name> <name><surname>Melnyk</surname> <given-names>V.</given-names></name> <name><surname>V&#x000F6;lckner</surname> <given-names>F.</given-names></name></person-group> (<year>2021</year>). <article-title>Effects of background music on evaluations of visual images</article-title>. <source>Psychol. Market.</source> <volume>38</volume>, <fpage>2240</fpage>&#x02013;<lpage>2246</lpage>. <pub-id pub-id-type="doi">10.1002/mar.21588</pub-id></citation>
</ref>
<ref id="B161">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kubovy</surname> <given-names>M.</given-names></name> <name><surname>Yu</surname> <given-names>M.</given-names></name></person-group> (<year>2012</year>). <article-title>Multistability, cross-modal binding and the additivity of conjoint grouping principles</article-title>. <source>Philosoph. Trans. Royal Soc. B.</source> <volume>367</volume>, <fpage>954</fpage>&#x02013;<lpage>964</lpage>. <pub-id pub-id-type="doi">10.1098/rstb.2011.0365</pub-id><pub-id pub-id-type="pmid">22371617</pub-id></citation></ref>
<ref id="B162">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lauwrens</surname> <given-names>J.</given-names></name></person-group> (<year>2012</year>). <article-title>Welcome to the revolution: the sensory turn and art history</article-title>. <source>J. Art Historiography</source> <volume>7</volume>, <fpage>1</fpage>&#x02013;<lpage>17</lpage>.</citation>
</ref>
<ref id="B163">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Legrenzi</surname> <given-names>P.</given-names></name> <name><surname>Umilt&#x000E0;</surname> <given-names>C.</given-names></name></person-group> (<year>2011</year>). <source>Neuromania: On the limits of brain science (translated by F. Anderson).</source> <publisher-loc>Oxford</publisher-loc>: <publisher-name>Oxford University Press</publisher-name>.</citation>
</ref>
<ref id="B164">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Leibniz</surname> <given-names>G. W.</given-names></name></person-group> (<year>1896</year>). <source>On Solidity. New Essays Concerning Human Understanding (Originally published, 1704).</source> <publisher-loc>New York, NY</publisher-loc>: <publisher-name>Macmillan</publisher-name>.</citation>
</ref>
<ref id="B165">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>L&#x000E9;vi-Strauss</surname> <given-names>C.</given-names></name></person-group> (<year>1997</year>). <source>Look, Listen, and Read</source>. <publisher-loc>New York, NY</publisher-loc>: <publisher-name>Basic Books</publisher-name>.</citation>
</ref>
<ref id="B166">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lewkowicz</surname> <given-names>D. J.</given-names></name> <name><surname>Turkewitz</surname> <given-names>G.</given-names></name></person-group> (<year>1980</year>). <article-title>Cross-modal equivalence in early infancy: Auditory-visual intensity matching</article-title>. <source>Dev. Psychol.</source> <volume>16</volume>, <fpage>597</fpage>&#x02013;<lpage>607</lpage>. <pub-id pub-id-type="doi">10.1037/0012-1649.16.6.597</pub-id></citation>
</ref>
<ref id="B167">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Limbert</surname> <given-names>W. M.</given-names></name> <name><surname>Polzella</surname> <given-names>D. J.</given-names></name></person-group> (<year>1998</year>). <article-title>Effects of music on the perception of paintings</article-title>. <source>Empir. Stud. Arts</source> <volume>16</volume>, <fpage>33</fpage>&#x02013;<lpage>39</lpage>. <pub-id pub-id-type="doi">10.2190/V8BL-GBJK-TLFP-R321</pub-id><pub-id pub-id-type="pmid">22612255</pub-id></citation></ref>
<ref id="B168">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lin</surname> <given-names>C.</given-names></name> <name><surname>Yeh</surname> <given-names>M.</given-names></name> <name><surname>Shams</surname> <given-names>L.</given-names></name></person-group> (<year>2022</year>). <article-title>Subliminal audio visual temporal congruency in music videos enhances perceptual pleasure</article-title>. <source>Neurosci. Lett.</source> <volume>779</volume>:<fpage>136623</fpage>. <pub-id pub-id-type="doi">10.1016/j.neulet.2022.136623</pub-id><pub-id pub-id-type="pmid">35398533</pub-id></citation></ref>
<ref id="B169">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lindborg</surname> <given-names>P. M.</given-names></name> <name><surname>Friberg</surname> <given-names>A. K.</given-names></name></person-group> (<year>2015</year>). <article-title>Colour association with music is mediated by emotion: Evidence from an experiment using a CIE Lab interface and interviews</article-title>. <source>PLoS ONE</source> <volume>10</volume>:<fpage>e0144013</fpage>. <pub-id pub-id-type="doi">10.1371/journal.pone.0144013</pub-id><pub-id pub-id-type="pmid">26642050</pub-id></citation></ref>
<ref id="B170">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lindholm</surname> <given-names>P.</given-names></name></person-group> (<year>2022</year>). <article-title>&#x0201C;A light in sound, a sound-like power in light&#x0201D;: Poetry, synaesthesia and multimedia</article-title>. <source>&#x000C9;tudes de lettres</source> <volume>319</volume>, <fpage>77</fpage>&#x02013;<lpage>103</lpage>. <pub-id pub-id-type="doi">10.4000/edl.4009</pub-id></citation>
</ref>
<ref id="B171">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lindner</surname> <given-names>D.</given-names></name> <name><surname>Hynan</surname> <given-names>M. T.</given-names></name></person-group> (<year>1987</year>). <article-title>Perceived structure of abstract paintings as a function of structure of music listened to on initial viewing</article-title>. <source>Bull. Psychon. Soc.</source> <volume>25</volume>, <fpage>44</fpage>&#x02013;<lpage>46</lpage>. <pub-id pub-id-type="doi">10.3758/BF03330072</pub-id></citation>
</ref>
<ref id="B172">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lippman</surname> <given-names>E. A.</given-names></name></person-group> (<year>1963</year>). <article-title>Hellenic conceptions of harmony</article-title>. <source>J. Am. Musicol. Soc.</source> <volume>16</volume>, <fpage>3</fpage>&#x02013;<lpage>35</lpage>. <pub-id pub-id-type="doi">10.2307/829917</pub-id></citation>
</ref>
<ref id="B173">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Lipscomb</surname> <given-names>S. D.</given-names></name></person-group> (<year>2013</year>). <article-title>&#x0201C;Cross-modal alignment of accent structures in multimedia. The psychology of music in multimedia,&#x0201D;</article-title> in <source>The psychology of Music in Multimedia</source>, eds. S. Tan, A. J. Cohen, S. D. Lipscomb, and R. A. Kendall (<publisher-loc>Oxford, UK</publisher-loc>: <publisher-name>Oxford University Press</publisher-name>), <fpage>192</fpage>&#x02013;<lpage>215</lpage>.</citation>
</ref>
<ref id="B174">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lipscomb</surname> <given-names>S. D.</given-names></name> <name><surname>Kendall</surname> <given-names>R. A.</given-names></name></person-group> (<year>1994</year>). <article-title>Perceptual judgment of the relationship between musical and visual components in film</article-title>. <source>Psychomusicology</source> <volume>13</volume>, <fpage>60</fpage>&#x02013;<lpage>98</lpage>. <pub-id pub-id-type="doi">10.1037/h0094101</pub-id></citation>
</ref>
<ref id="B175">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Lipscomb</surname> <given-names>S. D.</given-names></name> <name><surname>Kim</surname> <given-names>E. M.</given-names></name></person-group> (<year>2004</year>). <article-title>&#x0201C;Perceived match between visual parameters and auditory correlates: an experimental multimedia investigation,&#x0201D;</article-title> in <source>Proceedings of the 8th International Conference on Music Perception and Cognition</source> (<publisher-loc>Evanston, IL</publisher-loc>), <fpage>72</fpage>&#x02013;<lpage>75</lpage>.</citation>
</ref>
<ref id="B176">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Lipscomb</surname> <given-names>S. D.</given-names></name> <name><surname>Tolchinsky</surname> <given-names>D. E.</given-names></name></person-group> (<year>2005</year>). <article-title>&#x0201C;The role of music communication in cinema,&#x0201D;</article-title> in <source>Musical Communication</source>, eds. D. Miell, R. Macdonald, and D. J. Hargreaves (<publisher-loc>Oxford</publisher-loc>: <publisher-name>Oxford University Press</publisher-name>), <fpage>383</fpage>&#x02013;<lpage>404</lpage>.</citation>
</ref>
<ref id="B177">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Liu</surname> <given-names>M.</given-names></name></person-group> (<year>2022</year>). <article-title>Paintings of music</article-title>. <source>J. Aesthet. Art Critic.</source> <volume>80</volume>, <fpage>151</fpage>&#x02013;<lpage>163</lpage>. <pub-id pub-id-type="doi">10.1093/jaac/kpac003</pub-id></citation>
</ref>
<ref id="B178">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Locke</surname> <given-names>J.</given-names></name></person-group> (<year>1690</year>). <source>An Essay Concerning Human Understanding, Book III</source>. <publisher-loc>Paris, France</publisher-loc>: <publisher-name>Basset</publisher-name>.<pub-id pub-id-type="pmid">16769554</pub-id></citation></ref>
<ref id="B179">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Logeswaran</surname> <given-names>N.</given-names></name> <name><surname>Bhattacharya</surname> <given-names>J.</given-names></name></person-group> (<year>2009</year>). <article-title>Crossmodal transfer of emotion by music</article-title>. <source>Neurosci. Lett.</source> <volume>455</volume>, <fpage>129</fpage>&#x02013;<lpage>133</lpage>. <pub-id pub-id-type="doi">10.1016/j.neulet.2009.03.044</pub-id><pub-id pub-id-type="pmid">19368861</pub-id></citation></ref>
<ref id="B180">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>London</surname> <given-names>I. D.</given-names></name></person-group> (<year>1954</year>). <article-title>Research on sensory interaction in the Soviet Union</article-title>. <source>Psychol. Bull.</source> <volume>51</volume>, <fpage>531</fpage>&#x02013;<lpage>568</lpage>. <pub-id pub-id-type="doi">10.1037/h0056730</pub-id><pub-id pub-id-type="pmid">13215683</pub-id></citation></ref>
<ref id="B181">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Loureiro</surname> <given-names>S. M. C.</given-names></name> <name><surname>Roschk</surname> <given-names>H.</given-names></name> <name><surname>Lima</surname> <given-names>F.</given-names></name></person-group> (<year>2019</year>). <article-title>The role of background music in visitors&#x00027; experience of art exhibitions: Music, memory and art appraisal</article-title>. <source>Int. J. Arts Managem.</source> <volume>22</volume>, <fpage>4</fpage>&#x02013;<lpage>24</lpage>.</citation>
</ref>
<ref id="B182">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Lupton</surname> <given-names>E.</given-names></name></person-group> (<year>2018</year>). <article-title>&#x0201C;Visualizing sound,&#x0201D;</article-title> in <source>The Senses: Design Beyond Vision</source>, eds. E. Lupton and A. Lipps (<publisher-loc>Hudson, NY</publisher-loc>: <publisher-name>Princeton Architectural Press</publisher-name>), <fpage>204</fpage>&#x02013;<lpage>217</lpage>.</citation>
</ref>
<ref id="B183">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Maeda</surname> <given-names>F.</given-names></name> <name><surname>Kanai</surname> <given-names>R.</given-names></name> <name><surname>Shimojo</surname> <given-names>S.</given-names></name></person-group> (<year>2004</year>). <article-title>Changing pitch induced visual motion illusion</article-title>. <source>Curr. Biol.</source> <volume>14</volume>, <fpage>R990</fpage>&#x02013;<lpage>R991</lpage>. <pub-id pub-id-type="doi">10.1016/j.cub.2004.11.018</pub-id><pub-id pub-id-type="pmid">15589145</pub-id></citation></ref>
<ref id="B184">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Malhotra</surname> <given-names>N. K.</given-names></name></person-group> (<year>1984</year>). <article-title>Information and sensory overload. Information and sensory overload in psychology and marketing</article-title>. <source>Psychol. Market.</source> <volume>1</volume>, <fpage>9</fpage>&#x02013;<lpage>21</lpage>. <pub-id pub-id-type="doi">10.1002/mar.4220010304</pub-id></citation>
</ref>
<ref id="B185">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Marin</surname> <given-names>M. M.</given-names></name> <name><surname>Gingras</surname> <given-names>B.</given-names></name> <name><surname>Bhattacharya</surname> <given-names>J.</given-names></name></person-group> (<year>2012</year>). <article-title>Crossmodal transfer of arousal, but not pleasantness, from the musical to the visual domain</article-title>. <source>Emotion</source> <volume>12</volume>, <fpage>618</fpage>&#x02013;<lpage>631</lpage>. <pub-id pub-id-type="doi">10.1037/a0025020</pub-id><pub-id pub-id-type="pmid">21859191</pub-id></citation></ref>
<ref id="B186">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Marks</surname> <given-names>L. E.</given-names></name></person-group> (<year>1974</year>). <article-title>On associations of light and sound: the mediation of brightness, pitch, and loudness</article-title>. <source>Am. J. Psychol</source>. <volume>87</volume>, <fpage>173</fpage>&#x02013;<lpage>188</lpage>. <pub-id pub-id-type="doi">10.2307/1422011</pub-id><pub-id pub-id-type="pmid">4451203</pub-id></citation></ref>
<ref id="B187">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Marks</surname> <given-names>L. E.</given-names></name></person-group> (<year>1987</year>). <article-title>On cross-modal similarity: Auditory&#x02013;visual interactions in speeded discrimination</article-title>. <source>J. Exp. Psychol.</source> <volume>13</volume>, <fpage>384</fpage>&#x02013;<lpage>394</lpage>. <pub-id pub-id-type="doi">10.1037//0096-1523.13.3.384</pub-id><pub-id pub-id-type="pmid">2958587</pub-id></citation></ref>
<ref id="B188">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Marks</surname> <given-names>L. E.</given-names></name></person-group> (<year>1989</year>). <article-title>On cross-modal similarity: The perceptual structure of pitch, loudness, and brightness</article-title>. <source>J. Exp. Psychol.</source> <volume>15</volume>, <fpage>586</fpage>&#x02013;<lpage>602</lpage>. <pub-id pub-id-type="doi">10.1037//0096-1523.15.3.586</pub-id><pub-id pub-id-type="pmid">2527964</pub-id></citation></ref>
<ref id="B189">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Marks</surname> <given-names>L. E.</given-names></name></person-group> (<year>2011</year>). <article-title>Synesthesia, then and now</article-title>. <source>Intellectica</source> <volume>55</volume>, <fpage>47</fpage>&#x02013;<lpage>80</lpage>. <pub-id pub-id-type="doi">10.3406/intel.2011.1161</pub-id></citation>
</ref>
<ref id="B190">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Marks</surname> <given-names>L. E.</given-names></name> <name><surname>Hammeal</surname> <given-names>R. J.</given-names></name> <name><surname>Bornstein</surname> <given-names>M. H.</given-names></name> <name><surname>Smith</surname> <given-names>L. B.</given-names></name></person-group> (<year>1987</year>). <article-title>Perceiving similarity and comprehending metaphor</article-title>. <source>Monogr. Soc. Res. Child Dev.</source> <volume>52</volume>, <fpage>1</fpage>&#x02013;<lpage>100</lpage>. <pub-id pub-id-type="doi">10.2307/1166084</pub-id></citation>
</ref>
<ref id="B191">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>McDonald</surname> <given-names>J.</given-names></name> <name><surname>Canazza</surname> <given-names>S.</given-names></name> <name><surname>Chmiel</surname> <given-names>A.</given-names></name> <name><surname>De Poli</surname> <given-names>G.</given-names></name> <name><surname>Houbert</surname> <given-names>E.</given-names></name> <name><surname>Murari</surname> <given-names>M.</given-names></name> <etal/></person-group>. (<year>2022</year>). <article-title>Illuminating music: Impact of color hue for background lighting on emotional arousal in piano performance videos</article-title>. <source>Front. Psychol.</source><volume>13</volume>:<fpage>828699</fpage>. <pub-id pub-id-type="doi">10.3389/fpsyg.2022.828699</pub-id><pub-id pub-id-type="pmid">35369229</pub-id></citation></ref>
<ref id="B192">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>McNeill Whistler</surname> <given-names>J. A.</given-names></name></person-group> (<year>1978</year>). <source>Letter to the world. May 22</source><sup><italic>nd</italic></sup><italic>. [Reprinted in the Gentle Art of Making Enemies]</italic>. New York, NY: <volume>Dover</volume>, <fpage>127</fpage>&#x02013;<lpage>128</lpage>.</citation>
</ref>
<ref id="B193">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>McQuarrie</surname> <given-names>E.</given-names></name> <name><surname>Mick</surname> <given-names>D. G.</given-names></name></person-group> (<year>1992</year>). <article-title>On resonance: A critical pluralistic inquiry into advertising rhetoric</article-title>. <source>J. Consumer Res.</source> <volume>19</volume>, <fpage>180</fpage>&#x02013;<lpage>197</lpage>. <pub-id pub-id-type="doi">10.1086/209295</pub-id></citation>
</ref>
<ref id="B194">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Melara</surname> <given-names>R. D.</given-names></name></person-group> (<year>1989</year>). <article-title>Dimensional interaction between color and pitch</article-title>. <source>J. Exp. Psychol.</source> <volume>15</volume>, <fpage>69</fpage>&#x02013;<lpage>79</lpage>. <pub-id pub-id-type="doi">10.1037//0096-1523.15.1.69</pub-id><pub-id pub-id-type="pmid">2522534</pub-id></citation></ref>
<ref id="B195">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Menouti</surname> <given-names>K.</given-names></name> <name><surname>Akiva-Kabiri</surname> <given-names>L.</given-names></name> <name><surname>Banissy</surname> <given-names>M. J.</given-names></name> <name><surname>Stewart</surname> <given-names>L.</given-names></name></person-group> (<year>2015</year>). <article-title>Timbre-colour synaesthesia: exploring the consistency of associations based on timbre</article-title>. <source>Cortex</source> <volume>63</volume>, <fpage>1</fpage>&#x02013;<lpage>3</lpage>. <pub-id pub-id-type="doi">10.1016/j.cortex.2014.08.009</pub-id><pub-id pub-id-type="pmid">25241393</pub-id></citation></ref>
<ref id="B196">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Millet</surname> <given-names>B.</given-names></name> <name><surname>Chattah</surname> <given-names>J.</given-names></name> <name><surname>Ahn</surname> <given-names>S.</given-names></name></person-group> (<year>2021</year>). <article-title>Soundtrack design: The impact of music on visual attention and affective responses</article-title>. <source>Appl. Ergon.</source> <volume>93</volume>:<fpage>103301</fpage>. <pub-id pub-id-type="doi">10.1016/j.apergo.2020.103301</pub-id><pub-id pub-id-type="pmid">33516045</pub-id></citation></ref>
<ref id="B197">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Minnigerode</surname> <given-names>F. A.</given-names></name> <name><surname>Ciancio</surname> <given-names>D. W.</given-names></name> <name><surname>Sbarboro</surname> <given-names>L. A.</given-names></name></person-group> (<year>1976</year>). <article-title>Matching music with paintings by Klee</article-title>. <source>Percept. Mot. Skills</source> <volume>42</volume>, <fpage>269</fpage>&#x02013;<lpage>270</lpage>. <pub-id pub-id-type="doi">10.2466/pms.1976.42.1.269</pub-id></citation>
</ref>
<ref id="B198">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Mok</surname> <given-names>J.</given-names></name></person-group> (<year>2022</year>). <source>Cross-Modal Correspondences: Different Modes, Common Codes? Investigating Musical Engagement with an Ecological Cognitive Approach</source> (<publisher-loc>Master&#x00027;s programme in Music, Communication and Technology</publisher-loc>). Department of Musicology, University of Oslo, Oslo, Norway.</citation>
</ref>
<ref id="B199">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Mondloch</surname> <given-names>C. J.</given-names></name> <name><surname>Maurer</surname> <given-names>D.</given-names></name></person-group> (<year>2004</year>). <article-title>Do small white balls squeak? Pitch-object correspondences in young children</article-title>. <source>Cognit. Affect. Behav. Neurosci.</source> <volume>4</volume>, <fpage>133</fpage>&#x02013;<lpage>136</lpage>. <pub-id pub-id-type="doi">10.3758/CABN.4.2.133</pub-id><pub-id pub-id-type="pmid">15460920</pub-id></citation></ref>
<ref id="B200">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Moritz</surname> <given-names>W.</given-names></name></person-group> (<year>1997</year>). <article-title>&#x0201C;The dream of color music, and machines that made it possible,&#x0201D;</article-title> in <source>Animation World Magazine, 2.1, April</source>.</citation>
</ref>
<ref id="B201">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Motoki</surname> <given-names>K.</given-names></name> <name><surname>Marks</surname> <given-names>L. E.</given-names></name> <name><surname>Velasco</surname> <given-names>C.</given-names></name></person-group> (<year>2023</year>). <article-title>Reflections on cross-modal correspondences: Current understanding and issues for future research</article-title>. <source>Multisens. Res.</source> <volume>37</volume>, <fpage>1</fpage>&#x02013;<lpage>23</lpage>. <pub-id pub-id-type="doi">10.1163/22134808-bja10114</pub-id><pub-id pub-id-type="pmid">37963487</pub-id></citation></ref>
<ref id="B202">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Muecke</surname> <given-names>M. W.</given-names></name> <name><surname>Zach</surname> <given-names>M. S.</given-names></name></person-group> (<year>2007</year>). <source>Resonance: Essays on the Intersection of Music and Architecture.</source> <publisher-loc>Ames, IA</publisher-loc>: <publisher-name>Culicidae Press</publisher-name>.</citation>
</ref>
<ref id="B203">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Muller</surname> <given-names>J. P.</given-names></name></person-group> (<year>2010</year>). <article-title>&#x0201C;Synchronization as a sound-image relationship,&#x0201D;</article-title> in <source>See This Sound: Audiovisuology An Interdisciplinary Survey of Audiovisual Culture</source>, eds. D. Daniels, S. Naumann, and J. Thoben (<publisher-loc>New York, NY</publisher-loc>: <publisher-name>Walter K&#x000F6;nig</publisher-name>), <fpage>401</fpage>&#x02013;<lpage>413</lpage>.</citation>
</ref>
<ref id="B204">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>M&#x000FC;ller-Eberstein</surname> <given-names>M.</given-names></name> <name><surname>van Noord</surname> <given-names>N.</given-names></name></person-group> (<year>2019</year>). <article-title>&#x0201C;Translating visual art into music,&#x0201D;</article-title> in <source>IEEE/CVF International Conference on Computer Vision Workshop (ICCVW)</source> (<publisher-loc>Seoul</publisher-loc>: <publisher-name>IEEE</publisher-name>), <fpage>3117</fpage>&#x02013;<lpage>3120</lpage>.</citation>
</ref>
<ref id="B205">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Murari</surname> <given-names>M.</given-names></name> <name><surname>Chmiel</surname> <given-names>A.</given-names></name> <name><surname>Tiepolo</surname> <given-names>E.</given-names></name> <name><surname>Zhang</surname> <given-names>J. D.</given-names></name> <name><surname>Canazza</surname> <given-names>S.</given-names></name> <name><surname>Rod,&#x000E0;</surname> <given-names>A.</given-names></name> <etal/></person-group>. (<year>2020</year>). <article-title>&#x0201C;Key clarity is blue, relaxed, and maluma: Machine learning used to discover cross-modal connections between sensory items and the music they spontaneously evoke,&#x0201D;</article-title> in <source>KEER 2020, AISC 1256</source>, eds. H. Shoji, S. Koyama, T. Kato, K. Muramatsu, T. Yamanaka, P. L&#x000E9;vy, K. Chen, and A. M. Lokman (<publisher-loc>Singapore</publisher-loc>: <publisher-name>Springer Nature Singapore</publisher-name>), <fpage>214</fpage>&#x02013;<lpage>223</lpage>.</citation>
</ref>
<ref id="B206">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Murari</surname> <given-names>M.</given-names></name> <name><surname>Rod,&#x000E0;</surname> <given-names>A.</given-names></name> <name><surname>Canazza</surname> <given-names>S.</given-names></name> <name><surname>De Poli</surname> <given-names>G.</given-names></name> <name><surname>Da Pos</surname> <given-names>O.</given-names></name></person-group> (<year>2015</year>). <article-title>Is Vivaldi smooth and takete? Nonverbal sensory scales for describing music qualities</article-title>. <source>J. New Music Res.</source> <volume>44</volume>, <fpage>359</fpage>&#x02013;<lpage>372</lpage>. <pub-id pub-id-type="doi">10.1080/09298215.2015.1101475</pub-id></citation>
</ref>
<ref id="B207">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Myers</surname> <given-names>C. S.</given-names></name></person-group> (<year>1911</year>). <article-title>A case of synaesthesia</article-title>. <source>Br. J. Psychol.</source> <volume>4</volume>, <fpage>228</fpage>&#x02013;<lpage>238</lpage>. <pub-id pub-id-type="doi">10.1111/j.2044-8295.1911.tb00045.x</pub-id></citation>
</ref>
<ref id="B208">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Myers</surname> <given-names>C. S.</given-names></name></person-group> (<year>1914</year>). <article-title>Two cases of synaesthesia</article-title>. <source>Br. J. Psychol</source> <volume>7</volume>, <fpage>112</fpage>&#x02013;<lpage>117</lpage>. <pub-id pub-id-type="doi">10.1111/j.2044-8295.1914.tb00246.x</pub-id></citation>
</ref>
<ref id="B209">
<citation citation-type="web"><person-group person-group-type="author"><name><surname>Newton</surname> <given-names>I.</given-names></name></person-group> (<year>1704</year>). <source>Opticks</source>. Available online at <ext-link ext-link-type="uri" xlink:href="https://archive.org/details/opticksortreatis00newt">https://archive.org/details/opticksortreatis00newt</ext-link> (Accessed June 15, 2025).</citation>
</ref>
<ref id="B210">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Nordau</surname> <given-names>M.</given-names></name></person-group> (<year>1898</year>). <source>Degeneration [English Translation of Nordau, 1892]</source>. <publisher-loc>London, UK</publisher-loc>: <publisher-name>Heinemann</publisher-name>.</citation>
</ref>
<ref id="B211">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Nosal</surname> <given-names>A. P.</given-names></name> <name><surname>Keenan</surname> <given-names>E. A.</given-names></name> <name><surname>Hastings</surname> <given-names>P. A.</given-names></name> <name><surname>Gneezy</surname> <given-names>A.</given-names></name></person-group> (<year>2016</year>). <article-title>The effect of background music in shark documentaries on viewers&#x00027; perceptions of sharks</article-title>. <source>PLoS ONE</source> <volume>11</volume>:<fpage>e0159279</fpage>. <pub-id pub-id-type="doi">10.1371/journal.pone.0159279</pub-id><pub-id pub-id-type="pmid">27487003</pub-id></citation></ref>
<ref id="B212">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Occelli</surname> <given-names>V.</given-names></name> <name><surname>Spence</surname> <given-names>C.</given-names></name> <name><surname>Zampini</surname> <given-names>M.</given-names></name></person-group> (<year>2011</year>). <article-title>Audiotactile interactions in temporal perception</article-title>. <source>Psychon. Bull. Rev.</source> <volume>18</volume>, <fpage>429</fpage>&#x02013;<lpage>454</lpage>. <pub-id pub-id-type="doi">10.3758/s13423-011-0070-4</pub-id><pub-id pub-id-type="pmid">21400125</pub-id></citation></ref>
<ref id="B213">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Odgaard</surname> <given-names>E. C.</given-names></name> <name><surname>Arieh</surname> <given-names>Y.</given-names></name> <name><surname>Marks</surname> <given-names>L. E.</given-names></name></person-group> (<year>2003</year>). <article-title>Cross-modal enhancement of perceived brightness: sensory interaction versus response bias</article-title>. <source>Percept. Psychophys.</source> <volume>65</volume>, <fpage>123</fpage>&#x02013;<lpage>132</lpage>. <pub-id pub-id-type="doi">10.3758/BF03194789</pub-id><pub-id pub-id-type="pmid">12699315</pub-id></citation></ref>
<ref id="B214">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Odgaard</surname> <given-names>E. C.</given-names></name> <name><surname>Arieh</surname> <given-names>Y.</given-names></name> <name><surname>Marks</surname> <given-names>L. E.</given-names></name></person-group> (<year>2004</year>). <article-title>Brighter noise: Sensory enhancement of perceived loudness by concurrent visual stimulation</article-title>. <source>Cognit. Affect. Behav. Neurosci.</source> <volume>4</volume>, <fpage>127</fpage>&#x02013;<lpage>132</lpage>. <pub-id pub-id-type="doi">10.3758/CABN.4.2.127</pub-id><pub-id pub-id-type="pmid">15460919</pub-id></citation></ref>
<ref id="B215">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>O&#x00027;Leary</surname> <given-names>A.</given-names></name> <name><surname>Rhodes</surname> <given-names>G.</given-names></name></person-group> (<year>1984</year>). <article-title>Cross-modal effects on visual and auditory object perception</article-title>. <source>Percept. Psychophys.</source> <volume>35</volume>, <fpage>565</fpage>&#x02013;<lpage>569</lpage>. <pub-id pub-id-type="doi">10.3758/BF03205954</pub-id><pub-id pub-id-type="pmid">6483560</pub-id></citation></ref>
<ref id="B216">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>O&#x00027;Malley</surname> <given-names>G.</given-names></name></person-group> (<year>1957</year>). <article-title>Literary synesthesia</article-title>. <source>J. Aesthet. Art Critic.</source> <volume>15</volume>, <fpage>391</fpage>&#x02013;<lpage>411</lpage>. <pub-id pub-id-type="doi">10.2307/427153</pub-id></citation>
</ref>
<ref id="B217">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ortmann</surname> <given-names>O.</given-names></name></person-group> (<year>1933</year>). <article-title>Theories of synesthesia in the light of a case of colored hearing</article-title>. <source>Hum. Biol.</source> <volume>5</volume>, <fpage>155</fpage>&#x02013;<lpage>211</lpage>.</citation>
</ref>
<ref id="B218">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Osgood</surname> <given-names>C. E.</given-names></name> <name><surname>Suci</surname> <given-names>G. J.</given-names></name> <name><surname>Tannenbaum</surname> <given-names>P. H.</given-names></name></person-group> (<year>1957</year>). <source>The Measurement of Meaning</source>. <publisher-loc>Urbana</publisher-loc>: <publisher-name>University of Illinois Press</publisher-name>.</citation>
</ref>
<ref id="B219">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Palmer</surname> <given-names>S. E.</given-names></name> <name><surname>Griscom</surname> <given-names>W.</given-names></name></person-group> (<year>2013</year>). <article-title>Accounting for taste: Individual differences in preference for harmony</article-title>. <source>Psychon. Bull. Rev.</source> <volume>20</volume>, <fpage>453</fpage>&#x02013;<lpage>461</lpage>. <pub-id pub-id-type="doi">10.3758/s13423-012-0355-2</pub-id><pub-id pub-id-type="pmid">23242798</pub-id></citation></ref>
<ref id="B220">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Palmer</surname> <given-names>S. E.</given-names></name> <name><surname>Langlois</surname> <given-names>T. A.</given-names></name> <name><surname>Schloss</surname> <given-names>K. B.</given-names></name></person-group> (<year>2016</year>). <article-title>Music-to-color associations of single-line piano melodies in non-synesthetes</article-title>. <source>Multisens. Res.</source> <volume>29</volume>, <fpage>157</fpage>&#x02013;<lpage>193</lpage>. <pub-id pub-id-type="doi">10.1163/22134808-00002486</pub-id><pub-id pub-id-type="pmid">27311295</pub-id></citation></ref>
<ref id="B221">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Palmer</surname> <given-names>S. E.</given-names></name> <name><surname>Schloss</surname> <given-names>K. B.</given-names></name> <name><surname>Xu</surname> <given-names>Z.</given-names></name> <name><surname>Prado-Le&#x000F3;n</surname> <given-names>L. R.</given-names></name></person-group> (<year>2013</year>). <article-title>Music-color associations are mediated by emotion</article-title>. <source>Proc. Natl. Acad. Sci. USA.</source> <volume>110</volume>, <fpage>8836</fpage>&#x02013;<lpage>8841</lpage>. <pub-id pub-id-type="doi">10.1073/pnas.1212562110</pub-id><pub-id pub-id-type="pmid">23671106</pub-id></citation></ref>
<ref id="B222">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Parise</surname> <given-names>C.</given-names></name> <name><surname>Spence</surname> <given-names>C.</given-names></name></person-group> (<year>2009</year>). <article-title>&#x02018;When birds of a feather flock together&#x00027;: Synesthetic correspondences modulate audiovisual integration in non-synesthetes</article-title>. <source>PLoS ONE</source> <volume>4</volume>:<fpage>e5664</fpage>. <pub-id pub-id-type="doi">10.1371/journal.pone.0005664</pub-id><pub-id pub-id-type="pmid">19471644</pub-id></citation></ref>
<ref id="B223">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Parise</surname> <given-names>C. V.</given-names></name> <name><surname>Harrar</surname> <given-names>V.</given-names></name> <name><surname>Ernst</surname> <given-names>M. O.</given-names></name> <name><surname>Spence</surname> <given-names>C.</given-names></name></person-group> (<year>2013</year>). <article-title>Cross-correlation between auditory and visual signals promotes multisensory integration</article-title>. <source>Multisens. Res.</source> <volume>26</volume>, <fpage>307</fpage>&#x02013;<lpage>316</lpage>. <pub-id pub-id-type="doi">10.1163/22134808-00002417</pub-id><pub-id pub-id-type="pmid">23964482</pub-id></citation></ref>
<ref id="B224">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Parise</surname> <given-names>C. V.</given-names></name> <name><surname>Spence</surname> <given-names>C.</given-names></name></person-group> (<year>2012</year>). <article-title>Audiovisual crossmodal correspondences and sound symbolism: A study using the implicit association test</article-title>. <source>Exp. Brain Res.</source> <volume>220</volume>, <fpage>319</fpage>&#x02013;<lpage>333</lpage>. <pub-id pub-id-type="doi">10.1007/s00221-012-3140-6</pub-id><pub-id pub-id-type="pmid">22706551</pub-id></citation></ref>
<ref id="B225">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Parise</surname> <given-names>C. V.</given-names></name> <name><surname>Spence</surname> <given-names>C.</given-names></name> <name><surname>Ernst</surname> <given-names>M.</given-names></name></person-group> (<year>2012</year>). <article-title>When correlation implies causation in multisensory integration</article-title>. <source>Curr. Biol.</source> <volume>22</volume>, <fpage>46</fpage>&#x02013;<lpage>49</lpage>. <pub-id pub-id-type="doi">10.1016/j.cub.2011.11.039</pub-id><pub-id pub-id-type="pmid">22177899</pub-id></citation></ref>
<ref id="B226">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Parke</surname> <given-names>R.</given-names></name> <name><surname>Chew</surname> <given-names>E.</given-names></name> <name><surname>Kyriakakis</surname> <given-names>C.</given-names></name></person-group> (<year>2007</year>). <article-title>Quantitative and visual analysis of the impact of music on perceived emotion of film</article-title>. <source>Comp. Entertainm.</source> <volume>5</volume>:<fpage>5</fpage>. <pub-id pub-id-type="doi">10.1145/1316511.1316516</pub-id></citation>
</ref>
<ref id="B227">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Parncutt</surname> <given-names>R.</given-names></name></person-group> (<year>2014</year>). <article-title>The emotional connotations of major versus minor tonality: one or more origins?</article-title> <source>Musicae Scientiae</source> <volume>18</volume>, <fpage>324</fpage>&#x02013;<lpage>353</lpage>. <pub-id pub-id-type="doi">10.1177/1029864914542842</pub-id></citation>
</ref>
<ref id="B228">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Parncutt</surname> <given-names>R.</given-names></name></person-group> (<year>2024</year>). <source>Psychoacoustic Foundations of Major-Minor Tonality</source>. <publisher-loc>Cambridge, MA</publisher-loc>: <publisher-name>MIT Press</publisher-name>.</citation>
</ref>
<ref id="B229">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Parrott</surname> <given-names>A. C.</given-names></name></person-group> (<year>1982</year>). <article-title>Effects of paintings and music, both alone and in combination, on emotional judgment</article-title>. <source>Percept. Mot. Skills</source> <volume>54</volume>, <fpage>635</fpage>&#x02013;<lpage>641</lpage>. <pub-id pub-id-type="doi">10.2466/pms.1982.54.2.635</pub-id></citation>
</ref>
<ref id="B230">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Peacock</surname> <given-names>K.</given-names></name></person-group> (<year>1985</year>). <article-title>Synesthetic perception: Alexander Scriabin&#x00027;s color hearing</article-title>. <source>Music Percept.</source> <volume>2</volume>, <fpage>483</fpage>&#x02013;<lpage>506</lpage>. <pub-id pub-id-type="doi">10.2307/40285315</pub-id></citation>
</ref>
<ref id="B231">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Peacock</surname> <given-names>K.</given-names></name></person-group> (<year>1988</year>). <article-title>Instruments to perform color-music: Two centuries of technological experimentation</article-title>. <source>Leonardo</source> <volume>21</volume>, <fpage>397</fpage>&#x02013;<lpage>406</lpage>. <pub-id pub-id-type="doi">10.2307/1578702</pub-id></citation>
</ref>
<ref id="B232">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Peretti</surname> <given-names>P.</given-names></name></person-group> (<year>1972</year>). <article-title>A study of student correlations between music and six paintings by Klee</article-title>. <source>J. Res. Music Educ.</source><volume>20</volume>, <fpage>501</fpage>&#x02013;<lpage>504</lpage>. <pub-id pub-id-type="doi">10.2307/3343810</pub-id></citation>
</ref>
<ref id="B233">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Piesse</surname> <given-names>G. W. S.</given-names></name></person-group> (<year>1867</year>). <source>The Art of Perfumery and the Methods of Obtaining the Odors of Plants: With Instructions for the Manufacture of Perfumes for the Handkerchief, Scented Powders, Odorous Vinegars, Dentifrices, Pomatums, Cosmetics, Perfumed Soap, Etc., to which is Added an Appendix on Preparing Artificial Fruit-essences, Etc</source>. <publisher-loc>Philadelphia</publisher-loc>: <publisher-name>Lindsay and Blakiston</publisher-name>.</citation>
</ref>
<ref id="B234">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Pimentel</surname> <given-names>O.</given-names></name> <name><surname>Chuquichambi</surname> <given-names>E. G.</given-names></name> <name><surname>Spence</surname> <given-names>C.</given-names></name> <name><surname>Velasco</surname> <given-names>C.</given-names></name></person-group> (<year>2025</year>). <article-title>The diatonic sound of scent imagery</article-title>. <source>Perception</source>. <volume>3</volume>:<fpage>3010066251342011</fpage>. <pub-id pub-id-type="doi">10.1177/03010066251342011</pub-id><pub-id pub-id-type="pmid">40457964</pub-id></citation></ref>
<ref id="B235">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Plummer</surname> <given-names>H. C.</given-names></name></person-group> (<year>1915</year>). <article-title>Color music&#x02013;A new art created with the aid of science. The color organ used in Scriabine&#x00027;s symphony &#x0201C;Prometheus&#x0201D;</article-title>. <source>Scient. Am.</source> 112, <volume>343</volume>, <fpage>350</fpage>&#x02013;<lpage>351</lpage>. <pub-id pub-id-type="doi">10.1038/scientificamerican04101915-343</pub-id></citation>
</ref>
<ref id="B236">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Poast</surname> <given-names>M.</given-names></name></person-group> (<year>2000</year>). <article-title>Color music: Visual color notation for musical expression</article-title>. <source>Leonardo</source> <volume>33</volume>, <fpage>215</fpage>&#x02013;<lpage>221</lpage>. <pub-id pub-id-type="doi">10.1162/002409400552531</pub-id></citation>
</ref>
<ref id="B237">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Pridmore</surname> <given-names>R. W.</given-names></name></person-group> (<year>1992</year>). <article-title>Music and color: Relations in the psychophysical perspective</article-title>. <source>Color Res. Appl.</source> <volume>17</volume>, <fpage>57</fpage>&#x02013;<lpage>61</lpage>. <pub-id pub-id-type="doi">10.1002/col.5080170110</pub-id><pub-id pub-id-type="pmid">24653707</pub-id></citation></ref>
<ref id="B238">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Pursey</surname> <given-names>T.</given-names></name> <name><surname>Lomas</surname> <given-names>D.</given-names></name></person-group> (<year>2018</year>). <article-title>Tate Sensorium: An experiment in multisensory immersive design</article-title>. <source>Senses Soc.</source> <volume>13</volume>, <fpage>354</fpage>&#x02013;<lpage>366</lpage>. <pub-id pub-id-type="doi">10.1080/17458927.2018.1516026</pub-id></citation>
</ref>
<ref id="B239">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Rahne</surname> <given-names>T.</given-names></name> <name><surname>B&#x000F6;ckmann</surname> <given-names>M.</given-names></name> <name><surname>Von Specht</surname> <given-names>H.</given-names></name> <name><surname>Sussman</surname> <given-names>E. S.</given-names></name></person-group> (<year>2007</year>). <article-title>Visual cues can modulate integration and segregation of objects in auditory scene analysis</article-title>. <source>Brain Res.</source> <volume>1144</volume>, <fpage>127</fpage>&#x02013;<lpage>135</lpage>. <pub-id pub-id-type="doi">10.1016/j.brainres.2007.01.074</pub-id><pub-id pub-id-type="pmid">17306232</pub-id></citation></ref>
<ref id="B240">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Rahne</surname> <given-names>T.</given-names></name> <name><surname>Deike</surname> <given-names>S.</given-names></name> <name><surname>Selezneva</surname> <given-names>E.</given-names></name> <name><surname>Brosch</surname> <given-names>M.</given-names></name> <name><surname>K&#x000F6;nig</surname> <given-names>R.</given-names></name> <name><surname>Scheich</surname> <given-names>H.</given-names></name> <etal/></person-group>. (<year>2008</year>). <article-title>A multilevel and cross-modal approach towards neuronal mechanisms of auditory streaming</article-title>. <source>Brain Res.</source> <volume>1220</volume>, <fpage>118</fpage>&#x02013;<lpage>131</lpage>. <pub-id pub-id-type="doi">10.1016/j.brainres.2007.08.011</pub-id><pub-id pub-id-type="pmid">17765207</pub-id></citation></ref>
<ref id="B241">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ran&#x0010D;i&#x00107;</surname> <given-names>K.</given-names></name> <name><surname>Markovi&#x00107;</surname> <given-names>S.</given-names></name></person-group> (<year>2019</year>). <article-title>The perceptual and aesthetic aspects of the music-paintings congruence</article-title>. <source>Vision</source> <volume>3</volume>:<fpage>65</fpage>. <pub-id pub-id-type="doi">10.3390/vision3040065</pub-id><pub-id pub-id-type="pmid">31756887</pub-id></citation></ref>
<ref id="B242">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ravignani</surname> <given-names>A.</given-names></name> <name><surname>Sonnweber</surname> <given-names>R.</given-names></name></person-group> (<year>2017</year>). <article-title>Chimpanzees process structural isomorphisms across sensory modalities</article-title>. <source>Cognition</source> <volume>161</volume>, <fpage>74</fpage>&#x02013;<lpage>79</lpage>. <pub-id pub-id-type="doi">10.1016/j.cognition.2017.01.005</pub-id><pub-id pub-id-type="pmid">28135575</pub-id></citation></ref>
<ref id="B243">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Recanzone</surname> <given-names>G. H.</given-names></name></person-group> (<year>2003</year>). <article-title>Auditory influences on visual temporal rate perception</article-title>. <source>J. Neurophysiol.</source> <volume>89</volume>, <fpage>1078</fpage>&#x02013;<lpage>1093</lpage>. <pub-id pub-id-type="doi">10.1152/jn.00706.2002</pub-id><pub-id pub-id-type="pmid">12574482</pub-id></citation></ref>
<ref id="B244">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Recanzone</surname> <given-names>G. H.</given-names></name></person-group> (<year>2009</year>). <article-title>Interactions of auditory and visual stimuli in space and time</article-title>. <source>Hear. Res.</source> <volume>258</volume>, <fpage>89</fpage>&#x02013;<lpage>99</lpage>. <pub-id pub-id-type="doi">10.1016/j.heares.2009.04.009</pub-id><pub-id pub-id-type="pmid">19393306</pub-id></citation></ref>
<ref id="B245">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Reuter</surname> <given-names>C.</given-names></name> <name><surname>Jewanski</surname> <given-names>J.</given-names></name> <name><surname>Saitis</surname> <given-names>C.</given-names></name> <name><surname>Czedik-Eysenberg</surname> <given-names>I.</given-names></name> <name><surname>Siddiq</surname> <given-names>S.</given-names></name> <name><surname>Oehler</surname> <given-names>M.</given-names></name></person-group> (<year>2018</year>). <article-title>&#x0201C;Colors and timbres&#x02014;consistent color-timbre mappings at non-synesthetic individuals,&#x0201D;</article-title> in <source>Proceedings of the 34</source><sup><italic>th</italic></sup> <italic>Jahrestagung der Deutschen Gesellschaft f&#x000FC;r Musikpsychologie: Musik im audiovisuellen Kontext</italic> (<publisher-loc>Gie&#x000DF;en</publisher-loc>: <publisher-name>Waxmann Verlag</publisher-name>).</citation>
</ref>
<ref id="B246">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Reymore</surname> <given-names>L.</given-names></name> <name><surname>Lindsey</surname> <given-names>D. T.</given-names></name></person-group> (<year>2025</year>). <article-title>Color and tone color: audiovisual crossmodal correspondences with musical instrument timbre</article-title>. <source>Front. Psychol.</source><volume>15</volume>:<fpage>1520131</fpage>. <pub-id pub-id-type="doi">10.3389/fpsyg.2024.1520131</pub-id><pub-id pub-id-type="pmid">39839933</pub-id></citation></ref>
<ref id="B247">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Reymore</surname> <given-names>L.</given-names></name> <name><surname>Noble</surname> <given-names>J.</given-names></name> <name><surname>Saitis</surname> <given-names>C.</given-names></name> <name><surname>Traube</surname> <given-names>C.</given-names></name> <name><surname>Wallmark</surname> <given-names>Z.</given-names></name></person-group> (<year>2023</year>). <article-title>Timbre semantic associations vary both between and within instruments: an empirical study empirical study incorporating register and pitch height</article-title>. <source>Music Percept.</source> <volume>40</volume>, <fpage>253</fpage>&#x02013;<lpage>274</lpage>. <pub-id pub-id-type="doi">10.1525/mp.2023.40.3.253</pub-id></citation>
</ref>
<ref id="B248">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Rimington</surname> <given-names>A. W.</given-names></name></person-group> (<year>1895</year>). <article-title>&#x0201C;A new art: Colour-music,&#x0201D;</article-title> in <source>A paper read at St. James&#x00027;s Hall on June 6, 1895, published in pamphlet form by Messrs Spottiswoode and Co., New St. Square. June 13, 1895. Reprinted in &#x02018;Colour Music, the Art of Light</source>&#x00027;, ed. A. B. Klein (<publisher-loc>London</publisher-loc>: <publisher-name>Lockwood</publisher-name>), <fpage>256</fpage>&#x02013;<lpage>261</lpage>.</citation>
</ref>
<ref id="B249">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Rimington</surname> <given-names>A. W.</given-names></name></person-group> (<year>1911</year>). <source>Colour Music: The art of Mobile Colour</source>. <publisher-loc>New York, NY</publisher-loc>: <publisher-name>Frederick A</publisher-name>. Stokes.</citation>
</ref>
<ref id="B250">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Rimington</surname> <given-names>A. W.</given-names></name></person-group> (<year>1915</year>). <article-title>Color music</article-title>. <source>Sci. Am.</source> <volume>113</volume>, <fpage>79</fpage>. <pub-id pub-id-type="doi">10.1038/scientificamerican07241915-79</pub-id></citation>
</ref>
<ref id="B251">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Root</surname> <given-names>N.</given-names></name> <name><surname>Chkhaidze</surname> <given-names>A.</given-names></name> <name><surname>Melero</surname> <given-names>H.</given-names></name> <name><surname>Sidoroff-Dorso</surname> <given-names>A.</given-names></name> <name><surname>Volberg</surname> <given-names>G.</given-names></name> <name><surname>Zhang</surname> <given-names>Y.</given-names></name> <etal/></person-group>. (<year>2025</year>). <article-title>How &#x0201C;diagnostic&#x0201D; criteria interact to shape synesthetic behavior: the role of self-report and test&#x02013;retest consistency in synesthesia research</article-title>. <source>Conscious. Cogn.</source> <volume>129</volume>:<fpage>103819</fpage>. <pub-id pub-id-type="doi">10.1016/j.concog.2025.103819</pub-id><pub-id pub-id-type="pmid">39922140</pub-id></citation></ref>
<ref id="B252">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Root</surname> <given-names>R. T.</given-names></name> <name><surname>Ross</surname> <given-names>S.</given-names></name></person-group> (<year>1965</year>). <article-title>Further validation of subjective scales for loudness and brightness by means of cross-modality matching</article-title>. <source>Am. J. Psychol</source>. <volume>78</volume>, <fpage>285</fpage>&#x02013;<lpage>289</lpage>. <pub-id pub-id-type="doi">10.2307/1420502</pub-id><pub-id pub-id-type="pmid">14290758</pub-id></citation></ref>
<ref id="B253">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Rosenthal</surname> <given-names>R.</given-names></name></person-group> (<year>1979</year>). <article-title>The &#x0201C;file drawer problem&#x0201D; and tolerance for null results</article-title>. <source>Psychol. Bull.</source> <volume>86</volume>, <fpage>638</fpage>&#x02013;<lpage>641</lpage>. <pub-id pub-id-type="doi">10.1037/0033-2909.86.3.638</pub-id></citation>
</ref>
<ref id="B254">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Russet</surname> <given-names>R.</given-names></name> <name><surname>Starr</surname> <given-names>C.</given-names></name></person-group> (<year>1988</year>). <source>Experimental Animation: Origins of a New Art</source>. <publisher-loc>Boston</publisher-loc>: <publisher-name>Da Capo Press</publisher-name>.</citation>
</ref>
<ref id="B255">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Sabaneev</surname> <given-names>L.</given-names></name> <name><surname>Pring</surname> <given-names>S. W.</given-names></name></person-group> (<year>1929</year>). <article-title>The relation between sound and colour</article-title>. <source>Music Letters</source> <volume>10</volume>, <fpage>266</fpage>&#x02013;<lpage>277</lpage>. <pub-id pub-id-type="doi">10.1093/ml/10.3.266</pub-id></citation>
</ref>
<ref id="B256">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Sabaneyev</surname> <given-names>L.</given-names></name></person-group> (<year>1911</year>). <article-title>On sound-color accordance</article-title>. <source>Muzyka</source> <volume>9</volume>, <fpage>196</fpage>&#x02013;<lpage>200</lpage>.</citation>
</ref>
<ref id="B257">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Saitis</surname> <given-names>C.</given-names></name> <name><surname>Wallmark</surname> <given-names>Z.</given-names></name></person-group> (<year>2024</year>). <article-title>Timbral brightness perception investigated through multimodal interference</article-title>. <source>Attent. Percept. Psychophys.</source> <volume>86</volume>, <fpage>1835</fpage>&#x02013;<lpage>1845</lpage>. <pub-id pub-id-type="doi">10.3758/s13414-024-02934-2</pub-id><pub-id pub-id-type="pmid">39090510</pub-id></citation></ref>
<ref id="B258">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Saitis</surname> <given-names>C.</given-names></name> <name><surname>Weinzierl</surname> <given-names>S.</given-names></name> <name><surname>von Kriegstein</surname> <given-names>K.</given-names></name> <name><surname>Ystad</surname> <given-names>S.</given-names></name> <name><surname>Cuskley</surname> <given-names>C.</given-names></name></person-group> (<year>2020</year>). <article-title>Timbre semantics through the lens of crossmodal correspondences: a new way of asking old questions</article-title>. <source>Acoust. Sci. Technol.</source> <volume>41</volume>, <fpage>365</fpage>&#x02013;<lpage>368</lpage>. <pub-id pub-id-type="doi">10.1250/ast.41.365</pub-id></citation>
</ref>
<ref id="B259">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Schifferstein</surname> <given-names>H. N. J.</given-names></name> <name><surname>Tanudjaja</surname> <given-names>I.</given-names></name></person-group> (<year>2004</year>). <article-title>Visualizing fragrances through colors: The mediating role of emotions</article-title>. <source>Perception</source> <volume>33</volume>, <fpage>1249</fpage>&#x02013;<lpage>1266</lpage>. <pub-id pub-id-type="doi">10.1068/p5132</pub-id><pub-id pub-id-type="pmid">15693669</pub-id></citation></ref>
<ref id="B260">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Schloss</surname> <given-names>K. B.</given-names></name> <name><surname>Palmer</surname> <given-names>S. E.</given-names></name></person-group> (<year>2011</year>). <article-title>Aesthetic response to color combinations: Preference, harmony, and similarity</article-title>. <source>Attent. Percept. Psychophys</source> <volume>73</volume>, <fpage>551</fpage>&#x02013;<lpage>571</lpage>. <pub-id pub-id-type="doi">10.3758/s13414-010-0027-0</pub-id><pub-id pub-id-type="pmid">21264737</pub-id></citation></ref>
<ref id="B261">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Scholes</surname> <given-names>P.</given-names></name></person-group> (<year>1970</year>). <article-title>&#x0201C;Colour and music,&#x0201D;</article-title> in <source>The Oxford Companion to Music</source> (<publisher-loc>Oxford, UK</publisher-loc>: <publisher-name>Oxford University Press</publisher-name>), <fpage>202</fpage>&#x02013;<lpage>210</lpage>.</citation>
</ref>
<ref id="B262">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Schwartz</surname> <given-names>J. L.</given-names></name> <name><surname>Grimault</surname> <given-names>N.</given-names></name> <name><surname>Hup,&#x000E9;</surname> <given-names>J. M.</given-names></name> <name><surname>Moore</surname> <given-names>B. C.</given-names></name> <name><surname>Pressnitzer</surname> <given-names>D.</given-names></name></person-group> (<year>2012</year>). <article-title>Multistability in perception: Binding sensory modalities, an overview</article-title>. <source>Philosoph. Trans. Royal Soc. B: Biol. Sci.</source> <volume>367</volume>, <fpage>896</fpage>&#x02013;<lpage>905</lpage>. <pub-id pub-id-type="doi">10.1098/rstb.2011.0254</pub-id><pub-id pub-id-type="pmid">22371612</pub-id></citation></ref>
<ref id="B263">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Scriabin</surname> <given-names>A.</given-names></name></person-group> (<year>1910</year>). <article-title>&#x0201C;Prometheus, op. 60, tone-poem for orchestra and projected colour lights,&#x0201D;</article-title> in <source>Original Score in British Museum Library</source>.</citation>
</ref>
<ref id="B264">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Sebba</surname> <given-names>R.</given-names></name></person-group> (<year>1991</year>). <article-title>Structural correspondence between music and color</article-title>. <source>Color Res. Appl.</source> <volume>16</volume>, <fpage>81</fpage>&#x02013;<lpage>88</lpage>. <pub-id pub-id-type="doi">10.1002/col.5080160206</pub-id></citation>
</ref>
<ref id="B265">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Shams</surname> <given-names>L.</given-names></name> <name><surname>Kamitani</surname> <given-names>Y.</given-names></name> <name><surname>Shimojo</surname> <given-names>S.</given-names></name></person-group> (<year>2000</year>). <article-title>What you see is what you hear</article-title>. <source>Nature</source> <volume>408</volume>, <fpage>788</fpage>&#x02013;<lpage>788</lpage>. <pub-id pub-id-type="doi">10.1038/35048669</pub-id><pub-id pub-id-type="pmid">11130706</pub-id></citation></ref>
<ref id="B266">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Shaw Miller</surname> <given-names>S.</given-names></name></person-group> (<year>2006</year>). <article-title>Visual music and the case for rigorous thinking</article-title>. <source>The Art Book</source> <volume>13</volume>, <fpage>3</fpage>&#x02013;<lpage>5</lpage>. <pub-id pub-id-type="doi">10.1111/j.1467-8357.2006.00623.x</pub-id></citation>
</ref>
<ref id="B267">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Shaw-Miller</surname> <given-names>S.</given-names></name></person-group> (<year>2010</year>). <source>Eye hEar: The Visual in Music</source>. <publisher-loc>London, UK</publisher-loc>: <publisher-name>Routledge</publisher-name>.</citation>
</ref>
<ref id="B268">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Shipley</surname> <given-names>T.</given-names></name></person-group> (<year>1964</year>). <article-title>Auditory flutter-driving of visual flicker</article-title>. <source>Science</source> <volume>145</volume>, <fpage>1328</fpage>&#x02013;<lpage>1330</lpage>. <pub-id pub-id-type="doi">10.1126/science.145.3638.1328</pub-id><pub-id pub-id-type="pmid">14173429</pub-id></citation></ref>
<ref id="B269">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Siefkes</surname> <given-names>M.</given-names></name> <name><surname>Arielli</surname> <given-names>E.</given-names></name></person-group> (<year>2015</year>). <article-title>&#x0201C;An experimental approach to multimodality: How musical and architectural styles interact in aesthetic perception,&#x0201D;</article-title> in <source>Building Bridges for Multimodal Research: International Perspectives on Theories and Practices of Multimodal Analysis</source>, <fpage>246</fpage>&#x02013;<lpage>265</lpage>.</citation>
</ref>
<ref id="B270">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Siegel</surname> <given-names>L.</given-names></name></person-group> (<year>1974</year>). <article-title>Synaesthesia and the paintings of Caspar David Friedrich</article-title>. <source>Art J.</source> <volume>33</volume>, <fpage>196</fpage>&#x02013;<lpage>204</lpage>. <pub-id pub-id-type="doi">10.1080/00043249.1974.10793214</pub-id></citation>
</ref>
<ref id="B271">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Silas</surname> <given-names>S.</given-names></name> <name><surname>Baker</surname> <given-names>D. J.</given-names></name> <name><surname>M&#x000FC;llensiefen</surname> <given-names>D.</given-names></name></person-group> (<year>2024</year>). <article-title>Musical manipulation of visual scenes in video, film, and TV advertisements: a large-scale investigation into the implicit effects of sonic branding</article-title>. <source>J. Advert. Res.</source> <volume>64</volume>, <fpage>192</fpage>&#x02013;<lpage>212</lpage>. <pub-id pub-id-type="doi">10.2501/JAR-2024-013</pub-id></citation>
</ref>
<ref id="B272">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Simpson</surname> <given-names>R. H.</given-names></name> <name><surname>Quinn</surname> <given-names>M.</given-names></name> <name><surname>Ausubel</surname> <given-names>D. P.</given-names></name></person-group> (<year>1956</year>). <article-title>Synaesthesia in children: association of colors with pure tone frequencies</article-title>. <source>J. Genetic Psychol.</source> <volume>89</volume>, <fpage>95</fpage>&#x02013;<lpage>103</lpage>. <pub-id pub-id-type="doi">10.1080/00221325.1956.10532990</pub-id><pub-id pub-id-type="pmid">13429016</pub-id></citation></ref>
<ref id="B273">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Sirius</surname> <given-names>G.</given-names></name> <name><surname>Clarke</surname> <given-names>E. F.</given-names></name></person-group> (<year>1994</year>). <article-title>The perception of audiovisual relationships: a preliminary study</article-title>. <source>Psychomusicology</source> <volume>13</volume>, <fpage>119</fpage>&#x02013;<lpage>132</lpage>. <pub-id pub-id-type="doi">10.1037/h0094099</pub-id></citation>
</ref>
<ref id="B274">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Smith</surname> <given-names>M. W.</given-names></name></person-group> (<year>2007</year>). <source>The Total Work of Art: From Bayreuth to Cyberspace</source>. <publisher-loc>London, UK</publisher-loc>: <publisher-name>Routledge</publisher-name>.</citation>
</ref>
<ref id="B275">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Soto-Faraco</surname> <given-names>S.</given-names></name> <name><surname>Kingstone</surname> <given-names>A.</given-names></name> <name><surname>Spence</surname> <given-names>C.</given-names></name></person-group> (<year>2003</year>). <article-title>Multisensory contributions to the perception of motion</article-title>. <source>Neuropsychologia</source> <volume>41</volume>, <fpage>1847</fpage>&#x02013;<lpage>1862</lpage>. <pub-id pub-id-type="doi">10.1016/S0028-3932(03)00185-4</pub-id><pub-id pub-id-type="pmid">14527547</pub-id></citation></ref>
<ref id="B276">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Sourav</surname> <given-names>S.</given-names></name> <name><surname>R&#x000F6;der</surname> <given-names>B.</given-names></name> <name><surname>Ambsdorf</surname> <given-names>F.</given-names></name> <name><surname>Melissari</surname> <given-names>A.</given-names></name> <name><surname>Arvaniti</surname> <given-names>M.</given-names></name> <name><surname>Vatakis</surname> <given-names>A.</given-names></name></person-group> (<year>2025</year>). <article-title>Five experiments, including a pre-registered replication, and mini meta-analyses find no evidence that sound-shape associations modulate audiovisual temporal order judgements</article-title>. <source>J. Exp. Psychol.: General</source> <volume>154</volume>, <fpage>522</fpage>&#x02013;<lpage>532</lpage>. <pub-id pub-id-type="doi">10.1037/xge0001641</pub-id><pub-id pub-id-type="pmid">39298202</pub-id></citation></ref>
<ref id="B277">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>South</surname> <given-names>W.</given-names></name></person-group> (<year>2001</year>). <source>Color, Myth, and Music: Stanton Macdonald-Wright and Synchromism</source>. <publisher-loc>Raleigh</publisher-loc>: <publisher-name>North Carolina Museum of Art</publisher-name>.</citation>
</ref>
<ref id="B278">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Spence</surname> <given-names>C.</given-names></name></person-group> (<year>2011</year>). <article-title>Crossmodal correspondences: a tutorial review</article-title>. <source>Attent. Percept. Psychophys.</source> <volume>73</volume>, <fpage>971</fpage>&#x02013;<lpage>995</lpage>. <pub-id pub-id-type="doi">10.3758/s13414-010-0073-7</pub-id><pub-id pub-id-type="pmid">21264748</pub-id></citation></ref>
<ref id="B279">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Spence</surname> <given-names>C.</given-names></name></person-group> (<year>2012</year>). <article-title>Managing sensory expectations concerning products and brands: Capitalizing on the potential of sound and shape symbolism</article-title>. <source>J. Consumer Psychol.</source> <volume>22</volume>, <fpage>37</fpage>&#x02013;<lpage>54</lpage>. <pub-id pub-id-type="doi">10.1016/j.jcps.2011.09.004</pub-id></citation>
</ref>
<ref id="B280">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Spence</surname> <given-names>C.</given-names></name></person-group> (<year>2015</year>). <article-title>&#x0201C;Cross-modal perceptual organization,&#x0201D;</article-title> in <source>The Oxford Handbook of Perceptual Organization</source>, ed. J. Wagemans (<publisher-loc>Oxford, UK</publisher-loc>: <publisher-name>Oxford University Press</publisher-name>), <fpage>649</fpage>&#x02013;<lpage>664</lpage>.</citation>
</ref>
<ref id="B281">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Spence</surname> <given-names>C.</given-names></name></person-group> (<year>2018</year>). <article-title>&#x0201C;Multisensory perception,&#x0201D;</article-title> in <source>The Stevens&#x00027; Handbook of Experimental Psychology and Cognitive Neuroscience</source>, eds. J. Wixted and J. Serences (<publisher-loc>Hoboken, NJ</publisher-loc>: <publisher-name>John Wiley and Sons</publisher-name>), <fpage>1</fpage>&#x02013;<lpage>56</lpage>.</citation>
</ref>
<ref id="B282">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Spence</surname> <given-names>C.</given-names></name></person-group> (<year>2019</year>). <article-title>On the relative nature of (pitch-based) crossmodal correspondences</article-title>. <source>Multisens. Res.</source> <volume>32</volume>, <fpage>235</fpage>&#x02013;<lpage>265</lpage>. <pub-id pub-id-type="doi">10.1163/22134808-20191407</pub-id><pub-id pub-id-type="pmid">31059485</pub-id></citation></ref>
<ref id="B283">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Spence</surname> <given-names>C.</given-names></name></person-group> (<year>2020a</year>). <article-title>Assessing the role of emotional mediation in explaining crossmodal correspondences involving musical stimuli</article-title>. <source>Multisens. Res.</source> <volume>33</volume>, <fpage>1</fpage>&#x02013;<lpage>29</lpage>. <pub-id pub-id-type="doi">10.1163/22134808-20191469</pub-id><pub-id pub-id-type="pmid">31648195</pub-id></citation></ref>
<ref id="B284">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Spence</surname> <given-names>C.</given-names></name></person-group> (<year>2020b</year>). <article-title>Olfactory-colour crossmodal correspondences in art, science, and design</article-title>. <source>Cognit. Res.: Princi. Implicat.</source> <volume>5</volume>:<fpage>52</fpage>. <pub-id pub-id-type="doi">10.1186/s41235-020-00246-1</pub-id><pub-id pub-id-type="pmid">33113051</pub-id></citation></ref>
<ref id="B285">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Spence</surname> <given-names>C.</given-names></name></person-group> (<year>2021</year>). <source>Sensehacking: How to use the Power of Your Senses for Happier, Heathier Living</source>. <publisher-loc>London, UK</publisher-loc>: <publisher-name>Viking Penguin</publisher-name>.</citation>
</ref>
<ref id="B286">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Spence</surname> <given-names>C.</given-names></name> <name><surname>Deroy</surname> <given-names>O.</given-names></name></person-group> (<year>2013</year>). <article-title>&#x0201C;Crossmodal mental imagery,&#x0201D;</article-title> in <source>Multisensory Imagery: Theory and Applications</source>, eds. S. Lacey and R. Lawson (<publisher-loc>New York, NY</publisher-loc>: <publisher-name>Springer</publisher-name>), <fpage>157</fpage>&#x02013;<lpage>183</lpage>.</citation>
</ref>
<ref id="B287">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Spence</surname> <given-names>C.</given-names></name> <name><surname>Di Stefano</surname> <given-names>N.</given-names></name></person-group> (<year>2022a</year>). <article-title>Coloured hearing, colour music, colour organs, and the search for perceptually meaningful correspondences between colour and pitch</article-title>. <source>i-Perception.</source> <volume>13</volume>, <fpage>1</fpage>&#x02013;<lpage>42</lpage>. <pub-id pub-id-type="doi">10.1177/20416695221092802</pub-id><pub-id pub-id-type="pmid">35572076</pub-id></citation></ref>
<ref id="B288">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Spence</surname> <given-names>C.</given-names></name> <name><surname>Di Stefano</surname> <given-names>N.</given-names></name></person-group> (<year>2022b</year>). <article-title>Crossmodal harmony: Looking for the meaning of harmony beyond hearing</article-title>. <source>i-Perception.</source> <volume>13</volume>, <fpage>1</fpage>&#x02013;<lpage>40</lpage>. <pub-id pub-id-type="doi">10.1177/20416695211073817</pub-id><pub-id pub-id-type="pmid">35186248</pub-id></citation></ref>
<ref id="B289">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Spence</surname> <given-names>C.</given-names></name> <name><surname>Di Stefano</surname> <given-names>N.</given-names></name></person-group> (<year>2024a</year>). <article-title>Sensory translation between audition and vision</article-title>. <source>Psychon. Bull. Rev.</source> <volume>31</volume>, <fpage>599</fpage>&#x02013;<lpage>626</lpage>. <pub-id pub-id-type="doi">10.3758/s13423-023-02343-w</pub-id><pub-id pub-id-type="pmid">37803233</pub-id></citation></ref>
<ref id="B290">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Spence</surname> <given-names>C.</given-names></name> <name><surname>Di Stefano</surname> <given-names>N.</given-names></name></person-group> (<year>2024b</year>). <article-title>What, if anything, can be considered an amodal sensory dimension?</article-title> <source>Psychon. Bull. Rev.</source> <volume>31</volume>, <fpage>1915</fpage>&#x02013;<lpage>1933</lpage>. <pub-id pub-id-type="doi">10.3758/s13423-023-02447-3</pub-id><pub-id pub-id-type="pmid">38381301</pub-id></citation></ref>
<ref id="B291">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Spence</surname> <given-names>C.</given-names></name> <name><surname>Di Stefano</surname> <given-names>N.</given-names></name></person-group> (<year>2025</year>). <article-title>&#x0201C;Gestalt perceptual grouping and crossmodal art,&#x0201D;</article-title> in <source>Handbook of Gestalt-Theoretic Psychology of Art</source>, ed. W. Coppola (<publisher-loc>London</publisher-loc>: <publisher-name>Routledge</publisher-name>), <fpage>202</fpage>&#x02013;<lpage>230</lpage>.</citation>
</ref>
<ref id="B292">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Spence</surname> <given-names>C.</given-names></name> <name><surname>Levitan</surname> <given-names>C. A.</given-names></name></person-group> (<year>2021</year>). <article-title>Explaining crossmodal correspondences between colours and tastes</article-title>. <source>i-Perception.</source> <volume>12</volume>, <fpage>1</fpage>&#x02013;<lpage>28</lpage>. <pub-id pub-id-type="doi">10.1177/20416695211018223</pub-id><pub-id pub-id-type="pmid">34211685</pub-id></citation></ref>
<ref id="B293">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Spence</surname> <given-names>C.</given-names></name> <name><surname>Sanabria</surname> <given-names>D.</given-names></name> <name><surname>Soto-Faraco</surname> <given-names>S.</given-names></name></person-group> (<year>2007</year>). <article-title>&#x0201C;Intersensory Gestalten and crossmodal scene perception,&#x0201D;</article-title> in <source>Psychology of Beauty and Kansei: New Horizons of Gestalt Perception</source>, ed. K. Noguchi (<publisher-loc>Tokyo</publisher-loc>: <publisher-name>Fuzanbo International</publisher-name>), <fpage>519</fpage>&#x02013;<lpage>579</lpage>.</citation>
</ref>
<ref id="B294">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Spence</surname> <given-names>C.</given-names></name> <name><surname>Sathian</surname> <given-names>K.</given-names></name></person-group> (<year>2020</year>). <article-title>&#x0201C;Audiovisual crossmodal correspondences: Behavioural consequences and neural underpinnings,&#x0201D;</article-title> in <source>Multisensory Perception: From Laboratory to Clinic</source>, eds. K. Sathian and V. S. Ramachandran (<publisher-loc>San Diego, CA</publisher-loc>: <publisher-name>Elsevier</publisher-name>), <fpage>239</fpage>&#x02013;<lpage>258</lpage>.</citation>
</ref>
<ref id="B295">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Stechow</surname> <given-names>W.</given-names></name></person-group> (<year>1953</year>). <article-title>Problems of structure in some relations between the visual arts and music</article-title>. <source>J. Aesthet. Art Critic</source>. <volume>11</volume>, <fpage>324</fpage>&#x02013;<lpage>333</lpage>. <pub-id pub-id-type="doi">10.1111/1540_6245.jaac11.4.0324</pub-id><pub-id pub-id-type="pmid">38452409</pub-id></citation></ref>
<ref id="B296">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Steffens</surname> <given-names>J.</given-names></name></person-group> (<year>2020</year>). <article-title>The influence of film music on moral judgments of movie scenes and felt emotions</article-title>. <source>Psychol. Music</source> <volume>48</volume>, <fpage>3</fpage>&#x02013;<lpage>17</lpage>. <pub-id pub-id-type="doi">10.1177/0305735618779443</pub-id></citation>
</ref>
<ref id="B297">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Stein</surname> <given-names>B. E.</given-names></name> <name><surname>Burr</surname> <given-names>D.</given-names></name> <name><surname>Constantinides</surname> <given-names>C.</given-names></name> <name><surname>Laurienti</surname> <given-names>P. J.</given-names></name> <name><surname>Meredith</surname> <given-names>A. M.</given-names></name> <name><surname>Perrault</surname> <given-names>T. J.</given-names> <suffix>Jr</suffix></name> <name><surname>Ramachandran</surname> <given-names>R.</given-names></name> <etal/></person-group>. (<year>2010</year>). <article-title>Semantic confusion regarding the development of multisensory integration: a practical solution</article-title>. <source>Eur. J. Neurosci.</source> <volume>31</volume>, <fpage>1713</fpage>&#x02013;<lpage>1720</lpage>. <pub-id pub-id-type="doi">10.1111/j.1460-9568.2010.07206.x</pub-id><pub-id pub-id-type="pmid">20584174</pub-id></citation></ref>
<ref id="B298">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Stein</surname> <given-names>B. E.</given-names></name> <name><surname>London</surname> <given-names>N.</given-names></name> <name><surname>Wilkinson</surname> <given-names>L. K.</given-names></name> <name><surname>Price</surname> <given-names>D. P.</given-names></name></person-group> (<year>1996</year>). <article-title>Enhancement of perceived visual intensity by auditory stimuli: a psychophysical analysis</article-title>. <source>J. Cogn. Neurosci.</source> <volume>8</volume>, <fpage>497</fpage>&#x02013;<lpage>506</lpage>. <pub-id pub-id-type="doi">10.1162/jocn.1996.8.6.497</pub-id><pub-id pub-id-type="pmid">23961981</pub-id></citation></ref>
<ref id="B299">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Sullivan</surname> <given-names>J. W. N.</given-names></name></person-group> (<year>1914</year>). <article-title>An organ on which color compositions are played. The new art of color music and its mechanism</article-title>. <source>Scient. Am.</source> <volume>110</volume>, <fpage>16</fpage><volume>3</volume>:<fpage>170</fpage>. <pub-id pub-id-type="doi">10.1038/scientificamerican02211914-163</pub-id></citation>
</ref>
<ref id="B300">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Sun</surname> <given-names>X.</given-names></name> <name><surname>Li</surname> <given-names>X.</given-names></name> <name><surname>Ji</surname> <given-names>L.</given-names></name> <name><surname>Han</surname> <given-names>F.</given-names></name> <name><surname>Wang</surname> <given-names>H.</given-names></name> <name><surname>Liu</surname> <given-names>Y.</given-names></name> <etal/></person-group>. (<year>2018</year>). <article-title>An extended research of crossmodal correspondence between color and sound in psychology and cognitive ergonomics</article-title>. <source>PeerJ</source> <volume>6</volume>:<fpage>e4443</fpage>. <pub-id pub-id-type="doi">10.7717/peerj.4443</pub-id><pub-id pub-id-type="pmid">29507834</pub-id></citation></ref>
<ref id="B301">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Svartdal</surname> <given-names>F.</given-names></name> <name><surname>Iversen</surname> <given-names>T.</given-names></name></person-group> (<year>1989</year>). <article-title>Consistency in synesthetic experience to vowels and consonants: five case studies</article-title>. <source>Scand. J. Psychol.</source> <volume>30</volume>, <fpage>220</fpage>&#x02013;<lpage>227</lpage>. <pub-id pub-id-type="doi">10.1111/j.1467-9450.1989.tb01084.x</pub-id><pub-id pub-id-type="pmid">2587966</pub-id></citation></ref>
<ref id="B302">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Takeshima</surname> <given-names>Y.</given-names></name> <name><surname>Gyoba</surname> <given-names>J.</given-names></name></person-group> (<year>2013</year>). <article-title>Changing pitch of sounds alters perceived visual motion trajectory</article-title>. <source>Multisens. Res.</source> <volume>26</volume>, <fpage>317</fpage>&#x02013;<lpage>332</lpage>. <pub-id pub-id-type="doi">10.1163/22134808-00002422</pub-id><pub-id pub-id-type="pmid">24319926</pub-id></citation></ref>
<ref id="B303">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Tan</surname> <given-names>S. L.</given-names></name> <name><surname>Cohen</surname> <given-names>A. J.</given-names></name> <name><surname>Lipscomb</surname> <given-names>S. D.</given-names></name> <name><surname>Kendall</surname> <given-names>R. A.</given-names></name></person-group> (<year>2013</year>). <source>The Psychology of Music in Multimedia.</source> <publisher-loc>Oxford, UK</publisher-loc>: <publisher-name>Oxford University Press</publisher-name>.</citation>
</ref>
<ref id="B304">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Tan</surname> <given-names>S. L.</given-names></name> <name><surname>Pfordresher</surname> <given-names>P.</given-names></name> <name><surname>Harr&#x000E9;</surname> <given-names>R.</given-names></name></person-group> (<year>2017</year>). <source>Psychology of Music: From Sound to Significance</source> (<publisher-loc>London</publisher-loc>: <publisher-name>Routledge</publisher-name>).</citation>
</ref>
<ref id="B305">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Tan</surname> <given-names>S. L.</given-names></name> <name><surname>Spackman</surname> <given-names>M. P.</given-names></name> <name><surname>Bezdek</surname> <given-names>M. A.</given-names></name></person-group> (<year>2007</year>). <article-title>Viewers&#x00027; interpretations of film characters&#x00027; emotions: effects of presenting film music before or after a character is shown</article-title>. <source>Music Percept.</source> <volume>25</volume>, <fpage>135</fpage>&#x02013;<lpage>152</lpage>. <pub-id pub-id-type="doi">10.1525/mp.2007.25.2.135</pub-id></citation>
</ref>
<ref id="B306">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Thayer</surname> <given-names>J. F.</given-names></name> <name><surname>Levenson</surname> <given-names>R. W.</given-names></name></person-group> (<year>1983</year>). <article-title>Effects of music on psychophysiological responses to a stressful film</article-title>. <source>Psychomusicology</source> <volume>3</volume>, <fpage>44</fpage>&#x02013;<lpage>52</lpage>. <pub-id pub-id-type="doi">10.1037/h0094256</pub-id></citation>
</ref>
<ref id="B307">
<citation citation-type="web"><person-group person-group-type="author"><collab>The Getty Research Institute</collab></person-group> (<year>2013</year>). <source>Philipp Otto Runge&#x00027;s Times of Day</source>. Available online at: <ext-link ext-link-type="uri" xlink:href="http://www.getty.edu/research/special_collections/notable/runge.html">http://www.getty.edu/research/special_collections/notable/runge.html</ext-link> (Accessed June 15, 2025).</citation>
</ref>
<ref id="B308">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Timmers</surname> <given-names>R.</given-names></name></person-group> (<year>2022</year>). <article-title>&#x0201C;Cross-modality and embodiment of tempo and timing,&#x0201D;</article-title> in <source>The Oxford Handbook of Time in Music</source>, eds. M. Doffman, E. Payne, and T. Young (<publisher-loc>Oxford, UK</publisher-loc>: <publisher-name>Oxford University Press</publisher-name>), <fpage>215</fpage>&#x02013;<lpage>234</lpage>.</citation>
</ref>
<ref id="B309">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Triarhou</surname> <given-names>L. C.</given-names></name></person-group> (<year>2016</year>). <article-title>Neuromusicology or musiconeurology? &#x02018;Omni-art&#x00027; in Alexander Scriabin as a fount of ideas</article-title>. <source>Front. Psychol.</source><volume>7</volume>:<fpage>364</fpage>. <pub-id pub-id-type="doi">10.3389/fpsyg.2016.00364</pub-id><pub-id pub-id-type="pmid">27014167</pub-id></citation></ref>
<ref id="B310">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Truckenbrod</surname> <given-names>J.</given-names></name></person-group> (<year>1992</year>). <article-title>Integrated creativity: Transcending the boundaries of visual art, music and literature</article-title>. <source><italic>Leonardo Music J</italic>.</source> <volume>2</volume>, <fpage>89</fpage>&#x02013;<lpage>95</lpage>. <pub-id pub-id-type="doi">10.2307/1513214</pub-id></citation>
</ref>
<ref id="B311">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Vanechkina</surname> <given-names>I.</given-names></name></person-group> (<year>1968</year>). <article-title>&#x0201C;A r&#x000E9;sum&#x000E9; of inquiry on consistencies in &#x02018;colored hearing&#x00027; among members of composers&#x00027; association of the USSR,&#x0201D;</article-title> in <source>Papers of VI All-Union Acoustic Conference</source> (<publisher-loc>Moscow</publisher-loc>: <publisher-name>Academy of Sciences of the U.S.S.R</publisher-name>).</citation>
</ref>
<ref id="B312">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Vanechkina</surname> <given-names>I.</given-names></name></person-group> (<year>1973</year>). <article-title>&#x0201C;Soviet musicians and light-music,&#x0201D;</article-title> in <source>The Art of Luminous Sounds</source> (<publisher-loc>Kazan, Russia</publisher-loc>: <publisher-name>KAI</publisher-name>), <fpage>89</fpage>&#x02013;<lpage>110</lpage>.</citation>
</ref>
<ref id="B313">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Veldhorst</surname> <given-names>N.</given-names></name></person-group> (<year>2018</year>). <source>Van Gogh and Music: A Symphony of Blue and Yellow</source>. <publisher-loc>New Haven</publisher-loc>: <publisher-name>Yale University Press</publisher-name>.</citation>
</ref>
<ref id="B314">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Vergo</surname> <given-names>P.</given-names></name></person-group> (<year>2012</year>). <source>The Music of Painting: Music, Modernism, and the Visual Arts from the Romantics to John Cage</source>. <publisher-loc>London, UK</publisher-loc>: <publisher-name>Phaidon</publisher-name>.</citation>
</ref>
<ref id="B315">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Vuoskowski</surname> <given-names>J.</given-names></name> <name><surname>Spence</surname> <given-names>C.</given-names></name> <name><surname>Thompson</surname> <given-names>M.</given-names></name> <name><surname>Clarke</surname> <given-names>E.</given-names></name></person-group> (<year>2014</year>). <article-title>Cross-modal interactions in the perception of expressivity in musical performance</article-title>. <source>Attent. Percep. Psychophys.</source> <volume>76</volume>, <fpage>591</fpage>&#x02013;<lpage>604</lpage>. <pub-id pub-id-type="doi">10.3758/s13414-013-0582-2</pub-id><pub-id pub-id-type="pmid">24233641</pub-id></citation></ref>
<ref id="B316">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Vuoskowski</surname> <given-names>J.</given-names></name> <name><surname>Spence</surname> <given-names>C.</given-names></name> <name><surname>Thompson</surname> <given-names>M.</given-names></name> <name><surname>Clarke</surname> <given-names>E.</given-names></name></person-group> (<year>2016</year>). <article-title>Interaction of sight and sound in the perception and experience of musical performance</article-title>. <source>Music Percept.</source> <volume>33</volume>, <fpage>457</fpage>&#x02013;<lpage>471</lpage>. <pub-id pub-id-type="doi">10.1525/mp.2016.33.4.457</pub-id></citation>
</ref>
<ref id="B317">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wagner</surname> <given-names>S.</given-names></name> <name><surname>Winner</surname> <given-names>E.</given-names></name> <name><surname>Cicchetti</surname> <given-names>D.</given-names></name> <name><surname>Gardner</surname> <given-names>H.</given-names></name></person-group> (<year>1981</year>). <article-title>&#x0201C;Metaphorical&#x0201D; mapping in human infants</article-title>. <source>Child Dev.</source> <volume>52</volume>, <fpage>728</fpage>&#x02013;<lpage>731</lpage>. <pub-id pub-id-type="doi">10.2307/1129200</pub-id></citation>
</ref>
<ref id="B318">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Walker</surname> <given-names>E.</given-names></name></person-group> (<year>1927</year>). <source>Das musikalische Erlebnis und seine Entwicklung</source>. <publisher-loc>Gottingen, Germany</publisher-loc>: <publisher-name>Vandenhoeck and Ruprecht</publisher-name>.</citation>
</ref>
<ref id="B319">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Walker</surname> <given-names>P.</given-names></name></person-group> (<year>2012</year>). <article-title>Cross-sensory correspondences and cross talk between dimensions of connotative meaning: visual angularity is hard, high-pitched, and bright</article-title>. <source>Attent. Percep. Psychophys.</source> <volume>74</volume>, <fpage>1792</fpage>&#x02013;<lpage>1809</lpage>. <pub-id pub-id-type="doi">10.3758/s13414-012-0341-9</pub-id><pub-id pub-id-type="pmid">22833281</pub-id></citation></ref>
<ref id="B320">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Walker</surname> <given-names>R.</given-names></name></person-group> (<year>1987</year>). <article-title>The effects of culture, environment, age, and musical training on choices of visual metaphors for sound</article-title>. <source>Percept. Psychophys.</source> <volume>42</volume>, <fpage>491</fpage>&#x02013;<lpage>502</lpage>. <pub-id pub-id-type="doi">10.3758/BF03209757</pub-id><pub-id pub-id-type="pmid">2447557</pub-id></citation></ref>
<ref id="B321">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Walker-Andrews</surname> <given-names>A.</given-names></name></person-group> (<year>1994</year>). <article-title>&#x0201C;Taxonomy for intermodal relations,&#x0201D;</article-title> in <source>The Development of Intersensory Perception: Comparative Perspectives</source>, eds. D. J. Lewkowicz and R. Lickliter (<publisher-loc>Hillsdale, NJ</publisher-loc>: <publisher-name>Lawrence Erlbaum</publisher-name>), <fpage>39</fpage>&#x02013;<lpage>56</lpage>.</citation>
</ref>
<ref id="B322">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wallmark</surname> <given-names>Z.</given-names></name></person-group> (<year>2019</year>). <article-title>Semantic crosstalk in timbre perception</article-title>. <source>Music Sci.</source> <volume>2</volume>, <fpage>1</fpage>&#x02013;<lpage>18</lpage>. <pub-id pub-id-type="doi">10.1177/2059204319846617</pub-id></citation>
</ref>
<ref id="B323">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wallmark</surname> <given-names>Z.</given-names></name> <name><surname>Allen</surname> <given-names>S. E.</given-names></name></person-group> (<year>2020</year>). <article-title>Preschoolers&#x00027; crossmodal mappings of timbre</article-title>. <source>Attent. Percep. Psychophys.</source> <volume>82</volume>, <fpage>2230</fpage>&#x02013;<lpage>2236</lpage>. <pub-id pub-id-type="doi">10.3758/s13414-020-02015-0</pub-id><pub-id pub-id-type="pmid">32166645</pub-id></citation></ref>
<ref id="B324">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wallmark</surname> <given-names>Z.</given-names></name> <name><surname>Nghiem</surname> <given-names>L.</given-names></name> <name><surname>Marks</surname> <given-names>L. E.</given-names></name></person-group> (<year>2021</year>). <article-title>Does timbre modulate visual perception? Exploring crossmodal interactions</article-title>. <source>Music Perception</source> <volume>39</volume>, <fpage>1</fpage>&#x02013;<lpage>20</lpage>. <pub-id pub-id-type="doi">10.1525/mp.2021.39.1.1</pub-id></citation>
</ref>
<ref id="B325">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wanke</surname> <given-names>R.</given-names></name> <name><surname>Ansani</surname> <given-names>A.</given-names></name> <name><surname>Di Stefano</surname> <given-names>N.</given-names></name> <name><surname>Spence</surname> <given-names>C.</given-names></name></person-group> (<year>2025</year>). <article-title>Exploring auditory morphodynamics: audio-visual associations in sound-based music</article-title>. <source>i-Perception.</source> <volume>16</volume>, <fpage>1</fpage>&#x02013;<lpage>21</lpage>. <pub-id pub-id-type="doi">10.1177/20416695251338718</pub-id><pub-id pub-id-type="pmid">40606057</pub-id></citation></ref>
<ref id="B326">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Watanabe</surname> <given-names>K.</given-names></name> <name><surname>Shimojo</surname> <given-names>S.</given-names></name></person-group> (<year>2001</year>). <article-title>When sound affects vision: Effects of auditory grouping on visual motion perception</article-title>. <source>Psychol. Sci.</source> <volume>12</volume>, <fpage>109</fpage>&#x02013;<lpage>116</lpage>. <pub-id pub-id-type="doi">10.1111/1467-9280.00319</pub-id><pub-id pub-id-type="pmid">11340918</pub-id></citation></ref>
<ref id="B327">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Waterworth</surname> <given-names>J. A.</given-names></name></person-group> (<year>1997</year>). <article-title>Creativity and sensation: The case for synaesthetic media</article-title>. <source>Leonardo</source> <volume>30</volume>, <fpage>327</fpage>&#x02013;<lpage>330</lpage>. <pub-id pub-id-type="doi">10.2307/1576481</pub-id></citation>
</ref>
<ref id="B328">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Watkins</surname> <given-names>J.</given-names></name></person-group> (<year>2018</year>). <article-title>&#x0201C;Creating affective visual music,&#x0201D;</article-title> in <source>EVA &#x00027;18: Proceedings of the Conference on Electronic Visualisation and the Arts</source>, <fpage>374</fpage>&#x02013;<lpage>381</lpage>. <pub-id pub-id-type="doi">10.14236/ewic/EVA2018.70</pub-id></citation>
</ref>
<ref id="B329">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wehner</surname> <given-names>W. L.</given-names></name></person-group> (<year>1966</year>). <article-title>The relation between six paintings by Paul Klee and selected musical compositions</article-title>. <source>J. Res. Music Educ.</source> <volume>14</volume>, <fpage>220</fpage>&#x02013;<lpage>224</lpage>. <pub-id pub-id-type="doi">10.2307/3344055</pub-id></citation>
</ref>
<ref id="B330">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Welch</surname> <given-names>R. B.</given-names></name> <name><surname>DuttonHurt</surname> <given-names>L. D.</given-names></name> <name><surname>Warren</surname> <given-names>D. H.</given-names></name></person-group> (<year>1986</year>). <article-title>Contributions of audition and vision to temporal rate perception</article-title>. <source>Percept. Psychophys.</source> <volume>39</volume>, <fpage>294</fpage>&#x02013;<lpage>300</lpage>. <pub-id pub-id-type="doi">10.3758/BF03204939</pub-id><pub-id pub-id-type="pmid">3737359</pub-id></citation></ref>
<ref id="B331">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Welch</surname> <given-names>R. B.</given-names></name> <name><surname>Warren</surname> <given-names>D. H.</given-names></name></person-group> (<year>1980</year>). <article-title>Immediate perceptual response to intersensory discrepancy</article-title>. <source>Psychol. Bull.</source> <volume>88</volume>, <fpage>638</fpage>&#x02013;<lpage>667</lpage>. <pub-id pub-id-type="doi">10.1037/0033-2909.88.3.638</pub-id></citation>
</ref>
<ref id="B332">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Welch</surname> <given-names>R. B.</given-names></name> <name><surname>Warren</surname> <given-names>D. H.</given-names></name></person-group> (<year>1986</year>). <article-title>&#x0201C;Intersensory interactions,&#x0201D;</article-title> in <source>Handbook of Perception and Performance: Vol. 1. Sensory Processes and Perception</source>, eds. K. R. Boff, L. Kaufman, and J. P. Thomas (<publisher-loc>New York, NY</publisher-loc>: <publisher-name>Wiley</publisher-name>).</citation>
</ref>
<ref id="B333">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wells</surname> <given-names>A.</given-names></name></person-group> (<year>1980</year>). <article-title>Music and visual color: a proposed correlation</article-title>. <source>Leonardo</source> <volume>13</volume>, <fpage>101</fpage>&#x02013;<lpage>107</lpage>. <pub-id pub-id-type="doi">10.2307/1577978</pub-id><pub-id pub-id-type="pmid">36562072</pub-id></citation></ref>
<ref id="B334">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Whiteford</surname> <given-names>K. L.</given-names></name> <name><surname>Schloss</surname> <given-names>K. B.</given-names></name> <name><surname>Helwig</surname> <given-names>N. E.</given-names></name> <name><surname>Palmer</surname> <given-names>S. E.</given-names></name></person-group> (<year>2018</year>). <article-title>Color, music, and emotion: Bach to the blues</article-title>. <source>i-Perception.</source> <volume>9</volume>, <fpage>1</fpage>&#x02013;<lpage>27</lpage>. <pub-id pub-id-type="doi">10.1177/2041669518808535</pub-id><pub-id pub-id-type="pmid">30479734</pub-id></citation></ref>
<ref id="B335">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Whitelaw</surname> <given-names>M.</given-names></name></person-group> (<year>2008</year>). <article-title>Synesthesia and cross-modality in contemporary audiovisuals</article-title>. <source>Senses Soc.</source> <volume>3</volume>, <fpage>259</fpage>&#x02013;<lpage>276</lpage>. <pub-id pub-id-type="doi">10.2752/174589308X331314</pub-id></citation>
</ref>
<ref id="B336">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Whitney</surname> <given-names>J.</given-names></name></person-group> (<year>1980</year>). <source>Digital Harmony: On the Complementarity of Music and Visual Art</source>. <publisher-loc>Peterborough, NH</publisher-loc>: <publisher-name>Byte Books</publisher-name>.</citation>
</ref>
<ref id="B337">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wicker</surname> <given-names>F. W.</given-names></name></person-group> (<year>1968</year>). <article-title>Mapping the intersensory regions of perceptual space</article-title>. <source>Am. J. Psychol.</source> <volume>81</volume>, <fpage>178</fpage>&#x02013;<lpage>188</lpage>. <pub-id pub-id-type="doi">10.2307/1421262</pub-id><pub-id pub-id-type="pmid">5747961</pub-id></citation></ref>
<ref id="B338">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Witztum</surname> <given-names>E.</given-names></name> <name><surname>Lerner</surname> <given-names>V.</given-names></name></person-group> (<year>2016</year>). <article-title>Alexander Nicolaevich Scriabin (1872-1915): Enlightenment or illness?</article-title> <source>J. Med. Biogr.</source> <volume>24</volume>, <fpage>331</fpage>&#x02013;<lpage>338</lpage>. <pub-id pub-id-type="doi">10.1177/0967772014537151</pub-id><pub-id pub-id-type="pmid">24906401</pub-id></citation></ref>
<ref id="B339">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wrembel</surname> <given-names>M.</given-names></name></person-group> (<year>2009</year>). <article-title>On hearing colours&#x02014;cross-modal associations in vowel perception in a non-synaesthetic population</article-title>. <source>Pozna&#x00144; Studi. Contemp. Linguist.</source><volume>45</volume>, <fpage>595</fpage>&#x02013;<lpage>612</lpage>. <pub-id pub-id-type="doi">10.2478/v10010-009-0028-0</pub-id></citation>
</ref>
<ref id="B340">
<citation citation-type="web"><person-group person-group-type="author"><name><surname>Young</surname> <given-names>D.</given-names></name></person-group> (<year>2005</year>). <article-title>The smell of greenness: Cultural synaesthesia in the Western desert</article-title>. <source>Etnofoor</source> <volume>18</volume>, <fpage>61</fpage>&#x02013;<lpage>77</lpage>. Available online at: <ext-link ext-link-type="uri" xlink:href="https://www.jstor.org/stable/25758086">https://www.jstor.org/stable/25758086</ext-link></citation>
</ref>
<ref id="B341">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zhou</surname> <given-names>L.</given-names></name> <name><surname>Jiang</surname> <given-names>C.</given-names></name> <name><surname>Delogu</surname> <given-names>F.</given-names></name> <name><surname>Yang</surname> <given-names>Y.</given-names></name></person-group> (<year>2014</year>). <article-title>Spatial conceptual associations between music and pictures as revealed by N400 effect</article-title>. <source>Psychophysiology</source> <volume>51</volume>, <fpage>520</fpage>&#x02013;<lpage>528</lpage>. <pub-id pub-id-type="doi">10.1111/psyp.12195</pub-id><pub-id pub-id-type="pmid">24611598</pub-id></citation></ref>
<ref id="B342">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Zika</surname> <given-names>F.</given-names></name></person-group> (<year>2013</year>). <article-title>&#x0201C;Color and sound: Transcending the limits of the senses,&#x0201D;</article-title> in <source>Light, Image, Imagination</source>, eds. M. Blassnigg, G. Deutsch, and H. Schimek (<publisher-loc>Amsterdam</publisher-loc>: <publisher-name>Amsterdam University Press</publisher-name>), <fpage>29</fpage>&#x02013;<lpage>46</lpage>.</citation>
</ref>
<ref id="B343">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Zika</surname> <given-names>F.</given-names></name></person-group> (<year>2018</year>). <article-title>&#x0201C;Colour and sound: transcending the limits of the senses,&#x0201D;</article-title> in <source>Senses and Sensation: Critical and Primary Sources, Vol. 2: History and Sociology</source>, ed. D. Howes (<publisher-loc>Abingdon</publisher-loc>: <publisher-name>Routledge</publisher-name>), <fpage>303</fpage>&#x02013;<lpage>316</lpage>.</citation>
</ref>
<ref id="B344">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zilczer</surname> <given-names>J.</given-names></name></person-group> (<year>1987</year>). <article-title>&#x0201C;Color music&#x0201D;: Synaesthesia and nineteenth-century sources for abstract art</article-title>. <source>Artibus et Historiae</source> <volume>8</volume>, <fpage>101</fpage>&#x02013;<lpage>126</lpage>. <pub-id pub-id-type="doi">10.2307/1483303</pub-id></citation>
</ref>
<ref id="B345">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Zilczer</surname> <given-names>J.</given-names></name></person-group> (<year>2016</year>). <article-title>&#x0201C;American Rhapsody: From Modern to Postmodern in visual music,&#x0201D;</article-title> in <source>The Oxford Handbook of Sound and Image in Western Art</source>, ed. T. Kaduri (<publisher-loc>Oxford, UK</publisher-loc>: <publisher-name>Oxford University Press</publisher-name>), <fpage>17</fpage>&#x02013;<lpage>35</lpage>.</citation>
</ref>
</ref-list>
</back>
</article> 