<?xml version="1.0" encoding="utf-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v2.3 20070202//EN" "journalpublishing.dtd">
<article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" article-type="research-article" dtd-version="2.3" xml:lang="EN">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">Front. Neurosci.</journal-id>
<journal-title>Frontiers in Neuroscience</journal-title>
<abbrev-journal-title abbrev-type="pubmed">Front. Neurosci.</abbrev-journal-title>
<issn pub-type="epub">1662-453X</issn>
<publisher>
<publisher-name>Frontiers Media S.A.</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="doi">10.3389/fnins.2023.1258024</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Neuroscience</subject>
<subj-group>
<subject>Original Research</subject>
</subj-group>
</subj-group>
</article-categories>
<title-group>
<article-title>A general dual-pathway network for EEG denoising</article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author">
<name>
<surname>Xiong</surname>
<given-names>Wenjing</given-names>
</name>
<uri xlink:href="https://loop.frontiersin.org/people/2376279/overview"/>
<role content-type="https://credit.niso.org/contributor-roles/conceptualization/"/>
<role content-type="https://credit.niso.org/contributor-roles/data-curation/"/>
<role content-type="https://credit.niso.org/contributor-roles/formal-analysis/"/>
<role content-type="https://credit.niso.org/contributor-roles/investigation/"/>
<role content-type="https://credit.niso.org/contributor-roles/methodology/"/>
<role content-type="https://credit.niso.org/contributor-roles/software/"/>
<role content-type="https://credit.niso.org/contributor-roles/validation/"/>
<role content-type="https://credit.niso.org/contributor-roles/visualization/"/>
<role content-type="https://credit.niso.org/contributor-roles/writing-original-draft/"/>
<role content-type="https://credit.niso.org/contributor-roles/writing-review-editing/"/>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Ma</surname>
<given-names>Lin</given-names>
</name>
<role content-type="https://credit.niso.org/contributor-roles/resources/"/>
<role content-type="https://credit.niso.org/contributor-roles/supervision/"/>
<role content-type="https://credit.niso.org/contributor-roles/writing-original-draft/"/>
<role content-type="https://credit.niso.org/contributor-roles/writing-review-editing/"/>
</contrib>
<contrib contrib-type="author" corresp="yes">
<name>
<surname>Li</surname>
<given-names>Haifeng</given-names>
</name>
<xref ref-type="corresp" rid="c001"><sup>&#x002A;</sup></xref>
<uri xlink:href="https://loop.frontiersin.org/people/900653/overview"/>
<role content-type="https://credit.niso.org/contributor-roles/conceptualization/"/>
<role content-type="https://credit.niso.org/contributor-roles/funding-acquisition/"/>
<role content-type="https://credit.niso.org/contributor-roles/methodology/"/>
<role content-type="https://credit.niso.org/contributor-roles/resources/"/>
<role content-type="https://credit.niso.org/contributor-roles/supervision/"/>
<role content-type="https://credit.niso.org/contributor-roles/writing-review-editing/"/>
</contrib>
</contrib-group>
<aff><institution>Faculty of Computing, Harbin Institute of Technology</institution>, <addr-line>Harbin</addr-line>, <country>China</country></aff>
<author-notes>
<fn id="fn0001" fn-type="edited-by"><p>Edited by: Prasanna Parvathaneni, Flagship Biosciences, Inc, United States</p></fn>
<fn id="fn0002" fn-type="edited-by"><p>Reviewed by: Tuba Akt&#x00FC;rk, Istanbul Medipol University, T&#x00FC;rkiye</p>
<p>Th&#x00E9;odore Papadopoulo, Research Centre Inria Sophia Antipolis M&#x00E9;diterran&#x00E9;e, France</p></fn>
<corresp id="c001">&#x002A;Correspondence: Haifeng Li, <email>lihaifeng@hit.edu.cn</email></corresp>
</author-notes>
<pub-date pub-type="epub">
<day>24</day>
<month>01</month>
<year>2024</year>
</pub-date>
<pub-date pub-type="collection">
<year>2023</year>
</pub-date>
<volume>17</volume>
<elocation-id>1258024</elocation-id>
<history>
<date date-type="received">
<day>13</day>
<month>07</month>
<year>2023</year>
</date>
<date date-type="accepted">
<day>29</day>
<month>12</month>
<year>2023</year>
</date>
</history>
<permissions>
<copyright-statement>Copyright &#x00A9; 2024 Xiong, Ma and Li.</copyright-statement>
<copyright-year>2024</copyright-year>
<copyright-holder>Xiong, Ma and Li</copyright-holder>
<license xlink:href="http://creativecommons.org/licenses/by/4.0/">
<p>This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.</p>
</license>
</permissions>
<abstract>
<sec id="sec1001">
<title>Introduction</title>
<p>Scalp electroencephalogram (EEG) analysis and interpretation are crucial for tracking and analyzing brain activity. The collected scalp EEG signals, however, are weak and frequently tainted with various sorts of artifacts. The models based on deep learning provide comparable performance with that of traditional techniques. However, current deep learning networks applied to scalp EEG noise reduction are large in scale and suffer from overfitting.</p>
</sec>
<sec id="sec2001">
<title>Methods</title>
<p>Here, we propose a dual-pathway autoencoder modeling framework named DPAE for scalp EEG signal denoising and demonstrate the superiority of the model on multi-layer perceptron (MLP), convolutional neural network (CNN) and recurrent neural network (RNN), respectively. We validate the denoising performance on benchmark scalp EEG artifact datasets.</p>
</sec>
<sec id="sec3001">
<title>Results</title>
<p>The experimental results show that our model architecture not only significantly reduces the computational effort but also outperforms existing deep learning denoising algorithms in root relative mean square error (RRMSE)metrics, both in the time and frequency domains.</p>
</sec>
<sec id="sec4001">
<title>Discussion</title>
<p>The DPAE architecture does not require a priori knowledge of the noise distribution nor is it limited by the network layer structure, which is a general network model oriented toward blind source separation.</p>
</sec>
</abstract>
<kwd-group>
<kwd>EEG denoising</kwd>
<kwd>dual-pathway structure</kwd>
<kwd>general network model</kwd>
<kwd>light weight autoencoder</kwd>
<kwd>blind source separation</kwd>
</kwd-group>
<counts>
<fig-count count="9"/>
<table-count count="2"/>
<equation-count count="8"/>
<ref-count count="44"/>
<page-count count="12"/>
<word-count count="7542"/>
</counts>
<custom-meta-wrap>
<custom-meta>
<meta-name>section-at-acceptance</meta-name>
<meta-value>Brain Imaging Methods</meta-value>
</custom-meta>
</custom-meta-wrap>
</article-meta>
</front>
<body>
<sec sec-type="intro" id="sec1">
<label>1</label>
<title>Introduction</title>
<p>Scalp electroencephalogram (EEG) is a time-varying non-linear and non-stationary physiological signal with weak amplitude and is highly susceptible to contamination by extraneous noise, resulting in various artifacts (<xref ref-type="bibr" rid="ref30">Sanei and Chambers, 2013</xref>). The effective removal of artifacts in scalp EEG and the preservation of the pristine scalp EEG information are of great significance for brain science research and clinical applications (<xref ref-type="bibr" rid="ref38">Wolpaw et al., 1991</xref>; <xref ref-type="bibr" rid="ref26">Oberman et al., 2005</xref>; <xref ref-type="bibr" rid="ref4">Cai et al., 2020</xref>). Artifacts can be classified into two categories: physiological artifacts and non-physiological artifacts (<xref ref-type="bibr" rid="ref35">Urig&#x00FC;en and Garcia-Zapirain, 2015</xref>). Physiological artifacts are usually caused by the activity of body parts close to the head, such as the eyes (<xref ref-type="bibr" rid="ref6">Croft and Barry, 2000</xref>), muscles (<xref ref-type="bibr" rid="ref24">Muthukumaraswamy, 2013</xref>), and heart (<xref ref-type="bibr" rid="ref8">Dirlich et al., 1997</xref>). In addition, the relative motion of the subject&#x2019;s head and the electrode cap can produce physiological artifacts (<xref ref-type="bibr" rid="ref27">Oliveira et al., 2016</xref>). Non-physiological artifacts are usually caused by environmental and equipment factors, such as whether the electrodes are in good contact with the scalp, whether the acquisition equipment is in good operation, and utility interference. Non-physiological artifacts can be reduced by acquiring in a shielded room, filtering, and subject self-suppression of the head and other body movements. In contrast, the physiological artifacts of electrooculogram (EOG) and electromyogram (EMG) are difficult to remove by simple filtering methods due to the irregularity of their movements and the frequency band overlap with the commonly used scalp EEG rhythm signals and need to be processed by suitable artifact removal algorithms.</p>
<p>Classical artifact removal methods include independent component analysis (ICA) (<xref ref-type="bibr" rid="ref17">Jung et al., 2000</xref>; <xref ref-type="bibr" rid="ref34">Ullsperger and Debener, 2010</xref>), wavelet transform (WT) (<xref ref-type="bibr" rid="ref13">Hamaneh et al., 2014</xref>; <xref ref-type="bibr" rid="ref44">Zhao et al., 2014</xref>), empirical mode decomposition (EMD) (<xref ref-type="bibr" rid="ref15">Huang et al., 1998</xref>; <xref ref-type="bibr" rid="ref29">Patel et al., 2016</xref>), and common average reference (CAR) (<xref ref-type="bibr" rid="ref20">Ludwig et al., 2009</xref>). Among them, the hybrid optimization method is a newer research hotspot for scalp EEG denoising, which mainly focuses on EEG noise reduction by combining the classical algorithms mentioned above to extract different artifact component features. <xref ref-type="bibr" rid="ref21">Mahajan and Morshed (2015)</xref> combined blind source separation and wavelet transform, which avoids distortion in reconstructing scalp EEG signals due to the removal of useful EEG information. <xref ref-type="bibr" rid="ref2">Bono et al. (2014)</xref> proposed the WPT-EMD and WPT-ICA algorithms by combining the concepts of wavelet packet transform (WPT), EMD, and ICA, which has a significant increase in artifact removal ability using signal-to-noise ratio as a measure. <xref ref-type="bibr" rid="ref5">Chen et al. (2014)</xref> combined the ensemble empirical mode decomposition (EEMD) and the joint blind source separation (JBSS) for muscular artifact cancelation in single-channel EEG recordings, which demonstrated the performance of these single-channel solutions. <xref ref-type="bibr" rid="ref16">Islam et al. (2020)</xref> proposed an approach based on ICA to remove motion-related artifacts in the context of epilepsy and succeeded in improving the accuracy of epilepsy detection and prediction.</p>
<p>The advantages of applying the classical scalp EEG signal denoising methods mentioned above include high interpretability and good removal of specific artifacts, but the computational complexity is large and the generalization and robustness of the methods are not strong. The advantages of the generalization and robustness of deep learning are rapidly making it become a new development direction for scalp EEG noise reduction methods (<xref ref-type="bibr" rid="ref23">Mumtaz et al., 2021</xref>). <xref ref-type="bibr" rid="ref39">Yang et al. (2018)</xref> proposed a shallow multilayer perceptron (MLP) network for removing scalp EEG artifacts, which applies to the automatic removal of online scalp EEG by training the network offline, with the advantage of channel-independence and strong generalization ability. <xref ref-type="bibr" rid="ref1">Anon Hanrahan (2019)</xref> used the one-dimensional convolutional neural network (CNN) to reduce noise in scalp EEG signals. Based on the signal-to-noise ratio (SNR) before and after denoising, this method has successfully surpassed principal component analysis (PCA). The complex CNN network structure of EEGdenoiseNet proposed by <xref ref-type="bibr" rid="ref43">Zhang et al. (2020)</xref> outperformed the EMD algorithm in denoising the standardized EEG artifact dataset constructed by them. <xref ref-type="bibr" rid="ref33">Sun et al. (2020)</xref> used a one-dimensional residual convolutional neural network (1D-ResCNN) model for denoising the raw time-domain waveforms of scalp EEG, which was effective in removing ocular artifacts but ineffective for myogenic artifacts. <xref ref-type="bibr" rid="ref41">Zhang et al. (2021)</xref> proposed a new CNN network structure combining progressively increasing feature dimensions and downsampling for EMG denoising. The results show that structure design can help avoid overfitting and perform better.</p>
<p>In general, the removal of scalp EEG artifacts by deep learning is not controlled by the number of electrodes and does not require <italic>a priori</italic> knowledge of physiological artifact reference channels or some artifact signals, but due to the complexity and multiple sources of artifact signals, achieving good results for both ocular and myogenic artifacts using existing deep learning methods is difficult. These deep learning network models are complex and computationally intensive. In light of the recent explosion in portable BCI (brain-computer interfaces) (<xref ref-type="bibr" rid="ref12">Guneysu and Akin, 2013</xref>; <xref ref-type="bibr" rid="ref25">Nakanishi et al., 2017</xref>; <xref ref-type="bibr" rid="ref42">Zhang et al., 2019</xref>; <xref ref-type="bibr" rid="ref37">Wang et al., 2023</xref>), we suggested a lightweight scalp EEG noise reduction network model based on autoencoders. Autoencoders were first introduced by <xref ref-type="bibr" rid="ref14">Hinton and Salakhutdinov (2006)</xref> as neural network trainers to reconstruct its input and are widely used in computer vision (<xref ref-type="bibr" rid="ref32">Sinha et al., 2018</xref>), audio processing (<xref ref-type="bibr" rid="ref19">Leglaive et al., 2020</xref>), and natural language processing (<xref ref-type="bibr" rid="ref18">Lebret and Collobert, 2015</xref>). To alleviate the problem that the classic autoencoder tends to be overfitted, <xref ref-type="bibr" rid="ref36">Vincent et al. (2010)</xref> proposed the denoising autoencoder. The denoising autoencoder uses data corrupted by noise to reconstruct the original data, which makes the abstract features more robust. Here, we modify the encoder module of the noise-reducing self-encoder. Instead of stacked encoding layers, we design parallel dual-pathway network layers with different neuron shrinkage ratios and combine them with a symmetry fusion block module. In this way, we construct a robust feature extraction encoder to remove the non-linear and multi-source scalp EEG artifact noise features and obtain a high-level representation of clean EEG. Key contributions of this work are summarized by:</p>
<list list-type="bullet">
<list-item><p>A general dual-pathway network structure separately was built using MLP, CNN, and RNN for automatic scalp EEG artifact removal and we demonstrate that it outperforms other deep learning methods.</p></list-item>
<list-item><p>Modeling of scalp EEG contaminated with artifacts based on coupled systems.</p></list-item>
<list-item><p>Blind source separation without <italic>a priori</italic> knowledge of the source signal.</p></list-item>
</list>
<p>The rest of the paper is organized as follows. Section 2 presents the scalp EEG artifact coupled system model, the dual-pathway autoencoder network separately constructed using MLP, CNN, and RNN, and the benchmark scalp EEG artifact dataset. Section 3 presents different evaluation metrics, whose results show that our approach outperforms other deep learning denoising methods. Also, we conducted ablation experiments for the proposed new network structure and validated the denoising performance on multichannel real EEG data. Concluding remarks follow in Section 4.</p>
</sec>
<sec sec-type="materials|methods" id="sec2">
<label>2</label>
<title>Methods and materials</title>
<sec id="sec3">
<label>2.1</label>
<title>The scalp EEG artifact hypothesis</title>
<p>As a time-varying non-linear and non-stationary signal, scalp EEG is susceptible to exogenous artifacts and physiological artifacts. Physiological artifacts are caused by a variety of internal body sources: EOG, EMG, electrocardiographic (ECG), etc. Traditional methods assume that the artifacts and scalp EEG signals are statistically independent of each other and that the signals we observe are composed of linear combinations of source signals as <xref ref-type="disp-formula" rid="EQ1">equation (1)</xref> (<xref ref-type="bibr" rid="ref28">Parra et al., 2005</xref>), where <inline-formula><mml:math id="M1"><mml:mi>f</mml:mi></mml:math></inline-formula> denotes the loss function of the transmission medium (skull) to the clean EEG signal and <inline-formula><mml:math id="M2"><mml:msub><mml:mi>g</mml:mi><mml:mi>i</mml:mi></mml:msub></mml:math></inline-formula> denotes the loss function of the transmission medium (skin surface) to each artifact signal.</p>
<disp-formula id="EQ1"><label>(1)</label><mml:math id="M3"><mml:mtext mathvariant="italic">Signal</mml:mtext><mml:mo>=</mml:mo><mml:mi>f</mml:mi><mml:mfenced open="(" close=")"><mml:mrow><mml:mi>e</mml:mi><mml:mi>e</mml:mi><mml:mi>g</mml:mi></mml:mrow></mml:mfenced><mml:mo>+</mml:mo><mml:mstyle displaystyle="true"><mml:munderover><mml:mo stretchy="true">&#x2211;</mml:mo><mml:mrow><mml:mi>i</mml:mi><mml:mo>=</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mi>N</mml:mi></mml:munderover></mml:mstyle><mml:msub><mml:mi>&#x03BC;</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mo>&#x2217;</mml:mo><mml:msub><mml:mi>g</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mfenced open="(" close=")"><mml:mrow><mml:mtext mathvariant="italic">artifac</mml:mtext><mml:msub><mml:mi>t</mml:mi><mml:mi>i</mml:mi></mml:msub></mml:mrow></mml:mfenced></mml:math></disp-formula>
<p>In reality, however, there is a bidirectional contamination between scalp EEG and artifacts, forming a coupled system (<xref ref-type="bibr" rid="ref7">David and Friston, 2003</xref>). Exogenous artifacts can be removed by adjusting the acquisition environment and filtering. In contrast, endogenous artifacts and clean scalp EEG activity signals are interacting with each other during transmission through the scalp to the electrodes and are more difficult to remove. This can be modeled as <xref ref-type="disp-formula" rid="EQ2">equation (2)</xref>:</p>
<disp-formula id="EQ2"><label>(2)</label><mml:math id="M4"><mml:mtext mathvariant="italic">Signal</mml:mtext><mml:mo>=</mml:mo><mml:mi>f</mml:mi><mml:mfenced open="(" close=")"><mml:mrow><mml:mi>e</mml:mi><mml:mi>e</mml:mi><mml:mi>g</mml:mi><mml:mo>+</mml:mo><mml:mstyle displaystyle="true"><mml:munderover><mml:mo stretchy="true">&#x2211;</mml:mo><mml:mrow><mml:mi>i</mml:mi><mml:mo>=</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mi>M</mml:mi></mml:munderover></mml:mstyle><mml:msub><mml:mi>&#x03BC;</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mo>&#x2217;</mml:mo><mml:msub><mml:mi>g</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mfenced open="(" close=")"><mml:mrow><mml:mtext mathvariant="italic">artifac</mml:mtext><mml:msubsup><mml:mi>t</mml:mi><mml:mi>i</mml:mi><mml:mtext mathvariant="italic">endo</mml:mtext></mml:msubsup></mml:mrow></mml:mfenced></mml:mrow></mml:mfenced><mml:mo>+</mml:mo><mml:mstyle displaystyle="true"><mml:munderover><mml:mo stretchy="true">&#x2211;</mml:mo><mml:mrow><mml:mi>j</mml:mi><mml:mo>=</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mi>K</mml:mi></mml:munderover></mml:mstyle><mml:msub><mml:mi>&#x03B4;</mml:mi><mml:mi>j</mml:mi></mml:msub><mml:mo>&#x2217;</mml:mo><mml:msub><mml:mi>g</mml:mi><mml:mi>j</mml:mi></mml:msub><mml:mfenced open="(" close=")"><mml:mrow><mml:mtext mathvariant="italic">artifac</mml:mtext><mml:msubsup><mml:mi>t</mml:mi><mml:mi>j</mml:mi><mml:mrow><mml:mi>e</mml:mi><mml:mi>x</mml:mi><mml:mi>o</mml:mi></mml:mrow></mml:msubsup></mml:mrow></mml:mfenced></mml:math></disp-formula>
<p>where <inline-formula><mml:math id="M5"><mml:mtext mathvariant="italic">artifac</mml:mtext><mml:msubsup><mml:mi>t</mml:mi><mml:mi>i</mml:mi><mml:mtext mathvariant="italic">endo</mml:mtext></mml:msubsup></mml:math></inline-formula> denotes endogenous artifacts and <inline-formula><mml:math id="M6"><mml:mtext mathvariant="italic">artifac</mml:mtext><mml:msubsup><mml:mi>t</mml:mi><mml:mi>j</mml:mi><mml:mrow><mml:mi>e</mml:mi><mml:mi>x</mml:mi><mml:mi>o</mml:mi></mml:mrow></mml:msubsup></mml:math></inline-formula> denotes exogenous artifacts. <inline-formula><mml:math id="M7"><mml:msub><mml:mi>g</mml:mi><mml:mi>i</mml:mi></mml:msub></mml:math></inline-formula> and <inline-formula><mml:math id="M8"><mml:msub><mml:mi>g</mml:mi><mml:mi>j</mml:mi></mml:msub></mml:math></inline-formula> represent the transmission loss functions for endogenous and exogenous artifacts of different transmission mediums, respectively. We wished to construct a network structure that can simulate this mutual antagonism of the source signal during the scalp transmission process. The schematic diagram illustrating the coupling model framework of scalp electroencephalography can be found in <xref ref-type="fig" rid="fig1">Figure 1</xref>. Therefore, we propose the following dual-pathway denoised autoencoder model. We fit the coupling between the two types of loss functions, by means of a multiscale dual-pathway structure and a fusion block to extract the feature coding of clean scalp EEG signals.</p>
<fig position="float" id="fig1">
<label>Figure 1</label>
<caption><p>The coupled system model of scalp EEG contaminated by artifacts. The endogenous signals contain clean scalp EEG signals and physiological artifacts dominated by electrooculogram (EOG), electromyogram (EMG), and electrocardiographic (ECG). The endogenous signals antagonize each other and are co-transmitted through the skull to the scalp electrodes to be acquired. The exogenous signals, which contain physiological artifacts dominated by industrial frequency interference and motion artifacts, are transmitted via the scalp to the scalp electrodes, and together with the endogenous signals, constitute the observed scalp EEG. The exogenous signals comprised primarily of industrial frequency interference and motion artifacts are conveyed through the scalp to the scalp electrodes. These signals, in conjunction with the endogenous signals, collectively form the observable scalp EEG.</p></caption>
<graphic xlink:href="fnins-17-1258024-g001.tif"/>
</fig>
</sec>
<sec id="sec4">
<label>2.2</label>
<title>Network structure</title>
<p>Considering the coupled system, the clean scalp EEG signals and the artifact signals interfere and contaminate each other, which ultimately constitutes the observed scalp EEG signals composed of signals from multiple sources. In this paper, a dual-pathway autoencoder (DPAE) network architecture is designed for extracting the encoding of the target signal which is also the clean scalp EEG signal.</p>
<p>Firstly, at the time of extracting the primary features, the original input is mapped to subspaces of different dimensions to obtain feature representations at different scales. Secondly, through the fusion module, the joint representation features of different scales are compressed and reconstructed to obtain the scale-independent common feature information. Finally, the clean scalp EEG signal is reconstructed based on this common feature information. Notably, we associate the neural network layers at both ends of the fusion module through residual connections, i.e., the input and output layers of the fusion module are summed before feeding into the decoder. In view of the fact that the depth of the network structure increases with the addition of the fusion module, the residual connection can ensure that there is no gradient vanishing in the backpropagation (<xref ref-type="bibr" rid="ref9">Drozdzal et al., 2016</xref>). Furthermore, the residual connection is added to construct an identity mapping of the joint representation based on the residual connection, which is more conducive to the model learning the common feature information in the different scale features. This model structure has the advantage of automatically modeling the contamination relationship between multiple interference source signals. It also learns multiscale feature representations in a dual pathway format and uses the coding of the target source signals as its learning target. This helps the fusion module to obtain common feature representations at different scales that contain more information about the target source signals.</p>
<p>In addition, to better illustrate the generalization of the network frame, the dual-pathway autoencoder network architectures based on MLP, CNN, and RNN are constructed in this paper, respectively. Specifically, taking CNN as an example, we construct two sets of convolutional layer pathways with different sizes of convolutional kernels (1&#x00D7;3 vs. 1&#x00D7;5) and different steps (1&#x00D7;2 vs. 1&#x00D7;4) on the input layer of the network, respectively. In contrast to MLP, two sets of fully connected layer pathways are constructed with different neuron shrinkage ratios (0.45 vs. 0.75). Compared to RNN, two sets of GRU (gate recurrent unit) pathways are constructed with different unit shrinkage ratios (0.45 vs. 0.75). The two hidden layer pathways are followed by the fusion module. The fusion module is constructed based on a fully connected layer, which constructs the joint representation based on the two hidden layer pathways, and reconstructs the joint representation by compression coding first and then reconstructing the joint representation. The whole fusion module is a symmetric structure, as shown in <xref ref-type="fig" rid="fig2">Figure 2</xref>. The reconstructed joint representation is further compression coded by combining the residual connections after the fusion module. Finally, decoding is performed based on the decoder to synthesize clean scalp EEG signals. In addition, we added a batch normalization (<xref ref-type="bibr" rid="ref31">Santurkar et al., 2018</xref>) layer after the joint representation layer every time, which is to regularize the features from two pathways to a uniform interval, reduce the degree of data divergence, and construct independently and identically distributed recombination features.</p>
<fig position="float" id="fig2">
<label>Figure 2</label>
<caption><p>The dual-pathway denoising autoencoder (DPAE) structure diagram. The DPAE model first extracts multiscale abstract features by stacking network layers of varying scales. Subsequently, it simulates the coupling of multiple source signals through the fusion block. Following this, it employs residual connection structures to extract scale-independent generic encoding features of the target source signals. Finally, it reconstructs the target source signals based on the decoder. It is worth noting that to make the network structure look more intuitive, we ignore the joint representation layer. FN denotes a fully connected hidden neural layer, BN denotes a batch normalization layer, and the widths of the individual layers indicate the relative proportions of the layer neuron sizes. The encoder layers can be replaced by MLP, CNN, and RNN layers.</p></caption>
<graphic xlink:href="fnins-17-1258024-g002.tif"/>
</fig>
<p>In terms of parameter details, if we take the number of neurons at the input as 512, the specific parameters of the DPAE network model based on MLP are shown in <xref ref-type="table" rid="tab1">Table 1</xref>. Additionally, it should be noted that the number of neurons in the high-dimensional layer is 682 (512/0.75) and the number of neurons in the low-dimensional layer is 230 (512&#x002A;0.45). This is to map the input data to the low and high dimensional spaces, respectively, and thus ensure feature variability extracted by the subsequent dual-pathway structure. After that, the shrinkage ratio of the two pathway network layers stacked is always set to 0.45 and 0.75, respectively, so the shape of each network layer can be calculated in turn. In other words, our pathway with the high ratio is the first to carry out an expansion operation to upgrade dimensionality and a few contraction operations to reduce dimensionality. This is intended to improve the features&#x2019; expressiveness. In contrast, in the low ratio pathway, it is all shrinkage operations. In the decoder, the two-path network layers are concatenated together, then one layer of the network layers is stacked and output directly.</p>
<table-wrap position="float" id="tab1">
<label>Table 1</label>
<caption><p>Parameters of each layer of the DPAE model.</p></caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th align="left" valign="top">Layer</th>
<th align="center" valign="top">Shape</th>
<th align="left" valign="top">Connected to</th>
</tr>
</thead>
<tbody>
<tr>
<td align="left" valign="top">Input</td>
<td align="center" valign="top">512</td>
<td align="left" valign="top">-</td>
</tr>
<tr>
<td align="left" valign="top">Dense_layer1 (Path1)</td>
<td align="center" valign="top">230</td>
<td align="left" valign="top">Input</td>
</tr>
<tr>
<td align="left" valign="top">Dense_layer2 (Path1)</td>
<td align="center" valign="top">103</td>
<td align="left" valign="top">Dense_layer1</td>
</tr>
<tr>
<td align="left" valign="top">Dense_layer3 (Path1)</td>
<td align="center" valign="top">46</td>
<td align="left" valign="top">Dense_layer2</td>
</tr>
<tr>
<td align="left" valign="top">Dense_layer4 (Path2)</td>
<td align="center" valign="top">680</td>
<td align="left" valign="top">Input</td>
</tr>
<tr>
<td align="left" valign="top">Dense_layer5 (Path2)</td>
<td align="center" valign="top">511</td>
<td align="left" valign="top">Dense_layer4</td>
</tr>
<tr>
<td align="left" valign="top">Dense_layer6 (Path2)</td>
<td align="center" valign="top">383</td>
<td align="left" valign="top">Dense_layer5</td>
</tr>
<tr>
<td align="left" valign="top">Concatenate layer1</td>
<td align="center" valign="top">429</td>
<td align="left" valign="top">Dense_layer3, Dense_layer6</td>
</tr>
<tr>
<td align="left" valign="top">Batch normalization Layer1</td>
<td align="center" valign="top">429</td>
<td align="left" valign="top">Concatenate layer1</td>
</tr>
<tr>
<td align="left" valign="top">Dense_layer7 (fusion encoder)</td>
<td align="center" valign="top">193</td>
<td align="left" valign="top">Batch normalization Layer1</td>
</tr>
<tr>
<td align="left" valign="top">Dense_layer8 (fusion encoder)</td>
<td align="center" valign="top">86</td>
<td align="left" valign="top">Dense_layer7</td>
</tr>
<tr>
<td align="left" valign="top">Dense_layer9 (fusion feature)</td>
<td align="center" valign="top">39</td>
<td align="left" valign="top">Dense_layer8</td>
</tr>
<tr>
<td align="left" valign="top">Dense_layer10 (fusion decoder)</td>
<td align="center" valign="top">86</td>
<td align="left" valign="top">Dense_layer9</td>
</tr>
<tr>
<td align="left" valign="top">Dense_layer11 (fusion decoder)</td>
<td align="center" valign="top">193</td>
<td align="left" valign="top">Dense_layer10</td>
</tr>
<tr>
<td align="left" valign="top">Dense_layer12 (Path1)</td>
<td align="center" valign="top">46</td>
<td align="left" valign="top">Dense_layer11</td>
</tr>
<tr>
<td align="left" valign="top">Add layer1 (Path1)</td>
<td align="center" valign="top">46</td>
<td align="left" valign="top">Dense_layer3, Dense_layer12</td>
</tr>
<tr>
<td align="left" valign="top">Dense_layer13 (Path1)</td>
<td align="center" valign="top">20</td>
<td align="left" valign="top">Add layer1</td>
</tr>
<tr>
<td align="left" valign="top">Dense_layer14 (Path2)</td>
<td align="center" valign="top">383</td>
<td align="left" valign="top">Dense_layer11</td>
</tr>
<tr>
<td align="left" valign="top">Add layer2 (Path2)</td>
<td align="center" valign="top">383</td>
<td align="left" valign="top">Dense_layer6, Dense_layer14</td>
</tr>
<tr>
<td align="left" valign="top">Dense_layer15 (Path2)</td>
<td align="center" valign="top">287</td>
<td align="left" valign="top">Add layer2</td>
</tr>
<tr>
<td align="left" valign="top">Concatenate layer2</td>
<td align="center" valign="top">307</td>
<td align="left" valign="top">Dense_layer14, Dense_layer15</td>
</tr>
<tr>
<td align="left" valign="top">Batch normalization Layer2</td>
<td align="center" valign="top">307</td>
<td align="left" valign="top">Concatenate layer2</td>
</tr>
<tr>
<td align="left" valign="top">Dense_layer16 (decoder)</td>
<td align="center" valign="top">256</td>
<td align="left" valign="top">Batch normalization Layer2</td>
</tr>
<tr>
<td align="left" valign="top">Output</td>
<td align="center" valign="top">512</td>
<td align="left" valign="top">Dense_layer16</td>
</tr>
</tbody>
</table>
</table-wrap>
<p>We propose a training strategy to facilitate the learning of the clean scalp EEG characteristic features from scalp EEG contaminated with artifacts. We designed clean scalp EEG and contaminated scalp EEG input-to-output pairs. We set clean EEG as the expected output and constructed contaminated scalp EEG frames with different signal-to-noise ratio EMG and EOG artifacts as the input. The dual-way autoencoder is trained in a supervised manner, using the Adam optimizer (<xref ref-type="bibr" rid="ref40">Zhang, 2018</xref>) with the mean square error between the outputs and the expected reconstructed clean EEG as a loss function. Furthermore, all activation functions are SeLu.</p>
</sec>
<sec id="sec5">
<label>2.3</label>
<title>Datasets and pre-processing</title>
<sec id="sec6">
<label>2.3.1</label>
<title>Introduction to the dataset</title>
<p>We selected a publicly available standardized scalp EEG artifact removal dataset to validate the effectiveness of the method. The EEGdenoiseNet is a benchmark dataset for scalp EEG artifact removal by deep learning. This dataset has been manually annotated to obtain single-channel pure EEG, EOG, and EMG, and has been standardized for segmentation. <xref ref-type="fig" rid="fig3">Figure 3</xref> shows example waveforms of scalp EEG, EOG, and EMG, respectively. The total of 13,512 segments of scalp EEG with a sampling rate of 256&#x2009;Hz consist of 4,514 segments of pure EEG, 3,400 segments of oculomotor artifacts, and 5,598 segments of EMG artifacts, respectively.</p>
<fig position="float" id="fig3">
<label>Figure 3</label>
<caption><p>Typical examples in the EEGdenoiseNet dataset. The waveforms of clean scalp EEG, EOG, and EMG with a sampling rate of 256&#x2009;Hz and a duration of 2&#x2009;s are given here, respectively.</p></caption>
<graphic xlink:href="fnins-17-1258024-g003.tif"/>
</fig>
<p>The clean scalp EEG signals in this dataset were not spontaneous. They were extracted from 52 subjects performing real and imagined motor tasks. The sampled data from the 64 electrode channels were first extracted from the clean scalp EEG signals by ICA and then segmented into 2&#x2009;s one-dimensional EEG signals after band-pass filtering from 1 to 80 Hz. In other words, the final single-channel scalp EEG signal was extracted from different channels of EEG. The EOG signal was obtained from additional EOG channels from multiple publicly available datasets and processed by band-pass filtering from 0.3 to 10&#x2009;Hz. The EMG signal was obtained from a publicly available facial EMG dataset and processed by band-pass filtering from 1 to 120&#x2009;Hz.</p>
<p>With this dataset, we compared our method with the current state-of-the-art methods in power spectral density (PSD), signal-to-noise ratio (SNR), and other classical time series similarity measures for quantitative metrics.</p>
</sec>
<sec id="sec7">
<label>2.3.2</label>
<title>Pre-processing of the dataset</title>
<p>As mentioned previously, the EEGdenoiseNet dataset is divided into clean EEG, EOG, and EMG datasets, all with a sampling rate of 256&#x2009;Hz, and all have been segmented in advance for a 2&#x2009;s time window. We randomly selected 80% of the clean EEG as the training set and the remaining 20% of the EEG as the test set. The EEG contaminated by EOG and EMG is linearly constructed according to <xref ref-type="disp-formula" rid="EQ3">equation (3)</xref>, which is also the noise-added formula used in the original dataset paper. Additionally, it should be noted that the sample counts of EEG, EMG, and EOG were not equal, we adopted a random sampling strategy without replacement for EMG and EOG to construct EEG contaminated with artifacts.</p>
<disp-formula id="EQ3"><label>(3)</label><mml:math id="M9"><mml:mi>E</mml:mi><mml:mi>E</mml:mi><mml:msub><mml:mi>G</mml:mi><mml:mtext mathvariant="italic">Contaminated</mml:mtext></mml:msub><mml:mo>=</mml:mo><mml:mi>E</mml:mi><mml:mi>E</mml:mi><mml:msub><mml:mi>G</mml:mi><mml:mtext mathvariant="italic">clean</mml:mtext></mml:msub><mml:mo>+</mml:mo><mml:mi>&#x03BB;</mml:mi><mml:mo>&#x2217;</mml:mo><mml:mtext mathvariant="italic">Artifac</mml:mtext><mml:msub><mml:mi>t</mml:mi><mml:mrow><mml:mi>E</mml:mi><mml:mi>M</mml:mi><mml:mi>G</mml:mi><mml:mo stretchy="true">/</mml:mo><mml:mi>E</mml:mi><mml:mi>O</mml:mi><mml:mi>G</mml:mi></mml:mrow></mml:msub></mml:math></disp-formula>
<p>where &#x03BB; is the artifact coefficient inferred from the SNR of the noise-added EEG. The SNR value is calculated as <xref ref-type="disp-formula" rid="EQ4">equation (4)</xref>, with lg representing the logarithm with a base of 10.</p>
<disp-formula id="EQ4"><label>(4)</label><mml:math id="M10"><mml:mi>S</mml:mi><mml:mi>N</mml:mi><mml:mi>R</mml:mi><mml:mo>=</mml:mo><mml:mn>10</mml:mn><mml:mi>lg</mml:mi><mml:mfenced open="(" close=")"><mml:mfrac><mml:mrow><mml:mi>R</mml:mi><mml:mi>M</mml:mi><mml:mi>S</mml:mi><mml:mfenced open="(" close=")"><mml:mrow><mml:mi>E</mml:mi><mml:mi>E</mml:mi><mml:msub><mml:mi>G</mml:mi><mml:mtext mathvariant="italic">clean</mml:mtext></mml:msub></mml:mrow></mml:mfenced></mml:mrow><mml:mrow><mml:mi>R</mml:mi><mml:mi>M</mml:mi><mml:mi>S</mml:mi><mml:mfenced open="(" close=")"><mml:mrow><mml:mi>&#x03BB;</mml:mi><mml:mo>&#x2217;</mml:mo><mml:mtext mathvariant="italic">Artifac</mml:mtext><mml:msub><mml:mi>t</mml:mi><mml:mrow><mml:mi>E</mml:mi><mml:mi>M</mml:mi><mml:mi>G</mml:mi><mml:mo stretchy="true">/</mml:mo><mml:mi>E</mml:mi><mml:mi>O</mml:mi><mml:mi>G</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:mfenced></mml:mrow></mml:mfrac></mml:mfenced></mml:math></disp-formula>
<p>RMS denotes the root mean square and represents the power of this segment of the signal, calculated as <xref ref-type="disp-formula" rid="EQ5">equation (5)</xref>.</p>
<disp-formula id="EQ5"><label>(5)</label><mml:math id="M11"><mml:mi>R</mml:mi><mml:mi>M</mml:mi><mml:mi>S</mml:mi><mml:mfenced open="(" close=")"><mml:mtext mathvariant="italic">Signal</mml:mtext></mml:mfenced><mml:mo>=</mml:mo><mml:msqrt><mml:mrow><mml:mfrac><mml:mn>1</mml:mn><mml:mi>N</mml:mi></mml:mfrac><mml:mstyle displaystyle="true"><mml:munderover><mml:mo stretchy="true">&#x2211;</mml:mo><mml:mrow><mml:mi>i</mml:mi><mml:mo>=</mml:mo><mml:mn>0</mml:mn></mml:mrow><mml:mi>N</mml:mi></mml:munderover></mml:mstyle><mml:mtext mathvariant="italic">Signa</mml:mtext><mml:msubsup><mml:mi>l</mml:mi><mml:mi>i</mml:mi><mml:mn>2</mml:mn></mml:msubsup></mml:mrow></mml:msqrt></mml:math></disp-formula>
<p>Therefore, the smaller the value of &#x03BB;, the larger the SNR value. The authors who proposed this dataset set the signal-to-noise ratio interval to [-7&#x2009;dB, 2&#x2009;dB], and we adopted the same setting scheme. As a side note, the data segmentation of our EMG-contaminated EEG is as follows: set the number of training sets for each SNR level to 4,478 and the number of test sets to 560. Set the data segmentation of our EOG-contaminated EEG as follows: set the number of training sets for each SNR level to 2,720 and the number of test sets to 340. This is also consistent with the way the authors constructed the segmentation of this dataset. We also set up separate validation sets equal to the size of the test set.</p>
</sec>
</sec>
</sec>
<sec sec-type="results" id="sec8">
<label>3</label>
<title>Results</title>
<sec id="sec9">
<label>3.1</label>
<title>Performance evaluation metrics</title>
<p>Here, we adopted the evaluation criteria established by the creators of the dataset, consistent with their criteria for assessing denoising performance. Evaluating the network performance in terms of the root mean square error (RRMSE) of correlation on the time and frequency domains of the denoised EEG data, as well as the Pearson correlation coefficient values with the paired clean EEG.</p>
<p>First, we calculate the RRMSE of the denoised EEG from both the time domain and frequency domain, respectively. Here, we denote the denoised EEG by <inline-formula><mml:math id="M12"><mml:mi>f</mml:mi><mml:mfenced open="(" close=")"><mml:mi>y</mml:mi></mml:mfenced></mml:math></inline-formula> and the paired clean EEG by x as ground truth. Then, the RRMSE in the time domain is calculated as <xref ref-type="disp-formula" rid="EQ6">equation (6)</xref>:</p>
<disp-formula id="EQ6"><label>(6)</label><mml:math id="M13"><mml:mtext mathvariant="italic">RRMS</mml:mtext><mml:msub><mml:mi>E</mml:mi><mml:mtext mathvariant="italic">temporal</mml:mtext></mml:msub><mml:mo>=</mml:mo><mml:mfrac><mml:mrow><mml:mi>R</mml:mi><mml:mi>M</mml:mi><mml:mi>S</mml:mi><mml:mfenced open="(" close=")"><mml:mrow><mml:mi>f</mml:mi><mml:mfenced open="(" close=")"><mml:mi>y</mml:mi></mml:mfenced><mml:mo>&#x2212;</mml:mo><mml:mi>x</mml:mi></mml:mrow></mml:mfenced></mml:mrow><mml:mrow><mml:mi>R</mml:mi><mml:mi>M</mml:mi><mml:mi>S</mml:mi><mml:mfenced open="(" close=")"><mml:mi>x</mml:mi></mml:mfenced></mml:mrow></mml:mfrac></mml:math></disp-formula>
<p>Considering that the scalp EEG signal is a non-linear time-variant signal, the RRMSE in the frequency domain introduces the power spectral density for calculating as <xref ref-type="disp-formula" rid="EQ7">equation (7)</xref>:</p>
<disp-formula id="EQ7"><label>(7)</label><mml:math id="M14"><mml:mtext mathvariant="italic">RRMS</mml:mtext><mml:msub><mml:mi>E</mml:mi><mml:mtext mathvariant="italic">spectral</mml:mtext></mml:msub><mml:mo>=</mml:mo><mml:mfrac><mml:mrow><mml:mi>R</mml:mi><mml:mi>M</mml:mi><mml:mi>S</mml:mi><mml:mfenced open="(" close=")"><mml:mrow><mml:mi>P</mml:mi><mml:mi>S</mml:mi><mml:mi>D</mml:mi><mml:mfenced open="(" close=")"><mml:mrow><mml:mi>f</mml:mi><mml:mfenced open="(" close=")"><mml:mi>y</mml:mi></mml:mfenced></mml:mrow></mml:mfenced><mml:mo>&#x2212;</mml:mo><mml:mi>P</mml:mi><mml:mi>S</mml:mi><mml:mi>D</mml:mi><mml:mfenced open="(" close=")"><mml:mi>x</mml:mi></mml:mfenced></mml:mrow></mml:mfenced></mml:mrow><mml:mrow><mml:mi>R</mml:mi><mml:mi>M</mml:mi><mml:mi>S</mml:mi><mml:mfenced open="(" close=")"><mml:mrow><mml:mi>P</mml:mi><mml:mi>S</mml:mi><mml:mi>D</mml:mi><mml:mfenced open="(" close=")"><mml:mi>x</mml:mi></mml:mfenced></mml:mrow></mml:mfenced></mml:mrow></mml:mfrac></mml:math></disp-formula>
<p>The Pearson correlation coefficient (CC) is the classic measure of correlation of time-series data and is calculated as <xref ref-type="disp-formula" rid="EQ8">equation (8)</xref>:</p>
<disp-formula id="EQ8"><label>(8)</label><mml:math id="M15"><mml:mi>C</mml:mi><mml:mi>C</mml:mi><mml:mo>=</mml:mo><mml:mfrac><mml:mrow><mml:mi>c</mml:mi><mml:mi>o</mml:mi><mml:mi>v</mml:mi><mml:mfenced open="(" close=")"><mml:mrow><mml:mi>f</mml:mi><mml:mfenced open="(" close=")"><mml:mi>y</mml:mi></mml:mfenced><mml:mo>,</mml:mo><mml:mi>x</mml:mi></mml:mrow></mml:mfenced></mml:mrow><mml:msqrt><mml:mrow><mml:mi>V</mml:mi><mml:mi>a</mml:mi><mml:mi>r</mml:mi><mml:mfenced open="(" close=")"><mml:mrow><mml:mi>f</mml:mi><mml:mfenced open="(" close=")"><mml:mi>y</mml:mi></mml:mfenced></mml:mrow></mml:mfenced><mml:mi>V</mml:mi><mml:mi>a</mml:mi><mml:mi>r</mml:mi><mml:mfenced open="(" close=")"><mml:mi>x</mml:mi></mml:mfenced></mml:mrow></mml:msqrt></mml:mfrac></mml:math></disp-formula>
<p>We start by explaining the source of the test data, which was constructed with the training set by sampling without replacement, which ensures that for the testing stage, either clean scalp EEG, EOG, or EMG are completely new and unseen. Some samples of our model noise reduction performance are given in <xref ref-type="fig" rid="fig4">Figures 4</xref>&#x2013;<xref ref-type="fig" rid="fig6">6</xref>. For each segment of data, the normalization method has two steps: subtracting the standard deviation and then dividing by the maximum of the absolute values. It is important to highlight that when presenting the results of noise reduction through power spectrograms, we differentiated the various frequency bands of the EEG by employing distinct colors. This was done to accurately depict our method&#x2019;s capacity to preserve the essential information within the power spectrum of each band. The frequency distribution intervals of each band are as follows: delta [1&#x2013;4&#x2009;Hz], theta [4&#x2013;8&#x2009;Hz], alpha [8&#x2013;13&#x2009;Hz], beta [13&#x2013;30&#x2009;Hz], and gamma [30&#x2013;80&#x2009;Hz].</p>
<fig position="float" id="fig4">
<label>Figure 4</label>
<caption><p>Example waveforms of the denoising results of EMG (left) and EOG (right). It is evident that the time-domain waveforms of the denoised EEG signal closely resemble those of the clean scalp EEG signal, irrespective of the presence of contamination from EOG or EMG.</p></caption>
<graphic xlink:href="fnins-17-1258024-g004.tif"/>
</fig>
<fig position="float" id="fig5">
<label>Figure 5</label>
<caption><p>Example PSD plots of denoising result of EMG. We presented power spectral density (PSD) plots for scalp EEG signals contaminated by EMG across various frequency bands both before and after noise reduction. The delineated frequency intervals for each band are as follows: delta [1&#x2013;4&#x2009;Hz], theta [4&#x2013;8&#x2009;Hz], alpha [8&#x2013;13&#x2009;Hz], beta [13&#x2013;30&#x2009;Hz], and gamma [30&#x2013;80&#x2009;Hz]. The efficacy in eliminating artifacts in the higher-energy EEG bands, specifically delta, theta, and alpha, exhibited greater strength and displayed more consistent curves. Conversely, in the beta and gamma bands, the ability to remove artifacts was less pronounced due to the comparatively lower energy levels inherent in the clean EEG signals within these bands.</p></caption>
<graphic xlink:href="fnins-17-1258024-g005.tif"/>
</fig>
<fig position="float" id="fig6">
<label>Figure 6</label>
<caption><p>Example PSD plots of the denoising results of EOG. We present power spectral density (PSD) plots for scalp EEG signals contaminated by EOG across various frequency bands both before and after noise reduction. The delineated frequency intervals for each band are as follows: delta [1&#x2013;4&#x2009;Hz], theta [4&#x2013;8&#x2009;Hz], alpha [8&#x2013;13&#x2009;Hz], beta [13&#x2013;30&#x2009;Hz], and gamma [30&#x2013;80&#x2009;Hz]. The frequency range of EOG typically falls within 0&#x2013;13&#x2009;Hz, which is relatively narrow. As depicted in the figure, EOG primarily introduces interference in the delta, theta, and alpha frequency bands. The results demonstrate that our denoising method consistently exhibits exceptional performance across all frequency bands.</p></caption>
<graphic xlink:href="fnins-17-1258024-g006.tif"/>
</fig>
</sec>
<sec id="sec10">
<label>3.2</label>
<title>Contrast with other deep learning methods</title>
<p>To have a more comprehensive view of the performance of our model, we compare it with a fully convolutional neural network (FCNN), simple CNN, complex CNN (<xref ref-type="bibr" rid="ref10">Emara et al., 2019</xref>), and RNN, which are proposed with the benchmark dataset and the latest novel CNN. <xref ref-type="table" rid="tab2">Table 2</xref> shows the scales in parameters and FLOPs (floating point operations) (<xref ref-type="bibr" rid="ref11">Fr&#x00F8;lich and Dowding, 2018</xref>) of each network. Firstly, considering the number of parameters, it is evident that our methods belong to the forefront, characterized by a minimal parameter count. However, compared with the RNN model, the number of parameters is slightly increased, considering that the RNN only has three hidden layers, while our DPAE model has 16 hidden layers, so it can be said that the DPAE framework can increase the depth of the network while effectively reducing the number of parameters. FLOPs represent the amount of network computation when the model is propagated forward, which can be used to measure the computational complexity of the model. It can be seen that the FLOPs of the three network frameworks of the DPAE are extremely small, and are one-tenth or even one-twentieth of the other models, among which the FLOPs value of the DPAE MLP model is the smallest. This indicates that our method consumes very little memory and is very fast, which is conducive for applications.</p>
<table-wrap position="float" id="tab2">
<label>Table 2</label>
<caption><p>Comparison of the network scales.</p></caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th align="left" valign="middle">Model</th>
<th align="center" valign="middle">FCNN</th>
<th align="center" valign="middle">RNN</th>
<th align="center" valign="middle">Simple CNN</th>
<th align="center" valign="middle">Complex CNN</th>
<th align="center" valign="middle">Novel CNN</th>
<th align="center" valign="middle">DPAE MLP</th>
<th align="center" valign="middle">DPAE 1D-CNN</th>
<th align="center" valign="middle">DPAE 1D-RNN</th>
</tr>
</thead>
<tbody>
<tr>
<td align="left" valign="top">Params</td>
<td align="center" valign="top">1.0&#x2009;M</td>
<td align="center" valign="top"><bold>0.7&#x2009;M</bold></td>
<td align="center" valign="top">16.8&#x2009;M</td>
<td align="center" valign="top">8.5&#x2009;M</td>
<td align="center" valign="top">33.6&#x2009;M</td>
<td align="center" valign="top">1.5&#x2009;M</td>
<td align="center" valign="top">2.0&#x2009;M</td>
<td align="center" valign="top">2.3&#x2009;M</td>
</tr>
<tr>
<td align="left" valign="top">FLOPs</td>
<td align="center" valign="top">57.4&#x2009;M</td>
<td align="center" valign="top">55.3&#x2009;M</td>
<td align="center" valign="top">53.7&#x2009;M</td>
<td align="center" valign="top">20.1&#x2009;M</td>
<td align="center" valign="top">124.5&#x2009;M</td>
<td align="center" valign="top"><bold>3.1&#x2009;M</bold></td>
<td align="center" valign="top">3.9&#x2009;M</td>
<td align="center" valign="top">5.8&#x2009;M</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<p>We highlights the minimum values for parameters and FLOPs through bold formatting.</p>
</table-wrap-foot>
</table-wrap>
<p>We trained the above methods all in 200 epochs with batch size at 128, using the same optimizer and loss function as our method. We present the benchmark assessment metrics proposed with this dataset against the RRMSE temporal, RRMSE spectral, and Pearson correlation coefficients graphs at all SNR levels in <xref ref-type="fig" rid="fig7">Figure 7</xref>. The first column gives the performance for scalp EEG contaminated by EOG under the three noise reduction metrics, while the second column corresponds to the noise reduction results for EEG contaminated by EMG.</p>
<fig position="float" id="fig7">
<label>Figure 7</label>
<caption><p>Evaluation metrics curves of the denoising results from different artifacts of the EEGdenoiseNet dataset. The figure presents the denoising results of different neural network models based on time-domain and frequency-domain root relative mean square error (RRMSE) and Pearson correlation coefficients. These results are obtained for scalp EEG signals contaminated by varying signal-to-noise ratios of EOG and EMG.</p></caption>
<graphic xlink:href="fnins-17-1258024-g007.tif"/>
</fig>
<p>In terms of trend, we can see that our noise reduction performance gets better as the SNR level increases. The fluctuation of our method is much more stable compared to other methods, which indicates the higher robustness of our method. The DPAE CNN model stably outperforms other models at all SNR levels in both temporal and spectral RRMSE metrics for EMG removal. The DPAE CNN also obtained the best performance on the removal of EOG as the signal-to-noise ratio went beyond -3&#x2009;dB. In addition, the DPAE MLP model and the DPAE RNN model significantly outperform the baseline FCNN and RNN models, confirming the generalization of the DPAE frame across network structures. Secondly, the DPAE MLP model stably outperforms other deep learning algorithms on EOG removal after the SNR level exceeds -6&#x2009;dB both on RRMSE temporal and spectral. In EMG removal, it outperforms other models in RRMSE spectral after the SNR level exceeds -3&#x2009;dB, and in RRMSE temporal after the SNR level exceeds zero. The DPAE RNN, on the other hand, has a weaker performance improvement and does not outperform other deep learning algorithms, but only manages to completely outperform the simple CNN and outperform the complex RNN at the same SNR levels. As a side note, our method does not perform well on the Pearson correlation coefficient.</p>
</sec>
<sec id="sec11">
<label>3.3</label>
<title>Ablation study</title>
<p>In addition, our encoder design scheme mainly consists of three elements: a dual-pathway network structure, fusion block module, and residual connection. We took the two-pathway network as the baseline and the complete network as the ablation object based on the MLP layer. The ablation study (<xref ref-type="bibr" rid="ref22">Meyes et al., 2019</xref>) was set on two occasions: removing the fusion block module or removing the residual connection. In terms of control variables, we set the three networks under the same random seed with the same training set (using the EMG dataset) and consistent hyperparameters (batch size&#x2009;=&#x2009;128, leaning rate&#x2009;=&#x2009;0.001, epoch&#x2009;=&#x2009;200). The experimental results are shown in <xref ref-type="fig" rid="fig8">Figure 8</xref>.</p>
<fig position="float" id="fig8">
<label>Figure 8</label>
<caption><p>Ablation study on loss comparison. We conducted the ablation study on two separate instances: one involved the removal of the fusion block module, and the other involved the removal of the residual connection. Both instances utilized the same random seed and the same training set (utilizing the EMG dataset). The observed metric is the training losses.</p></caption>
<graphic xlink:href="fnins-17-1258024-g008.tif"/>
</fig>
<p>As we can see, the model with the fusion block removed has the slowest convergence speed and the worst final loss level. The model with the residual connections removed has a slightly higher convergence speed, and the final loss is better but not as good as the complete model. First, this result is consistent with the fact that residual connections can improve the convergence speed of the network. Second, it shows that the fusion block module can indeed effectively fuse and update the coding representation of scalp EEG signals, which can not only improve the convergence efficiency of the network jointly with the residual connection but also make the network converge to a better local optimum. Finally, according to the results of the comparison between eliminating the fusion block and eliminating the residual connections for the network performance degradation, the fusion block contributes more to the performance improvement of the neural network compared to the residual connections.</p>
</sec>
<sec id="sec12">
<label>3.4</label>
<title>Multichannel noise reduction experiment</title>
<p>The clean scalp EEG of the EEGdenoiseNet dataset is one-dimensional although it comes from different electrode channels. That is, our model is currently not based on specific channels for noise reduction, but in the form of single-channel reuse of the same model. However, most of the real scalp EEG datasets are multichannel, coupled with the fact that the clean scalp EEG signals of EEGdenoiseNet extracted by ICA differ from the real scalp EEG data. Therefore, in order to verify the practicality of our method, we did another experiment on multichannel scalp EEG noise reduction.</p>
<p>The scalp EEG data were obtained from the BCI Competition IV dataset 2a (<xref ref-type="bibr" rid="ref3">Brunner et al., 2008</xref>), a four-class motor image dataset with nine subjects and 288 trials per subject. The number of electrodes was 22, and the sampling rate was 250&#x2009;Hz. We randomly sampled 192 trials of data from two subjects (with equal proportions for the four motor imagery tasks). We intercepted 2&#x2009;s of EEG data from which the subjects started motor imagery and resampled the scalp EEG to 256&#x2009;Hz as ground truth. Those data were then combined with EMG data from EEGdenoiseNet to simulate contaminated EEG data at different signal-to-noise levels.</p>
<p>For the training of the network model, we made the following changes: the original 1d DPAE model structure was still used, and the data from each trial were fed into the network channel by channel for noise reduction. <xref ref-type="fig" rid="fig9">Figure 9</xref> shows the noise reduction results of the three model frameworks under different signal-to-noise levels. It can be seen that all three models maintain consistent noise reduction with the EEGDenoseNet dataset at different signal-to-noise ratio levels when facing real multichannel EEG data. Among them, 1dCNN DPAE consistently outperforms the other two models in EMG artifact removal until the signal-to-noise level is below 0&#x2009;dB. This further illustrates the greater robustness of the 1dCNN DPAE model. The MLP DPAE model, on the other hand, maintains the trend of increasing noise reduction ability with increasing signal-to-noise ratio, whereas the other two models show a decrease in noise reduction ability at signal-to-noise ratios greater than 0&#x2009;dB, suggesting that the MLP DPAE model is more capable of generalization.</p>
<fig position="float" id="fig9">
<label>Figure 9</label>
<caption><p>Denoising results of myoelectric artifacts on multichannel scalp electroencephalogram at different signal-to-noise ratios. The figure presents the denoising results of different DPAE models based on time-domain and frequency-domain root relative mean square error (RRMSE) and Pearson correlation coefficients. These results are obtained for multichannel scalp EEG signals contaminated by varying signal-to-noise ratios of EMG.</p></caption>
<graphic xlink:href="fnins-17-1258024-g009.tif"/>
</fig>
</sec>
</sec>
<sec sec-type="conclusions" id="sec13">
<label>4</label>
<title>Conclusion</title>
<p>In this paper, we propose a general dual-pathway autoencoder network for automatic scalp EEG denoising and validating the model separately constructed using MLP, CNN, and RNN on benchmark EEG datasets.</p>
<p>The experimental results show that the DPAE model constructed based on the above three architectures significantly outperforms the model constructed on the basic architecture, proving the effectiveness of our method. Secondly, the DPAE CNN model outperforms the existing deep learning noise reduction methods in terms of RRMSE temporal and spectral at almost all SNR levels, both in the EOG removal experiments and in the EMG removal experiments. Both the DPAE MLP and DPAE RNN also show competitive performance. Finally, in the comparison of the number of parameters in the network model and the computational complexity, the scale of our model is much smaller than existing deep learning methods, which proves the efficiency of the DPAE framework. Our method has the advantage of not limiting the number of channels and performs stably on the noise reduction of 22-channel EEG data. However, we have not yet started noise reduction experiments on real-time data, which is one of our future enhancement directions. In this way, we can combine the advantages of a lightweight network and apply it to portable brain&#x2013;computer interface devices.</p>
<p>Our feature extraction structure of dual pathways, combined with residual connection and constant mapping of fusion modules, enables the DPAE model to efficiently guide the network to learn scale-independent common feature representations of the target source signals for reconstruction. Our network model does not require <italic>a priori</italic> distributional knowledge of multiple source signals and can be well migrated for application to blind source signal separation.</p>
<p>Also, since the overall framework of our method is an autoencoder, it is not difficult to envision that the encoder structure and weight design in this paper can be used as an upstream task for deep learning of scalp EEG data based on transfer learning without complicated manual filtering and can be applied directly to remove artifacts to improve the performance of downstream tasks.</p>
</sec>
<sec sec-type="data-availability" id="sec14">
<title>Data availability statement</title>
<p>The original contributions presented in the study are included in the article/supplementary material, further inquiries can be directed to the corresponding author.</p>
</sec>
<sec sec-type="ethics-statement" id="sec15">
<title>Ethics statement</title>
<p>The studies involving humans were approved by the Research Ethics Committee of Keio University. The studies were conducted in accordance with the local legislation and institutional requirements. The participants provided their written informed consent to participate in this study.</p>
</sec>
<sec sec-type="author-contributions" id="sec16">
<title>Author contributions</title>
<p>WX: Conceptualization, Data curation, Formal analysis, Investigation, Methodology, Software, Validation, Visualization, Writing &#x2013; original draft, Writing &#x2013; review &#x0026; editing. LM: Resources, Supervision, Writing &#x2013; original draft, Writing &#x2013; review &#x0026; editing. HL: Conceptualization, Funding acquisition, Methodology, Resources, Supervision, Writing &#x2013; review &#x0026; editing.</p>
</sec>
</body>
<back>
<sec sec-type="funding-information" id="sec17">
<title>Funding</title>
<p>The author(s) declare financial support was received for the research, authorship, and/or publication of this article. This work is supported by the Key Research and Development Program of Heilongjiang (Grant No. GY2021ZB0206).</p>
</sec>
<sec sec-type="COI-statement" id="sec18">
<title>Conflict of interest</title>
<p>The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.</p>
</sec>
<sec id="sec100" sec-type="disclaimer">
<title>Publisher&#x2019;s note</title>
<p>All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.</p>
</sec>
<sec sec-type="supplementary-material" id="sec20">
<title>Supplementary material</title>
<p>The Supplementary material for this article can be found online at: <ext-link xlink:href="https://www.frontiersin.org/articles/10.3389/fpsyg.2023.1258024/full#supplementary-material" ext-link-type="uri">https://www.frontiersin.org/articles/10.3389/fpsyg.2023.1258024/full#supplementary-material</ext-link></p>
<supplementary-material xlink:href="Data_Sheet_1.docx" id="SM1" mimetype="application/vnd.openxmlformats-officedocument.wordprocessingml.document" xmlns:xlink="http://www.w3.org/1999/xlink"/>
</sec>
<ref-list>
<title>References</title>
<ref id="ref1"><citation citation-type="book"><person-group person-group-type="author"><name><surname>Anon Hanrahan</surname> <given-names>C</given-names></name></person-group>. (<year>2019</year>) <source>Noise reduction in EEG signals using convolutional autoencoding techniques</source>, <publisher-name>Technological University Dublin</publisher-name>. <publisher-loc>Dublin.</publisher-loc></citation></ref>
<ref id="ref2"><citation citation-type="confproc"><person-group person-group-type="author"><name><surname>Bono</surname> <given-names>V.</given-names></name> <name><surname>Jamal</surname> <given-names>W.</given-names></name> <name><surname>Das</surname> <given-names>S.</given-names></name> <name><surname>Maharatna</surname> <given-names>K.</given-names></name></person-group> (<year>2014</year>) <article-title>Artifact reduction in multichannel pervasive EEG using hybrid WPT-ICA and WPT-EMD signal decomposition techniques</article-title>. In: <conf-name>2014 EEE international conference on acoustics, speech and signal processing (ICASSP)</conf-name> pp. <fpage>5864</fpage>&#x2013;<lpage>5868</lpage>. <publisher-loc>Florence, Italy</publisher-loc>: <publisher-name>IEEE</publisher-name>.</citation></ref>
<ref id="ref3"><citation citation-type="book"><person-group person-group-type="author"><name><surname>Brunner</surname> <given-names>C.</given-names></name> <name><surname>Leeb</surname> <given-names>R.</given-names></name> <name><surname>M&#x00FC;ller-Putz</surname> <given-names>G.</given-names></name> <name><surname>Schl&#x00F6;gl</surname> <given-names>A.</given-names></name> <name><surname>Pfurtscheller</surname> <given-names>G.</given-names></name></person-group> (<year>2008</year>). <source>BCI competition 2008&#x2013;Graz data set a. Institute for Knowledge Discovery</source>, <publisher-name>Graz University of Technology</publisher-name>, <publisher-loc>Graz</publisher-loc>, <fpage>1</fpage>&#x2013;<lpage>6</lpage>.</citation></ref>
<ref id="ref4"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Cai</surname> <given-names>H.</given-names></name> <name><surname>Qu</surname> <given-names>Z.</given-names></name> <name><surname>Li</surname> <given-names>Z.</given-names></name> <name><surname>Zhang</surname> <given-names>Y.</given-names></name> <name><surname>Hu</surname> <given-names>X.</given-names></name> <name><surname>Hu</surname> <given-names>B.</given-names></name></person-group> (<year>2020</year>). <article-title>Feature-level fusion approaches based on multimodal EEG data for depression recognition</article-title>. <source>Inf. Fusion</source> <volume>59</volume>, <fpage>127</fpage>&#x2013;<lpage>138</lpage>. doi: <pub-id pub-id-type="doi">10.1016/j.inffus.2020.01.008</pub-id></citation></ref>
<ref id="ref5"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Chen</surname> <given-names>X.</given-names></name> <name><surname>Liu</surname> <given-names>A.</given-names></name> <name><surname>Peng</surname> <given-names>H.</given-names></name> <name><surname>Ward</surname> <given-names>R. K.</given-names></name></person-group> (<year>2014</year>). <article-title>A preliminary study of muscular artifact cancellation in single-channel EEG</article-title>. <source>Sensors</source> <volume>14</volume>, <fpage>18370</fpage>&#x2013;<lpage>18389</lpage>. doi: <pub-id pub-id-type="doi">10.3390/s141018370</pub-id>, PMID: <pub-id pub-id-type="pmid">25275348</pub-id></citation></ref>
<ref id="ref6"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Croft</surname> <given-names>R. J.</given-names></name> <name><surname>Barry</surname> <given-names>R. J.</given-names></name></person-group> (<year>2000</year>). <article-title>Removal of ocular artifact from the EEG: a review</article-title>. <source>Neurophysiol. Clin.</source> <volume>30</volume>, <fpage>5</fpage>&#x2013;<lpage>19</lpage>. doi: <pub-id pub-id-type="doi">10.1016/S0987-7053(00)00055-1</pub-id>, PMID: <pub-id pub-id-type="pmid">10740792</pub-id></citation></ref>
<ref id="ref7"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>David</surname> <given-names>O.</given-names></name> <name><surname>Friston</surname> <given-names>K. J.</given-names></name></person-group> (<year>2003</year>). <article-title>A neural mass model for MEG/EEG: coupling and neuronal dynamics</article-title>. <source>Neuroimage</source> <volume>20</volume>, <fpage>1743</fpage>&#x2013;<lpage>1755</lpage>. doi: <pub-id pub-id-type="doi">10.1016/j.neuroimage.2003.07.015</pub-id>, PMID: <pub-id pub-id-type="pmid">14642484</pub-id></citation></ref>
<ref id="ref8"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Dirlich</surname> <given-names>G.</given-names></name> <name><surname>Vogl</surname> <given-names>L.</given-names></name> <name><surname>Plaschke</surname> <given-names>M.</given-names></name> <name><surname>Strian</surname> <given-names>F.</given-names></name></person-group> (<year>1997</year>). <article-title>Cardiac field effects on the EEG</article-title>. <source>Electroencephalogr. Clin. Neurophysiol.</source> <volume>102</volume>, <fpage>307</fpage>&#x2013;<lpage>315</lpage>. doi: <pub-id pub-id-type="doi">10.1016/S0013-4694(96)96506-2</pub-id></citation></ref>
<ref id="ref9"><citation citation-type="web"><person-group person-group-type="author"><name><surname>Drozdzal</surname> <given-names>M.</given-names></name> <name><surname>Vorontsov</surname> <given-names>E.</given-names></name> <name><surname>Chartrand</surname> <given-names>G.</given-names></name> <name><surname>Kadoury</surname> <given-names>S.</given-names></name> <name><surname>Pal</surname> <given-names>C.</given-names></name></person-group> (<year>2016</year>) The importance of skip connections in biomedical image segmentation. Available at: (<ext-link xlink:href="https://arxiv.org/abs/1608.04117" ext-link-type="uri">https://arxiv.org/abs/1608.04117</ext-link>).</citation></ref>
<ref id="ref10"><citation citation-type="book"><person-group person-group-type="author"><name><surname>Emara</surname> <given-names>T.</given-names></name> <name><surname>Munim</surname> <given-names>H. E. A. E.</given-names></name> <name><surname>Abbas</surname> <given-names>H. M.</given-names></name></person-group> (<year>2019</year>). &#x201C;<article-title>LiteSeg: a novel lightweight ConvNet for semantic segmentation</article-title>&#x201D; in <source>2019 digital image computing: Techniques and applications (DICTA)</source> (<publisher-loc>Perth, Australia</publisher-loc>: <publisher-name>IEEE</publisher-name>), <fpage>1</fpage>&#x2013;<lpage>7</lpage>.</citation></ref>
<ref id="ref11"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Fr&#x00F8;lich</surname> <given-names>L.</given-names></name> <name><surname>Dowding</surname> <given-names>I.</given-names></name></person-group> (<year>2018</year>). <article-title>Removal of muscular artifacts in EEG signals: a comparison of linear decomposition methods</article-title>. <source>Brain Inf.</source> <volume>5</volume>, <fpage>13</fpage>&#x2013;<lpage>22</lpage>. doi: <pub-id pub-id-type="doi">10.1007/s40708-017-0074-6</pub-id>, PMID: <pub-id pub-id-type="pmid">29322469</pub-id></citation></ref>
<ref id="ref12"><citation citation-type="confproc"><person-group person-group-type="author"><name><surname>Guneysu</surname> <given-names>A.</given-names></name> <name><surname>Akin</surname> <given-names>H. L.</given-names></name></person-group> (<year>2013</year>) <article-title>An SSVEP based BCI to control a humanoid robot by using portable EEG device</article-title>. In: <conf-name>2013 35th annual international conference of the IEEE engineering in medicine and biology society (EMBC)</conf-name>, pp. <fpage>6905</fpage>&#x2013;<lpage>6908</lpage>. <publisher-loc>Osaka</publisher-loc>: <publisher-name>IEEE</publisher-name>.</citation></ref>
<ref id="ref13"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hamaneh</surname> <given-names>M. B.</given-names></name> <name><surname>Chitravas</surname> <given-names>N.</given-names></name> <name><surname>Kaiboriboon</surname> <given-names>K.</given-names></name> <name><surname>Lhatoo</surname> <given-names>S. D.</given-names></name> <name><surname>Loparo</surname> <given-names>K. A.</given-names></name></person-group> (<year>2014</year>). <article-title>Automated removal of EKG artifact from EEG data using independent component analysis and continuous wavelet transformation</article-title>. <source>I.E.E.E. Trans. Biomed. Eng.</source> <volume>61</volume>, <fpage>1634</fpage>&#x2013;<lpage>1641</lpage>. doi: <pub-id pub-id-type="doi">10.1109/TBME.2013.2295173</pub-id>, PMID: <pub-id pub-id-type="pmid">24845273</pub-id></citation></ref>
<ref id="ref14"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hinton</surname> <given-names>G. E.</given-names></name> <name><surname>Salakhutdinov</surname> <given-names>R. R.</given-names></name></person-group> (<year>2006</year>). <article-title>Reducing the dimensionality of data with neural networks</article-title>. <source>Science</source> <volume>313</volume>, <fpage>504</fpage>&#x2013;<lpage>507</lpage>. doi: <pub-id pub-id-type="doi">10.1126/science.1127647</pub-id></citation></ref>
<ref id="ref15"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Huang</surname> <given-names>N. E.</given-names></name> <name><surname>Shen</surname> <given-names>Z.</given-names></name> <name><surname>Long</surname> <given-names>S. R.</given-names></name> <name><surname>Wu</surname> <given-names>M. C.</given-names></name> <name><surname>Shih</surname> <given-names>H. H.</given-names></name> <name><surname>Zheng</surname> <given-names>Q.</given-names></name> <etal/></person-group>. (<year>1998</year>). <article-title>The empirical mode decomposition and the Hilbert spectrum for nonlinear and non-stationary time series analysis</article-title>. <source>Proc. R. Soc. Lond. A</source> <volume>454</volume>, <fpage>903</fpage>&#x2013;<lpage>995</lpage>. doi: <pub-id pub-id-type="doi">10.1098/rspa.1998.0193</pub-id></citation></ref>
<ref id="ref16"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Islam</surname> <given-names>M. S.</given-names></name> <name><surname>El-Hajj</surname> <given-names>A. M.</given-names></name> <name><surname>Alawieh</surname> <given-names>H.</given-names></name> <name><surname>Dawy</surname> <given-names>Z.</given-names></name> <name><surname>Abbas</surname> <given-names>N.</given-names></name> <name><surname>El-Imad</surname> <given-names>J.</given-names></name></person-group> (<year>2020</year>). <article-title>EEG mobility artifact removal for ambulatory epileptic seizure prediction applications</article-title>. <source>Biomed. Signal Process. Control</source> <volume>55</volume>:<fpage>101638</fpage>. doi: <pub-id pub-id-type="doi">10.1016/j.bspc.2019.101638</pub-id></citation></ref>
<ref id="ref17"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Jung</surname> <given-names>T.-P.</given-names></name> <name><surname>Makeig</surname> <given-names>S.</given-names></name> <name><surname>Humphries</surname> <given-names>C.</given-names></name> <name><surname>Lee</surname> <given-names>T.-W.</given-names></name> <name><surname>McKeown</surname> <given-names>M. J.</given-names></name> <name><surname>Iragui</surname> <given-names>V.</given-names></name> <etal/></person-group>. (<year>2000</year>). <article-title>Removing electroencephalographic artifacts by blind source separation</article-title>. <source>Psychophysiology</source> <volume>37</volume>, <fpage>163</fpage>&#x2013;<lpage>178</lpage>. doi: <pub-id pub-id-type="doi">10.1111/1469-8986.3720163</pub-id>, PMID: <pub-id pub-id-type="pmid">10731767</pub-id></citation></ref>
<ref id="ref18"><citation citation-type="web"><person-group person-group-type="author"><name><surname>Lebret</surname> <given-names>R.</given-names></name> <name><surname>Collobert</surname> <given-names>R.</given-names></name></person-group> (<year>2015</year>) &#x201C;The sum of its parts&#x201D;: joint learning of word and phrase representations with autoencoders. Available at: (<ext-link xlink:href="https://arxiv.org/abs/1506.05703" ext-link-type="uri">https://arxiv.org/abs/1506.05703</ext-link>).</citation></ref>
<ref id="ref19"><citation citation-type="confproc"><person-group person-group-type="author"><name><surname>Leglaive</surname> <given-names>S.</given-names></name> <name><surname>Alameda-Pineda</surname> <given-names>X.</given-names></name> <name><surname>Girin</surname> <given-names>L.</given-names></name> <name><surname>Horaud</surname> <given-names>R.</given-names></name></person-group> (<year>2020</year>) <article-title>A recurrent Variational autoencoder for speech enhancement</article-title>. In: <conf-name>ICASSP 2020&#x2013;2020 IEEE international conference on acoustics, speech and signal processing</conf-name>, pp. <fpage>371</fpage>&#x2013;<lpage>375</lpage>. <publisher-loc>Barcelona, Spain</publisher-loc>: <publisher-name>IEEE</publisher-name>.</citation></ref>
<ref id="ref20"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ludwig</surname> <given-names>K. A.</given-names></name> <name><surname>Miriani</surname> <given-names>R. M.</given-names></name> <name><surname>Langhals</surname> <given-names>N. B.</given-names></name> <name><surname>Joseph</surname> <given-names>M. D.</given-names></name> <name><surname>Anderson</surname> <given-names>D. J.</given-names></name> <name><surname>Kipke</surname> <given-names>D. R.</given-names></name></person-group> (<year>2009</year>). <article-title>Using a common average reference to improve cortical neuron recordings from microelectrode arrays</article-title>. <source>J. Neurophysiol.</source> <volume>101</volume>, <fpage>1679</fpage>&#x2013;<lpage>1689</lpage>. doi: <pub-id pub-id-type="doi">10.1152/jn.90989.2008</pub-id>, PMID: <pub-id pub-id-type="pmid">19109453</pub-id></citation></ref>
<ref id="ref21"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Mahajan</surname> <given-names>R.</given-names></name> <name><surname>Morshed</surname> <given-names>B. I.</given-names></name></person-group> (<year>2015</year>). <article-title>Unsupervised eye blink artifact denoising of EEG data with modified multiscale sample entropy, kurtosis, and wavelet-ICA</article-title>. <source>IEEE J. Biomed. Health Inform.</source> <volume>19</volume>, <fpage>158</fpage>&#x2013;<lpage>165</lpage>. doi: <pub-id pub-id-type="doi">10.1109/JBHI.2014.2333010</pub-id>, PMID: <pub-id pub-id-type="pmid">24968340</pub-id></citation></ref>
<ref id="ref22"><citation citation-type="web"><person-group person-group-type="author"><name><surname>Meyes</surname> <given-names>R.</given-names></name> <name><surname>Lu</surname> <given-names>M.</given-names></name> <name><surname>de Puiseau</surname> <given-names>C. W.</given-names></name> <name><surname>Meisen</surname> <given-names>T.</given-names></name></person-group> (<year>2019</year>) Ablation studies in artificial neural networks. Available at: (<ext-link xlink:href="https://arxiv.org/abs/1901.08644" ext-link-type="uri">https://arxiv.org/abs/1901.08644</ext-link>).</citation></ref>
<ref id="ref23"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Mumtaz</surname> <given-names>W.</given-names></name> <name><surname>Rasheed</surname> <given-names>S.</given-names></name> <name><surname>Irfan</surname> <given-names>A.</given-names></name></person-group> (<year>2021</year>). <article-title>Review of challenges associated with the EEG artifact removal methods</article-title>. <source>Biomed. Signal Process. Control</source> <volume>68</volume>:<fpage>102741</fpage>. doi: <pub-id pub-id-type="doi">10.1016/j.bspc.2021.102741</pub-id></citation></ref>
<ref id="ref24"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Muthukumaraswamy</surname> <given-names>S. D.</given-names></name></person-group> (<year>2013</year>). <article-title>High-frequency brain activity and muscle artifacts in MEG/EEG: a review and recommendations</article-title>. <source>Front. Hum. Neurosci.</source> <volume>7</volume>:<fpage>138</fpage>. doi: <pub-id pub-id-type="doi">10.3389/fnhum.2013.00138</pub-id>, PMID: <pub-id pub-id-type="pmid">23596409</pub-id></citation></ref>
<ref id="ref25"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Nakanishi</surname> <given-names>M.</given-names></name> <name><surname>Wang</surname> <given-names>Y.-T.</given-names></name> <name><surname>Jung</surname> <given-names>T.-P.</given-names></name> <name><surname>Zao</surname> <given-names>J. K.</given-names></name> <name><surname>Chien</surname> <given-names>Y.-Y.</given-names></name> <name><surname>Diniz-Filho</surname> <given-names>A.</given-names></name> <etal/></person-group>. (<year>2017</year>). <article-title>Detecting Glaucoma with a portable brain-computer Interface for objective assessment of visual function loss</article-title>. <source>JAMA Ophthalmol.</source> <volume>135</volume>, <fpage>550</fpage>&#x2013;<lpage>557</lpage>. doi: <pub-id pub-id-type="doi">10.1001/jamaophthalmol.2017.0738</pub-id>, PMID: <pub-id pub-id-type="pmid">28448641</pub-id></citation></ref>
<ref id="ref26"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Oberman</surname> <given-names>L. M.</given-names></name> <name><surname>Hubbard</surname> <given-names>E. M.</given-names></name> <name><surname>McCleery</surname> <given-names>J. P.</given-names></name> <name><surname>Altschuler</surname> <given-names>E. L.</given-names></name> <name><surname>Ramachandran</surname> <given-names>V. S.</given-names></name> <name><surname>Pineda</surname> <given-names>J. A.</given-names></name></person-group> (<year>2005</year>). <article-title>EEG evidence for mirror neuron dysfunction in autism spectrum disorders</article-title>. <source>Cogn. Brain Res.</source> <volume>24</volume>, <fpage>190</fpage>&#x2013;<lpage>198</lpage>. doi: <pub-id pub-id-type="doi">10.1016/j.cogbrainres.2005.01.014</pub-id>, PMID: <pub-id pub-id-type="pmid">15993757</pub-id></citation></ref>
<ref id="ref27"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Oliveira</surname> <given-names>A. S.</given-names></name> <name><surname>Schlink</surname> <given-names>B. R.</given-names></name> <name><surname>Hairston</surname> <given-names>W. D.</given-names></name> <name><surname>K&#x00F6;nig</surname> <given-names>P.</given-names></name> <name><surname>Ferris</surname> <given-names>D. P.</given-names></name></person-group> (<year>2016</year>). <article-title>Induction and separation of motion artifacts in EEG data using a mobile phantom head device</article-title>. <source>J. Neural Eng.</source> <volume>13</volume>:<fpage>036014</fpage>. doi: <pub-id pub-id-type="doi">10.1088/1741-2560/13/3/036014</pub-id>, PMID: <pub-id pub-id-type="pmid">27137818</pub-id></citation></ref>
<ref id="ref28"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Parra</surname> <given-names>L. C.</given-names></name> <name><surname>Spence</surname> <given-names>C. D.</given-names></name> <name><surname>Gerson</surname> <given-names>A. D.</given-names></name> <name><surname>Sajda</surname> <given-names>P.</given-names></name></person-group> (<year>2005</year>). <article-title>Recipes for the linear analysis of EEG</article-title>. <source>Neuroimage</source> <volume>28</volume>, <fpage>326</fpage>&#x2013;<lpage>341</lpage>. doi: <pub-id pub-id-type="doi">10.1016/j.neuroimage.2005.05.032</pub-id>, PMID: <pub-id pub-id-type="pmid">16084117</pub-id></citation></ref>
<ref id="ref29"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Patel</surname> <given-names>R.</given-names></name> <name><surname>Sengottuvel</surname> <given-names>S.</given-names></name> <name><surname>Janawadkar</surname> <given-names>M. P.</given-names></name> <name><surname>Gireesan</surname> <given-names>K.</given-names></name> <name><surname>Radhakrishnan</surname> <given-names>T. S.</given-names></name> <name><surname>Mariyappa</surname> <given-names>N.</given-names></name></person-group> (<year>2016</year>). <article-title>Ocular artifact suppression from EEG using ensemble empirical mode decomposition with principal component analysis</article-title>. <source>Comput. Electr. Eng.</source> <volume>54</volume>, <fpage>78</fpage>&#x2013;<lpage>86</lpage>. doi: <pub-id pub-id-type="doi">10.1016/j.compeleceng.2015.08.019</pub-id></citation></ref>
<ref id="ref30"><citation citation-type="book"><person-group person-group-type="author"><name><surname>Sanei</surname> <given-names>S.</given-names></name> <name><surname>Chambers</surname> <given-names>J. A.</given-names></name></person-group> (<year>2013</year>) <source>EEG signal processing</source>. <publisher-loc>Somerset</publisher-loc>: <publisher-name>Wiley</publisher-name>.</citation></ref>
<ref id="ref31"><citation citation-type="web"><person-group person-group-type="author"><name><surname>Santurkar</surname> <given-names>S.</given-names></name> <name><surname>Tsipras</surname> <given-names>D.</given-names></name> <name><surname>Ilyas</surname> <given-names>A.</given-names></name> <name><surname>Madry</surname> <given-names>A.</given-names></name></person-group> (<year>2018</year>) How does batch normalization help optimization? Available at:(<ext-link xlink:href="https://arxiv.org/abs/1805.11604" ext-link-type="uri">https://arxiv.org/abs/1805.11604</ext-link>).</citation></ref>
<ref id="ref32"><citation citation-type="web"><person-group person-group-type="author"><name><surname>Sinha</surname> <given-names>R. K.</given-names></name> <name><surname>Pandey</surname> <given-names>R.</given-names></name> <name><surname>Pattnaik</surname> <given-names>R.</given-names></name></person-group> (<year>2018</year>) Deep learning for computer vision tasks: a review. Available at: (<ext-link xlink:href="https://arxiv.org/abs/1804.03928" ext-link-type="uri">https://arxiv.org/abs/1804.03928</ext-link>)</citation></ref>
<ref id="ref33"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Sun</surname> <given-names>W.</given-names></name> <name><surname>Su</surname> <given-names>Y.</given-names></name> <name><surname>Wu</surname> <given-names>X.</given-names></name> <name><surname>Wu</surname> <given-names>X.</given-names></name></person-group> (<year>2020</year>). <article-title>A novel end-to-end 1D-ResCNN model to remove artifact from EEG signals</article-title>. <source>Neurocomputing</source> <volume>404</volume>, <fpage>108</fpage>&#x2013;<lpage>121</lpage>. doi: <pub-id pub-id-type="doi">10.1016/j.neucom.2020.04.029</pub-id></citation></ref>
<ref id="ref34"><citation citation-type="book"><person-group person-group-type="author"><name><surname>Ullsperger</surname> <given-names>M.</given-names></name> <name><surname>Debener</surname> <given-names>S.</given-names></name></person-group> (<year>2010</year>) <source>Simultaneous EEG and fMRI: recording, analysis, and application</source>. <publisher-loc>New York</publisher-loc>: <publisher-name>Oxford University Press</publisher-name>.</citation></ref>
<ref id="ref35"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Urig&#x00FC;en</surname> <given-names>J. A.</given-names></name> <name><surname>Garcia-Zapirain</surname> <given-names>B.</given-names></name></person-group> (<year>2015</year>). <article-title>EEG artifact removal-state-of-the-art and guidelines</article-title>. <source>J. Neural Eng.</source> <volume>12</volume>:<fpage>031001</fpage>. doi: <pub-id pub-id-type="doi">10.1088/1741-2560/12/3/031001</pub-id>, PMID: <pub-id pub-id-type="pmid">25834104</pub-id></citation></ref>
<ref id="ref36"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Vincent</surname> <given-names>P.</given-names></name> <name><surname>Larochelle</surname> <given-names>H.</given-names></name> <name><surname>Lajoie</surname> <given-names>I.</given-names></name> <name><surname>Bengio</surname> <given-names>Y.</given-names></name> <name><surname>Manzagol</surname> <given-names>P.-A.</given-names></name></person-group> (<year>2010</year>). <article-title>Stacked denoising autoencoders: learning useful representations in a deep network with a local denoising criterion</article-title>. <source>J. Mach. Learn. Res.</source> <volume>11</volume>, <fpage>3371</fpage>&#x2013;<lpage>3408</lpage>. doi: <pub-id pub-id-type="doi">10.5555/1756006.1953039</pub-id></citation></ref>
<ref id="ref37"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wang</surname> <given-names>F.</given-names></name> <name><surname>Wen</surname> <given-names>Y.</given-names></name> <name><surname>Bi</surname> <given-names>J.</given-names></name> <name><surname>Li</surname> <given-names>H.</given-names></name> <name><surname>Sun</surname> <given-names>J.</given-names></name></person-group> (<year>2023</year>). <article-title>A portable SSVEP-BCI system for rehabilitation exoskeleton in augmented reality environment</article-title>. <source>Biomed. Signal Process. Control</source> <volume>83</volume>:<fpage>104664</fpage>. doi: <pub-id pub-id-type="doi">10.1016/j.bspc.2023.104664</pub-id></citation></ref>
<ref id="ref38"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wolpaw</surname> <given-names>J. R.</given-names></name> <name><surname>McFarland</surname> <given-names>D. J.</given-names></name> <name><surname>Neat</surname> <given-names>G. W.</given-names></name> <name><surname>Forneris</surname> <given-names>C. A.</given-names></name></person-group> (<year>1991</year>). <article-title>An EEG-based brain-computer interface for cursor control</article-title>. <source>Electroencephalogr. Clin. Neurophysiol.</source> <volume>78</volume>, <fpage>252</fpage>&#x2013;<lpage>259</lpage>. doi: <pub-id pub-id-type="doi">10.1016/0013-4694(91)90040-B</pub-id></citation></ref>
<ref id="ref39"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Yang</surname> <given-names>B.</given-names></name> <name><surname>Duan</surname> <given-names>K.</given-names></name> <name><surname>Fan</surname> <given-names>C.</given-names></name> <name><surname>Hu</surname> <given-names>C.</given-names></name> <name><surname>Wang</surname> <given-names>J.</given-names></name></person-group> (<year>2018</year>). <article-title>Automatic ocular artifacts removal in EEG using deep learning</article-title>. <source>Biomed. Signal Process. Control</source> <volume>43</volume>, <fpage>148</fpage>&#x2013;<lpage>158</lpage>. doi: <pub-id pub-id-type="doi">10.1016/j.bspc.2018.02.021</pub-id></citation></ref>
<ref id="ref40"><citation citation-type="confproc"><person-group person-group-type="author"><name><surname>Zhang</surname> <given-names>Z.</given-names></name></person-group> (<year>2018</year>) <article-title>Improved Adam optimizer for deep neural networks</article-title>. In: <conf-name>2018 IEEE/ACM 26th international symposium on quality of service (IWQoS)</conf-name>, pp. <fpage>1</fpage>&#x2013;<lpage>2</lpage>. <publisher-loc>Banff, AB, Canada</publisher-loc>: <publisher-name>IEEE</publisher-name>.</citation></ref>
<ref id="ref41"><citation citation-type="confproc"><person-group person-group-type="author"><name><surname>Zhang</surname> <given-names>H.</given-names></name> <name><surname>Wei</surname> <given-names>C.</given-names></name> <name><surname>Zhao</surname> <given-names>M.</given-names></name> <name><surname>Liu</surname> <given-names>Q.</given-names></name> <name><surname>Wu</surname> <given-names>H.</given-names></name></person-group> (<year>2021</year>) <article-title>A novel convolutional neural network model to remove muscle artifacts from EEG</article-title>. In: <conf-name>ICASSP 2021&#x2013;2021 IEEE international conference on acoustics, speech and signal processing (ICASSP)</conf-name>, pp. <fpage>1265</fpage>&#x2013;<lpage>1269</lpage>. <publisher-loc>Toronto, ON, Canada</publisher-loc>: <publisher-name>IEEE</publisher-name>.</citation></ref>
<ref id="ref42"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zhang</surname> <given-names>Y.</given-names></name> <name><surname>Zhang</surname> <given-names>X.</given-names></name> <name><surname>Sun</surname> <given-names>H.</given-names></name> <name><surname>Fan</surname> <given-names>Z.</given-names></name> <name><surname>Zhong</surname> <given-names>X.</given-names></name></person-group> (<year>2019</year>). <article-title>Portable brain-computer interface based on novel convolutional neural network</article-title>. <source>Comput. Biol. Med.</source> <volume>107</volume>, <fpage>248</fpage>&#x2013;<lpage>256</lpage>. doi: <pub-id pub-id-type="doi">10.1016/j.compbiomed.2019.02.023</pub-id>, PMID: <pub-id pub-id-type="pmid">30856388</pub-id></citation></ref>
<ref id="ref43"><citation citation-type="web"><person-group person-group-type="author"><name><surname>Zhang</surname> <given-names>H.</given-names></name> <name><surname>Zhao</surname> <given-names>M.</given-names></name> <name><surname>Wei</surname> <given-names>C.</given-names></name> <name><surname>Mantini</surname> <given-names>D.</given-names></name> <name><surname>Li</surname> <given-names>Z.</given-names></name> <name><surname>Liu</surname> <given-names>Q.</given-names></name></person-group> (<year>2020</year>) EEGdenoiseNet: a benchmark dataset for end-to-end deep learning solutions of EEG denoising. Available at: (<ext-link xlink:href="https://arxiv.org/abs/2009.11662" ext-link-type="uri">https://arxiv.org/abs/2009.11662</ext-link>).</citation></ref>
<ref id="ref44"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zhao</surname> <given-names>Q.</given-names></name> <name><surname>Hu</surname> <given-names>B.</given-names></name> <name><surname>Shi</surname> <given-names>Y.</given-names></name> <name><surname>Li</surname> <given-names>Y.</given-names></name> <name><surname>Moore</surname> <given-names>P.</given-names></name> <name><surname>Sun</surname> <given-names>M.</given-names></name> <etal/></person-group>. (<year>2014</year>). <article-title>Automatic identification and removal of ocular artifacts in EEG&#x2014;improved adaptive predictor filtering for portable applications</article-title>. <source>IEEE Trans. Nanobioscience</source> <volume>13</volume>, <fpage>109</fpage>&#x2013;<lpage>117</lpage>. doi: <pub-id pub-id-type="doi">10.1109/TNB.2014.2316811</pub-id>, PMID: <pub-id pub-id-type="pmid">24802943</pub-id></citation></ref>
</ref-list>
</back>
</article>
