<?xml version="1.0" encoding="UTF-8" standalone="no"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v2.3 20070202//EN" "journalpublishing.dtd">
<article xml:lang="EN" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" article-type="research-article">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">Front. Neurosci.</journal-id>
<journal-title>Frontiers in Neuroscience</journal-title>
<abbrev-journal-title abbrev-type="pubmed">Front. Neurosci.</abbrev-journal-title>
<issn pub-type="epub">1662-453X</issn>
<publisher>
<publisher-name>Frontiers Media S.A.</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="doi">10.3389/fnins.2022.869522</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Neuroscience</subject>
<subj-group>
<subject>Original Research</subject>
</subj-group>
</subj-group>
</article-categories>
<title-group>
<article-title>EEG-TNet: An End-To-End Brain Computer Interface Framework for Mental Workload Estimation</article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author">
<name><surname>Fan</surname> <given-names>Chaojie</given-names></name>
<xref ref-type="aff" rid="aff1"><sup>1</sup></xref>
<xref ref-type="aff" rid="aff2"><sup>2</sup></xref>
<uri xlink:href="http://loop.frontiersin.org/people/1544628/overview"/>
</contrib>
<contrib contrib-type="author">
<name><surname>Hu</surname> <given-names>Jin</given-names></name>
<xref ref-type="aff" rid="aff3"><sup>3</sup></xref>
</contrib>
<contrib contrib-type="author">
<name><surname>Huang</surname> <given-names>Shufang</given-names></name>
<xref ref-type="aff" rid="aff4"><sup>4</sup></xref>
</contrib>
<contrib contrib-type="author" corresp="yes">
<name><surname>Peng</surname> <given-names>Yong</given-names></name>
<xref ref-type="aff" rid="aff1"><sup>1</sup></xref>
<xref ref-type="corresp" rid="c001"><sup>&#x0002A;</sup></xref>
<uri xlink:href="http://loop.frontiersin.org/people/1630921/overview"/>
</contrib>
<contrib contrib-type="author">
<name><surname>Kwong</surname> <given-names>Sam</given-names></name>
<xref ref-type="aff" rid="aff2"><sup>2</sup></xref>
<uri xlink:href="http://loop.frontiersin.org/people/1384993/overview"/>
</contrib>
</contrib-group>
<aff id="aff1"><sup>1</sup><institution>Key Laboratory of Traffic Safety on Track of Ministry of Education, School of Traffic and Transportation Engineering, Central South University</institution>, <addr-line>Changsha</addr-line>, <country>China</country></aff>
<aff id="aff2"><sup>2</sup><institution>Department of Computer Science, City University of Hong Kong, Kowloon</institution>, <addr-line>Hong Kong SAR</addr-line>, <country>China</country></aff>
<aff id="aff3"><sup>3</sup><institution>Hunan Communications Research Institute Co., Ltd., Hunan Communication &#x00026; Water Conservancy Group Ltd.</institution>, <addr-line>Changsha</addr-line>, <country>China</country></aff>
<aff id="aff4"><sup>4</sup><institution>School of Business and Trade, Hunan Industry Polytechnic</institution>, <addr-line>Changsha</addr-line>, <country>China</country></aff>
<author-notes>
<fn fn-type="edited-by"><p>Edited by: Gui Ye, Chongqing University, China</p></fn>
<fn fn-type="edited-by"><p>Reviewed by: Lizhuang Ma, Shanghai Jiao Tong University, China; Yufei Chen, Xi&#x00027;an Jiaotong University, China; Jianbo Zhu, Southeast University, China</p></fn>
<corresp id="c001">&#x0002A;Correspondence: Yong Peng <email>yong_peng&#x00040;csu.edu.cn</email></corresp>
<fn fn-type="other" id="fn001"><p>This article was submitted to Front. Neurosci, a section of the journal Frontiers in Neuroscience</p></fn></author-notes>
<pub-date pub-type="epub">
<day>25</day>
<month>04</month>
<year>2022</year>
</pub-date>
<pub-date pub-type="collection">
<year>2022</year>
</pub-date>
<volume>16</volume>
<elocation-id>869522</elocation-id>
<history>
<date date-type="received">
<day>04</day>
<month>02</month>
<year>2022</year>
</date>
<date date-type="accepted">
<day>18</day>
<month>03</month>
<year>2022</year>
</date>
</history>
<permissions>
<copyright-statement>Copyright &#x000A9; 2022 Fan, Hu, Huang, Peng and Kwong.</copyright-statement>
<copyright-year>2022</copyright-year>
<copyright-holder>Fan, Hu, Huang, Peng and Kwong</copyright-holder>
<license xlink:href="http://creativecommons.org/licenses/by/4.0/"><p>This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.</p></license>
</permissions>
<abstract>
<p>The mental workload (MWL) of different occupational groups&#x00027; workers is the main and direct factor of unsafe behavior, which may cause serious accidents. One of the new and useful technologies to estimate MWL is the Brain computer interface (BCI) based on EEG signals, which is regarded as the gold standard of cognitive status. However, estimation systems involving handcrafted EEG features are time-consuming and unsuitable to apply in real-time. The purpose of this study was to propose an end-to-end BCI framework for MWL estimation. First, a new automated data preprocessing method was proposed to remove the artifact without human interference. Then a new neural network structure named EEG-TNet was designed to extract both the temporal and frequency information from the original EEG. Furthermore, two types of experiments and ablation studies were performed to prove the effectiveness of this model. In the subject-dependent experiment, the estimation accuracy of dual-task estimation (No task vs. TASK) and triple-task estimation (Lo vs. Mi vs. Hi) reached 99.82 and 99.21%, respectively. In contrast, the accuracy of different tasks reached 82.78 and 66.83% in subject-independent experiments. Additionally, the ablation studies proved that preprocessing method and network structure had significant contributions to estimation MWL. The proposed method is convenient without any human intervention and outperforms other related studies, which becomes an effective way to reduce human factor risks.</p></abstract>
<kwd-group>
<kwd>mental workload</kwd>
<kwd>brain computer interface</kwd>
<kwd>deep neural network</kwd>
<kwd>occupational safety</kwd>
<kwd>ergonomics</kwd>
</kwd-group>
<counts>
<fig-count count="5"/>
<table-count count="3"/>
<equation-count count="12"/>
<ref-count count="35"/>
<page-count count="11"/>
<word-count count="6808"/>
</counts>
</article-meta>
</front>
<body>
<sec sec-type="intro" id="s1">
<title>1. Introduction</title>
<p>Information systems are increasingly approaching the boundaries of human competence due to their increasing complexity and autonomy. A dynamic and automated adaptation of the system to the user state is required to minimize user overload in high-demand scenarios (M&#x000FC;hl et al., <xref ref-type="bibr" rid="B21">2014</xref>). A reliable understanding of the user&#x00027;s current status, particularly the workload, is essential for timely and appropriate system adaptation (van Erp et al., <xref ref-type="bibr" rid="B29">2010</xref>). The workload is a direct factor in unsafe operations. Workers of special occupational groups such as construction workers, car drivers, pilots are prone to physical exhaustion and lack of consciousness under high workloads for a long time, leading to numbness of safety conditions and causing great insecurity. Therefore, it is extremely important to effectively assess and reduce the workload of operators in preventing unsafe behaviors and reducing dangerous accidents. Thus, workload estimation is an actively growing research field, for it possesses numerous human factor applications in many occupational groups to reduce safety risks (Roy et al., <xref ref-type="bibr" rid="B28">2016</xref>; Yin et al., <xref ref-type="bibr" rid="B33">2019</xref>).</p>
<p>The workload is mainly divided into physical workload and mental workload (MWL). When the human body is under different physical workloads, various physiological parameters such as oxygen consumption, heart rate, pulmonary ventilation, energy expenditure rate, and various chemical enzymes related to energy conversion show changes. (Roscoe, <xref ref-type="bibr" rid="B27">1992</xref>; Abdelhamid and Everett, <xref ref-type="bibr" rid="B1">2002</xref>). However, the estimation of MWL is more complicated than physical workload, while the former is more closely associated with safety.</p>
<p>There are two types of MWL estimation methods, subjective and objective estimation methods (Hogervorst et al., <xref ref-type="bibr" rid="B13">2014</xref>; Charles and Nixon, <xref ref-type="bibr" rid="B4">2019</xref>). The subjective test is a self-recorded and a questionnaire-based test in which the subject&#x00027;s workload is scored. Among a large number of subjective estimation methods, the National Aeronautics and Space Administration&#x00027;s Task Load Index (NASA TLX) (Hart, <xref ref-type="bibr" rid="B11">2006</xref>) and Subjective Workload Assessment Technique (SWAT) (Reid and Nygren, <xref ref-type="bibr" rid="B25">1988</xref>) are the most popular subjective estimation methods. Additionally, the objective estimation methods are used to estimate their workload by collecting the object&#x00027;s physiological signals.</p>
<p>The objective test has developed rapidly in recent years due to developments in sensor technology. The rationality for the objective test based on physiological signals is that when people are under MWL, the parameters of each physiological condition deviate from the normal state. Thus, it is possible to detect changes in the body&#x00027;s physiological signals to estimate MWL. The changed physiological parameters include cardiac activity, electrical brain activity, eye movements, and metabolic changes (Fairclough and Houston, <xref ref-type="bibr" rid="B9">2004</xref>). Therefore, many physiological indicators have been used to estimate MWL, such as electrocardiograms (ECG), eye movements, electroencephalography (EEG) measurements, respiration, and electromyography (EMG). Among these physiological indicators, EEG is widely used because MWL changes are closely linked to brain cortical activity and because it is non-smooth, non-invasive, and highly discriminative (Wilson et al., <xref ref-type="bibr" rid="B31">1994</xref>; Dehais et al., <xref ref-type="bibr" rid="B8">2020</xref>; Pieper et al., <xref ref-type="bibr" rid="B24">2021</xref>; Liu et al., <xref ref-type="bibr" rid="B18">2022</xref>; Yu et al., <xref ref-type="bibr" rid="B34">2022</xref>). This is why EEG is also known as the gold standard. In conclusion, EEG had the best and most reliable estimation performance of MWL.</p>
<p>To sum up, in terms of accuracy and practicality, EEG is optimal for estimating the MWL. The entire framework also can be referred to as a brain-computer interface (BCI) by means of computer algorithms that decode information from the brain and thus access the state of the human. In this study, an end-to-end BCI framework using EEG is proposed to estimate workers&#x00027; MWL continuously, which can directly decode EEG without feature extraction.</p>
<sec>
<title>1.1. Related Study</title>
<sec>
<title>1.1.1. Handcrafted Features-Based BCI Framework</title>
<p>Brain-computer interface, as a new human-computer interaction technology, provides a new method of communication with the outside world and enables direct human control of machines. In recent years, with deep cross-fertilization of artificial intelligence technology in neuroscience, cybernetics, computer science, and other related fields, research on BCI cognitive status computing systems based on EEG. There are a large number of BCI frameworks for MWL assessment have been proposed in recent years, and most of the research has used handcrafted features. Lim et al. (<xref ref-type="bibr" rid="B17">2018</xref>) assessed the MWL induced by the single-session simultaneous capacity (SIMKAP) experiment. They collected the 14-channels of EEG and extracted different bands&#x00027; power spectral density (PSD). The Neighborhood Component Analysis (NCA) was used to select critical features and the Support Vector Regression (SVR) model was trained to assess the MWL. A helmet with EEG sensors was designed by Wang et al. (<xref ref-type="bibr" rid="B30">2017</xref>) to meet the requirements of the construction industry, and they designed different construction activities to induce different levels of MWL. The results showed that Gamma waves and Fp1 and Tp10 channels are good candidates for MWL estimation in the frequency domain. However, the unavoidable limitations of current BCI frameworks which use handcrafted features should not be ignored. The extraction of EEG signal features requires researchers to master interdisciplinary theories and research results in stochastic signal analysis and cognitive neuroscience, raising the threshold for researching this field (Cheng et al., <xref ref-type="bibr" rid="B5">2022</xref>). Thus, the incomprehensibility of domain knowledge can limit the extracted features that cannot effectively represent the implicit MWL-related information in the original signal. In addition, restricted by the performance of computing units of wearable devices, algorithms with high computational complexity cannot be applied on brain-computer interface systems. The computation of EEG features, especially non-linear features such as entropy value and complexity, requires much time and, thus, cannot meet the needs of brain-computer interface systems.</p>
</sec>
<sec>
<title>1.1.2. Deep Learning-Based BCI Framework</title>
<p>To solve the feature extraction problem, inspired by the success of the feature extraction ability of convolutional neural network, decoding EEG according to CNN, which constructing the end-to-end BCI framework are receiving increasing attention. There are some related end-to-end studies in the field of EEG-based BCI frameworks for other tasks, such as emotion recognition, word imagined, and epileptic seizure recognition (Xu et al., <xref ref-type="bibr" rid="B32">2020</xref>; Datta and Boulgouris, <xref ref-type="bibr" rid="B7">2021</xref>; Hu et al., <xref ref-type="bibr" rid="B14">2021</xref>). Furthermore, unlike the images, EEG signals are typically time-series signals, and the evolutionary trends in neural activity during complex or simple cognitive processing are of equal interest. Therefore, combined models by merging CNN and Long short-term memory (LSTM) network was proposed and attempted to extract features by CNN and obtain the temporal information by LSTM layers. However, most CNN-LSTM studies use 1-D or 2-D convolutional kernels and full connected layers to process EEG data. The original EEG was transformed into 1-D or 2-D tensors, which were then fed into the LSTM layers. The above algorithms disrupt the temporal information and the transformed data does not have a real time sequence in the &#x0201C;time step&#x0201D; dimension (Xu et al., <xref ref-type="bibr" rid="B32">2020</xref>). Therefore, the effect of LSTM layers is weakened because of the wrong temporal information.</p>
</sec>
</sec>
<sec>
<title>1.2. Contribution</title>
<p>To fill the research gap mentioned above, in this study, a convenient and efficient end-to-end BCI framework for MWL estimation was proposed. The contributions of the article can be summarized as follows:</p>
<p>First, our proposed end-to-end BCI framework for workers&#x00027; MWL estimation, which decodes mental workload related relevant information directly from raw EEG, is able to avoid the time consumption associated with complex feature extraction and thus meet the hardware requirements of brain-computer interface systems.</p>
<p>Second, this method uses a combination of filters, ASR, and ICA with ADJUST to obtain relatively pure EEG signals without manual involvement. Additionally, the MWL related neural information is decoded smoothly from the original EEG by the designed time fixed 3-D-CNN layers while the temporal dimension is unchanged. Then the following bi-LSTM layers can be used to extract temporal features.</p>
<p>Third, according to two types of comparison experiments and ablation studies, the estimation effectiveness of EEG-TNet can be proved.</p>
</sec>
</sec>
<sec id="s2">
<title>2. Neural Network Preliminary</title>
<p>In this section, some preliminary knowledge about the neural network including convolutional layer, LSTM, and fully connected layers were introduced, which are the basis of our EEG-TNet method.</p>
<sec>
<title>2.1. Convolutional Layer</title>
<p>In this study, convolutional layers include four normal convolutional layers, a depthwise convolutional, and a pointwise convolutional. For the normal convolutional, the input of the convolutional layer is <italic>X</italic><sub><italic>in</italic></sub>(<italic>C</italic><sub><italic>in</italic></sub>, <italic>D</italic><sub><italic>in</italic></sub>, <italic>H</italic><sub><italic>in</italic></sub>, <italic>W</italic><sub><italic>in</italic></sub>), and the output <italic>y</italic><sub><italic>out</italic></sub>(<italic>C</italic><sub><italic>out</italic></sub>, <italic>D</italic><sub><italic>out</italic></sub>, <italic>H</italic><sub><italic>out</italic></sub>, <italic>W</italic><sub><italic>out</italic></sub>). The formula of the convolutional layer can be described as follows:</p>
<disp-formula id="E1"><label>(1)</label><mml:math id="M1"><mml:mtable class="eqnarray" columnalign="left"><mml:mtr><mml:mtd><mml:msub><mml:mrow><mml:mi>y</mml:mi></mml:mrow><mml:mrow><mml:mi>o</mml:mi><mml:mi>u</mml:mi><mml:mi>t</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mi>b</mml:mi><mml:mo>&#x0002B;</mml:mo><mml:mstyle displaystyle="true"><mml:munderover accentunder="false" accent="false"><mml:mrow><mml:mo>&#x02211;</mml:mo></mml:mrow><mml:mrow><mml:mi>k</mml:mi><mml:mo>=</mml:mo><mml:mn>0</mml:mn></mml:mrow><mml:mrow><mml:msub><mml:mrow><mml:mi>C</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mi>n</mml:mi></mml:mrow></mml:msub><mml:mo>-</mml:mo><mml:mn>1</mml:mn></mml:mrow></mml:munderover></mml:mstyle><mml:mi>w</mml:mi><mml:mo>&#x022C6;</mml:mo><mml:msub><mml:mrow><mml:mi>x</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mi>n</mml:mi></mml:mrow></mml:msub></mml:mtd></mml:mtr></mml:mtable></mml:math></disp-formula>
<p>Where the &#x022C6; is the valid 3D cross-correlation operation. The shape of <italic>y</italic><sub><italic>out</italic></sub>(<italic>C</italic><sub><italic>out</italic></sub>, <italic>D</italic><sub><italic>out</italic></sub>, <italic>H</italic><sub><italic>out</italic></sub>, <italic>W</italic><sub><italic>out</italic></sub>) can be calculated according to the kernel size (<italic>K</italic><sub><italic>D</italic></sub>, <italic>K</italic><sub><italic>H</italic></sub>, <italic>K</italic><sub><italic>W</italic></sub>) and the kernel number <italic>C</italic><sub><italic>out</italic></sub>. Specifically, the depthwise separable convolution (Chollet, <xref ref-type="bibr" rid="B6">2017</xref>) which consists of depthwise convolutional and pointwise convolutional was used in our research to extract spatial information from EEG with a lower number of convolutional parameters (Chollet, <xref ref-type="bibr" rid="B6">2017</xref>).</p>
</sec>
<sec>
<title>2.2. LSTM Layer</title>
<p>By designing time-fixed 3D convolutional layers, we retain the EEG information in each time step and further analyze the temporal information using LSTM networks (Hochreiter and Schmidhuber, <xref ref-type="bibr" rid="B12">1997</xref>). Recurrent neural networks (RNN) have an excellent memory capability owing to their distinctive self-connected structure, which has an absolute advantage in processing temporal data (Mikolov et al., <xref ref-type="bibr" rid="B19">2010</xref>). The LSTM network is a popular expansion of RNN to address the gradient disappearance problem while RNN only processes long-term data. The LSTM introduces a gating mechanism to control the rate of accumulation of information, including adding new information and forgetting previous information by using the gates. There are three gates including input gate <italic>i</italic><sub><italic>t</italic></sub>, forget gate <italic>f</italic><sub><italic>t</italic></sub>, cell gate <italic>g</italic><sub><italic>t</italic></sub>, and output gate <italic>o</italic><sub><italic>t</italic></sub>, respectively. Specifically, the forget gate <italic>f</italic><sub><italic>t</italic></sub> controls the rates of previous information required to be forgotten about the internal state <italic>c</italic><sub><italic>t</italic>&#x02212;1</sub> at the last moment.</p>
<disp-formula id="E2"><label>(2)</label><mml:math id="M2"><mml:mtable class="eqnarray" columnalign="left"><mml:mtr><mml:mtd><mml:msub><mml:mrow><mml:mi>f</mml:mi></mml:mrow><mml:mrow><mml:mi>t</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mi>&#x003C3;</mml:mi><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:msub><mml:mrow><mml:mi>W</mml:mi></mml:mrow><mml:mrow><mml:mi>f</mml:mi></mml:mrow></mml:msub><mml:mrow><mml:mo>[</mml:mo><mml:mrow><mml:msub><mml:mrow><mml:mi>h</mml:mi></mml:mrow><mml:mrow><mml:mi>t</mml:mi><mml:mo>-</mml:mo><mml:mn>1</mml:mn></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mrow><mml:mi>x</mml:mi></mml:mrow><mml:mrow><mml:mi>t</mml:mi></mml:mrow></mml:msub></mml:mrow><mml:mo>]</mml:mo></mml:mrow><mml:mo>&#x0002B;</mml:mo><mml:msub><mml:mrow><mml:mi>b</mml:mi></mml:mrow><mml:mrow><mml:mi>f</mml:mi></mml:mrow></mml:msub></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mtd></mml:mtr></mml:mtable></mml:math></disp-formula>
<p>The input gate <italic>i</italic><sub><italic>t</italic></sub> determines the rates of new information which is allowed to be added to the current <italic>c</italic><sub><italic>t</italic></sub>. Two steps are required to achieve this. First, calculate the input gate <italic>i</italic><sub><italic>t</italic></sub> and cell gate <italic>g</italic><sub><italic>t</italic></sub> are calculated. Second, update memory cells <italic>C</italic><sub><italic>t</italic></sub> by combining forgetting gates <italic>f</italic><sub><italic>t</italic></sub> and input gates <italic>i</italic><sub><italic>t</italic></sub>.</p>
<disp-formula id="E3"><label>(3)</label><mml:math id="M3"><mml:mtable class="eqnarray" columnalign="left"><mml:mtr><mml:mtd><mml:msub><mml:mrow><mml:mi>i</mml:mi></mml:mrow><mml:mrow><mml:mi>t</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mi>&#x003C3;</mml:mi><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:msub><mml:mrow><mml:mi>W</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mrow><mml:mo>[</mml:mo><mml:mrow><mml:msub><mml:mrow><mml:mi>h</mml:mi></mml:mrow><mml:mrow><mml:mi>t</mml:mi><mml:mo>-</mml:mo><mml:mn>1</mml:mn></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mrow><mml:mi>x</mml:mi></mml:mrow><mml:mrow><mml:mi>t</mml:mi></mml:mrow></mml:msub></mml:mrow><mml:mo>]</mml:mo></mml:mrow><mml:mo>&#x0002B;</mml:mo><mml:msub><mml:mrow><mml:mi>b</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mtd></mml:mtr></mml:mtable></mml:math></disp-formula>
<disp-formula id="E5"><label>(4)</label><mml:math id="M5"><mml:mtable class="eqnarray" columnalign="left"><mml:mtr><mml:mtd><mml:msub><mml:mrow><mml:mi>g</mml:mi></mml:mrow><mml:mrow><mml:mi>t</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mi>t</mml:mi><mml:mi>a</mml:mi><mml:mi>n</mml:mi><mml:mi>h</mml:mi><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:msub><mml:mrow><mml:mi>W</mml:mi></mml:mrow><mml:mrow><mml:mi>c</mml:mi></mml:mrow></mml:msub><mml:mrow><mml:mo>[</mml:mo><mml:mrow><mml:msub><mml:mrow><mml:mi>h</mml:mi></mml:mrow><mml:mrow><mml:mi>t</mml:mi><mml:mo>-</mml:mo><mml:mn>1</mml:mn></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mrow><mml:mi>x</mml:mi></mml:mrow><mml:mrow><mml:mi>t</mml:mi></mml:mrow></mml:msub></mml:mrow><mml:mo>]</mml:mo></mml:mrow><mml:mo>&#x0002B;</mml:mo><mml:msub><mml:mrow><mml:mi>b</mml:mi></mml:mrow><mml:mrow><mml:mi>C</mml:mi></mml:mrow></mml:msub></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mtd></mml:mtr></mml:mtable></mml:math></disp-formula>
<disp-formula id="E7"><label>(5)</label><mml:math id="M7"><mml:mtable class="eqnarray" columnalign="left"><mml:mtr><mml:mtd><mml:msub><mml:mrow><mml:mi>C</mml:mi></mml:mrow><mml:mrow><mml:mi>t</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:msub><mml:mrow><mml:mi>f</mml:mi></mml:mrow><mml:mrow><mml:mi>t</mml:mi></mml:mrow></mml:msub><mml:mo>*</mml:mo><mml:msub><mml:mrow><mml:mi>C</mml:mi></mml:mrow><mml:mrow><mml:mi>t</mml:mi><mml:mo>-</mml:mo><mml:mn>1</mml:mn></mml:mrow></mml:msub><mml:mo>&#x0002B;</mml:mo><mml:msub><mml:mrow><mml:mi>i</mml:mi></mml:mrow><mml:mrow><mml:mi>t</mml:mi></mml:mrow></mml:msub><mml:mo>*</mml:mo><mml:msub><mml:mrow><mml:mi>g</mml:mi></mml:mrow><mml:mrow><mml:mi>t</mml:mi></mml:mrow></mml:msub></mml:mtd></mml:mtr></mml:mtable></mml:math></disp-formula>
<p>Ultimately, we need to determine the output, which is based on the state of our memory cells <italic>C</italic><sub><italic>t</italic></sub>. First, a sigmoid layer is used to determine which parts of the memory cell state will be output. Second, the memory cell is processed through <italic>tanh</italic> and multiplied by <italic>o</italic><sub><italic>t</italic></sub>.</p>
<disp-formula id="E8"><label>(6)</label><mml:math id="M8"><mml:mtable class="eqnarray" columnalign="left"><mml:mtr><mml:mtd><mml:msub><mml:mrow><mml:mi>o</mml:mi></mml:mrow><mml:mrow><mml:mi>t</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mi>&#x003C3;</mml:mi><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:msub><mml:mrow><mml:mi>W</mml:mi></mml:mrow><mml:mrow><mml:mi>o</mml:mi></mml:mrow></mml:msub><mml:mrow><mml:mo>[</mml:mo><mml:mrow><mml:msub><mml:mrow><mml:mi>h</mml:mi></mml:mrow><mml:mrow><mml:mi>t</mml:mi><mml:mo>-</mml:mo><mml:mn>1</mml:mn></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mrow><mml:mi>x</mml:mi></mml:mrow><mml:mrow><mml:mi>t</mml:mi></mml:mrow></mml:msub></mml:mrow><mml:mo>]</mml:mo></mml:mrow><mml:mo>&#x0002B;</mml:mo><mml:msub><mml:mrow><mml:mi>b</mml:mi></mml:mrow><mml:mrow><mml:mi>o</mml:mi></mml:mrow></mml:msub></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mtd></mml:mtr></mml:mtable></mml:math></disp-formula>
<disp-formula id="E10"><label>(7)</label><mml:math id="M10"><mml:mtable class="eqnarray" columnalign="left"><mml:mtr><mml:mtd><mml:msub><mml:mrow><mml:mi>h</mml:mi></mml:mrow><mml:mrow><mml:mi>t</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:msub><mml:mrow><mml:mi>o</mml:mi></mml:mrow><mml:mrow><mml:mi>t</mml:mi></mml:mrow></mml:msub><mml:mo>*</mml:mo><mml:mi>t</mml:mi><mml:mi>a</mml:mi><mml:mi>n</mml:mi><mml:mi>h</mml:mi><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:msub><mml:mrow><mml:mi>C</mml:mi></mml:mrow><mml:mrow><mml:mi>t</mml:mi></mml:mrow></mml:msub></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mtd></mml:mtr></mml:mtable></mml:math></disp-formula>
</sec>
<sec>
<title>2.3. Fully Connected Layer</title>
<p>The fully connected layer serves as an &#x0201C;estimator&#x0201D; in the entire neural network structure. The procedures such as convolutional layers, pooling, LSTM, and activation function translate the original data to the hidden feature space. The fully connected layer transfers them to the sample labeling space. As Equation (8) shows, the fully connected layer multiplies the weight matrix with the input vector and then adds the bias.</p>
<disp-formula id="E11"><label>(8)</label><mml:math id="M11"><mml:mtable class="eqnarray" columnalign="left"><mml:mtr><mml:mtd><mml:mi>y</mml:mi><mml:mo>=</mml:mo><mml:mi>x</mml:mi><mml:msup><mml:mrow><mml:mi>A</mml:mi></mml:mrow><mml:mrow><mml:mi>T</mml:mi></mml:mrow></mml:msup><mml:mo>&#x0002B;</mml:mo><mml:mi>b</mml:mi></mml:mtd></mml:mtr></mml:mtable></mml:math></disp-formula>
<p>where <italic>A</italic><sup><italic>T</italic></sup> is the learnable parameter and <italic>b</italic> is the bias. In addition, a softmax activation function may be used to calculate the likely distribution of the output classes. In the final FC layer, the softmax function is utilized, which is defined as follows:</p>
<disp-formula id="E12"><label>(9)</label><mml:math id="M12"><mml:mtable class="eqnarray" columnalign="left"><mml:mtr><mml:mtd><mml:msub><mml:mrow><mml:mi>S</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mfrac><mml:mrow><mml:msup><mml:mrow><mml:mi>e</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msup></mml:mrow><mml:mrow><mml:mstyle displaystyle="true"><mml:msubsup><mml:mrow><mml:mo>&#x02211;</mml:mo></mml:mrow><mml:mrow><mml:mi>j</mml:mi><mml:mo>=</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:mi>k</mml:mi></mml:mrow></mml:msubsup></mml:mstyle><mml:msup><mml:mrow><mml:mi>e</mml:mi></mml:mrow><mml:mrow><mml:mi>j</mml:mi></mml:mrow></mml:msup></mml:mrow></mml:mfrac><mml:mtext>&#x000A0;&#x000A0;&#x000A0;&#x000A0;&#x000A0;&#x000A0;</mml:mtext><mml:mi>f</mml:mi><mml:mi>o</mml:mi><mml:mi>r</mml:mi><mml:mtext>&#x000A0;&#x000A0;&#x000A0;&#x000A0;&#x000A0;</mml:mtext><mml:mi>i</mml:mi><mml:mo>=</mml:mo><mml:mn>1</mml:mn><mml:mo>,</mml:mo><mml:mo>.</mml:mo><mml:mo>.</mml:mo><mml:mo>.</mml:mo><mml:mi>k</mml:mi><mml:mo>.</mml:mo></mml:mtd></mml:mtr></mml:mtable></mml:math></disp-formula>
<p>where <italic>i</italic> is the input vector, the output <italic>S</italic><sub><italic>i</italic></sub> is between o to 1, and <inline-formula><mml:math id="M13"><mml:munder class="msub"><mml:mrow><mml:mo>&#x02211;</mml:mo></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:munder><mml:msub><mml:mrow><mml:mi>S</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mn>1</mml:mn></mml:math></inline-formula></p>
</sec>
</sec>
<sec sec-type="methods" id="s3">
<title>3. Methods</title>
<p>The detailed procedure of this study can be summarized in several steps, which are described by the detailed flowchart shown in <xref ref-type="fig" rid="F1">Figure 1</xref>. In this study, first, we preprocessed the data from the STEW database (Lim et al., <xref ref-type="bibr" rid="B17">2018</xref>) by our designed automated methods. Then the processed EEG was directly imported to the proposed EEG-TNet to estimate the MWL. The comparison studies and ablation studies were performed to prove the effectiveness of the proposed end-to-end EEG-TNet model.</p>
<fig id="F1" position="float">
<label>Figure 1</label>
<caption><p>The detailed flowchart of this study.</p></caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fnins-16-869522-g0001.tif"/>
</fig>
<sec>
<title>3.1. MWL EEG Database</title>
<p>The database used in this study is STEW (Lim et al., <xref ref-type="bibr" rid="B17">2018</xref>), which contains EEG data of 48 subjects under different MWL levels. Specifically, the subjects performed the Simultaneous Capacity (SIMKAP) test to induce MWL. After the test, all the subjects were required to finish the subjective questionnaire to report their MWL, which is a 9-point rating scale. During the whole experiments, the EEG signals were recorded using an Emotiv EPOC EEG headset with 14 electrodes (AF3, F7, F3, FC5, T7, P7, O1, O2, P8, T8, FC6, F4, F8, AF4) and two reference channels (CMS, DRL). The sampling frequency was 128 Hz and the resolution was 16-bit A/D. In this study, the classifier was proposed to finish two tasks, the first one is classified &#x0201C;No Task&#x0201D; vs. &#x0201C;SIMKAP Task&#x0201D;, which was a binary classification. The second task was classifying Low vs. Moderate vs. High MWL, which was divided by a rating scale. A detailed definition of the label can be found in the article (Lim et al., <xref ref-type="bibr" rid="B17">2018</xref>).</p>
</sec>
<sec>
<title>3.2. Data Preprocessing</title>
<p>To meet the requirements of the automation process, we eliminated parts of the preprocessing process that require manual intervention, such as manual artifact removal and manual judgment of ICA components to remove artifacts, especially eye movement artifacts (Fan et al., <xref ref-type="bibr" rid="B10">2021</xref>; Peng et al., <xref ref-type="bibr" rid="B23">2021</xref>). This undoubtedly reduces the quality of the data and, therefore, the accuracy of the recognition, but it makes sense for real-world applications (Rosanne et al., <xref ref-type="bibr" rid="B26">2021</xref>). <xref ref-type="table" rid="T1">Table 1</xref> shows the comparison of traditional preprocessing steps and ours. The whole preprocessing steps are</p>
<list list-type="order">
<list-item><p>High-pass filter raw data at 1 Hz and low-pass filter raw data at 40 Hz.</p></list-item>
<list-item><p>Notch filter raw data at 50 Hz to avoid power line interference.</p></list-item>
<list-item><p>Perform Artifact Subspace Reconstruction (ASR) (Chang et al., <xref ref-type="bibr" rid="B3">2018</xref>).</p></list-item>
<list-item><p>Perform Independent Component Analysis (ICA).</p></list-item>
<list-item><p>ADJUST (Mognon et al., <xref ref-type="bibr" rid="B20">2011</xref>) was performed to automated inspect the artifact component from ICA.</p></list-item>
<list-item><p>Average re-reference the data channels.</p></list-item>
</list>
<table-wrap position="float" id="T1">
<label>Table 1</label>
<caption><p>Comparison of traditional preprocessing steps and ours.</p></caption>
<table frame="hsides" rules="groups">
<thead><tr>
<th valign="top" align="left"><bold>Traditional steps</bold></th>
<th valign="top" align="left"><bold>Human interference</bold></th>
<th valign="top" align="left"><bold>Ours</bold></th>
<th valign="top" align="left"><bold>Human interference</bold></th>
<th valign="top" align="left"><bold>Goal</bold></th>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left">Filters</td>
<td valign="top" align="left">NO</td>
<td valign="top" align="left">Filters</td>
<td valign="top" align="left">NO</td>
<td valign="top" align="left">Remove low-frequency drifts and power line interference</td>
</tr>
<tr>
<td valign="top" align="left">Manual artifacts remove</td>
<td valign="top" align="left">YES</td>
<td valign="top" align="left">ASR</td>
<td valign="top" align="left">NO</td>
<td valign="top" align="left">Remove drifts like muscle activity, sensor motion</td>
</tr>
<tr>
<td valign="top" align="left">ICA with manual judgment</td>
<td valign="top" align="left">YES</td>
<td valign="top" align="left">ICA with ADJUST</td>
<td valign="top" align="left">NO</td>
<td valign="top" align="left">Remove artifacts especially eye movement artifact</td>
</tr>
</tbody>
</table>
</table-wrap>
</sec>
<sec>
<title>3.3. EEG-TNet Architecture</title>
<p>The architecture of our EEG-TNet framework is inspired by the network architecture EEGNet of Lawhern et al. (<xref ref-type="bibr" rid="B16">2018</xref>), which is a widely used end-to-end EEG BCI framework. The detailed framework of our proposed EEG-TNet model can be summarized in three steps, which are shown in <xref ref-type="fig" rid="F2">Figure 2</xref>. Step 1 is to segment the raw EEG to the required size and expand a new dimension for the model need, which is used to keep the temporal dimension stable. Step 2 is to extract the temporal and spatial information from each EEG fragment without between-fragments temporal information loss according to the designed temporal fixed 3-D-CNN layers. Step 3 is to extract the temporal information between each EEG fragment by using the LSTM layer. The output of the last time step in the last layer is used to compute the final status according to the fully connected layer and the softmax function.</p>
<fig id="F2" position="float">
<label>Figure 2</label>
<caption><p>The framework of the EEG-TNet model. The EEG-TNet model consists of Data segmentation, dimension expansion, time fixed 3-D-CNN layers, Bi-LSTM layer, fully connected layers, and softmax operation.</p></caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fnins-16-869522-g0002.tif"/>
</fig>
<sec>
<title>3.3.1. Data Conversion</title>
<p>The original EEG signals are defined as <inline-formula><mml:math id="M14"><mml:mi>D</mml:mi><mml:mo>=</mml:mo><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:msub><mml:mrow><mml:mi>d</mml:mi></mml:mrow><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mrow><mml:mi>d</mml:mi></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mo>.</mml:mo><mml:mo>.</mml:mo><mml:mo>.</mml:mo><mml:msub><mml:mrow><mml:mi>d</mml:mi></mml:mrow><mml:mrow><mml:mi>S</mml:mi></mml:mrow></mml:msub></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow><mml:mo>&#x02208;</mml:mo><mml:msup><mml:mrow><mml:mi>&#x0211D;</mml:mi></mml:mrow><mml:mrow><mml:mi>S</mml:mi></mml:mrow></mml:msup><mml:mo>&#x000D7;</mml:mo><mml:mi>C</mml:mi></mml:math></inline-formula>, where <italic>S</italic> is the time- series length of the original EEG, and <italic>C</italic> denotes the channel number. Similar to the previous BCI task, the original EEG was segmented and constructed by using the overlapping sliding window and non-overlapping sliding window. The input dataset <inline-formula><mml:math id="M15"><mml:mover accent="true"><mml:mrow><mml:mi>X</mml:mi></mml:mrow><mml:mo>^</mml:mo></mml:mover><mml:mtext>&#x000A0;</mml:mtext><mml:mo>=</mml:mo><mml:mtext>&#x000A0;</mml:mtext></mml:math></inline-formula> (<inline-formula><mml:math id="M16"><mml:msub><mml:mrow><mml:mover accent="true"><mml:mrow><mml:mi>x</mml:mi></mml:mrow><mml:mo>^</mml:mo></mml:mover></mml:mrow><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:msub><mml:mrow><mml:mover accent="true"><mml:mrow><mml:mi>x</mml:mi></mml:mrow><mml:mo>^</mml:mo></mml:mover></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msub><mml:mo>,</mml:mo><mml:mo>.</mml:mo><mml:mo>.</mml:mo><mml:mo>.</mml:mo><mml:msub><mml:mrow><mml:mover accent="true"><mml:mrow><mml:mi>x</mml:mi></mml:mrow><mml:mo>^</mml:mo></mml:mover></mml:mrow><mml:mrow><mml:mi>M</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula>) &#x02208; &#x0211D;<sup><italic>M</italic> &#x000D7; <italic>T</italic> &#x000D7; <italic>C</italic> &#x000D7; <italic>L</italic></sup>, where <italic>M</italic> is the number of samples. The sample size of each sample <inline-formula><mml:math id="M17"><mml:msub><mml:mrow><mml:mover accent="true"><mml:mrow><mml:mi>x</mml:mi></mml:mrow><mml:mo>^</mml:mo></mml:mover></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>&#x02208;</mml:mo><mml:mover accent="true"><mml:mrow><mml:mi>X</mml:mi></mml:mrow><mml:mo>^</mml:mo></mml:mover><mml:mo>&#x02208;</mml:mo><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:mn>1</mml:mn><mml:mo>,</mml:mo><mml:mn>2</mml:mn><mml:mo>,</mml:mo><mml:mo>.</mml:mo><mml:mo>.</mml:mo><mml:mo>.</mml:mo><mml:mo>,</mml:mo><mml:mi>M</mml:mi></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:math></inline-formula> was (<italic>T</italic> &#x000D7; <italic>C</italic> &#x000D7; <italic>L</italic>), the <italic>C</italic> &#x000D7; <italic>L</italic> is the size of per EEG fragments, which is set as 64 &#x000D7; 14 in this study, which means the 0.5s EEG signals of two forehead channels. In addition, the number of EEG fragments was <italic>T</italic>, the larger T represents the longer EEG data considered per sample. Most of the related studies analyze the input sample as a graph, where <italic>C</italic> &#x000D7; <italic>L</italic> is the height and width of the graph, and the dimension <italic>T</italic> is the channel size of EEG, such as the RGB. However, after 2-d convolutional layers, the temporal information between each fragment might be lost. In this step, we expand a new dimension whose size is 1 to meet the requirement of channel size. Furthermore, dimension <italic>T</italic> was considered the depth of the sample, which is stale during the whole convolutional process. Finally, the dataset <inline-formula><mml:math id="M18"><mml:mi>X</mml:mi><mml:mo>=</mml:mo><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:msub><mml:mrow><mml:mi>x</mml:mi></mml:mrow><mml:mrow><mml:mn>1</mml:mn></mml:mrow></mml:msub><mml:mo>.</mml:mo><mml:msub><mml:mrow><mml:mi>x</mml:mi></mml:mrow><mml:mrow><mml:mn>2</mml:mn></mml:mrow></mml:msub><mml:mo>.</mml:mo><mml:mo>.</mml:mo><mml:mo>.</mml:mo><mml:mo>,</mml:mo><mml:msub><mml:mrow><mml:mi>x</mml:mi></mml:mrow><mml:mrow><mml:mi>N</mml:mi></mml:mrow></mml:msub></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow><mml:mo>&#x02208;</mml:mo><mml:msup><mml:mrow><mml:mi>&#x0211D;</mml:mi></mml:mrow><mml:mrow><mml:mi>N</mml:mi><mml:mo>&#x000D7;</mml:mo><mml:mi>T</mml:mi><mml:mo>&#x000D7;</mml:mo><mml:mn>1</mml:mn><mml:mo>&#x000D7;</mml:mo><mml:mi>C</mml:mi><mml:mo>&#x000D7;</mml:mo><mml:mi>L</mml:mi></mml:mrow></mml:msup></mml:math></inline-formula>.</p>
</sec>
<sec>
<title>3.3.2. Time-Fixed Convolutional Layer</title>
<p>Four convolutional layers were used in the EEG-TNet model. To ensure the temporal dimension is unchanged, the <italic>K</italic><sub><italic>D</italic></sub> of the kernel size (<italic>K</italic><sub><italic>D</italic></sub>, <italic>K</italic><sub><italic>H</italic></sub>, <italic>K</italic><sub><italic>D</italic></sub>) is set as 1 during the whole convolutional processing. First, eight 3-d normal convolutional filters of size (1,1,2/L) were used to extract frequency features from the EEG signal (Lawhern et al., <xref ref-type="bibr" rid="B16">2018</xref>). Then 16 convolutional filers of size (1,<italic>C</italic>,1) are fitted for the channel information aggregation. Subsequently, an average pooling operation (kernel size = 1 &#x000D7; 1 &#x000D7; 4) is performed to aggregate information and reduce the data dimension. Then, 16 depthwise separable convolutions are constructed, which consists of 16 depthwise convolutional filters (1 &#x000D7; 1 &#x000D7; 16) and pointwise convolutional filters (1 &#x000D7; 1 &#x000D7; 1).</p>
</sec>
<sec>
<title>3.3.3. Bi-LSTM Layer</title>
<p>As we introduced before, the traditional LSTM layer receives the inputs solely in the forward direction through hidden states, which only retains the past information. Bidirectional LSTM (BLSTM) has been proposed to solve the problem, which has two layers named forward layer and backward layer. The forward layer is computed forward from moment 1 to moment <italic>t</italic>, and the output of the forward hidden layer is obtained and saved at each moment. In the backward layer, the output of the backward implicit layer is obtained and stored at each moment by computing the backward layer from moment <italic>t</italic> to moment 1. The final output is obtained at each moment by combining the output of the forward and backward layers at the corresponding moment. In this study, as shown in <xref ref-type="fig" rid="F2">Figure 2</xref>, the output sample shape of the time-fixed convolutional layers is <italic>T</italic> &#x000D7; 1 &#x000D7; 1 &#x000D7; 8, so that both the forward layer and backward of Bi-LSTM have 8 cells to fill the data shape.</p>
</sec>
</sec>
</sec>
<sec sec-type="results" id="s4">
<title>4. Results</title>
<p>In this section, two types of experiments were conducted to evaluate the MWL estimation performance using the proposed EEG-TNet BCI framework. The first type of experiment is subject-dependent while the second one is subject-independent.</p>
<sec>
<title>4.1. Subject-Dependent Experiment</title>
<p>In the subject-dependent experiment, we adopt a similar experimental protocol as that of Chakladar et al. (<xref ref-type="bibr" rid="B2">2020</xref>); Kingphai and Moshfeghi (<xref ref-type="bibr" rid="B15">2021</xref>); Zhu et al. (<xref ref-type="bibr" rid="B35">2021</xref>). The five-fold cross-validation method was applied to evaluate the performance of the framework. As shown in <xref ref-type="fig" rid="F2">Figure 2</xref>, specifically, the first 80% of all samples were selected as the training set and the remaining 20% of the samples were kept aside as the test dataset. Then the other 20% of the samples were selected as the second test set, the last 80% of samples were the second training set. Divide all samples like this five times and the average accuracy of the five experiments was taken as the final result. In addition, five-fold cross-validation was also applied to find optimal hyperparameters.</p>
<p>The proposed EEG-TNet framework was compared with the other four baseline methods for MWL estimation on the STEW dataset under the subject-dependent experiments setting, as shown in <xref ref-type="table" rid="T2">Table 2</xref>. The results showed that the proposed EEG-TNet framework achieved higher estimation accuracies than the other four methods. Although most of the recent studies use many kinds of features like frequency features [PSD (Chakladar et al., <xref ref-type="bibr" rid="B2">2020</xref>; Kingphai and Moshfeghi, <xref ref-type="bibr" rid="B15">2021</xref>)], non-linear features [Approximate Entropy (ApEn) (Chakladar et al., <xref ref-type="bibr" rid="B2">2020</xref>; Kingphai and Moshfeghi, <xref ref-type="bibr" rid="B15">2021</xref>)], linear features [autoregressive coefficient (AR) (Chakladar et al., <xref ref-type="bibr" rid="B2">2020</xref>)], and the graph-based features (clustering coefficient, mean degree) (Zhu et al., <xref ref-type="bibr" rid="B35">2021</xref>). However, traditional machine learning models (SVM and random forest cannot learn the full EEG information. Moreover, some combined deep neural networks (CNN-LSTM, BLSTM-LSTM) show better performance than machine learning models, it is still well below our proposed model.</p>
<table-wrap position="float" id="T2">
<label>Table 2</label>
<caption><p>Comparisons of the estimation accuracy (%) of subject-dependent experiments among the various methods.</p></caption>
<table frame="hsides" rules="groups">
<thead><tr>
<th valign="top" align="left"><bold>References</bold></th>
<th valign="top" align="left"><bold>Method</bold></th>
<th valign="top" align="left"><bold>Features</bold></th>
<th valign="top" align="center"><bold>No task vs. task</bold></th>
<th valign="top" align="center"><bold>Lo vs. Mi vs. Hi</bold></th>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left">Zhu et al. (<xref ref-type="bibr" rid="B35">2021</xref>)</td>
<td valign="top" align="left">SVM</td>
<td valign="top" align="left">Graph features</td>
<td valign="top" align="center">89.60</td>
<td valign="top" align="center">79.50</td>
</tr>
<tr>
<td valign="top" align="left">Chakladar et al. (<xref ref-type="bibr" rid="B2">2020</xref>)</td>
<td valign="top" align="left">Random Forest</td>
<td valign="top" align="left">Frequency features Linear features Non-Linear features</td>
<td valign="top" align="center">83.00</td>
<td valign="top" align="center">78.46</td>
</tr>
<tr>
<td valign="top" align="left">Chakladar et al. (<xref ref-type="bibr" rid="B2">2020</xref>)</td>
<td valign="top" align="left">CNN-LSTM</td>
<td valign="top" align="left">Frequency features Linear features Non-Linear features</td>
<td valign="top" align="center">85.21</td>
<td valign="top" align="center">76.76</td>
</tr>
<tr>
<td valign="top" align="left">Kingphai and Moshfeghi (<xref ref-type="bibr" rid="B15">2021</xref>)</td>
<td valign="top" align="left">BLSTM-LSTM</td>
<td valign="top" align="left">Frequency features Non-Linear features</td>
<td valign="top" align="center">91.15</td>
<td valign="top" align="center">89.44</td>
</tr>
<tr>
<td/>
<td valign="top" align="left"><bold>EEG-TNet</bold></td>
<td valign="top" align="left">None</td>
<td valign="top" align="center"><bold>99.82</bold></td>
<td valign="top" align="center"><bold>99.21</bold></td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<p><italic>The bold values indicate the best performance accuracies</italic>.</p>
</table-wrap-foot>
</table-wrap>
</sec>
<sec>
<title>4.2. Subject-Independent Experiment</title>
<p>The leave-subject-out (LSO) cross-validation method was used to evaluate the performance of the proposed framework in the subject-independent experiments. As shown in <xref ref-type="fig" rid="F3">Figure 3</xref>, in the LSO cross-validation experimental protocol, the EEG samples of 36 subjects (80% of a total of 45 subjects) were selected for training the model and the last EEG samples of 8 subjects (20% of a total of 45 subjects) were used to test the model performance. The whole process was repeated five times so that all the subjects&#x00027; samples were taken as the test set. The average accuracy of the five experiments was taken as the final result. Similarly, the hyperparameters were found according to the five-fold cross-validation in the training set.</p>
<fig id="F3" position="float">
<label>Figure 3</label>
<caption><p>The divided methods of different experiments.</p></caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fnins-16-869522-g0003.tif"/>
</fig>
<p>Compared with the subject-dependent experiments, there are fewer studies that perform the subject-independent experiments because of their difficulty. The proposed EEG-TNet framework was compared with the other three baseline methods under the subject-independent experiments setting, as shown in <xref ref-type="table" rid="T3">Table 3</xref>. It is worth noting that no studies were conducted with either dual-task estimation (No Task vs. Task) or triple task estimation (Lo vs. Mi vs. Hi) subject-independent experiments simultaneously until now (Lim et al., <xref ref-type="bibr" rid="B17">2018</xref>; Pandey et al., <xref ref-type="bibr" rid="B22">2020</xref>).</p>
<table-wrap position="float" id="T3">
<label>Table 3</label>
<caption><p>Comparisons of the estimation accuracy (%) of subject-independent experiments among the various methods.</p></caption>
<table frame="hsides" rules="groups">
<thead><tr>
<th valign="top" align="left"><bold>References</bold></th>
<th valign="top" align="left"><bold>Method</bold></th>
<th valign="top" align="left"><bold>Features</bold></th>
<th valign="top" align="center"><bold>No task vs. task</bold></th>
<th valign="top" align="center"><bold>Lo vs. Mi vs. Hi</bold></th>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left">Pandey et al. (<xref ref-type="bibr" rid="B22">2020</xref>)</td>
<td valign="top" align="left">KNN</td>
<td valign="top" align="left">None</td>
<td valign="top" align="center">61.08</td>
<td valign="top" align="center">\</td>
</tr>
<tr>
<td valign="top" align="left">Pandey et al. (<xref ref-type="bibr" rid="B22">2020</xref>)</td>
<td valign="top" align="left">MLP</td>
<td valign="top" align="left">None</td>
<td valign="top" align="center">58.68</td>
<td valign="top" align="center">\</td>
</tr>
<tr>
<td valign="top" align="left">Pandey et al. (<xref ref-type="bibr" rid="B22">2020</xref>)</td>
<td valign="top" align="left">LSTM</td>
<td valign="top" align="left">None</td>
<td valign="top" align="center">57.30</td>
<td valign="top" align="center">\</td>
</tr>
<tr>
<td valign="top" align="left">Lim et al. (<xref ref-type="bibr" rid="B17">2018</xref>)</td>
<td valign="top" align="left">SVM</td>
<td valign="top" align="left">Frequency features</td>
<td valign="top" align="center">\</td>
<td valign="top" align="center"><bold>69.00</bold></td>
</tr>
<tr>
<td/>
<td valign="top" align="left"><bold>EEG-TNet</bold></td>
<td valign="top" align="left">None</td>
<td valign="top" align="center"><bold>82.78</bold></td>
<td valign="top" align="center">66.83</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<p><italic>The bold values indicate the best performance accuracies</italic>.</p>
</table-wrap-foot>
</table-wrap>
<p><xref ref-type="table" rid="T3">Table 3</xref> summarized the comparative results regarding the average estimation accuracy under the subject-independent experiments setting. Pandey et al. (<xref ref-type="bibr" rid="B22">2020</xref>) realized that handcrafted features would slow down the speed of computing and significantly increased the evaluation time, making it challenging to apply them in practical scenarios. Therefore, he used an end-to-end structure similar to ours to finish the dual-task estimation(No Task vs. Task). However, the results were unsatisfactory, with his best assessment only reaching 61.08%, which is only a tiny improvement over the random classification (50%).</p>
<p>For practical reasons, we need to focus more on triple-task estimation (Lo vs. Mi vs. Hi) than on the dual-task estimation (No Task vs. Task). Lim et al. (<xref ref-type="bibr" rid="B17">2018</xref>) contributed the STEW dataset, where he extracted the PSD of different bans as features and used SVM as a classifier. Although their recognition results were slightly higher than our results, comparing the confusion matrices shows that our estimation results are more balanced and valid. As <xref ref-type="fig" rid="F4">Figure 4</xref> shows, the accuracy of their results was 99.54% for low MWL, down to 46.15% for medium workloads and only 31.07% for high MWL, which was even lower than the random results (33.33%). In contrast, our estimation accuracies range from 52.00 to 74.72%. On the most difficult estimation task with high MWL, our results were nearly twice as good as theirs.</p>
<fig id="F4" position="float">
<label>Figure 4</label>
<caption><p>Confusion matrix of EEG-TNet and Lim&#x00027;s work.</p></caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fnins-16-869522-g0004.tif"/>
</fig>
</sec>
</sec>
<sec sec-type="discussion" id="s5">
<title>5. Discussion</title>
<sec>
<title>5.1. Practicability</title>
<p>The main objective of this study is to propose a practical and effective MWL estimation method for workers of special occupational groups, which can be used to ensure safety during the course of their work. EEG signals are regarded as the gold standard and BCI systems based on EEG signals have natural advantages. However, most BCI systems cannot meet the requirements of online evaluation due to the high manual involvement in signal noise reduction, complex and time-consuming feature extraction, and other disadvantages. In order to solve the disadvantage of manual involvement in signal noise reduction, this system uses a combination of filters, ASR, and ICA with ADJUST to obtain relatively pure EEG signals without manual involvement.</p>
<p>The classification model is at the heart of the BCI system, because of the system&#x00027;s computing time, deployment method, and evaluation accuracy depends on it. In most related BCI systems, the handcrafted features were used to estimate the MWL, which may be computationally demanding and not suitable for a real-time system. This study exploited deep neural networks&#x00027; powerful feature extraction and classification capabilities to design the EEG-TNet network as the computational core of an end-to-end BCI framework. This study improves the traditional neural network model named EEG-Net and extracts features through the processes of data segmentation, dimension expansion, time fixed 3-D-CNN layers, Bi-LSTM layer, fully connected layer, and softmax operation. The time-fixed method was designed to ensure the temporal segment order, and a Bi-LSTM layer was added at the end for temporal information analysis. Moreover, the total time cost of this model is only 386.74 ms in our machine [System: Ubuntu 20.04, CPU: Intel(R) Core(TM) i7-8700K CPU &#x00040; 3.70GHz, Memory: 32 GB, GPU: GeForce RTX 2080Ti]. The low time cost proves that the proposed EEG-TNet can meet the requirements of real-time application.</p>
</sec>
<sec>
<title>5.2. Estimation Performance</title>
<p>The most important metric for evaluating a model is estimation accuracy. Unlike other research areas, both subject-dependent experiments and subject-independent experiments need to be considered in the field of human factors engineering. In most cases, subject-dependent experiments are more accurate because such experimental methods allow the model to obtain EEG data for each individual during the training phase. Information on individual differences can be extracted. As shown in <xref ref-type="table" rid="T2">Table 2</xref>, most studies have achieved more than 80% or even more than 90% recognition accuracy, while our recognition accuracy is close to 100%.</p>
<p>However, the subject-dependent experimental approach is often not applicable to practical scenarios. Training a unique classification model for each worker would be time-consuming and costly, so subject-independent experiments use unseen subjects&#x00027; data as the test set, satisfying the need for &#x0201C;Plug-and-Play&#x0201D;. However, head shape, scalp impedance, and psychological state can all affect the EEG data, resulting in large variations in EEG data among subjects. The accuracy of the method is poor. Therefore, as shown in <xref ref-type="table" rid="T3">Table 3</xref>, almost all of the previous methods do not apply to estimating the MWL of workers. In the dual-task estimation, the method proposed by Pandey et al. (<xref ref-type="bibr" rid="B22">2020</xref>) was only marginally more accurate than random. In contrast, our method was able to achieve 82.78%, which is sufficient for use in realistic scenarios.</p>
<p>The triple task estimation(Lo vs. Mi vs. Hi) is the most urgent from the point of view of ensuring safety in different fields like transportation and construction. Accurate assessment of the high or moderate MWL of workers helps managers to allocate tasks rationally and to avoid overloading workers with work that could lead to human factor accidents. Although the estimation accuracy of the method proposed by Lim et al. (<xref ref-type="bibr" rid="B17">2018</xref>) appears to be slightly higher than our proposed EEG-TNet method. By analyzing and comparing the confusion matrices of the two methods in <xref ref-type="fig" rid="F4">Figure 4</xref>, the method of Lim et al. (<xref ref-type="bibr" rid="B17">2018</xref>) may not apply to worker workload estimation. According to their confusion matrix, we can find that the low MWL was assessed at 99.54%, which means that almost all low load situations were successfully identified. However, the estimation accuracy for moderate MWL dropped to 46.15%, and only 31.07% of the samples with high MWL were correctly estimated, with an accuracy rate even lower than the random results (33.33%). More notably, 37.57% of the medium workload samples and 38.40% of the high MWL samples were misclassified as low MWL. In many occupational fields, underestimating workers&#x00027; workload by managers can lead to their scheduling of excessive workloads, leading to workers being overloaded, making the chances of unsafe behavior much higher. In our assessment results, just 5.25% of the moderately loaded and 19.78% of the high loaded sample were incorrectly underestimated as low workloads. Compared to Lim et al. (<xref ref-type="bibr" rid="B17">2018</xref>), the likelihood of underestimation was 7 times and 2 times lower, respectively.</p>
<p>To verify the effectiveness of our model, ablation experiments are performed on the STEW database. There are two kinds of ablation experiments: (1) Ablation experiments on the effectiveness of designed automated preprocessing method (2) Ablation experiments on the effectiveness of Bi-LSTM. As <xref ref-type="fig" rid="F5">Figure 5</xref> shows, estimation accuracy significantly decreased when the data pre-processing process or LSTM layer was removed, not only in subject-dependent experiments but also in subject-independent experiments.</p>
<fig id="F5" position="float">
<label>Figure 5</label>
<caption><p>Ablation studies.</p></caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fnins-16-869522-g0005.tif"/>
</fig>
<p>However, there were some limitations to this study. First, this study used the multi-channel EEG to build the EEG-TNet model. However, collecting multi-channel EEG needs gel-based EEG caps or clumsy dry electrode EEG caps, which is too troublesome to fill the practical usage. Additionally, the increase in the number of channels also brings a significant increase in computational complexity. Furthermore, although the estimation accuracy for high MWL is much larger than the previous study, it is still not enough for application scenarios. Finally, the STEW database only contains 48 students&#x00027; EEG data under different levels of MWL, which were not selected to be representative. It is important to analyze the EEG signals of different occupational groups, ages, and work experiences.</p>
</sec>
</sec>
<sec sec-type="conclusions" id="s6">
<title>6. Conclusion</title>
<p>This study proposed an end-to-end BCI framework named EEG-TNet for the estimation of worker MWL using EEG signals and conducted different types of experiments to assess the effectiveness of the EEG-TNet framework. In the subject-dependent experiments, the estimation accuracy of dual-task estimation (No task vs. TASK) and that of triple-task estimation (Lo vs. Mi vs. Hi) reach 99.82 and 99.21% respectively. Compared with the state-of-the-art methods proposed in previous studies, the accuracy is improved by 8.67 and 9.77%, respectively. Although there is a substantial decrease in estimation accuracy in subject-independent experiments, the accuracy of different tasks still reaches 82.78 and 66.83% respectively. Especially, in the subject-independent experiments, compared to previous study, the likelihood of underestimation was 7 times and 2 times lower respectively, which means that our proposed EEG-TNet model can fill the requirement of real-time application. In the future, we will extend the research by designing new network structures such as graph neural networks (GNN) to improve the estimation accuracy of high MWL and designing a closed-loop system that includes real-time estimation and feedback systems. Additionally, building a new database that includes more occupational groups will also be our future direction.</p>
</sec>
<sec sec-type="data-availability" id="s7">
<title>Data Availability Statement</title>
<p>The datasets presented in this study can be found in online repositories. The names of the repository/repositories and accession number(s) can be found below: <ext-link ext-link-type="uri" xlink:href="https://ieee-dataport.org/open-access/stew-simultaneous-task-eeg-workload-dataset">https://ieee-dataport.org/open-access/stew-simultaneous-task-eeg-workload-dataset</ext-link>.</p>
</sec>
<sec id="s8">
<title>Author Contributions</title>
<p>CF responsible for the conceptualization, performed the majority of the experiments and analyzes, made the figures, and wrote the first draft of the manuscript. JH and SH performed some experiments, updated the figures, performed the statistics, and edited the manuscript. YP responsible for the methodology, project administration, resources, funding acquisition, and validation. SK responsible for the conceptualization and supervision. All authors contributed to the article and approved the submitted version.</p>
</sec>
<sec sec-type="funding-information" id="s9">
<title>Funding</title>
<p>This study was supported by the National Natural Science Foundation of China [Grant Numbers: 52075553]; the Natural Science Foundation of Hunan [Grant Numbers: 2020JJ7030]; the Hunan Science Foundation for Distinguished Young Scholars of China [Grant Numbers: 2021JJ10059]; the Postgraduate Scientific Research Innovation Project of Hunan Province [Grant Numbers: CX20210099].</p>
</sec>
<sec sec-type="COI-statement" id="conf1">
<title>Conflict of Interest</title>
<p>JH was employed by the Hunan Communications Research Institute Co., Ltd., Hunan Communication &#x00026; Water Conservancy Group Ltd. The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.</p>
</sec>
<sec sec-type="disclaimer" id="s10">
<title>Publisher&#x00027;s Note</title>
<p>All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.</p>
</sec>
</body>
<back>
<ack><p>Our sincere thanks to W. L. Lim, O. Sourina, and L. P. Wang at Nanyang Technological University for providing the STEW dataset.</p>
</ack>
<ref-list>
<title>References</title>
<ref id="B1">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Abdelhamid</surname> <given-names>T. S.</given-names></name> <name><surname>Everett</surname> <given-names>J. G.</given-names></name></person-group> (<year>2002</year>). <article-title>Physiological demands during construction work</article-title>. <source>J. Construct. Eng. Manage</source>. <volume>128</volume>, <fpage>427</fpage>&#x02013;<lpage>437</lpage>. <pub-id pub-id-type="doi">10.1061/(ASCE)0733-9364(2002)128:5(427)</pub-id></citation>
</ref>
<ref id="B2">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Chakladar</surname> <given-names>D. D.</given-names></name> <name><surname>Dey</surname> <given-names>S.</given-names></name> <name><surname>Roy</surname> <given-names>P. P.</given-names></name> <name><surname>Dogra</surname> <given-names>D. P.</given-names></name></person-group> (<year>2020</year>). <article-title>EEG-based mental workload estimation using deep BLSTM-LSTM network and evolutionary algorithm</article-title>. <source>Biomed. Signal Process. Control</source> <volume>60</volume>, <fpage>101989</fpage>. <pub-id pub-id-type="doi">10.1016/j.bspc.2020.101989</pub-id></citation>
</ref>
<ref id="B3">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Chang</surname> <given-names>C.-Y.</given-names></name> <name><surname>Hsu</surname> <given-names>S.-H.</given-names></name> <name><surname>Pion-Tonachini</surname> <given-names>L.</given-names></name> <name><surname>Jung</surname> <given-names>T.-P.</given-names></name></person-group> (<year>2018</year>). <article-title>Evaluation of artifact subspace reconstruction for automatic EEG artifact removal</article-title>, in <source>2018 40th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC)</source> (<publisher-loc>Honolulu, HI</publisher-loc>: <publisher-name>IEEE</publisher-name>), <fpage>1242</fpage>&#x02013;<lpage>1245</lpage>. <pub-id pub-id-type="doi">10.1109/EMBC.2018.8512547</pub-id><pub-id pub-id-type="pmid">31329105</pub-id></citation></ref>
<ref id="B4">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Charles</surname> <given-names>R. L.</given-names></name> <name><surname>Nixon</surname> <given-names>J.</given-names></name></person-group> (<year>2019</year>). <article-title>Measuring mental workload using physiological measures: a systematic review</article-title>. <source>Appl. Ergon</source>. <volume>74</volume>, <fpage>221</fpage>&#x02013;<lpage>232</lpage>. <pub-id pub-id-type="doi">10.1016/j.apergo.2018.08.028</pub-id><pub-id pub-id-type="pmid">30487103</pub-id></citation></ref>
<ref id="B5">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Cheng</surname> <given-names>B.</given-names></name> <name><surname>Fan</surname> <given-names>C.</given-names></name> <name><surname>Fu</surname> <given-names>H.</given-names></name> <name><surname>Huang</surname> <given-names>J.</given-names></name> <name><surname>Chen</surname> <given-names>H.</given-names></name> <name><surname>Luo</surname> <given-names>X.</given-names></name></person-group> (<year>2022</year>). <article-title>Measuring and computing cognitive statuses of construction workers based on electroencephalogram: a critical review</article-title>. <source>IEEE Trans. Comput. Soc. Syst</source>. <fpage>1</fpage>&#x02013;<lpage>16</lpage>. <pub-id pub-id-type="doi">10.1109/TCSS.2022.3158585</pub-id></citation>
</ref>
<ref id="B6">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Chollet</surname> <given-names>F.</given-names></name></person-group> (<year>2017</year>). <article-title>Xception: deep learning with depthwise separable convolutions</article-title>, in <source>Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition</source>, (<publisher-loc>Honolulu, HI</publisher-loc>: <publisher-name>IEEE</publisher-name>), <fpage>1251</fpage>&#x02013;<lpage>1258</lpage>. <pub-id pub-id-type="doi">10.1109/CVPR.2017.195</pub-id></citation>
</ref>
<ref id="B7">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Datta</surname> <given-names>S.</given-names></name> <name><surname>Boulgouris</surname> <given-names>N. V.</given-names></name></person-group> (<year>2021</year>). <article-title>Recognition of grammatical class of imagined words from EEG signals using convolutional neural network</article-title>. <source>Neurocomputing</source> <volume>465</volume>, <fpage>301</fpage>&#x02013;<lpage>309</lpage>. <pub-id pub-id-type="doi">10.1016/j.neucom.2021.08.035</pub-id></citation>
</ref>
<ref id="B8">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Dehais</surname> <given-names>F.</given-names></name> <name><surname>Lafont</surname> <given-names>A.</given-names></name> <name><surname>Roy</surname> <given-names>R.</given-names></name> <name><surname>Fairclough</surname> <given-names>S.</given-names></name></person-group> (<year>2020</year>). <article-title>A neuroergonomics approach to mental workload, engagement and human performance</article-title>. <source>Front. Neurosci</source>. 14, 268. <pub-id pub-id-type="doi">10.3389/fnins.2020.00268</pub-id><pub-id pub-id-type="pmid">32317914</pub-id></citation></ref>
<ref id="B9">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Fairclough</surname> <given-names>S. H.</given-names></name> <name><surname>Houston</surname> <given-names>K.</given-names></name></person-group> (<year>2004</year>). <article-title>A metabolic measure of mental effort</article-title>. <source>Biol. Psychol</source>. <volume>66</volume>, <fpage>177</fpage>&#x02013;<lpage>190</lpage>. <pub-id pub-id-type="doi">10.1016/j.biopsycho.2003.10.001</pub-id><pub-id pub-id-type="pmid">15041139</pub-id></citation></ref>
<ref id="B10">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Fan</surname> <given-names>C.</given-names></name> <name><surname>Peng</surname> <given-names>Y.</given-names></name> <name><surname>Peng</surname> <given-names>S.</given-names></name> <name><surname>Zhang</surname> <given-names>H.</given-names></name> <name><surname>Wu</surname> <given-names>Y.</given-names></name> <name><surname>Sam</surname> <given-names>K.</given-names></name></person-group> (<year>2021</year>). <article-title>Detection of train driver fatigue and distraction based on forehead EEG: A time-series ensemble learning method</article-title>. <source>IEEE Trans. Intell. Transport. Syst</source>. <fpage>1</fpage>&#x02013;<lpage>11</lpage>. <pub-id pub-id-type="doi">10.1109/TITS.2021.3125737</pub-id></citation>
</ref>
<ref id="B11">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Hart</surname> <given-names>S. G.</given-names></name></person-group> (<year>2006</year>). <article-title>Nasa-task load index (NASA-TLX); 20 years later</article-title>, in <source>Proceedings of the Human Factors and Ergonomics Society Annual Meeting</source> (<publisher-loc>Sage, CA; Los Angeles, CA</publisher-loc>: <publisher-name>Sage Publications</publisher-name>), <fpage>904</fpage>&#x02013;<lpage>908</lpage>. <pub-id pub-id-type="doi">10.1177/154193120605000909</pub-id></citation>
</ref>
<ref id="B12">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hochreiter</surname> <given-names>S.</given-names></name> <name><surname>Schmidhuber</surname> <given-names>J.</given-names></name></person-group> (<year>1997</year>). <article-title>Long short-term memory</article-title>. <source>Neural Comput</source>. <volume>9</volume>, <fpage>1735</fpage>&#x02013;<lpage>1780</lpage>. <pub-id pub-id-type="doi">10.1162/neco.1997.9.8.1735</pub-id><pub-id pub-id-type="pmid">9377276</pub-id></citation></ref>
<ref id="B13">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hogervorst</surname> <given-names>M. A.</given-names></name> <name><surname>Brouwer</surname> <given-names>A.-M.</given-names></name> <name><surname>Van Erp</surname> <given-names>J. B.</given-names></name></person-group> (<year>2014</year>). <article-title>Combining and comparing EEG, peripheral physiology and eye-related measures for the assessment of mental workload</article-title>. <source>Front. Neurosci</source>. 8, 322. <pub-id pub-id-type="doi">10.3389/fnins.2014.00322</pub-id><pub-id pub-id-type="pmid">25352774</pub-id></citation></ref>
<ref id="B14">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hu</surname> <given-names>J.</given-names></name> <name><surname>Wang</surname> <given-names>C.</given-names></name> <name><surname>Jia</surname> <given-names>Q.</given-names></name> <name><surname>Bu</surname> <given-names>Q.</given-names></name> <name><surname>Sutcliffe</surname> <given-names>R.</given-names></name> <name><surname>Feng</surname> <given-names>J.</given-names></name></person-group> (<year>2021</year>). <article-title>ScalingNet: extracting features from raw EEG data for emotion recognition</article-title>. <source>Neurocomputing</source> <volume>463</volume>, <fpage>177</fpage>&#x02013;<lpage>184</lpage>. <pub-id pub-id-type="doi">10.1016/j.neucom.2021.08.018</pub-id></citation>
</ref>
<ref id="B15">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Kingphai</surname> <given-names>K.</given-names></name> <name><surname>Moshfeghi</surname> <given-names>Y.</given-names></name></person-group> (<year>2021</year>). <article-title>On EEG preprocessing role in deep learning effectiveness for mental workload classification</article-title>, in <source>International Symposium on Human Mental Workload: Models and Applications</source> (<publisher-loc>Springer</publisher-loc>), <fpage>81</fpage>&#x02013;<lpage>98</lpage>. <pub-id pub-id-type="doi">10.1007/978-3-030-91408-0_6</pub-id></citation>
</ref>
<ref id="B16">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lawhern</surname> <given-names>V. J.</given-names></name> <name><surname>Solon</surname> <given-names>A. J.</given-names></name> <name><surname>Waytowich</surname> <given-names>N. R.</given-names></name> <name><surname>Gordon</surname> <given-names>S. M.</given-names></name> <name><surname>Hung</surname> <given-names>C. P.</given-names></name> <name><surname>Lance</surname> <given-names>B. J.</given-names></name></person-group> (<year>2018</year>). <article-title>EEGNet: a compact convolutional neural network for EEG-based brain-computer interfaces</article-title>. <source>J. Neural Eng</source>. 15, 056013. <pub-id pub-id-type="doi">10.1088/1741-2552/aace8c</pub-id><pub-id pub-id-type="pmid">29932424</pub-id></citation></ref>
<ref id="B17">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lim</surname> <given-names>W.</given-names></name> <name><surname>Sourina</surname> <given-names>O.</given-names></name> <name><surname>Wang</surname> <given-names>L.</given-names></name></person-group> (<year>2018</year>). <article-title>Stew: simultaneous task EEG workload data set</article-title>. <source>IEEE Trans. Neural Syst. Rehabil. Eng</source>. <volume>26</volume>, <fpage>2106</fpage>&#x02013;<lpage>2114</lpage>. <pub-id pub-id-type="doi">10.1109/TNSRE.2018.2872924</pub-id><pub-id pub-id-type="pmid">30281467</pub-id></citation></ref>
<ref id="B18">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Liu</surname> <given-names>X.</given-names></name> <name><surname>Chen</surname> <given-names>S.</given-names></name> <name><surname>Guo</surname> <given-names>X.</given-names></name> <name><surname>Fu</surname> <given-names>H.</given-names></name></person-group> (<year>2022</year>). <article-title>Can social norms promote recycled water use on campus? The evidence from event-related potentials</article-title>. <source>Front. Psychol</source>. 13, 818292. <pub-id pub-id-type="doi">10.3389/fpsyg.2022.818292</pub-id><pub-id pub-id-type="pmid">35185735</pub-id></citation></ref>
<ref id="B19">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Mikolov</surname> <given-names>T.</given-names></name> <name><surname>Karafi&#x000E1;t</surname> <given-names>M.</given-names></name> <name><surname>Burget</surname> <given-names>L.</given-names></name> <name><surname>Cernock&#x000FD;</surname> <given-names>J.</given-names></name> <name><surname>Khudanpur</surname> <given-names>S.</given-names></name></person-group> (<year>2010</year>). <article-title>Recurrent neural network based language model</article-title>, in <source>Interspeech</source> (<publisher-loc>Makuhari</publisher-loc>), <fpage>1045</fpage>&#x02013;<lpage>1048</lpage>. <pub-id pub-id-type="doi">10.21437/Interspeech.2010-343</pub-id></citation>
</ref>
<ref id="B20">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Mognon</surname> <given-names>A.</given-names></name> <name><surname>Jovicich</surname> <given-names>J.</given-names></name> <name><surname>Bruzzone</surname> <given-names>L.</given-names></name> <name><surname>Buiatti</surname> <given-names>M.</given-names></name></person-group> (<year>2011</year>). <article-title>Adjust: An automatic EEG artifact detector based on the joint use of spatial and temporal features</article-title>. <source>Psychophysiology</source> <volume>48</volume>, <fpage>229</fpage>&#x02013;<lpage>240</lpage>. <pub-id pub-id-type="doi">10.1111/j.1469-8986.2010.01061.x</pub-id><pub-id pub-id-type="pmid">20636297</pub-id></citation></ref>
<ref id="B21">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>M&#x000FC;hl</surname> <given-names>C.</given-names></name> <name><surname>Jeunet</surname> <given-names>C.</given-names></name> <name><surname>Lotte</surname> <given-names>F.</given-names></name></person-group> (<year>2014</year>). <article-title>EEG-based workload estimation across affective contexts</article-title>. <source>Front. Neurosci</source>. 8, 114. <pub-id pub-id-type="doi">10.3389/fnins.2014.00114</pub-id><pub-id pub-id-type="pmid">24971046</pub-id></citation></ref>
<ref id="B22">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Pandey</surname> <given-names>V.</given-names></name> <name><surname>Choudhary</surname> <given-names>D. K.</given-names></name> <name><surname>Verma</surname> <given-names>V.</given-names></name> <name><surname>Sharma</surname> <given-names>G.</given-names></name> <name><surname>Singh</surname> <given-names>R.</given-names></name> <name><surname>Chandra</surname> <given-names>S.</given-names></name></person-group> (<year>2020</year>). <article-title>Mental workload estimation using EEG</article-title>, in <source>2020 Fifth International Conference on Research in Computational Intelligence and Communication Networks (ICRCICN)</source> (<publisher-loc>Bangalore</publisher-loc>: <publisher-name>IEEE</publisher-name>), <fpage>83</fpage>&#x02013;<lpage>86</lpage>. <pub-id pub-id-type="doi">10.1109/ICRCICN50933.2020.9296150</pub-id><pub-id pub-id-type="pmid">31947126</pub-id></citation></ref>
<ref id="B23">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Peng</surname> <given-names>Y.</given-names></name> <name><surname>Lin</surname> <given-names>Y.</given-names></name> <name><surname>Fan</surname> <given-names>C.</given-names></name> <name><surname>Xu</surname> <given-names>Q.</given-names></name> <name><surname>Xu</surname> <given-names>D.</given-names></name> <name><surname>Yi</surname> <given-names>S.</given-names></name> <etal/></person-group>. (<year>2021</year>). <article-title>Passenger overallcomfort in high-speed railway environments based on EEG: assessment and degradation mechanism</article-title>. <source>Build. Environ</source>. 2021, 108711. <pub-id pub-id-type="doi">10.1016/j.buildenv.2021.108711</pub-id></citation>
</ref>
<ref id="B24">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Pieper</surname> <given-names>K.</given-names></name> <name><surname>Spang</surname> <given-names>R. P.</given-names></name> <name><surname>Prietz</surname> <given-names>P.</given-names></name> <name><surname>M&#x000F6;ller</surname> <given-names>S.</given-names></name> <name><surname>Paajanen</surname> <given-names>E.</given-names></name> <name><surname>Vaalgamaa</surname> <given-names>M.</given-names></name> <etal/></person-group>. (<year>2021</year>). <article-title>Working with environmental noise and noise-cancelation: a workload assessment with EEG and subjective measures</article-title>. <source>Front. Neurosci</source>. 15, 771533. <pub-id pub-id-type="doi">10.3389/fnins.2021.771533</pub-id><pub-id pub-id-type="pmid">34790093</pub-id></citation></ref>
<ref id="B25">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Reid</surname> <given-names>G. B.</given-names></name> <name><surname>Nygren</surname> <given-names>T. E.</given-names></name></person-group> (<year>1988</year>). <article-title>The subjective workload assessment technique: a scaling procedure for measuring mental workload</article-title>, in <source>Advances in Psychology</source>, eds <person-group person-group-type="editor"><name><surname>Hancock</surname> <given-names>P. A.</given-names></name> <name><surname>Meshkati</surname> <given-names>N.</given-names></name></person-group> (<publisher-name>Elsevier</publisher-name>), <fpage>185</fpage>&#x02013;<lpage>218</lpage>. <pub-id pub-id-type="doi">10.1016/S0166-4115(08)62387-0</pub-id></citation>
</ref>
<ref id="B26">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Rosanne</surname> <given-names>O.</given-names></name> <name><surname>Albuquerque</surname> <given-names>I.</given-names></name> <name><surname>Cassani</surname> <given-names>R.</given-names></name> <name><surname>Gagnon</surname> <given-names>J.-F.</given-names></name> <name><surname>Tremblay</surname> <given-names>S.</given-names></name> <name><surname>Falk</surname> <given-names>T. H.</given-names></name></person-group> (<year>2021</year>). <article-title>Adaptive filtering for improved EEG-based mental workload assessment of ambulant users</article-title>. <source>Front. Neurosci</source>. <volume>15</volume>:<fpage>341</fpage>. <pub-id pub-id-type="doi">10.3389/fnins.2021.611962</pub-id><pub-id pub-id-type="pmid">33897342</pub-id></citation></ref>
<ref id="B27">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Roscoe</surname> <given-names>A. H.</given-names></name></person-group> (<year>1992</year>). <article-title>Assessing pilot workload. Why measure heart rate, hrv and respiration?</article-title> <source>Biol. Psychol</source>. <volume>34</volume>, <fpage>259</fpage>&#x02013;<lpage>287</lpage>. <pub-id pub-id-type="doi">10.1016/0301-0511(92)90018-P</pub-id><pub-id pub-id-type="pmid">1467396</pub-id></citation></ref>
<ref id="B28">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Roy</surname> <given-names>R. N.</given-names></name> <name><surname>Charbonnier</surname> <given-names>S.</given-names></name> <name><surname>Campagne</surname> <given-names>A.</given-names></name> <name><surname>Bonnet</surname> <given-names>S.</given-names></name></person-group> (<year>2016</year>). <article-title>Efficient mental workload estimation using task-independent EEG features</article-title>. <source>J. Neural Eng</source>. 13, 026019. <pub-id pub-id-type="doi">10.1088/1741-2560/13/2/026019</pub-id><pub-id pub-id-type="pmid">26877162</pub-id></citation></ref>
<ref id="B29">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>van Erp</surname> <given-names>J. B.</given-names></name> <name><surname>Veltman</surname> <given-names>H. J.</given-names></name> <name><surname>Grootjen</surname> <given-names>M.</given-names></name></person-group> (<year>2010</year>). <article-title>Brain-based indices for user system symbiosis</article-title>, in <source>Brain-Computer Interfaces</source>, eds <person-group person-group-type="editor"><name><surname>Vanderdonckt</surname> <given-names>J.</given-names></name> <name><surname>Liao</surname> <given-names>Q. V.</given-names></name></person-group> (<publisher-name>Springer</publisher-name>), <fpage>201</fpage>&#x02013;<lpage>219</lpage>. <pub-id pub-id-type="doi">10.1007/978-1-84996-272-8_12</pub-id></citation>
</ref>
<ref id="B30">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wang</surname> <given-names>D.</given-names></name> <name><surname>Chen</surname> <given-names>J.</given-names></name> <name><surname>Zhao</surname> <given-names>D.</given-names></name> <name><surname>Dai</surname> <given-names>F.</given-names></name> <name><surname>Zheng</surname> <given-names>C.</given-names></name> <name><surname>Wu</surname> <given-names>X.</given-names></name></person-group> (<year>2017</year>). <article-title>Monitoring workers&#x00027; attention and vigilance in construction activities through a wireless and wearable electroencephalography system</article-title>. <source>Automat. Construct</source>. <volume>82</volume>, <fpage>122</fpage>&#x02013;<lpage>137</lpage>. <pub-id pub-id-type="doi">10.1016/j.autcon.2017.02.001</pub-id></citation>
</ref>
<ref id="B31">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wilson</surname> <given-names>G. F.</given-names></name> <name><surname>Fullenkamp</surname> <given-names>P.</given-names></name> <name><surname>Davis</surname> <given-names>I.</given-names></name></person-group> (<year>1994</year>). <article-title>Evoked potential, cardiac, blink, and respiration measures of pilot workload in air-to-ground missions</article-title>. <source>Aviat. Space Environ. Med</source>. <volume>65</volume>, <fpage>100</fpage>&#x02013;<lpage>105</lpage>. <pub-id pub-id-type="pmid">8161318</pub-id></citation></ref>
<ref id="B32">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Xu</surname> <given-names>G.</given-names></name> <name><surname>Ren</surname> <given-names>T.</given-names></name> <name><surname>Chen</surname> <given-names>Y.</given-names></name> <name><surname>Che</surname> <given-names>W.</given-names></name></person-group> (<year>2020</year>). <article-title>A one-dimensional cnn-lstm model for epileptic seizure recognition using EEG signal analysis</article-title>. <source>Front. Neurosci</source>. 14, 1253. <pub-id pub-id-type="doi">10.3389/fnins.2020.578126</pub-id><pub-id pub-id-type="pmid">33390878</pub-id></citation></ref>
<ref id="B33">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Yin</surname> <given-names>Z.</given-names></name> <name><surname>Zhao</surname> <given-names>M.</given-names></name> <name><surname>Zhang</surname> <given-names>W.</given-names></name> <name><surname>Wang</surname> <given-names>Y.</given-names></name> <name><surname>Wang</surname> <given-names>Y.</given-names></name> <name><surname>Zhang</surname> <given-names>J.</given-names></name></person-group> (<year>2019</year>). <article-title>Physiological-signal-based mental workload estimation via transfer dynamical autoencoders in a deep learning framework</article-title>. <source>Neurocomputing</source> <volume>347</volume>, <fpage>212</fpage>&#x02013;<lpage>229</lpage>. <pub-id pub-id-type="doi">10.1016/j.neucom.2019.02.061</pub-id></citation>
</ref>
<ref id="B34">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Yu</surname> <given-names>X.</given-names></name> <name><surname>Liu</surname> <given-names>T.</given-names></name> <name><surname>He</surname> <given-names>L.</given-names></name> <name><surname>Yajie</surname> <given-names>L.</given-names></name></person-group> (<year>2022</year>). <article-title>Micro-foundations of strategic decision-making in family business organisations: a cognitive neuroscience perspective</article-title>. <source>Long Range Plann</source>. 2022, 102198. <pub-id pub-id-type="doi">10.1016/j.lrp.2022.102198</pub-id></citation>
</ref>
<ref id="B35">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zhu</surname> <given-names>G.</given-names></name> <name><surname>Zong</surname> <given-names>F.</given-names></name> <name><surname>Zhang</surname> <given-names>H.</given-names></name> <name><surname>Wei</surname> <given-names>B.</given-names></name> <name><surname>Liu</surname> <given-names>F.</given-names></name></person-group> (<year>2021</year>). <article-title>Cognitive load during multitasking can be accurately assessed based on single channel electroencephalography using graph methods</article-title>. <source>IEEE Access</source> <volume>9</volume>, <fpage>33102</fpage>&#x02013;<lpage>33109</lpage>. <pub-id pub-id-type="doi">10.1109/ACCESS.2021.3058271</pub-id></citation>
</ref>
</ref-list> 
</back>
</article>