<?xml version="1.0" encoding="UTF-8" standalone="no"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v2.3 20070202//EN" "journalpublishing.dtd">
<article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" article-type="research-article">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">Front. Neurosci.</journal-id>
<journal-title>Frontiers in Neuroscience</journal-title>
<abbrev-journal-title abbrev-type="pubmed">Front. Neurosci.</abbrev-journal-title>
<issn pub-type="epub">1662-453X</issn>
<publisher>
<publisher-name>Frontiers Media S.A.</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="doi">10.3389/fnins.2017.00103</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Neuroscience</subject>
<subj-group>
<subject>Original Research</subject>
</subj-group>
</subj-group>
</article-categories>
<title-group>
<article-title>Improving EEG-Based Driver Fatigue Classification Using Sparse-Deep Belief Networks</article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author" corresp="yes">
<name><surname>Chai</surname> <given-names>Rifai</given-names></name>
<xref ref-type="aff" rid="aff1"><sup>1</sup></xref>
<xref ref-type="author-notes" rid="fn001"><sup>&#x0002A;</sup></xref>
<uri xlink:href="http://loop.frontiersin.org/people/381442/overview"/>
</contrib>
<contrib contrib-type="author">
<name><surname>Ling</surname> <given-names>Sai Ho</given-names></name>
<xref ref-type="aff" rid="aff1"><sup>1</sup></xref>
<uri xlink:href="http://loop.frontiersin.org/people/400332/overview"/>
</contrib>
<contrib contrib-type="author">
<name><surname>San</surname> <given-names>Phyo Phyo</given-names></name>
<xref ref-type="aff" rid="aff2"><sup>2</sup></xref>
<uri xlink:href="http://loop.frontiersin.org/people/418245/overview"/>
</contrib>
<contrib contrib-type="author">
<name><surname>Naik</surname> <given-names>Ganesh R.</given-names></name>
<xref ref-type="aff" rid="aff1"><sup>1</sup></xref>
<uri xlink:href="http://loop.frontiersin.org/people/364003/overview"/>
</contrib>
<contrib contrib-type="author">
<name><surname>Nguyen</surname> <given-names>Tuan N.</given-names></name>
<xref ref-type="aff" rid="aff1"><sup>1</sup></xref>
<uri xlink:href="http://loop.frontiersin.org/people/418168/overview"/>
</contrib>
<contrib contrib-type="author">
<name><surname>Tran</surname> <given-names>Yvonne</given-names></name>
<xref ref-type="aff" rid="aff1"><sup>1</sup></xref>
<xref ref-type="aff" rid="aff3"><sup>3</sup></xref>
<uri xlink:href="http://loop.frontiersin.org/people/418212/overview"/>
</contrib>
<contrib contrib-type="author">
<name><surname>Craig</surname> <given-names>Ashley</given-names></name>
<xref ref-type="aff" rid="aff3"><sup>3</sup></xref>
<uri xlink:href="http://loop.frontiersin.org/people/196244/overview"/>
</contrib>
<contrib contrib-type="author">
<name><surname>Nguyen</surname> <given-names>Hung T.</given-names></name>
<xref ref-type="aff" rid="aff1"><sup>1</sup></xref>
<uri xlink:href="http://loop.frontiersin.org/people/418216/overview"/>
</contrib>
</contrib-group>
<aff id="aff1"><sup>1</sup><institution>Faculty of Engineering and Information Technology, Centre for Health Technologies, University of Technology</institution> <country>Sydney, NSW, Australia</country></aff>
<aff id="aff2"><sup>2</sup><institution>Data Analytic Department, Institute for Infocomm Research</institution> <country>A&#x0002A;STAR, Singapore, Singapore</country></aff>
<aff id="aff3"><sup>3</sup><institution>Kolling Institute of Medical Research, Sydney Medical School, The University of Sydney</institution> <country>Sydney, NSW, Australia</country></aff>
<author-notes>
<fn fn-type="edited-by"><p>Edited by: Jianhua Zhang, East China University of Science and Technology, China</p></fn>
<fn fn-type="edited-by"><p>Reviewed by: Sunan Li, East China University of Science and Technology, China; Zhong Yin, University of Shanghai for Science and Technology, China; Amar R. Marathe, U.S. Army Research Laboratory, USA</p></fn>
<fn fn-type="corresp" id="fn001"><p>&#x0002A;Correspondence: Rifai Chai <email>rifai.chai&#x00040;uts.edu.au</email></p></fn>
<fn fn-type="other" id="fn002"><p>This article was submitted to Neural Technology, a section of the journal Frontiers in Neuroscience</p></fn></author-notes>
<pub-date pub-type="epub">
<day>07</day>
<month>03</month>
<year>2017</year>
</pub-date>
<pub-date pub-type="collection">
<year>2017</year>
</pub-date>
<volume>11</volume>
<elocation-id>103</elocation-id>
<history>
<date date-type="received">
<day>04</day>
<month>10</month>
<year>2016</year>
</date>
<date date-type="accepted">
<day>17</day>
<month>02</month>
<year>2017</year>
</date>
</history>
<permissions>
<copyright-statement>Copyright &#x000A9; 2017 Chai, Ling, San, Naik, Nguyen, Tran, Craig and Nguyen.</copyright-statement>
<copyright-year>2017</copyright-year>
<copyright-holder>Chai, Ling, San, Naik, Nguyen, Tran, Craig and Nguyen</copyright-holder>
<license xlink:href="http://creativecommons.org/licenses/by/4.0/"><p>This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.</p></license>
</permissions>
<abstract>
<p>This paper presents an improvement of classification performance for electroencephalography (EEG)-based driver fatigue classification between fatigue and alert states with the data collected from 43 participants. The system employs autoregressive (AR) modeling as the features extraction algorithm, and sparse-deep belief networks (sparse-DBN) as the classification algorithm. Compared to other classifiers, sparse-DBN is a semi supervised learning method which combines unsupervised learning for modeling features in the pre-training layer and supervised learning for classification in the following layer. The sparsity in sparse-DBN is achieved with a regularization term that penalizes a deviation of the expected activation of hidden units from a fixed low-level prevents the network from overfitting and is able to learn low-level structures as well as high-level structures. For comparison, the artificial neural networks (ANN), Bayesian neural networks (BNN), and original deep belief networks (DBN) classifiers are used. The classification results show that using AR feature extractor and DBN classifiers, the classification performance achieves an improved classification performance with a of sensitivity of 90.8%, a specificity of 90.4%, an accuracy of 90.6%, and an area under the receiver operating curve (AUROC) of 0.94 compared to ANN (sensitivity at 80.8%, specificity at 77.8%, accuracy at 79.3% with AUC-ROC of 0.83) and BNN classifiers (sensitivity at 84.3%, specificity at 83%, accuracy at 83.6% with AUROC of 0.87). Using the sparse-DBN classifier, the classification performance improved further with sensitivity of 93.9%, a specificity of 92.3%, and an accuracy of 93.1% with AUROC of 0.96. Overall, the sparse-DBN classifier improved accuracy by 13.8, 9.5, and 2.5% over ANN, BNN, and DBN classifiers, respectively.</p>
</abstract>
<kwd-group>
<kwd>electroencephalography</kwd>
<kwd>driver fatigue</kwd>
<kwd>autoregressive model</kwd>
<kwd>deep belief networks</kwd>
<kwd>sparse-deep belief networks</kwd>
</kwd-group>
<contract-num rid="cn001">DP150102493</contract-num>
<contract-sponsor id="cn001">Australian Research Council<named-content content-type="fundref-id">10.13039/501100000923</named-content></contract-sponsor>
<counts>
<fig-count count="7"/>
<table-count count="6"/>
<equation-count count="7"/>
<ref-count count="49"/>
<page-count count="14"/>
<word-count count="10183"/>
</counts>
</article-meta>
</front>
<body>
<sec sec-type="intro" id="s1">
<title>Introduction</title>
<p>Fatigue during driving is a major cause of road accidents in transportation, and therefore poses a significant risk of injury and fatality, not only to the drivers themselves but also to other road users such as passengers, motorbike users, other drivers, and pedestrians (Matthews et al., <xref ref-type="bibr" rid="B34">2012</xref>). Driver fatigue reduces the ability to perform essential driving skills such as vehicle steering control, tracking vehicle speed, visual awareness, and sufficient selective attention during a monotonous driving condition for a long period of time (Lal and Craig, <xref ref-type="bibr" rid="B25">2001</xref>; Wijesuriya et al., <xref ref-type="bibr" rid="B45">2007</xref>; Craig et al., <xref ref-type="bibr" rid="B12">2012</xref>; Jurecki and Sta&#x00144;czyk, <xref ref-type="bibr" rid="B22">2014</xref>). As a result an automated countermeasure for a driver fatigue system with reliable and improved fatigue classification/detection accuracy is needed to overcome the risk of driver fatigue in transportation (Lal et al., <xref ref-type="bibr" rid="B26">2003</xref>; Vanlaar et al., <xref ref-type="bibr" rid="B43">2008</xref>; Touryan et al., <xref ref-type="bibr" rid="B40">2013</xref>, <xref ref-type="bibr" rid="B41">2014</xref>; Chai et al., <xref ref-type="bibr" rid="B9">2016</xref>).</p>
<p>In the digital age, machine learning can be used to provide automated prediction of driver fatigue. Two approaches can be used in machine learning, which are the regression and classification methods. The goal of regression algorithms is the prediction of continuous values to estimate driving performance (Lin et al., <xref ref-type="bibr" rid="B32">2005</xref>; Touryan et al., <xref ref-type="bibr" rid="B40">2013</xref>, <xref ref-type="bibr" rid="B41">2014</xref>). The outcome of classification algorithms is to predict the target class, such as the classification between fatigue and non-fatigue/alert states (Lin et al., <xref ref-type="bibr" rid="B30">2010</xref>; Zhang et al., <xref ref-type="bibr" rid="B49">2014</xref>; Chai et al., <xref ref-type="bibr" rid="B9">2016</xref>; Xiong et al., <xref ref-type="bibr" rid="B48">2016</xref>). The aim of this study is to improve the accuracy of the prediction of fatigue and non-fatigue states. As a result, this study focuses on using an advanced classification method for enhancing the accuracy of a fatigue classification system previously studied (Chai et al., <xref ref-type="bibr" rid="B9">2016</xref>).</p>
<p>As described in a previous paper (Chai et al., <xref ref-type="bibr" rid="B9">2016</xref>), possible driver fatigue assessment includes psychological and physiological measurements (Lal and Craig, <xref ref-type="bibr" rid="B25">2001</xref>; Borghini et al., <xref ref-type="bibr" rid="B5">2014</xref>). For instance, psychological measurement of driver fatigue involves the need for frequent self-report of fatigue status via brief psychometric questionnaires (Lai et al., <xref ref-type="bibr" rid="B24">2011</xref>). Such an approach would be difficult to implement and may well be biased given its subjective nature (Craig et al., <xref ref-type="bibr" rid="B11">2006</xref>). Physiological measurement of the driver fatigue includes video measurement of the face (Lee and Chung, <xref ref-type="bibr" rid="B28">2012</xref>), brain signal measurement using electroencephalography (EEG; Lal et al., <xref ref-type="bibr" rid="B26">2003</xref>; Lin et al., <xref ref-type="bibr" rid="B32">2005</xref>; Craig et al., <xref ref-type="bibr" rid="B12">2012</xref>; Chai et al., <xref ref-type="bibr" rid="B9">2016</xref>), eye movement tracking system using camera and electrooculography (EOG; Hsieh and Tai, <xref ref-type="bibr" rid="B18">2013</xref>), and heart rate measurement using electrocardiography (ECG; Tran et al., <xref ref-type="bibr" rid="B42">2009</xref>; Jung et al., <xref ref-type="bibr" rid="B21">2014</xref>).</p>
<p>Physiological assessment of facial or eye changes using video recording of the driver&#x00027;s face may lead to privacy issues. Physiological measurement strategies like monitoring eye blink rates using EOG and heart rate variability (HRV) using ECG have been shown to reliably detect fatigue (Tran et al., <xref ref-type="bibr" rid="B42">2009</xref>; Hsieh and Tai, <xref ref-type="bibr" rid="B18">2013</xref>). EEG has also been shown to be a reliable method of detecting fatigue, as it directly measures neurophysiological signals that are correlated with mental fatigue (Wijesuriya et al., <xref ref-type="bibr" rid="B45">2007</xref>; Craig et al., <xref ref-type="bibr" rid="B12">2012</xref>; Zhang et al., <xref ref-type="bibr" rid="B49">2014</xref>; Chuang et al., <xref ref-type="bibr" rid="B10">2015</xref>; He et al., <xref ref-type="bibr" rid="B14">2015</xref>; Xiong et al., <xref ref-type="bibr" rid="B48">2016</xref>). Recently, we have shown a classification of EEG-based driver fatigue with the inclusion of new ICA based pre-processing with a promising classification result (Chai et al., <xref ref-type="bibr" rid="B9">2016</xref>), however, it was concluded the classification accuracy needs to be improved. As a result, this paper will extend the work on a potential EEG-based countermeasure driver fatigue system with an improved classification of fatigue vs. alert states.</p>
<p>An EEG-based classification countermeasure system requires several components including EEG signal measurement, signal pre-processing, feature extraction, and classification modules. For feature extraction in EEG analysis, frequency domain data has been widely explored (Lal and Craig, <xref ref-type="bibr" rid="B25">2001</xref>; Craig et al., <xref ref-type="bibr" rid="B12">2012</xref>). Power spectral density (PSD) methods are popular for converting the time domain of EEG signal into the frequency domain (Demandt et al., <xref ref-type="bibr" rid="B13">2012</xref>; Lin et al., <xref ref-type="bibr" rid="B31">2014</xref>). Alternatively, an autoregressive (AR) modeling parametric approach can also be used for feature extraction in an EEG classification system (McFarland and Wolpaw, <xref ref-type="bibr" rid="B35">2008</xref>; Chai et al., <xref ref-type="bibr" rid="B9">2016</xref>; Wang et al., <xref ref-type="bibr" rid="B44">2016</xref>). The advantage of AR modeling is its inherent capacity to model the peak spectra that are characteristic of the EEG signals and it is an all-pole model making it efficient for resolving sharp changes in the spectra. In our previous finding, an AR modeling feature extractor provided superior classification results compared to PSD for EEG-based driver fatigue classification (Chai et al., <xref ref-type="bibr" rid="B9">2016</xref>). Therefore, in this paper, we present the results of applying AR for modeling feature extraction in order to improve the accuracy the classification algorithm. The PSD method is also included for comparison. For the classification, non-linear methods, such as artificial neural networks (ANN), have been used widely in a variety of applications involving EEG (Nguyen, <xref ref-type="bibr" rid="B37">2008</xref>; Casson, <xref ref-type="bibr" rid="B7">2014</xref>). Bayesian neural networks (BNN) have also been used in EEG-based driver fatigue classification (Chai et al., <xref ref-type="bibr" rid="B9">2016</xref>). The Bayesian regularization framework is able to enhance the generalization of neural networks training regardless of finite and/or noisy data.</p>
<p>Recent attention has been focused on improvement of an ANN approach called deep belief networks (DBN; Hinton and Salakhutdinov, <xref ref-type="bibr" rid="B17">2006</xref>; Hinton et al., <xref ref-type="bibr" rid="B16">2006</xref>; Bengio, <xref ref-type="bibr" rid="B3">2009</xref>; LeCun et al., <xref ref-type="bibr" rid="B27">2015</xref>), which involves a fast, unsupervised learning algorithm for the deep generative model, and supervised learning for a discriminative model. The key advantage of this algorithm is the layer-by-layer training for learning a deep hierarchical probabilistic model efficiently as well as a discriminative fine tuning algorithm to optimize performance on the classification problems (Bengio, <xref ref-type="bibr" rid="B3">2009</xref>; LeCun et al., <xref ref-type="bibr" rid="B27">2015</xref>). A DBN classifier is a promising strategy for improving classification of problems including hand-writing character classification (Hinton et al., <xref ref-type="bibr" rid="B16">2006</xref>), speech recognition (Mohamed et al., <xref ref-type="bibr" rid="B36">2010</xref>; Hinton et al., <xref ref-type="bibr" rid="B15">2012</xref>), visual object recognition (Krizhevsky et al., <xref ref-type="bibr" rid="B23">2012</xref>), and other biomedical applications (O&#x00027;Connor et al., <xref ref-type="bibr" rid="B38">2013</xref>; Stromatias et al., <xref ref-type="bibr" rid="B39">2015</xref>). The training of the DBN is based on the restricted Boltzmann machine (RBM) with layers-wise training of the network per layer at a time from the bottom up (Hinton et al., <xref ref-type="bibr" rid="B16">2006</xref>). Furthermore, the original RBM approach tended to learn a distributed non-sparse representation. A modified version of the RBM using sparse-RBM to form a sparse-deep belief network (sparse-DBN) has shown promising results for modeling low-order features as well as higher-order features for the application of image classification with improved accuracy (Lee et al., <xref ref-type="bibr" rid="B29">2008</xref>; Ji et al., <xref ref-type="bibr" rid="B20">2014</xref>). As a result of this promising advance in classification of complex features, this paper further investigates the classification of EEG signals associated with driver fatigue using the sparse-DBN. For comparison purposes, the results from several different classifiers are included to determine which algorithms are superior with the highest classification performance.</p>
<p>The main contribution of this paper is the combination of the AR modeling feature extractor and sparse-DBN classifier which have not been explored previously for EEG-based driver fatigue classification, with the objective of enhancing the classification performance over past attempts (Chai et al., <xref ref-type="bibr" rid="B9">2016</xref>). The motivation to utilize the sparse-DBN classifier was to investigate its potential superiority for classifying fatigue, in comparison to other classifiers. Sparse-DBN is a semi supervised learning method that combines unsupervised learning for modeling the feature in the pre-training layer and supervised learning for discriminating the feature in the following layer. Incorporating the sparsity in sparse-DBN, achieved with a regularization term that penalizes a deviation of the expected activation of hidden units from a fixed low-level, prevents the network from overfitting, and is able to learn low-level structures as well as high-level structures (Ji et al., <xref ref-type="bibr" rid="B20">2014</xref>).</p>
</sec>
<sec id="s2">
<title>Background and methodology</title>
<sec>
<title>General structure</title>
<p>The general structure for the EEG-based driver fatigue classification used in this paper is shown in Figure <xref ref-type="fig" rid="F1">1</xref> which is divided into four components: (i) the first component involves EEG data collection in a simulated driver fatigue environment; (ii) the second component involves data pre-processing for removing EEG artifact and the moving window segmentation; (iii) the third component involves the features extraction module that converts the signals into useful features; (iv) the fourth component involves the classification module to process the feature and which translates into output via training and classification procedures. The output of the classification comprises two states: fatigue state and alert (non-fatigue) state.</p>
<fig id="F1" position="float">
<label>Figure 1</label>
<caption><p><bold>General structure EEG-based driver fatigue classification in this study</bold>.</p></caption>
<graphic xlink:href="fnins-11-00103-g0001.tif"/>
</fig>
</sec>
<sec>
<title>EEG data collection</title>
<p>The EEG data collection has been described in a previous paper (Chai et al., <xref ref-type="bibr" rid="B9">2016</xref>). The study was approved by the Human Research Ethics Committee of the University of Technology Sydney (UTS) obtained from previous experiments of driver fatigue study (Craig et al., <xref ref-type="bibr" rid="B11">2006</xref>, <xref ref-type="bibr" rid="B12">2012</xref>; Wijesuriya et al., <xref ref-type="bibr" rid="B45">2007</xref>). The dataset contains electrophysiological data from 43 healthy participants aged between 18 and 55 years who had a current driver&#x00027;s license. The study involved continuous measurement taken during a monotonous simulated driving task followed by post-EEG measures and post-subjective self-report of fatigue. For the simulated driving task, the divided attention steering simulator (DASS) from Stowood scientific instruments was used (Craig et al., <xref ref-type="bibr" rid="B12">2012</xref>). Participants were asked to keep driving at the center of the road in the simulation task. The participants were also required to respond to a target number that appeared in any of the four corners of the computer screen in front of the participants when they were driving in the experiment, so as to record reaction time.</p>
<p>The simulation driving task was terminated if the participant drove off the simulated road for &#x0003E;15 s, or if they showed consistent facial signs of fatigue such as head nodding and extended eyes closure, both determined by analysis of participants&#x00027; faces that occurred throughout the experiment. Three methods were used to validate fatigue occurrence: (i) using video monitoring for consistent physiological signs of fatigue such as tired eyes, head nodding and extended eye closure, verified further by EOG analysis of blink rate and eye closure; (ii) using performance decrements such as deviation off the road, and (iii) using validated psychometrics such as the Chalder Fatigue Scale and the Stanford Sleepiness Scale. Two participants who did not meet the criterion of becoming fatigued were excluded from the dataset. The validation of fatigue vs. non-fatigue in these participants has been reported in prior studies (Craig et al., <xref ref-type="bibr" rid="B11">2006</xref>, <xref ref-type="bibr" rid="B12">2012</xref>). The EEG signals were recorded using a 32-channel EEG system, the Active-Two system (Biosemi) with electrode positions at: FP1, AF3, F7, F3, FC1, FC5, T7, C3, CP1, CP5, P7, P3, PZ, PO3, O1, OZ, O2, PO4, P4, P8, CP6, CP2, C4, T8, FC6, FC2, F4, F8, AF4, FP2, FZ, and CZ. The recorded EEG data was down sampled from 2,048 to 256 Hz.</p>
</sec>
<sec>
<title>Data pre-processing and segmentation</title>
<p>For the alert status, the first 5 min of EEG data was selected when the driving simulation task began. For the fatigue status, the data was selected from the last 5 min of EEG data before the task was terminated, after consistent signs of fatigue were identified and verified. Then in each group of data (alert and fatigue), 20 s segments were taken with the segment that was chosen being the first 20 s where EEG signals were preserved. For the sample this was all within the first 1 min of the 5 min selected. Further, artifact removal using an ICA-based method was used to remove blinks, heart, and muscle artifact. As a result, 20 s of the alert state and 20 s of the fatigue state data were available from each participant.</p>
<p>In the pre-processing module before feature extraction, the second-order blind identification (SOBI) and canonical correlation analysis (CCA) were utilized to remove artifacts of the eyes, muscle, and heart signals. The pre-processed data were segmented by applying a moving window of 2 s with overlapping 1.75 s to the 20 s EEG data which provided 73 overlapping segments for each state (fatigue and alert states) as shown in Figure <xref ref-type="fig" rid="F2">2</xref>. The pre-processing segments were used in the feature extraction module as described in next section.</p>
<fig id="F2" position="float">
<label>Figure 2</label>
<caption><p><bold>Moving window segmentation for driver fatigue study</bold>.</p></caption>
<graphic xlink:href="fnins-11-00103-g0002.tif"/>
</fig>
</sec>
<sec>
<title>Feature extraction</title>
<p>For comparison purposes and validity of previous work, a feature extractor using the power spectral density (PSD), a widely used spectral analysis of feature extractor in fatigue studies, is provided in this paper.</p>
<p>An autoregressive (AR) model was also applied as a features extraction algorithm in this study. AR modeling has been used in EEG studies as an alternative to Fourier-based methods, and has been reported to have improved classification accuracy in previous studies compared to spectral analysis of the feature extractor (Brunner et al., <xref ref-type="bibr" rid="B6">2011</xref>; Chai et al., <xref ref-type="bibr" rid="B9">2016</xref>). The advantage of AR modeling is its inherent capacity to model the peak spectra that are characteristic of the EEG signals and it is an all-pole model making it efficient for resolving sharp changes in the spectra. The fast Fourier transform (FFT) is a widely used non-parametric approach that can provide accurate and efficient results, but it does not have acceptable spectral resolution for short data segments (Anderson et al., <xref ref-type="bibr" rid="B1">2009</xref>). AR modeling requires the selection of the model order number. The best AR order number requires consideration of both the signal complexity and the sampling rate. If the AR model order is too low, the whole signal cannot be captured in the model. On the other hand, if the model order is too high, then more noise is captured. In a previous study, the AR order number of five provided the best classification accuracy (Chai et al., <xref ref-type="bibr" rid="B9">2016</xref>). The calculation of the AR modeling was as follows:</p>
<disp-formula id="E1"><label>(1)</label><mml:math id="M1"><mml:mover accent='true'><mml:mi>x</mml:mi><mml:mo>&#x0005E;</mml:mo></mml:mover><mml:mo stretchy='false'>(</mml:mo><mml:mi>t</mml:mi><mml:mo stretchy='false'>)</mml:mo><mml:mo>=</mml:mo><mml:munder><mml:mrow><mml:mover><mml:mo>&#x02211;</mml:mo><mml:mi>P</mml:mi></mml:mover></mml:mrow><mml:mrow><mml:mi>k</mml:mi><mml:mtext>&#x02009;</mml:mtext><mml:mo>=</mml:mo><mml:mtext>&#x02009;</mml:mtext><mml:mn>1</mml:mn></mml:mrow></mml:munder><mml:mi>a</mml:mi><mml:mo stretchy='false'>(</mml:mo><mml:mi>k</mml:mi><mml:mo stretchy='false'>)</mml:mo><mml:mover accent='true'><mml:mi>x</mml:mi><mml:mo>&#x0005E;</mml:mo></mml:mover><mml:mo stretchy='false'>(</mml:mo><mml:mi>t</mml:mi><mml:mo>&#x02212;</mml:mo><mml:mi>k</mml:mi><mml:mo stretchy='false'>)</mml:mo><mml:mo>&#x0002B;</mml:mo><mml:mi>e</mml:mi><mml:mo stretchy='false'>(</mml:mo><mml:mi>t</mml:mi><mml:mo stretchy='false'>)</mml:mo></mml:math></disp-formula>
<p>where <inline-formula><mml:math id="M8"><mml:mover accent="true"><mml:mrow><mml:mi>x</mml:mi></mml:mrow><mml:mo>^</mml:mo></mml:mover><mml:mrow><mml:mo stretchy="false">(</mml:mo><mml:mrow><mml:mi>t</mml:mi></mml:mrow><mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:math></inline-formula> denotes EEG data at time (<italic>t</italic>), <italic>P</italic> denotes the AR order number, <italic>e</italic>(<italic>t</italic>) denotes the white noise with zero means error and finite variance, and <italic>a(k)</italic> denotes the AR coefficients.</p>
</sec>
<sec>
<title>Classification algorithm</title>
<p>The key feature of DBN is the greedy layer-by-layer training to learn a deep, hierarchical model (Hinton et al., <xref ref-type="bibr" rid="B16">2006</xref>). The main structure of the DBN learning is the restricted Boltzmann machine (RBM). A RBM is a type of Markov random field (MRF) which is a graphical model that has a two-layer architecture in which the observed data variables as visible neurons are connected to hidden neurons. A RBM is as shown in which m visible neuron [<italic>v</italic> &#x0003D; (<italic>v</italic><sub>1</sub>, <italic>v</italic><sub>2</sub>, <italic>v</italic><sub>3</sub>,&#x02026;,<italic>v</italic><sub><italic>m</italic></sub>)] and <italic>n</italic> hidden neurons [<italic>h</italic> &#x0003D; (<italic>h</italic><sub>1</sub>, <italic>h</italic><sub>2</sub>,&#x02026;, <italic>h</italic><sub><italic>n</italic></sub>)] are fully connected via symmetric undirected weights and there is no intra-layer connections within either the visible or the hidden layer.</p>
<p>The connections weights and the biases define a probability over the joint states of visible and hidden neurons through energy function <italic>E</italic>(<italic>v,h</italic>), defined as follows:</p>
<disp-formula id="E2"><label>(2)</label><mml:math id="M2"><mml:mi>E</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:mrow><mml:mi>v</mml:mi><mml:mo>,</mml:mo><mml:mi>h</mml:mi><mml:mo>;</mml:mo><mml:mi>&#x003B8;</mml:mi></mml:mrow><mml:mo>)</mml:mo></mml:mrow><mml:mo>=</mml:mo><mml:mo>&#x02212;</mml:mo><mml:munder><mml:mrow><mml:mover><mml:mo>&#x02211;</mml:mo><mml:mi>m</mml:mi></mml:mover></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mtext>&#x02009;</mml:mtext><mml:mo>=</mml:mo><mml:mtext>&#x02009;</mml:mtext><mml:mn>1</mml:mn></mml:mrow></mml:munder><mml:munder><mml:mrow><mml:mover><mml:mo>&#x02211;</mml:mo><mml:mi>n</mml:mi></mml:mover></mml:mrow><mml:mrow><mml:mi>j</mml:mi><mml:mtext>&#x02009;</mml:mtext><mml:mo>=</mml:mo><mml:mtext>&#x02009;</mml:mtext><mml:mn>1</mml:mn></mml:mrow></mml:munder><mml:msub><mml:mi>w</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mi>j</mml:mi></mml:mrow></mml:msub><mml:msub><mml:mi>v</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:msub><mml:mi>h</mml:mi><mml:mi>j</mml:mi></mml:msub><mml:mo>&#x02212;</mml:mo><mml:munder><mml:mrow><mml:mover><mml:mo>&#x02211;</mml:mo><mml:mi>m</mml:mi></mml:mover></mml:mrow><mml:mrow><mml:mi>i</mml:mi><mml:mtext>&#x02009;</mml:mtext><mml:mo>=</mml:mo><mml:mtext>&#x02009;</mml:mtext><mml:mn>1</mml:mn></mml:mrow></mml:munder><mml:msub><mml:mi>a</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:msub><mml:mi>v</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mo>&#x02212;</mml:mo><mml:munder><mml:mrow><mml:mover><mml:mo>&#x02211;</mml:mo><mml:mi>n</mml:mi></mml:mover></mml:mrow><mml:mrow><mml:mi>j</mml:mi><mml:mtext>&#x02009;</mml:mtext><mml:mo>=</mml:mo><mml:mtext>&#x02009;</mml:mtext><mml:mn>1</mml:mn></mml:mrow></mml:munder><mml:msub><mml:mi>b</mml:mi><mml:mi>j</mml:mi></mml:msub><mml:msub><mml:mi>h</mml:mi><mml:mi>i</mml:mi></mml:msub></mml:math></disp-formula>
<p>where <italic>w</italic><sub><italic>ij</italic></sub> denotes the weight between <italic>v</italic><sub><italic>i</italic></sub> and <italic>h</italic><sub><italic>j</italic></sub> for all <italic>i</italic> &#x000CE; {1,&#x02026;, <italic>m</italic>} and <italic>j</italic> &#x000CE; {1,&#x02026;, <italic>n</italic>}; <italic>a</italic><sub><italic>i</italic></sub> and <italic>b</italic><sub><italic>j</italic></sub> are the bias term associated with the <italic>ith</italic> and <italic>jth</italic> visible and hidden neurons; &#x003B8; &#x0003D; {<italic>W</italic>,<italic>b</italic>,<italic>a</italic>} is the model parameter with symmetric weight parameters <italic>W</italic><sub><italic>nm</italic></sub>.</p>
<p>For RBM training, the gradient of log probability of a visible vector (<italic>v</italic>) over the weight <italic>w</italic><sub><italic>ij</italic></sub> with the updated rule calculated by constructive divergence (CD) method is as follows:</p>
<disp-formula id="E3"><label>(3)</label><mml:math id="M3"><mml:mi>&#x00394;</mml:mi><mml:msub><mml:mi>w</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mi>j</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mi>&#x003B7;</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:mrow><mml:msub><mml:mrow><mml:mo>&#x02329;</mml:mo><mml:msub><mml:mi>v</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:msub><mml:mi>h</mml:mi><mml:mi>j</mml:mi></mml:msub><mml:mo>&#x0232A;</mml:mo></mml:mrow><mml:mrow><mml:mi>d</mml:mi><mml:mi>a</mml:mi><mml:mi>t</mml:mi><mml:mi>a</mml:mi></mml:mrow></mml:msub><mml:mo>&#x02212;</mml:mo><mml:msub><mml:mrow><mml:mo>&#x02329;</mml:mo><mml:msub><mml:mi>v</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:msub><mml:mi>h</mml:mi><mml:mi>j</mml:mi></mml:msub><mml:mo>&#x0232A;</mml:mo></mml:mrow><mml:mrow><mml:mi>r</mml:mi><mml:mi>e</mml:mi><mml:mi>c</mml:mi><mml:mi>o</mml:mi><mml:mi>n</mml:mi></mml:mrow></mml:msub></mml:mrow><mml:mo>)</mml:mo></mml:mrow></mml:math></disp-formula>
<p>where &#x003B7; is a learning rate, &#x02329;<italic>v<sub>i</sub>h<sub>j</sub></italic>&#x0232A;<italic><sub>recon</sub></italic> is the reconstruction of original visible units which is calculated by setting the visible unit to a random training vector. The updating of the hidden and visible states is considered as follows:</p>
<disp-formula id="E4"><label>(4)</label><mml:math id="M4"><mml:mi>p</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:mrow><mml:msub><mml:mi>h</mml:mi><mml:mi>j</mml:mi></mml:msub><mml:mo>=</mml:mo><mml:mn>1</mml:mn><mml:mo>&#x000A0;</mml:mo><mml:mo stretchy='false'>&#x0007C;</mml:mo><mml:mo>&#x000A0;</mml:mo><mml:mi>v</mml:mi></mml:mrow><mml:mo>)</mml:mo></mml:mrow><mml:mo>=</mml:mo><mml:mi>&#x003C3;</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:mrow><mml:msub><mml:mi>b</mml:mi><mml:mi>j</mml:mi></mml:msub><mml:mo>&#x0002B;</mml:mo><mml:munder><mml:mo>&#x02211;</mml:mo><mml:mi>i</mml:mi></mml:munder><mml:msub><mml:mi>v</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:msub><mml:mi>w</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mi>j</mml:mi></mml:mrow></mml:msub></mml:mrow><mml:mo>)</mml:mo></mml:mrow></mml:math></disp-formula>
<disp-formula id="E5"><label>(5)</label><mml:math id="M5"><mml:mi>p</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:mrow><mml:msub><mml:mi>v</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mo>=</mml:mo><mml:mn>1</mml:mn><mml:mo>&#x000A0;</mml:mo><mml:mo stretchy='false'>&#x0007C;</mml:mo><mml:mo>&#x000A0;</mml:mo><mml:mi>h</mml:mi></mml:mrow><mml:mo>)</mml:mo></mml:mrow><mml:mo>=</mml:mo><mml:mi>&#x003C3;</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:mrow><mml:msub><mml:mi>a</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mo>&#x0002B;</mml:mo><mml:munder><mml:mo>&#x02211;</mml:mo><mml:mi>i</mml:mi></mml:munder><mml:msub><mml:mi>h</mml:mi><mml:mi>j</mml:mi></mml:msub><mml:msub><mml:mi>w</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mi>j</mml:mi></mml:mrow></mml:msub></mml:mrow><mml:mo>)</mml:mo></mml:mrow></mml:math></disp-formula>
<p>where &#x003C3; is the logistic sigmoid function.</p>
<p>The original RBM tended to learn a distributed, non-sparse representation of the data, however sparse-RBM is able to play an important role in learning algorithms. In an information-theoretic sense, sparse representations are more efficient than the non-sparse ones, allowing for varying of the effective number of bits per example and able to learn useful low- and high-level feature representations for unlabeled data (i.e., unsupervised learning; Lee et al., <xref ref-type="bibr" rid="B29">2008</xref>; Ji et al., <xref ref-type="bibr" rid="B20">2014</xref>).</p>
<p>This paper uses the sparse-RBM to form the sparse-DBN for EEG-based driver fatigue classification. The sparsity in sparse-DBN is achieved with a regularization term that penalizes a deviation of the expected activation of hidden units from a fixed low-level, which prevents the network from overfitting, as well as allowing it to learn low-level structures as well as high-level structures (Ji et al., <xref ref-type="bibr" rid="B20">2014</xref>). The sparse-RBM is obtained by adding a regularization term to the full data negative log likelihood with the following optimization:</p>
<disp-formula id="E6"><label>(6)</label><mml:math id="M6"><mml:mtable columnalign='left'><mml:mtr><mml:mtd><mml:munder><mml:mrow><mml:mi>min</mml:mi></mml:mrow><mml:mrow><mml:mrow><mml:mo>{</mml:mo><mml:mrow><mml:msub><mml:mi>w</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mi>j</mml:mi></mml:mrow></mml:msub><mml:msub><mml:mi>a</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:msub><mml:mi>b</mml:mi><mml:mi>j</mml:mi></mml:msub></mml:mrow><mml:mo>}</mml:mo></mml:mrow></mml:mrow></mml:munder><mml:mi>E</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:mrow><mml:mi>v</mml:mi><mml:mo>,</mml:mo><mml:mi>h</mml:mi><mml:mo>,</mml:mo><mml:mi>&#x003B8;</mml:mi></mml:mrow><mml:mo>)</mml:mo></mml:mrow><mml:mo>&#x02212;</mml:mo><mml:mstyle displaystyle='true'><mml:munder><mml:mrow><mml:mover><mml:mo>&#x02211;</mml:mo><mml:mi>m</mml:mi></mml:mover></mml:mrow><mml:mrow><mml:mi>l</mml:mi><mml:mtext>&#x02009;</mml:mtext><mml:mo>=</mml:mo><mml:mtext>&#x02009;</mml:mtext><mml:mn>1</mml:mn></mml:mrow></mml:munder></mml:mstyle><mml:mtext>&#x02009;</mml:mtext><mml:mstyle mathvariant='bold' mathsize='normal'><mml:mi>l</mml:mi><mml:mi>o</mml:mi><mml:mi>g</mml:mi></mml:mstyle><mml:mstyle displaystyle='true'><mml:munder><mml:mrow><mml:mtext>&#x02009;</mml:mtext><mml:mo>&#x02211;</mml:mo></mml:mrow><mml:mi>h</mml:mi></mml:munder></mml:mstyle><mml:mi>P</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:mrow><mml:msup><mml:mi>v</mml:mi><mml:mrow><mml:mrow><mml:mo>(</mml:mo><mml:mi>l</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:mrow></mml:msup><mml:mo>,</mml:mo><mml:msup><mml:mi>h</mml:mi><mml:mrow><mml:mo stretchy='false'>(</mml:mo><mml:mi>l</mml:mi><mml:mo stretchy='false'>)</mml:mo></mml:mrow></mml:msup></mml:mrow><mml:mo>)</mml:mo></mml:mrow></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:mtext>&#x02009;&#x02009;&#x02009;&#x02009;&#x02009;&#x02009;&#x02009;&#x02009;&#x02009;&#x02009;&#x02009;&#x02009;&#x02009;&#x02009;&#x02009;&#x02009;&#x02009;&#x02009;&#x02009;&#x02009;&#x02009;&#x02009;&#x02009;&#x02009;&#x02009;&#x02009;&#x02009;&#x02009;&#x02009;&#x02009;&#x02009;&#x02009;</mml:mtext><mml:mo>&#x0002B;</mml:mo><mml:mi>&#x003BB;</mml:mi><mml:mstyle displaystyle='true'><mml:munder><mml:mrow><mml:mover><mml:mo>&#x02211;</mml:mo><mml:mi>n</mml:mi></mml:mover></mml:mrow><mml:mrow><mml:mi>j</mml:mi><mml:mtext>&#x02009;</mml:mtext><mml:mo>=</mml:mo><mml:mtext>&#x02009;</mml:mtext><mml:mn>1</mml:mn></mml:mrow></mml:munder></mml:mstyle><mml:msup><mml:mrow><mml:mo>|</mml:mo><mml:mrow><mml:mi>p</mml:mi><mml:mo>&#x02212;</mml:mo><mml:mfrac><mml:mn>1</mml:mn><mml:mi>m</mml:mi></mml:mfrac><mml:mstyle displaystyle='true'><mml:munder><mml:mrow><mml:mover><mml:mo>&#x02211;</mml:mo><mml:mi>m</mml:mi></mml:mover></mml:mrow><mml:mrow><mml:mi>l</mml:mi><mml:mtext>&#x02009;</mml:mtext><mml:mo>=</mml:mo><mml:mtext>&#x02009;</mml:mtext><mml:mn>1</mml:mn></mml:mrow></mml:munder></mml:mstyle><mml:mo>&#x1D53C;</mml:mo><mml:mrow><mml:mo>[</mml:mo><mml:mrow><mml:msubsup><mml:mi>h</mml:mi><mml:mi>j</mml:mi><mml:mrow><mml:mo stretchy='false'>(</mml:mo><mml:mi>l</mml:mi><mml:mo stretchy='false'>)</mml:mo></mml:mrow></mml:msubsup><mml:mrow><mml:mo>|</mml:mo><mml:mrow><mml:msup><mml:mi>v</mml:mi><mml:mrow><mml:mrow><mml:mo>(</mml:mo><mml:mi>l</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:mrow></mml:msup></mml:mrow></mml:mrow></mml:mrow><mml:mo>]</mml:mo></mml:mrow></mml:mrow><mml:mo>|</mml:mo></mml:mrow><mml:mn>2</mml:mn></mml:msup></mml:mtd></mml:mtr></mml:mtable></mml:math></disp-formula>
<p>where &#x1D53C;[.] is the conditional expectation given the data, &#x003BB; is a regularization constant, and <italic>p</italic> is a constant controlling the sparseness of the hidden neurons <italic>h</italic><sub><italic>j</italic></sub>. The DBN is constructed by stacking a predefined number of RBMs to allow each RBM model in the sequence to receive a different representation of the EEG data. The modeling between visible input (<italic>v</italic>) and <italic>N</italic> hidden layer <italic>h</italic><sub><italic>k</italic></sub> is as follows:</p>
<disp-formula id="E7"><label>(7)</label><mml:math id="M7"><mml:mi>P</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:mrow><mml:mi>v</mml:mi><mml:mo>,</mml:mo><mml:msup><mml:mi>h</mml:mi><mml:mn>1</mml:mn></mml:msup><mml:mo>,</mml:mo><mml:mo>&#x02026;</mml:mo><mml:mo>,</mml:mo><mml:msup><mml:mi>h</mml:mi><mml:mi>l</mml:mi></mml:msup></mml:mrow><mml:mo>)</mml:mo></mml:mrow><mml:mo>=</mml:mo><mml:mrow><mml:mo>(</mml:mo><mml:mrow><mml:munder><mml:mrow><mml:mover><mml:mo>&#x0220F;</mml:mo><mml:mrow><mml:mi>l</mml:mi><mml:mo>&#x02212;</mml:mo><mml:mn>2</mml:mn></mml:mrow></mml:mover></mml:mrow><mml:mrow><mml:mi>k</mml:mi><mml:mtext>&#x02009;</mml:mtext><mml:mo>=</mml:mo><mml:mtext>&#x02009;</mml:mtext><mml:mn>0</mml:mn></mml:mrow></mml:munder><mml:mi>P</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:mrow><mml:mrow><mml:mo>[</mml:mo><mml:mrow><mml:msup><mml:mi>h</mml:mi><mml:mrow><mml:mrow><mml:mo>(</mml:mo><mml:mi>k</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:mrow></mml:msup><mml:mrow><mml:mo>|</mml:mo><mml:mrow><mml:msup><mml:mi>h</mml:mi><mml:mrow><mml:mrow><mml:mo>(</mml:mo><mml:mrow><mml:mi>k</mml:mi><mml:mo>&#x0002B;</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mo>)</mml:mo></mml:mrow></mml:mrow></mml:msup></mml:mrow></mml:mrow></mml:mrow><mml:mo>]</mml:mo></mml:mrow></mml:mrow><mml:mo>)</mml:mo></mml:mrow></mml:mrow><mml:mo>)</mml:mo></mml:mrow><mml:mi>P</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:mrow><mml:msup><mml:mi>h</mml:mi><mml:mrow><mml:mi>l</mml:mi><mml:mo>&#x02212;</mml:mo><mml:mn>1</mml:mn></mml:mrow></mml:msup><mml:mo>,</mml:mo><mml:msup><mml:mi>h</mml:mi><mml:mi>l</mml:mi></mml:msup></mml:mrow><mml:mo>)</mml:mo></mml:mrow></mml:math></disp-formula>
<p>where <italic>v</italic> &#x0003D; <italic>h</italic><sup>0</sup>, <italic>P</italic>(<italic>h</italic><sup><italic>k</italic></sup>|<italic>h</italic><sup><italic>k</italic>&#x0002B;1</sup>) is a conditional distribution for the visible units conditioned on the hidden units of the RBM at level <italic>k</italic> and <italic>P</italic>(<italic>h</italic><sup><italic>l</italic>&#x02212;1</sup>,<italic>h</italic><sup><italic>l</italic></sup>) is the visible-hidden joint distribution at the top-level RBM. Two training types of the RBM can be used: generative and discriminative. The generative training of RBM is used as pre-training with un-supervised learning rule. After greedy layer-wise unsupervised learning, the DBN can be used for discriminative ability using the supervised learning. This paper uses a sparse variant of DBN with 2 layers of semi supervised sparse-DBN as shown in Figure <xref ref-type="fig" rid="F3">3</xref> with the first layer using the sparse-RBM for generative mode (un-supervised learning) and the second layer using the sparse-RBM in discriminative mode (supervised learning). After layer-by-layer training in DBN, an ANN with back-propagation method is used through the whole classifier to fine-tune the weights for optimal classification.</p>
<fig id="F3" position="float">
<label>Figure 3</label>
<caption><p><bold>Structure of sparse-DBN for driver fatigue classification: (A)</bold> Greedy learning stack of sparse-RBM; <bold>(B)</bold> the corresponding sparse-DBN.</p></caption>
<graphic xlink:href="fnins-11-00103-g0003.tif"/>
</fig>
<p>The performance indicators, including, sensitivity or true positive rate [<italic>TPR</italic> &#x0003D; <italic>TP</italic>/(<italic>TP</italic>&#x0002B;<italic>FN</italic>)], specificity or true negative rate [<italic>TNR</italic> &#x0003D; <italic>TN</italic>/(<italic>TN</italic>&#x0002B;<italic>FP</italic>)] and accuracy (TP&#x0002B;TN)/(TP&#x0002B;TN&#x0002B;FP&#x0002B;FN), were used for the performance measurement. <italic>TP</italic> (true positive) denotes the number of the fatigue data correctly classified as fatigue state. <italic>FP</italic> (false positive) is the number of alert datasets classified as a fatigue state. <italic>TN</italic> (true negative) is number of alert datasets correctly classified as an alert state. <italic>FN</italic> (false negative) is the fatigue datasets classified as an alert state.</p>
<p>For network learning generalization, we presented the results based on two cross-validation techniques: an early stopping technique and <italic>k</italic>-fold cross-validation. The early stopping technique used the &#x0201C;hold-out cross validation&#x0201D;&#x02014;one of the widely used cross validations techniques. Basically, it divided the dataset into three subsets (training, validation, and testing sets). The model is trained using the training set while the validation set is periodically used to evaluate the model performance to avoid over-fitting/over-training. The accuracy of the testing set is used as the result of the model&#x00027;s performance. Another cross validation technique is known as <italic>k</italic>-fold cross-validation (<italic>k</italic> &#x0003D; 3). In <italic>k</italic>-fold cross-validation (<italic>k</italic> &#x0003D; 3), the dataset is divided into three equal (or near equal) sized folds. The training of the network uses 2-folds and the testing the network uses the remaining fold. The process of training and testing is repeated for three possible choices of the subset omitted from the training. The average performance on the three omitted subsets is then used as an estimate of the generalization performance.</p>
<p>Furthermore, a receiver operating characteristic (ROC) graph is used to evaluate further the performance of the proposed method with the compared method for this study. The areas under the curve of the ROC (AUROC) were also computed to evaluate quantitatively the classification performance.</p>
</sec>
</sec>
<sec sec-type="results" id="s3">
<title>Results</title>
<p>From the 32-EEG channel dataset for the 43 participants (2 participants who did not meet the criterion of becoming fatigued were excluded from original 45 participants), 20 s of alert state and 20 s of fatigue state data were available from each participant. This was fed to the pre-processing module including artifact removal and a 2 s moving window segmentation with overlapping 1.75 s to the 20 s EEG data, providing 73 overlapping segments for each state. As a result, from the 43 participants, a total 6,278 units of datasets were formed for the alert and fatigue states (each state having 3,139 units).</p>
<p>The segmented datasets were fed to the feature extraction module. AR modeling with the order number of 5 was used for the feature extractor as it provided an optimum result from the previous study (Chai et al., <xref ref-type="bibr" rid="B9">2016</xref>). The size of the AR features equaled the AR order number multiplied with 32 units of EEG channels, thus the AR order number of 5 resulted in 160 units of the AR features. For comparison and validity purposes, this paper includes the PSD, a popular feature extractor in the EEG classification for driver fatigue classification. The spectrum of EEG bands consisted of: delta (0.5&#x02013;3 Hz), theta (3.5&#x02013;7.5 Hz), alpha (8&#x02013;13 Hz), and beta activity (13.5&#x02013;30 Hz). The total power for each EEG activity band was used for the features that were calculated using the numerical integration trapezoidal method, providing 4 units of power values. This resulted in 128 units of total power of PSD for the 32 EEG channels used.</p>
<p>The variant of standard DBN algorithm, sparse-DBN with semi supervised learning used in this paper, comprised of one layer of sparse-RBM with the generative type learning and the second layer of sparse-RBM with discriminative type of learning. The training of the sparse-DBN is done layer-by-layer. The ANN with back-propagation method was used to fine-tune the weights for optimal classification.</p>
<p>For the discriminative learning of sparse-DBN, the total 6,278 datasets were divided into three subsets with similar amounts of number sets: training (2,093 sets) validation (2,093 sets), and testing sets (2,092 sets). The generative learning of sparse-DBN uses unlabeled data from the training sets. For the training of the sparse-DBN using the learning rate (&#x003B7;) of 0.01, the maximum epoch is set to 200, with a regularization constant (&#x003BB;) of 1, and the constant controlling the sparseness (<italic>p</italic>) of 0.02. The selection of these training parameters was chosen by trial-and-error, with the chosen values achieving the best training result. Table <xref ref-type="table" rid="T1">1</xref> shows the selection of the regularization constant (&#x003BB;), with the chosen value of 1 and the constant controlling the sparseness (<italic>p</italic>) with the chosen value of 0.02, providing lowest the mean square error (MSE) values of 0.00119 (training set) and 0.0521 (validation set) with the iteration number of 69. The average of the MSE values was 0.0046 &#x000B1; 0.0018 (training set), and 0.0760 &#x000B1; 0.0124.</p>
<table-wrap position="float" id="T1">
<label>Table 1</label>
<caption><p><bold>Testing several values of regularization constant (&#x003BB;) and the constant controlling the sparseness (<italic><bold>p</bold></italic>) in order to select values with the lowest MSE (trial-and-error method)</bold>.</p></caption>
<table frame="hsides" rules="groups">
<thead><tr>
<th valign="top" align="left"><bold>Regularization constant<italic>(&#x003BB;)</italic></bold></th>
<th valign="top" align="center"><bold>Sparseness constant <italic>(p)</italic></bold></th>
<th valign="top" align="center"><bold>MSE training</bold></th>
<th valign="top" align="center"><bold>MSE validation</bold></th>
<th valign="top" align="center"><bold>Iteration number</bold></th>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left">0.5</td>
<td valign="top" align="center">0.1</td>
<td valign="top" align="center">0.00492</td>
<td valign="top" align="center">0.06625</td>
<td valign="top" align="center">90</td>
</tr>
<tr>
<td valign="top" align="left">1</td>
<td valign="top" align="center">0.1</td>
<td valign="top" align="center">0.00680</td>
<td valign="top" align="center">0.06710</td>
<td valign="top" align="center">82</td>
</tr>
<tr>
<td valign="top" align="left">2</td>
<td valign="top" align="center">0.1</td>
<td valign="top" align="center">0.00676</td>
<td valign="top" align="center">0.07961</td>
<td valign="top" align="center">64</td>
</tr>
<tr>
<td valign="top" align="left">0.5</td>
<td valign="top" align="center">0.01</td>
<td valign="top" align="center">0.00542</td>
<td valign="top" align="center">0.07365</td>
<td valign="top" align="center">66</td>
</tr>
<tr>
<td valign="top" align="left">1</td>
<td valign="top" align="center">0.01</td>
<td valign="top" align="center">0.00507</td>
<td valign="top" align="center">0.08360</td>
<td valign="top" align="center">71</td>
</tr>
<tr>
<td valign="top" align="left">2</td>
<td valign="top" align="center">0.01</td>
<td valign="top" align="center">0.00395</td>
<td valign="top" align="center">0.06831</td>
<td valign="top" align="center">85</td>
</tr>
<tr>
<td valign="top" align="left">0.5</td>
<td valign="top" align="center">0.02</td>
<td valign="top" align="center">0.00288</td>
<td valign="top" align="center">0.07664</td>
<td valign="top" align="center">73</td>
</tr>
<tr>
<td valign="top" align="left"><bold>1</bold></td>
<td valign="top" align="center"><bold>0.02</bold></td>
<td valign="top" align="center"><bold>0.00119</bold></td>
<td valign="top" align="center"><bold>0.05206</bold></td>
<td valign="top" align="center"><bold>69</bold></td>
</tr>
<tr>
<td valign="top" align="left">2</td>
<td valign="top" align="center">0.02</td>
<td valign="top" align="center">0.00288</td>
<td valign="top" align="center">0.07181</td>
<td valign="top" align="center">66</td>
</tr>
<tr>
<td valign="top" align="left">0.5</td>
<td valign="top" align="center">0.03</td>
<td valign="top" align="center">0.00327</td>
<td valign="top" align="center">0.08289</td>
<td valign="top" align="center">88</td>
</tr>
<tr>
<td valign="top" align="left">1</td>
<td valign="top" align="center">0.03</td>
<td valign="top" align="center">0.00574</td>
<td valign="top" align="center">0.09207</td>
<td valign="top" align="center">73</td>
</tr>
<tr>
<td valign="top" align="left">2</td>
<td valign="top" align="center">0.03</td>
<td valign="top" align="center">0.00665</td>
<td valign="top" align="center">0.09825</td>
<td valign="top" align="center">89</td>
</tr>
<tr>
<td valign="top" align="left">Mean</td>
<td/>
<td valign="top" align="center">0.004629</td>
<td valign="top" align="center">0.07615</td>
<td valign="top" align="center">76.42</td>
</tr>
<tr>
<td valign="top" align="left"><italic>SD</italic></td>
<td/>
<td valign="top" align="center">0.001803</td>
<td valign="top" align="center">0.01269</td>
<td valign="top" align="center">9.72</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<p><italic>Bold values signify the chosen parameters</italic>.</p>
</table-wrap-foot>
</table-wrap>
<p>In order to prevent over-fitting/over-training in the network, a validation-based early stopping method was used for the proposed classifier of sparse-DBN. The plot of the mean square error (MSE) training set and validation set are shown in Figure <xref ref-type="fig" rid="F4">4</xref> for classification using AR and sparse-DBN. Table <xref ref-type="table" rid="T2">2</xref> shows the best performance of the training in term of the MSE values and iteration numbers. For comparison, the results for ANN, BNN, and DBN classifier are also included.</p>
<fig id="F4" position="float">
<label>Figure 4</label>
<caption><p><bold>Plot of the training and validation MSE for early stopping of classifiers: (A)</bold> MSE training and validation of ANN. <bold>(B)</bold> MSE training of BNN. <bold>(C)</bold> MSE training of DBN in hidden layer 1 (Generative mode). <bold>(D)</bold> MSE training of sparse-DBN in hidden layer 1 (Generative mode). <bold>(E)</bold> MSE training and validation of DBN in hidden layer 2 (Discriminative mode). <bold>(F)</bold> MSE training and validation of DBN in hidden layer 2 (Discriminative mode).</p></caption>
<graphic xlink:href="fnins-11-00103-g0004.tif"/>
</fig>
<table-wrap position="float" id="T2">
<label>Table 2</label>
<caption><p><bold>The best MSE and iteration numbers from the training of the classifiers (ANN, BNN, DBN, and Sparse-DBN)</bold>.</p></caption>
<table frame="hsides" rules="groups">
<thead><tr>
<th valign="top" align="left"><bold>Classifiers</bold></th>
<th valign="top" align="center"><bold>Best MSE</bold></th>
<th valign="top" align="center"><bold>Best iteration number</bold></th>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left">ANN</td>
<td valign="top" align="center">0.115</td>
<td valign="top" align="center">110</td>
</tr>
<tr>
<td valign="top" align="left">BNN</td>
<td valign="top" align="center">0.0979</td>
<td valign="top" align="center">77</td>
</tr>
<tr>
<td valign="top" align="left">DBN</td>
<td valign="top" align="center">0.0649</td>
<td valign="top" align="center">68</td>
</tr>
<tr>
<td valign="top" align="left">Sparse-DBN</td>
<td valign="top" align="center">0.0520</td>
<td valign="top" align="center">69</td>
</tr>
</tbody>
</table>
</table-wrap>
<p>ANN, DBN and sparse-DBN classifiers utilized the early stopping framework (with the dataset divided into training validation and test sets) to prevent the overfitting problem, except for BNN (where the dataset was divided into training and testing). The BNN used a different framework for preventing the overfitting problem utilizing adaptive hyper-parameters in the cost function to prevent the neural network weight from being too large, which would have resulted in poor generalization. As a result, the validation set is not required for the BNN. A detailed analysis of BNN for EEG based driver fatigue classification has been addressed in our previous study (Chai et al., <xref ref-type="bibr" rid="B9">2016</xref>). The core parameters for the training classifiers (ANN, BNN, DBN, and sparse-DBN) are the ANN-based classifier which includes the number of hidden nodes, an activation function, and learning rate. In the BNN classifier, an additional hyper-parameter is introduced to fine tune the optimal structure of the ANN. Further, in the sparse-DBN classifier, the regulation constant and constant controlling of sparseness were introduced for the training the DBN classifier. The DBN and sparse-DBN used two hidden layers: the first hidden layer as generative mode (un-supervised learning) and second hidden layer as discriminative mode (supervised learning).</p>
<p>The mean square error (MSE) of the training set decreased smoothly. Using ANN classifier, the training network stopped after 100 iterations as the MSE validation set reached a maximum fail of 10 times the increment value to ensure no over-training happened with the best validation MSE at 0.115. Using a BNN classifier, the training network stopped after 77 iterations as the conditions are met with the BNN parameters with the best validation MSE at 0.0979. Using a DBN classifier in the first hidden layer (generative mode), the training network stopped after 200 iterations with best MSE at 0.434. Using a DBN classifier in the second hidden layer (discriminative mode), the training network stopped after 68 iterations as the MSE validation set reached maximum fail of 10 times increment value to ensure no over-training happened with the best validation MSE at 0.0649. Using the proposed method of sparse-DBN classifier in the first hidden layer (generative mode), the training network stopped after 200 iterations with the best of MSE at 0.388. Using the proposed method of sparse-DBN classifier in the second hidden layer (discriminative mode), the training network stopped after 69 iterations as the MSE validation set reached maximum fail of 10 times increment value to ensure no over-training happened, with the best validation MSE at 0.0520.</p>
<p>Using the classification results from the validation set, the optimal number of hidden neurons of the sparse-DBN is shown in Figure <xref ref-type="fig" rid="F5">5</xref>. For the PSD feature extraction, using 10 hidden nodes resulted in the best classification performance. For the AR feature extraction, using 15 hidden nodes produced the best classification performance. These optimal hidden nodes were then used for the training of the network to classify the test set. Also, the results using a different number of layers (2, 3, 5, and 10 layers) are also provided in Figure <xref ref-type="fig" rid="F5">5</xref>, with the 2 layers (generative mode for the first layer and discriminative mode for second layer) providing the optimal number of layers in this study. This figure shows that using a higher number of layers (3, 5, and 10 layers) results in a lower accuracy compared to results of using only two layers. Therefore, the two layers sparse-DBN was the chosen architecture providing the higher accuracy. The optimal size of sparse-DBN to classify the PSD features of the EEG-based driver fatigue is [128-10-10-2] and the optimal size of sparse DBN to classify the AR feature is [160-15-15-2]. Table <xref ref-type="table" rid="T3">3</xref> shows the results for the classification of the fatigue state vs. alert state using AR feature extractor and sparse-DBN classifier. For a feature extractor comparison and validity of previous result, the result of the classification using PSD feature extractor method is included. Also for classifier comparison, the classification results using original DBN, BNN and ANN are given.</p>
<fig id="F5" position="float">
<label>Figure 5</label>
<caption><p><bold>Plot of the optimal number hidden nodes and layers</bold>.</p></caption>
<graphic xlink:href="fnins-11-00103-g0005.tif"/>
</fig>
<table-wrap position="float" id="T3">
<label>Table 3</label>
<caption><p><bold>Results classification fatigue state vs. alert state for the test set on different feature extractors and classifiers&#x02014;early stopping approach</bold>.</p></caption>
<table frame="hsides" rules="groups">
<thead><tr>
<th valign="top" align="left"><bold>Feature extraction methods</bold></th>
<th valign="top" align="left"><bold>Classification results</bold></th>
<th valign="top" align="center" colspan="4" style="border-bottom: thin solid #000000;"><bold>Classification methods</bold></th>
</tr>
<tr>
<th/>
<th/>
<th valign="top" align="center"><bold>ANN</bold></th>
<th valign="top" align="center"><bold>BNN</bold></th>
<th valign="top" align="center"><bold>DBN</bold></th>
<th valign="top" align="center"><bold>Sparse-DBN</bold></th>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left">PSD</td>
<td valign="top" align="left">TP</td>
<td valign="top" align="center">782</td>
<td valign="top" align="center">808</td>
<td valign="top" align="center">873</td>
<td valign="top" align="center">919</td>
</tr>
<tr>
<td/>
<td valign="top" align="left">FN</td>
<td valign="top" align="center">264</td>
<td valign="top" align="center">238</td>
<td valign="top" align="center">173</td>
<td valign="top" align="center">127</td>
</tr>
<tr>
<td/>
<td valign="top" align="left">TN</td>
<td valign="top" align="center">731</td>
<td valign="top" align="center">791</td>
<td valign="top" align="center">833</td>
<td valign="top" align="center">855</td>
</tr>
<tr>
<td/>
<td valign="top" align="left">FP</td>
<td valign="top" align="center">315</td>
<td valign="top" align="center">255</td>
<td valign="top" align="center">213</td>
<td valign="top" align="center">191</td>
</tr>
<tr>
<td/>
<td valign="top" align="left">Sensitivity (%)</td>
<td valign="top" align="center">74.8</td>
<td valign="top" align="center">77.2</td>
<td valign="top" align="center">83.5</td>
<td valign="top" align="center">87.9</td>
</tr>
<tr>
<td/>
<td valign="top" align="left">Specificity (%)</td>
<td valign="top" align="center">69.9</td>
<td valign="top" align="center">75.6</td>
<td valign="top" align="center">79.6</td>
<td valign="top" align="center">81.7</td>
</tr>
<tr>
<td/>
<td valign="top" align="left">Accuracy (%)</td>
<td valign="top" align="center">72.3</td>
<td valign="top" align="center">76.4</td>
<td valign="top" align="center">81.5</td>
<td valign="top" align="center">84.8</td>
</tr>
<tr style="border-top: thin solid #000000;">
<td valign="top" align="left">AR</td>
<td valign="top" align="left">TP</td>
<td valign="top" align="center">845</td>
<td valign="top" align="center">882</td>
<td valign="top" align="center">950</td>
<td valign="top" align="center">982</td>
</tr>
<tr>
<td/>
<td valign="top" align="left">FN</td>
<td valign="top" align="center">201</td>
<td valign="top" align="center">164</td>
<td valign="top" align="center">96</td>
<td valign="top" align="center">64</td>
</tr>
<tr>
<td/>
<td valign="top" align="left">TN</td>
<td valign="top" align="center">814</td>
<td valign="top" align="center">868</td>
<td valign="top" align="center">946</td>
<td valign="top" align="center">965</td>
</tr>
<tr>
<td/>
<td valign="top" align="left">FP</td>
<td valign="top" align="center">232</td>
<td valign="top" align="center">178</td>
<td valign="top" align="center">100</td>
<td valign="top" align="center">81</td>
</tr>
<tr>
<td/>
<td valign="top" align="left">Sensitivity (%)</td>
<td valign="top" align="center">80.8</td>
<td valign="top" align="center">84.3</td>
<td valign="top" align="center">90.8</td>
<td valign="top" align="center"><bold>93.9</bold></td>
</tr>
<tr>
<td/>
<td valign="top" align="left">Specificity (%)</td>
<td valign="top" align="center">77.8</td>
<td valign="top" align="center">83.0</td>
<td valign="top" align="center">90.4</td>
<td valign="top" align="center"><bold>92.3</bold></td>
</tr>
<tr>
<td/>
<td valign="top" align="left">Accuracy (%)</td>
<td valign="top" align="center">79.3</td>
<td valign="top" align="center">83.6</td>
<td valign="top" align="center">90.6</td>
<td valign="top" align="center"><bold>93.1</bold></td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<p><italic>Bold values signify improved classification results using proposed method</italic>.</p>
</table-wrap-foot>
</table-wrap>
<p>First, for the artificial neural network (ANN) classifier: (i) ANN with PSD, for the fatigue data, of a total with 1,046 units of actual fatigue dataset, 782 units were correctly classified as fatigue states (true positive: TP), resulting in a sensitivity of 74.8%. For the alert group, of a total of 1,046 units of actual alert dataset, 731 units of alert data were correctly classified as alert state (true negative: TN), resulting in a specificity of 69.9%. The combination of ANN and PSD resulted in an accuracy of 72.3%, (ii) ANN with AR, for the fatigue group, of a total of 1,046 units of actual fatigue dataset, 845 units of fatigue data were correctly classified as fatigue states (TP), resulting in a sensitivity of 80.8%. For the alert group, of a total of 1,046 units of actual alert dataset, 814 units of alert data were correctly classified as alert states (TN), resulting in a specificity of 77.8%, while the combination of ANN with AR resulted in an improved accuracy of 79.3% compared to ANN with PSD.</p>
<p>Second, for the Bayesian neural networks (BNN) classifier: (i) BNN with PSD achieved an improvement compared to ANN with PSD, and for the fatigue group, of a total of 1,046 units of actual fatigue dataset, 808 units of fatigue data were correctly classified as fatigue states (TP), resulting in a sensitivity of 77.2%. For the alert state, of a total of 1,046 units of actual alert dataset, 791 units of alert data were correctly classified as alert state (TN), resulting in a specificity of 75.6%. The combination BNN with PSD resulted in an accuracy of 76.4%, (ii) BNN with AR achieved an improvement compared to ANN with AR, and ANN with PSD. BNN with PSD, for the fatigue state, of a total of 1,046 units of actual fatigue data, 882 units were correctly classified as fatigue states (TP), resulting in a sensitivity of 84.3%. For the alert state, of a total of 1,046 units of actual alert data, 868 units of alert data were correctly classified as alert states (TN), resulting in a specificity of 83%. The combination BNN with AR resulted in an accuracy of 83.6%.</p>
<p>Third, when using the deep belief network (DBN) classifier: (i) DBN with PSD achieved a further improvement compared to BNN with PSD, ANN with PSD, and ANN with AR; for the fatigue state, of a total of 1,046 units of actual fatigue data, 873 units of fatigue data were correctly classified as fatigue states (TP), resulting in a sensitivity of 83.5%. For the alert state, of a total of 1,046 units of actual alert data, 833 units of alert data were correctly classified as alert state (TN), resulting in a specificity of 79.6%. The combination DBN with PSD resulted in an accuracy of 81.5%, (ii) DBN with AR achieved further improvement compared to BNN with AR, ANN with AR, DBN with PSD, BNN with PSD, and ANN with PSD, for the fatigue state, of a total of 1,046 units of actual fatigue data, 950 units of fatigue data were correctly classified as fatigue states (TP), resulting in a sensitivity of 90.8%. For the alert state, of a total of 1,046 units of actual alert data, 946 units of alert data were correctly classified as alert states (TN), resulting in a specificity of 90.4%. The combination of DBN with AR resulted in an accuracy of 90.6%.</p>
<p>Fourth, using sparse deep belief networks (sparse-DBN): (i) sparse-DBN with PSD achieved additional improvements compared to DBN with PSD, BNN with PSD, ANN with PSD, BNN with AR, and ANN with AR; for the fatigue state, of a total of 1,046 units of actual fatigue data, 919 units of fatigue data were correctly classified as fatigue states (TP), resulting in a sensitivity of 87.9%. For the alert state, of a total of 1,046 units of actual alert dataset, 855 units of alert data were correctly classified as alert state (TN), resulting in a specificity of 81.7%. The combination sparse-DBN with PSD resulted in an accuracy of 84.8%, (ii) sparse-DBN with AR achieved the most superior result to the other classifier and feature extractor combination with the fatigue state, of a total of 1,046 units of actual fatigue data, 982 units of fatigue data were correctly classified as fatigue states (TP), resulting in a sensitivity of 93.9%. For the alert state, of a total of 1,046 units of actual alert data, 965 units of alert data were correctly classified as alert states (TN), resulting in a specificity of 92.3%. The combination sparse-DBN with AR resulted in best accuracy of 93.1% compared to the other classifier and feature extractor combinations.</p>
</sec>
<sec sec-type="discussion" id="s4">
<title>Discussion</title>
<p>In summary, using the PSD feature extractor: (i) compared to the ANN classifier, the sparse-DBN classifier improved the classification performance with sensitivity by 13.1% (from 74.8 to 87.9%), specificity by 11.8% (from 69.9 to 81.7%), and accuracy by 12.5% (from 72.3 to 84.8%); (ii) compared to the BNN classifier, the sparse-DBN resulted in improved performance indicators for sensitivity by 10.7% (from 77.2 to 87.9%), specificity by 6.1% (from 75.6 to 81.7%), and accuracy by 8.4% (from 76.4 to 84.8%); (iii) compared to the DBN classifier, the sparse-DBN resulted in improved performance indicators for sensitivity by 4.4% (from 83.5 to 87.9%), specificity by 2.1% (from 79.6 to 81.7%), and accuracy by 3.3% (from 81.5 to 84.8%).</p>
<p>Further, using the AR feature extractor: (i) compared to the ANN classifier, the sparse-DBN classifier improved the classification performance with sensitivity by 13.1% (from 80.8 to 93.9%), specificity by 14.5% (from 77.8 to 92.3%), and accuracy by 13.8% (from 79.3 to 93.1%); (ii) compared to the BNN classifier, the sparse-DBN resulted in improved performance indicators for sensitivity by 9.6% (from 84.3 to 93.9%), specificity by 9.3% (from 83.0 to 92.3%), and accuracy by 9.5% (from 83.6 to 93.1%); (iii) compared to the DBN classifier, the sparse-DBN resulted in improved performance indicators for sensitivity by 3.1% (from 90.8 to 93.9%), specificity by 1.9% (from 90.4 to 92.3%), and accuracy by 2.5% (from 90.6 to 93.1%).</p>
<p>The result of sensitivity (TPR) and specificity (TNR) analyses can also be viewed as the false positive rate (FPR &#x0003D; 1&#x02212;specificity) and false negative rate (FNR &#x0003D; 1&#x02212;sensitivity). The FPR is the rate of the non-fatigue (alert) state being incorrectly classified as fatigue state. The FNR is the rate of fatigue state being incorrectly classified as an alert state. As a result, the proposed classifier (sparse-DBN) with the AR feature extractor resulted in a sensitivity (TPR) of 93.9%, FNR of 6.1%, specificity (TNR) of 92.3%, and FPR of 7.7%. For a real-time implementation, an additional debounce algorithm could be implemented. By adding a debounce component, it masks multiple consecutive false positive detection that may decrease the FPR (Bashashati et al., <xref ref-type="bibr" rid="B2">2006</xref>). The real-time implementation with a debounce algorithm will be a future direction in this area of our study.</p>
<p>For the early stopping classifier comparison, a <italic>k</italic>-fold cross-validation, a popular method for EEG machine learning, is evaluated as well (Billinger et al., <xref ref-type="bibr" rid="B4">2012</xref>). As a result, this study used <italic>k</italic>-fold cross-validation (<italic>k</italic> &#x0003D; 3) with the mean value of 10 results of accuracies on each fold. A total of 6,278 datasets were divided into 3-folds (first-fold &#x0003D; 2,093 sets, second-fold &#x0003D; 2,093 sets, and third-fold &#x0003D; 2,092 sets). Overall, the mean value accuracy of 3-folds was reported. Table <xref ref-type="table" rid="T4">4</xref> shows results using <italic>k</italic>-fold cross validation approach with the chosen AR feature extraction and different classifiers. The result shows that the mean accuracy using the <italic>k</italic>-fold cross validation approach is comparable to the early stopping approach with the proposed classifier of sparse-DBN as the best classifier (94.8% &#x000B1; 0.011 of sensitivity, 93.3% &#x000B1; 0.012 of specificity, and 94.1% &#x000B1; 0.011 of accuracy) and followed by DBN (90.9% &#x000B1; 0.005 of sensitivity, 90.5% &#x000B1; 0.005 of specificity, and 90.7% &#x000B1; 0.005 of accuracy), BNN (84.8% &#x000B1; 0.012 of sensitivity, 83.6% &#x000B1; 0.015 of specificity, and 84.2% &#x000B1; 0.014 of accuracy), and ANN (81.4% &#x000B1; 0.010 of sensitivity, 78.4% &#x000B1; 0.012 of specificity, and 79.9% &#x000B1; 0.011 of accuracy).</p>
<table-wrap position="float" id="T4">
<label>Table 4</label>
<caption><p><bold>Results of classification accuracy fatigue state vs. alert state with chosen AR feature extractors and different classifiers&#x02014;<italic><bold>k</bold></italic>-fold cross validation (3-folds) approach</bold>.</p></caption>
<table frame="hsides" rules="groups">
<thead><tr>
<th valign="top" align="left"><bold>Classification results</bold></th>
<th valign="top" align="center" colspan="4" style="border-bottom: thin solid #000000;"><bold>Classification methods</bold></th>
</tr>
<tr>
<th/>
<th valign="top" align="center"><bold>ANN (Mean &#x000B1; <italic>SD</italic>)</bold></th>
<th valign="top" align="center"><bold>BNN (Mean &#x000B1; <italic>SD</italic>)</bold></th>
<th valign="top" align="center"><bold>DBN (Mean &#x000B1; <italic>SD</italic>)</bold></th>
<th valign="top" align="center"><bold>Sparse-DBN (Mean &#x000B1; <italic>SD</italic>)</bold></th>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left">TP</td>
<td valign="top" align="char" char=".">852.0 &#x000B1; 10.583</td>
<td valign="top" align="char" char=".">888.0 &#x000B1; 13.229</td>
<td valign="top" align="char" char=".">951.3 &#x000B1; 4.933</td>
<td valign="top" align="char" char=".">992 &#x000B1; 11.930</td>
</tr>
<tr>
<td valign="top" align="left">FN</td>
<td valign="top" align="char" char=".">194.7 &#x000B1; 10.408</td>
<td valign="top" align="char" char=".">158.7 &#x000B1; 13.051</td>
<td valign="top" align="char" char=".">95.3 &#x000B1; 4.726</td>
<td valign="top" align="char" char=".">54.3 &#x000B1; 11.719</td>
</tr>
<tr>
<td valign="top" align="left">TN</td>
<td valign="top" align="char" char=".">820.3 &#x000B1; 13.051</td>
<td valign="top" align="char" char=".">874.7 &#x000B1; 15.308</td>
<td valign="top" align="char" char=".">947.0 &#x000B1; 5.292</td>
<td valign="top" align="char" char=".">976.0 &#x000B1; 12.288</td>
</tr>
<tr>
<td valign="top" align="left">FP</td>
<td valign="top" align="char" char=".">225.7 &#x000B1; 13.051</td>
<td valign="top" align="char" char=".">171.3 &#x000B1; 15.308</td>
<td valign="top" align="char" char=".">99.0 &#x000B1; 5.292</td>
<td valign="top" align="char" char=".">70.0 &#x000B1; 12.288</td>
</tr>
<tr>
<td valign="top" align="left">Sensitivity</td>
<td valign="top" align="char" char=".">81.4% &#x000B1; 0.010</td>
<td valign="top" align="char" char=".">84.8% &#x000B1; 0.012</td>
<td valign="top" align="char" char=".">90.9% &#x000B1; 0.005</td>
<td valign="top" align="char" char="."><bold>94.8%</bold> &#x000B1; 0.011</td>
</tr>
<tr>
<td valign="top" align="left">Specificity</td>
<td valign="top" align="char" char=".">78.4% &#x000B1; 0.012</td>
<td valign="top" align="char" char=".">83.6% &#x000B1; 0.015</td>
<td valign="top" align="char" char=".">90.5% &#x000B1; 0.005</td>
<td valign="top" align="char" char="."><bold>93.3%</bold> &#x000B1; 0.012</td>
</tr>
<tr>
<td valign="top" align="left">Accuracy</td>
<td valign="top" align="char" char=".">79.9% &#x000B1; 0.011</td>
<td valign="top" align="char" char=".">84.2% &#x000B1; 0.014</td>
<td valign="top" align="char" char=".">90.7% &#x000B1; 0.005</td>
<td valign="top" align="char" char="."><bold>94.1%</bold> &#x000B1; 0.011</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<p><italic>Bold values signify improved classification results using proposed method</italic>.</p>
</table-wrap-foot>
</table-wrap>
<p>One-way ANOVA was used to compare the four classifiers (ANN, BNN, DBN, and sparse-DBN) and the resultant <italic>p</italic>-value was 9.3666e-07. This <italic>p</italic>-value corresponding to the <italic>F</italic>-statistic of one-way ANOVA is much lower than 0.05, suggesting that one or more classifiers are significantly different for which Tukey&#x00027;s HSD test (Tukey&#x02212;Kramer method) was used to detect where the differences were. The critical value of the Tukey&#x02212;Kramer HSD <italic>Q</italic> statistic based on the four classifiers and <italic>v</italic> &#x0003D; 8 degree of freedom for the error term, were significance levels of &#x003B1; &#x0003D; 0.01 and 0.05 (<italic>p</italic>-value). The critical value for <italic>Q</italic>, for &#x003B1; of 0.01 (<italic>Q</italic><sup>&#x003B1;</sup><sup>&#x0003D; 0.01</sup>) is 6.2044 and the critical value for <italic>Q</italic> for &#x003B1; of 0.05 (<italic>Q</italic><sup>&#x003B1;</sup><sup>&#x0003D; 0.05</sup>) is 4.5293. The Tukey HSD Q-statistic (<italic>Q</italic><sub><italic>i, j</italic></sub>) values were calculated for pairwise comparison of the classifiers. In each pair, the statistical significance is found when <italic>Q</italic><sub><italic>i, j</italic></sub> is more than the critical value of <italic>Q</italic>. Table <xref ref-type="table" rid="T5">5</xref> presents the Tukey HSD Q-statistic (<italic>Q</italic><sub><italic>i, j</italic></sub>) and Tukey HSD <italic>p</italic>-value and Tukey HSD inference of the pairwise comparisons. The results in Table <xref ref-type="table" rid="T5">5</xref> show all six pairwise combinations reached statistical significance (<sup>&#x0002A;</sup><italic>p</italic> &#x0003C; 0.05 and <sup>&#x0002A;&#x0002A;</sup><italic>p</italic> &#x0003C; 0.01). In addition, to compare the proposed classifier (sparse-DBN) and other classifiers (DBN, BNN, ANN), the sparse-DBN vs. DBN resulted in a <italic>p</italic>-value of 0.021 (<sup>&#x0002A;</sup><italic>p</italic> &#x0003C; 0.05), while sparse-DBN vs. BNN and sparse-DBN vs. ANN resulted in a <italic>p</italic>-value of 0.001 (<sup>&#x0002A;&#x0002A;</sup><italic>p</italic> &#x0003C; 0.01).</p>
<table-wrap position="float" id="T5">
<label>Table 5</label>
<caption><p><bold>Result of Statistical significance of Tukey&#x02013;Kramer HSD in pairwise comparison</bold>.</p></caption>
<table frame="hsides" rules="groups">
<thead><tr>
<th valign="top" align="left"><bold>Pairwise comparison</bold></th>
<th valign="top" align="center"><bold>Tukey HSD <italic>Q</italic>-statistic</bold></th>
<th valign="top" align="center"><bold>Tukey HSD <italic>p</italic>-value</bold></th>
<th valign="top" align="center"><bold>Tukey HSD inference</bold></th>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left">Sparse DBN vs. DBN</td>
<td valign="top" align="center">5.376</td>
<td valign="top" align="center">0.021</td>
<td valign="top" align="center"><xref ref-type="table-fn" rid="TN1"><sup>&#x0002A;</sup></xref><italic><bold>p</bold></italic> &#x0003C; <bold>0.05</bold></td>
</tr>
<tr>
<td valign="top" align="left">Sparse DBN vs. BNN</td>
<td valign="top" align="center">15.795</td>
<td valign="top" align="center">0.001</td>
<td valign="top" align="center"><xref ref-type="table-fn" rid="TN2"><sup>&#x0002A;&#x0002A;</sup></xref><italic><bold>p</bold></italic> &#x0003C; <bold>0.01</bold></td>
</tr>
<tr>
<td valign="top" align="left">Sparse DBN vs. ANN</td>
<td valign="top" align="center">22.733</td>
<td valign="top" align="center">0.001</td>
<td valign="top" align="center"><xref ref-type="table-fn" rid="TN2"><sup>&#x0002A;&#x0002A;</sup></xref><italic><bold>p</bold></italic> &#x0003C; <bold>0.01</bold></td>
</tr>
<tr>
<td valign="top" align="left">DBN vs. BNN</td>
<td valign="top" align="center">10.419</td>
<td valign="top" align="center">0.001</td>
<td valign="top" align="center"><xref ref-type="table-fn" rid="TN2"><sup>&#x0002A;&#x0002A;</sup></xref><italic>p</italic> &#x0003C; 0.01</td>
</tr>
<tr>
<td valign="top" align="left">DBN vs. ANN</td>
<td valign="top" align="center">17.357</td>
<td valign="top" align="center">0.001</td>
<td valign="top" align="center"><xref ref-type="table-fn" rid="TN2"><sup>&#x0002A;&#x0002A;</sup></xref><italic>p</italic> &#x0003C; 0.01</td>
</tr>
<tr>
<td valign="top" align="left">BNN vs. ANN</td>
<td valign="top" align="center">6.938</td>
<td valign="top" align="center">0.005</td>
<td valign="top" align="center"><xref ref-type="table-fn" rid="TN2"><sup>&#x0002A;&#x0002A;</sup></xref><italic>p</italic> &#x0003C; 0.01</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<p><italic>Bold values signify statistical significance of proposed method vs. other methods</italic>.</p>
<fn id="TN1">
<label>&#x0002A;</label>
<p><italic>p &#x0003C; 0.05 statistically significant</italic>.</p></fn>
<fn id="TN2">
<label>&#x0002A;&#x0002A;</label>
<p><italic>p &#x0003C; 0.01 statistically highly significant</italic>.</p></fn>
</table-wrap-foot>
</table-wrap>
<p>Overall, the combination of the AR feature extractor and sparse-DBN achieved the best result with improved sensitivity, specificity and accuracy for the classification fatigue vs. alert states in a simulated driving scenario.</p>
<p>Figure <xref ref-type="fig" rid="F6">6</xref> shows the results displayed in the receiver operating characteristic (ROC) curve analyses with AR feature extractor and ANN, BNN, DBN, and sparse-DBN classifiers of early stopping (hold-out cross-validation) techniques. The ROC graph is a plot of true positive rate or sensitivity (TPR) on the Y axis and false positive rate (FPR) or 1&#x02013;specificity on the X-axis by varying different threshold ratios as the sweeping variable. A random performance of a classifier would have a straight line connecting (0, 0) to (1, 1). A ROC curve of the classifier appearing in the lower right triangle suggest it performs worse than random guessing and if the ROC curve appears in the upper left, the classifier is believed to have a superior performance classification (Huang and Ling, <xref ref-type="bibr" rid="B19">2005</xref>; Castanho et al., <xref ref-type="bibr" rid="B8">2007</xref>). All ROC curves in Figure <xref ref-type="fig" rid="F6">6</xref> for ANN, BNN, DBN, and sparse-DBN classifier shows the curves plotted in the upper left or above random guess classification. The result also shows that the ROC curve for sparse-DBN classifier achieved the best upper left curve compared to DBN, BNN, and ANN.</p>
<fig id="F6" position="float">
<label>Figure 6</label>
<caption><p><bold>ROC plot with AUC values for AR feature extractor and ANN, BNN, DBN, and sparse-DBN classifiers of early stopping (hold-out cross-validation) technique</bold>.</p></caption>
<graphic xlink:href="fnins-11-00103-g0006.tif"/>
</fig>
<p>The areas under the curve of ROC (AUROC) were also computed to evaluate quantitatively the classification performance. AUROC represents the probability that the classifier will rank a randomly chosen positive example higher than a randomly chosen negative example, and it exhibits several interesting properties compared to accuracy measurement (Huang and Ling, <xref ref-type="bibr" rid="B19">2005</xref>). The AUROC value lies between 0 and 1 with a higher AUROC value indicating a better classification performance. Figure <xref ref-type="fig" rid="F6">6</xref> shows that the classifier using sparse-DBN and AR feature extractor achieved the best performance result with the highest AUROC of 0.9624 compared to original DBN classifier with AUROC of 0.9428, BNN classifier with AUROC 0.8725, and conventional ANN with AUROC of 0.8306.</p>
<p>Figure <xref ref-type="fig" rid="F7">7</xref> shows the results displayed in the receiver operating characteristic (ROC) curve analyses with AR feature extractor and ANN, BNN, DBN, and sparse-DBN classifiers of <italic>k</italic>-fold cross-validation (3-folds) technique with three subplots for each fold. Similar with the ROC plot from the hold-out cross validation technique, all ROC curves in Figure <xref ref-type="fig" rid="F7">7</xref> for ANN, BNN, DBN, and sparse-DBN classifier shows the curves plotted in the upper left or above random guess classification, and the ROC curve for the sparse-DBN classifier again had best upper left curve compared to DBN, BNN, and ANN. For the area under the curve analysis, in first-fold (<italic>k</italic> &#x0003D; 1), sparse-DBN achieved the best AUROC of 0.9643 compared to DBN classifier with AUROC of 0.9484, BNN classifier with AUROC of 0.8879, and ANN classifier with AUROC of 0.8419. For second-fold (<italic>k</italic> &#x0003D; 2), the sparse-DBN achieved the best AUROC of 0.9673 compared to DBN classifier with AUROC of 0.9520, BNN classifier with AUROC of 0.8968, and ANN classifier with AUROC of 0.8458. For third-fold (<italic>k</italic> &#x0003D; 3), the sparse-DBN achieved the best AUROC of 0.9627 compared to DBN classifier with AUROC of 0.9434, BNN classifier with AUROC of 0.8858, and ANN classifier with AUROC of 0.8372.</p>
<fig id="F7" position="float">
<label>Figure 7</label>
<caption><p><bold>ROC plot with AUC values for AR feature extractor and ANN, BNN, DBN, and sparse-DBN classifiers of <italic><bold>k</bold></italic>-fold cross validation (<italic><bold>k</bold></italic> &#x0003D; 3) technique. (A)</bold> ROC plot with AUC value for 1st fold. <bold>(B)</bold> ROC plot with AUC value for 2nd fold. <bold>(C)</bold> ROC plot with AUC value for 3rd fold.</p></caption>
<graphic xlink:href="fnins-11-00103-g0007.tif"/>
</fig>
<p>Our previous work in Chai et al. (<xref ref-type="bibr" rid="B9">2016</xref>) showed a promising result with the inclusion of an additional pre-processing component using a recent independent component analysis (ICA) algorithm, AR feature extractor and BNN classifier. However, it was concluded that the performance of the classification needed to be improved. The findings presented in this paper, strongly suggests that the use of an AR feature extractor provides superior results compared to PSD method, and also extends further the study by improving the reliability including the sensitivity, specificity, and accuracy using sparse-DBN classifier in combination with the AR feature extractor, even without the need to include the ICA pre-processing component.</p>
<p>Using chosen classifier parameters, Table <xref ref-type="table" rid="T6">6</xref> shows the comparison of computation times between the proposed classifier (sparse-DBN) and other classifiers (ANN, BNN, and DBN). The computational time is estimated using the MATLAB <italic>tic</italic>/<italic>toc</italic> function, where the <italic>tic</italic> function was called before the program and the <italic>toc</italic> function afterward on the computer (Intel Core i5&#x02212;4570 processor 3.20 GHz, 8-GB RAM). The result shows that for the training time, the sparse-DBN required 169.23 &#x000B1; 0.93 s which was slower compared to other classifiers (86.79 &#x000B1; 0.24 s for DBN, 55.82 &#x000B1; 2.77 s for BNN and 24.02 &#x000B1; 1.04 for ANN). In terms of the testing (classification) time, all classifiers required the same amount of time of 0.03 s or less than a second to complete the task. Although the proposed sparse-DBN required more time to complete the training process, the classifier was able to perform as fast as other classifiers during the testing process. The reason that the testing times of the classifier are comparable to each other was because, after the training process, the final weights were used as constants and in the classification process all classifiers used the same ANN feed-forward classification routine. For the operation of real-time classification, there is no necessity to perform the classifier training again. The classifier just needs to compute the feed forward ANN routine with the saved weight parameters. Thus, sparse-DBN classification time in the runtime mode (execution) is fast, taking less than a second.</p>
<table-wrap position="float" id="T6">
<label>Table 6</label>
<caption><p><bold>Comparison of the training time and testing time for different classifiers</bold>.</p></caption>
<table frame="hsides" rules="groups">
<thead><tr>
<th valign="top" align="left"><bold>Classifiers</bold></th>
<th valign="top" align="center"><bold>Training time (s) (Mean &#x000B1; <italic>SD</italic>)</bold></th>
<th valign="top" align="center"><bold>Testing time (s) (Mean &#x000B1; <italic>SD</italic>)</bold></th>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left">ANN</td>
<td valign="top" align="center">24.02 &#x000B1; 1.04</td>
<td valign="top" align="center">0.0371 &#x000B1; 0.0023</td>
</tr>
<tr>
<td valign="top" align="left">BNN</td>
<td valign="top" align="center">55.82 &#x000B1; 2.77</td>
<td valign="top" align="center">0.0381 &#x000B1; 0.0082</td>
</tr>
<tr>
<td valign="top" align="left">DBN</td>
<td valign="top" align="center">86.79 &#x000B1; 0.24</td>
<td valign="top" align="center">0.0334 &#x000B1; 0.0016</td>
</tr>
<tr>
<td valign="top" align="left">Sparse-DBN</td>
<td valign="top" align="center">169.23 &#x000B1; 0.93</td>
<td valign="top" align="center">0.0385 &#x000B1; 0.0043</td>
</tr>
</tbody>
</table>
</table-wrap>
<p>The potential future direction of this research includes: (i) real-time driver fatigue with the active transfer learning approach for new user adaptation (Wu et al., <xref ref-type="bibr" rid="B47">2014</xref>; Marathe et al., <xref ref-type="bibr" rid="B33">2016</xref>; Wu, <xref ref-type="bibr" rid="B46">2016</xref>), (ii) improvement of the classification result through an intelligent fusion algorithm, and (iii) testing the efficacy of hybrid driver fatigue detection systems using a combination of physiological measurement strategies known to be related to fatigue status, such as brain signal measurement using electroencephalography (EEG), eye movement and facial tracking systems using camera and electrooculography (EOG), and heart rate variability measurement using electrocardiography (ECG).</p>
</sec>
<sec sec-type="conclusions" id="s5">
<title>Conclusions</title>
<p>In this paper, the EEG-based classification of fatigue vs. alert states during a simulated driving task was applied with 43 participants. The AR was used for feature extractor and the sparse-DBN was used as a classifier. For comparison, the PSD feature extractor and ANN, BNN, original DBN were included.</p>
<p>Using the early stopping (hold-out cross validation) evaluation, the results showed that for a PSD feature extractor, the sparse-DBN classifier achieved a superior classification result (sensitivity at 87.9%, specificity at 81.7%, and accuracy at 84.8%) compared to the DBN classifier (sensitivity at 83.5%, specificity at 79.6%, and accuracy at 81.6%), BNN classifier (sensitivity at 77.2%, specificity at 75.6%, and accuracy at 76.4%), and ANN classifier (sensitivity at 74.8%, specificity at 69.9%, and accuracy at 72.3%). Further, using an AR feature extractor and the sparse-DBN achieves a significantly superior classification result (sensitivity at 93.9%, specificity at 92.3%, and accuracy at 93.1% with AUROC at 0.96) compared to DBN classifier (sensitivity at 90.8%, specificity at 90.4%, and accuracy at 90.6% with AUROC at 0.94), BNN classifier (sensitivity at 84.3%, specificity at 83%, and accuracy at 83.6% with AUROC at 0.87) and ANN classifier (sensitivity at 80.8%, specificity at 77.8%, and accuracy at 79.3% with AUROC at 0.83).</p>
<p>Overall the findings strongly suggest that a combination of the AR feature extractor and sparse-DBN provides a superior performance of fatigue classification, especially in terms of overall sensitivity, specificity and accuracy for classifying the fatigue vs. alert states. The <italic>k</italic>-fold cross-validation (<italic>k</italic> &#x0003D; 3) also validated that the sparse-DBN with the AR features extractor is the best algorithm compared to the other classifiers (ANN, BNN, and DBN), confirmed by a significance of a <italic>p</italic> &#x0003C; 0.05.</p>
<p>It is hoped these results provide a foundation for the development of real-time sensitive fatigue countermeasure algorithms that can be applied in on-road settings where fatigue is a major contributor to traffic injury and mortality (Craig et al., <xref ref-type="bibr" rid="B11">2006</xref>; Wijesuriya et al., <xref ref-type="bibr" rid="B45">2007</xref>). The challenge for this type of technology to be implemented will involve valid assessment of EEG and fatigue based on classification strategies discussed in this paper, while using an optimal number of EEG channels (i.e., the minimum number that will result in valid EEG signals from relevant cortical sites) that can be easily applied. These remain the challenges for detecting fatigue using brain signal classification.</p>
</sec>
<sec id="s6">
<title>Author contributions</title>
<p>RC performed all data analysis and wrote the manuscript. SL, PS, GN, and TN advised the analysis and edited the manuscript. YT and AC conceptualized the experiment and edited the manuscript. HN supervised the study, advised the analysis, and edited the manuscript.</p>
</sec>
<sec id="s7">
<title>Funding</title>
<p>This study is funded by &#x0201C;non-invasive prediction of adverse neural events using brain wave activity&#x0201D; from Australian Research Council (DP150102493).</p>
</sec>
<sec id="s8">
<title>Ethics statement</title>
<p>Human subjects were given a consent form, which described the experimental procedure and any risks involved (which were minimal). After reading the form, human subjects were asked if they had any questions. Next, human subjects signed the consent form, and then the investigator signed the consent form. The consent forms were stored in a secure filling cabinet in the laboratory.</p>
<sec>
<title>Conflict of interest statement</title>
<p>The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest. The reviewer SL and handling Editor declared their shared affiliation, and the handling Editor states that the process nevertheless met the standards of a fair and objective review.</p>
</sec>
</sec>
</body>
<back>
<ack><p>The authors would like to thank Dr. Nirupama Wijesuriya for her contribution to the work for collecting the data in this study.</p>
</ack>
<ref-list>
<title>References</title>
<ref id="B1">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Anderson</surname> <given-names>N. R.</given-names></name> <name><surname>Wisneski</surname> <given-names>K.</given-names></name> <name><surname>Eisenman</surname> <given-names>L.</given-names></name> <name><surname>Moran</surname> <given-names>D. W.</given-names></name> <name><surname>Leuthardt</surname> <given-names>E. C.</given-names></name> <name><surname>Krusienski</surname> <given-names>D. J.</given-names></name> <etal/></person-group>. (<year>2009</year>). <article-title>An offline evaluation of the autoregressive spectrum for electrocorticography</article-title>. <source>IEEE Trans. Biomed. Eng.</source> <volume>56</volume>, <fpage>913</fpage>&#x02013;<lpage>916</lpage>. <pub-id pub-id-type="doi">10.1109/TBME.2009.2009767</pub-id></citation>
</ref>
<ref id="B2">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bashashati</surname> <given-names>A.</given-names></name> <name><surname>Fatourechi</surname> <given-names>M.</given-names></name> <name><surname>Ward</surname> <given-names>R. K.</given-names></name> <name><surname>Birch</surname> <given-names>G. E.</given-names></name></person-group> (<year>2006</year>). <article-title>User customization of the feature generator of an asynchronous brain interface</article-title>. <source>Ann. Biomed. Eng.</source> <volume>34</volume>, <fpage>1051</fpage>&#x02013;<lpage>1060</lpage>. <pub-id pub-id-type="doi">10.1007/s10439-006-9097-5</pub-id><pub-id pub-id-type="pmid">16783660</pub-id></citation></ref>
<ref id="B3">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bengio</surname> <given-names>Y.</given-names></name></person-group> (<year>2009</year>). <article-title>Learning deep architectures for AI</article-title>. <source>Found. Trends Mach. Learn.</source> <volume>2</volume>, <fpage>1</fpage>&#x02013;<lpage>127</lpage>. <pub-id pub-id-type="doi">10.1561/2200000006</pub-id></citation></ref>
<ref id="B4">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Billinger</surname> <given-names>M.</given-names></name> <name><surname>Daly</surname> <given-names>I.</given-names></name> <name><surname>Kaiser</surname> <given-names>V.</given-names></name> <name><surname>Jin</surname> <given-names>J.</given-names></name> <name><surname>Allison</surname> <given-names>B. Z.</given-names></name> <name><surname>M&#x000FC;ller-Putz</surname> <given-names>G. R.</given-names></name> <etal/></person-group>. (<year>2012</year>). <article-title>Is it significant? Guidelines for reporting BCI performance</article-title>, in <source>Towards Practical Brain-Computer Interfaces</source>, eds <person-group person-group-type="editor"><name><surname>Allison</surname> <given-names>B. Z.</given-names></name> <name><surname>Dunne</surname> <given-names>S.</given-names></name> <name><surname>Leeb</surname> <given-names>R.</given-names></name> <name><surname>Mill&#x000E1;n</surname> <given-names>J. D. R.</given-names></name> <name><surname>Nijholt</surname> <given-names>A.</given-names></name></person-group> (<publisher-loc>Berlin; Heidelberg</publisher-loc>: <publisher-name>Springer Verlag</publisher-name>), <fpage>333</fpage>&#x02013;<lpage>354</lpage>.</citation></ref>
<ref id="B5">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Borghini</surname> <given-names>G.</given-names></name> <name><surname>Astolfi</surname> <given-names>L.</given-names></name> <name><surname>Vecchiato</surname> <given-names>G.</given-names></name> <name><surname>Mattia</surname> <given-names>D.</given-names></name> <name><surname>Babiloni</surname> <given-names>F.</given-names></name></person-group> (<year>2014</year>). <article-title>Measuring neurophysiological signals in aircraft pilots and car drivers for the assessment of mental workload, fatigue and drowsiness</article-title>. <source>Neurosci. Biobehav. Rev.</source> <volume>44</volume>, <fpage>58</fpage>&#x02013;<lpage>75</lpage>. <pub-id pub-id-type="doi">10.1016/j.neubiorev.2012.10.003</pub-id><pub-id pub-id-type="pmid">23116991</pub-id></citation></ref>
<ref id="B6">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Brunner</surname> <given-names>C.</given-names></name> <name><surname>Billinger</surname> <given-names>M.</given-names></name> <name><surname>Vidaurre</surname> <given-names>C.</given-names></name> <name><surname>Neuper</surname> <given-names>C.</given-names></name></person-group> (<year>2011</year>). <article-title>A comparison of univariate, vector, bilinear autoregressive, and band power features for brain&#x02013;computer interfaces</article-title>. <source>Med. Biol. Eng. Comput.</source> <volume>49</volume>, <fpage>1337</fpage>&#x02013;<lpage>1346</lpage>. <pub-id pub-id-type="doi">10.1007/s11517-011-0828-x</pub-id><pub-id pub-id-type="pmid">21947797</pub-id></citation></ref>
<ref id="B7">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Casson</surname> <given-names>A. J.</given-names></name></person-group> (<year>2014</year>). <article-title>Artificial Neural Network classification of operator workload with an assessment of time variation and noise-enhancement to increase performance</article-title>. <source>Front. Neurosci.</source> <volume>8</volume>:<fpage>372</fpage>. <pub-id pub-id-type="doi">10.3389/fnins.2014.00372</pub-id><pub-id pub-id-type="pmid">25520608</pub-id></citation></ref>
<ref id="B8">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Castanho</surname> <given-names>M. J.</given-names></name> <name><surname>Barros</surname> <given-names>L. C.</given-names></name> <name><surname>Yamakami</surname> <given-names>A.</given-names></name> <name><surname>Vendite</surname> <given-names>L. L.</given-names></name></person-group> (<year>2007</year>). <article-title>Fuzzy receiver operating characteristic curve: an option to evaluate diagnostic tests</article-title>. <source>IEEE Trans. Inf. Technol. Biomed.</source> <volume>11</volume>, <fpage>244</fpage>&#x02013;<lpage>250</lpage>. <pub-id pub-id-type="doi">10.1109/TITB.2006.879593</pub-id><pub-id pub-id-type="pmid">17521074</pub-id></citation></ref>
<ref id="B9">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Chai</surname> <given-names>R.</given-names></name> <name><surname>Naik</surname> <given-names>G. R.</given-names></name> <name><surname>Nguyen</surname> <given-names>T. N.</given-names></name> <name><surname>Ling</surname> <given-names>S. H.</given-names></name> <name><surname>Tran</surname> <given-names>Y.</given-names></name> <name><surname>Craig</surname> <given-names>A.</given-names></name> <etal/></person-group>. (<year>2016</year>). <article-title>Driver Fatigue Classification with Independent Component by Entropy Rate Bound Minimization Analysis in an EEG-based System</article-title>. <source>IEEE J. Biomed. Health Informat</source>. <volume>PP</volume>, <fpage>1</fpage>&#x02013;<lpage>1</lpage>. <pub-id pub-id-type="doi">10.1109/JBHI.2016.2532354</pub-id></citation></ref>
<ref id="B10">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Chuang</surname> <given-names>C.-H.</given-names></name> <name><surname>Huang</surname> <given-names>C.-S.</given-names></name> <name><surname>Ko</surname> <given-names>L.-W.</given-names></name> <name><surname>Lin</surname> <given-names>C.-T.</given-names></name></person-group> (<year>2015</year>). <article-title>An EEG-based perceptual function integration network for application to drowsy driving</article-title>. <source>Knowl. Based Syst.</source> <volume>80</volume>, <fpage>143</fpage>&#x02013;<lpage>152</lpage>. <pub-id pub-id-type="doi">10.1016/j.knosys.2015.01.007</pub-id></citation></ref>
<ref id="B11">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Craig</surname> <given-names>A.</given-names></name> <name><surname>Tran</surname> <given-names>Y.</given-names></name> <name><surname>Wijesuriya</surname> <given-names>N.</given-names></name> <name><surname>Boord</surname> <given-names>P.</given-names></name></person-group> (<year>2006</year>). <article-title>A controlled investigation into the psychological determinants of fatigue</article-title>. <source>Biol. Psychol.</source> <volume>72</volume>, <fpage>78</fpage>&#x02013;<lpage>87</lpage>. <pub-id pub-id-type="doi">10.1016/j.biopsycho.2005.07.005</pub-id><pub-id pub-id-type="pmid">16137817</pub-id></citation></ref>
<ref id="B12">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Craig</surname> <given-names>A.</given-names></name> <name><surname>Tran</surname> <given-names>Y.</given-names></name> <name><surname>Wijesuriya</surname> <given-names>N.</given-names></name> <name><surname>Nguyen</surname> <given-names>H.</given-names></name></person-group> (<year>2012</year>). <article-title>Regional brain wave activity changes associated with fatigue</article-title>. <source>Psychophysiology</source> <volume>49</volume>, <fpage>574</fpage>&#x02013;<lpage>582</lpage>. <pub-id pub-id-type="doi">10.1111/j.1469-8986.2011.01329.x</pub-id><pub-id pub-id-type="pmid">22324302</pub-id></citation></ref>
<ref id="B13">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Demandt</surname> <given-names>E.</given-names></name> <name><surname>Mehring</surname> <given-names>C.</given-names></name> <name><surname>Vogt</surname> <given-names>K.</given-names></name> <name><surname>Schulze-Bonhage</surname> <given-names>A.</given-names></name> <name><surname>Aertsen</surname> <given-names>A.</given-names></name> <name><surname>Ball</surname> <given-names>T.</given-names></name></person-group> (<year>2012</year>). <article-title>Reaching movement onset- and end-related characteristics of EEG spectral power modulations</article-title>. <source>Front. Neurosci.</source> <volume>6</volume>:<fpage>65</fpage>. <pub-id pub-id-type="doi">10.3389/fnins.2012.00065</pub-id><pub-id pub-id-type="pmid">22586364</pub-id></citation></ref>
<ref id="B14">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>He</surname> <given-names>Q.</given-names></name> <name><surname>Li</surname> <given-names>W.</given-names></name> <name><surname>Fan</surname> <given-names>X.</given-names></name> <name><surname>Fei</surname> <given-names>Z.</given-names></name></person-group> (<year>2015</year>). <article-title>Driver fatigue evaluation model with integration of multi-indicators based on dynamic Bayesian network</article-title>. <source>IET Intell. Trans. Syst.</source> <volume>9</volume>, <fpage>547</fpage>&#x02013;<lpage>554</lpage>. <pub-id pub-id-type="doi">10.1049/iet-its.2014.0103</pub-id></citation></ref>
<ref id="B15">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hinton</surname> <given-names>G.</given-names></name> <name><surname>Deng</surname> <given-names>L.</given-names></name> <name><surname>Yu</surname> <given-names>D.</given-names></name> <name><surname>Dahl</surname> <given-names>G. E.</given-names></name> <name><surname>Mohamed</surname> <given-names>A. R.</given-names></name> <name><surname>Jaitly</surname> <given-names>N.</given-names></name> <etal/></person-group>. (<year>2012</year>). <article-title>Deep neural networks for acoustic modeling in speech recognition: the shared views of four research groups</article-title>. <source>IEEE Signal Process. Mag.</source> <volume>29</volume>, <fpage>82</fpage>&#x02013;<lpage>97</lpage>. <pub-id pub-id-type="doi">10.1109/MSP.2012.2205597</pub-id></citation></ref>
<ref id="B16">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hinton</surname> <given-names>G. E.</given-names></name> <name><surname>Osindero</surname> <given-names>S.</given-names></name> <name><surname>Teh</surname> <given-names>Y.-W.</given-names></name></person-group> (<year>2006</year>). <article-title>A fast learning algorithm for deep belief nets</article-title>. <source>Neural Comput.</source> <volume>18</volume>, <fpage>1527</fpage>&#x02013;<lpage>1554</lpage>. <pub-id pub-id-type="doi">10.1162/neco.2006.18.7.1527</pub-id><pub-id pub-id-type="pmid">16764513</pub-id></citation></ref>
<ref id="B17">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hinton</surname> <given-names>G. E.</given-names></name> <name><surname>Salakhutdinov</surname> <given-names>R. R.</given-names></name></person-group> (<year>2006</year>). <article-title>Reducing the dimensionality of data with neural networks</article-title>. <source>Science</source> <volume>313</volume>, <fpage>504</fpage>&#x02013;<lpage>507</lpage>. <pub-id pub-id-type="doi">10.1126/science.1127647</pub-id><pub-id pub-id-type="pmid">16873662</pub-id></citation></ref>
<ref id="B18">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hsieh</surname> <given-names>C.-S.</given-names></name> <name><surname>Tai</surname> <given-names>C.-C.</given-names></name></person-group> (<year>2013</year>). <article-title>An improved and portable eye-blink duration detection system to warn of driver fatigue</article-title>. <source>Instrum. Sci. Technol.</source> <volume>41</volume>, <fpage>429</fpage>&#x02013;<lpage>444</lpage>. <pub-id pub-id-type="doi">10.1080/10739149.2013.796560</pub-id></citation></ref>
<ref id="B19">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Huang</surname> <given-names>J.</given-names></name> <name><surname>Ling</surname> <given-names>C. X.</given-names></name></person-group> (<year>2005</year>). <article-title>Using AUC and accuracy in evaluating learning algorithms</article-title>. <source>IEEE Trans. Knowl. Data Eng.</source> <volume>17</volume>, <fpage>299</fpage>&#x02013;<lpage>310</lpage>. <pub-id pub-id-type="doi">10.1109/TKDE.2005.50</pub-id></citation></ref>
<ref id="B20">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ji</surname> <given-names>N.-N.</given-names></name> <name><surname>Zhang</surname> <given-names>J.-S.</given-names></name> <name><surname>Zhang</surname> <given-names>C.-X.</given-names></name></person-group> (<year>2014</year>). <article-title>A sparse-response deep belief network based on rate distortion theory</article-title>. <source>Pattern Recognit.</source> <volume>47</volume>, <fpage>3179</fpage>&#x02013;<lpage>3191</lpage>. <pub-id pub-id-type="doi">10.1016/j.patcog.2014.03.025</pub-id></citation></ref>
<ref id="B21">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Jung</surname> <given-names>S. J.</given-names></name> <name><surname>Shin</surname> <given-names>H. J.</given-names></name> <name><surname>Chung</surname> <given-names>W. J.</given-names></name></person-group> (<year>2014</year>). <article-title>Driver fatigue and drowsiness monitoring system with embedded electrocardiogram sensor on steering wheel</article-title>. <source>IET Intell. Transp. Syst.</source> <volume>8</volume>, <fpage>43</fpage>&#x02013;<lpage>50</lpage>. <pub-id pub-id-type="doi">10.1049/iet-its.2012.0032</pub-id></citation></ref>
<ref id="B22">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Jurecki</surname> <given-names>R. S.</given-names></name> <name><surname>Sta&#x00144;czyk</surname> <given-names>T. L.</given-names></name></person-group> (<year>2014</year>). <article-title>Driver reaction time to lateral entering pedestrian in a simulated crash traffic situation</article-title>. <source>Transp. Res. Part F Traffic Psychol. Behav.</source> <volume>27</volume>, <fpage>22</fpage>&#x02013;<lpage>36</lpage>. <pub-id pub-id-type="doi">10.1016/j.trf.2014.08.006</pub-id></citation></ref>
<ref id="B23">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Krizhevsky</surname> <given-names>A.</given-names></name> <name><surname>Sutskever</surname> <given-names>I.</given-names></name> <name><surname>Hinton</surname> <given-names>G. E.</given-names></name></person-group> (<year>2012</year>). <article-title>Imagenet classification with deep convolutional neural networks</article-title>, in <source>Advances in Neural Information Processing Systems</source> (<publisher-loc>Lake Tahoe</publisher-loc>), <fpage>1097</fpage>&#x02013;<lpage>1105</lpage>.</citation></ref>
<ref id="B24">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lai</surname> <given-names>J.-S.</given-names></name> <name><surname>Cella</surname> <given-names>D.</given-names></name> <name><surname>Choi</surname> <given-names>S.</given-names></name> <name><surname>Junghaenel</surname> <given-names>D. U.</given-names></name> <name><surname>Christodoulou</surname> <given-names>C.</given-names></name> <name><surname>Gershon</surname> <given-names>R.</given-names></name> <etal/></person-group>. (<year>2011</year>). <article-title>How item banks and their application can influence measurement practice in rehabilitation medicine: a PROMIS fatigue item bank example</article-title>. <source>Arch. Phys. Med. Rehabil.</source> <volume>92</volume>, <fpage>S20</fpage>&#x02013;<lpage>S27</lpage>. <pub-id pub-id-type="doi">10.1016/j.apmr.2010.08.033</pub-id><pub-id pub-id-type="pmid">21958919</pub-id></citation></ref>
<ref id="B25">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lal</surname> <given-names>S. K.</given-names></name> <name><surname>Craig</surname> <given-names>A.</given-names></name></person-group> (<year>2001</year>). <article-title>A critical review of the psychophysiology of driver fatigue</article-title>. <source>Biol. Psychol.</source> <volume>55</volume>, <fpage>173</fpage>&#x02013;<lpage>194</lpage>. <pub-id pub-id-type="doi">10.1016/S0301-0511(00)00085-5</pub-id><pub-id pub-id-type="pmid">11240213</pub-id></citation></ref>
<ref id="B26">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lal</surname> <given-names>S. K.</given-names></name> <name><surname>Craig</surname> <given-names>A.</given-names></name> <name><surname>Boord</surname> <given-names>P.</given-names></name> <name><surname>Kirkup</surname> <given-names>L.</given-names></name> <name><surname>Nguyen</surname> <given-names>H.</given-names></name></person-group> (<year>2003</year>). <article-title>Development of an algorithm for an EEG-based driver fatigue countermeasure</article-title>. <source>J. Saf. Res.</source> <volume>34</volume>, <fpage>321</fpage>&#x02013;<lpage>328</lpage>. <pub-id pub-id-type="doi">10.1016/S0022-4375(03)00027-6</pub-id><pub-id pub-id-type="pmid">12963079</pub-id></citation></ref>
<ref id="B27">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>LeCun</surname> <given-names>Y.</given-names></name> <name><surname>Bengio</surname> <given-names>Y.</given-names></name> <name><surname>Hinton</surname> <given-names>G.</given-names></name></person-group> (<year>2015</year>). <article-title>Deep learning</article-title>. <source>Nature</source> <volume>521</volume>, <fpage>436</fpage>&#x02013;<lpage>444</lpage>. <pub-id pub-id-type="doi">10.1038/nature14539</pub-id><pub-id pub-id-type="pmid">26017442</pub-id></citation></ref>
<ref id="B28">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lee</surname> <given-names>B. G.</given-names></name> <name><surname>Chung</surname> <given-names>W. Y.</given-names></name></person-group> (<year>2012</year>). <article-title>Driver alertness monitoring using fusion of facial features and bio-signals</article-title>. <source>IEEE Sensors J.</source> <volume>12</volume>, <fpage>2416</fpage>&#x02013;<lpage>2422</lpage>. <pub-id pub-id-type="doi">10.1109/JSEN.2012.2190505</pub-id></citation></ref>
<ref id="B29">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Lee</surname> <given-names>H.</given-names></name> <name><surname>Ekanadham</surname> <given-names>C.</given-names></name> <name><surname>Ng</surname> <given-names>A. Y.</given-names></name></person-group> (<year>2008</year>). <article-title>Sparse deep belief net model for visual area V2</article-title>, in <source>Advances in Neural Information Processing Systems</source> (<publisher-loc>Vancouver</publisher-loc>), <fpage>873</fpage>&#x02013;<lpage>880</lpage>.</citation></ref>
<ref id="B30">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lin</surname> <given-names>C. T.</given-names></name> <name><surname>Chang</surname> <given-names>C. J.</given-names></name> <name><surname>Lin</surname> <given-names>B. S.</given-names></name> <name><surname>Hung</surname> <given-names>S. H.</given-names></name> <name><surname>Chao</surname> <given-names>C. F.</given-names></name> <name><surname>Wang</surname> <given-names>I. J.</given-names></name></person-group> (<year>2010</year>). <article-title>A real-time wireless brain-computer interface system for drowsiness detection</article-title>. <source>IEEE Trans. Biomed. Circuits Syst.</source> <volume>4</volume>, <fpage>214</fpage>&#x02013;<lpage>222</lpage>. <pub-id pub-id-type="doi">10.1109/TBCAS.2010.2046415</pub-id><pub-id pub-id-type="pmid">23853367</pub-id></citation></ref>
<ref id="B31">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lin</surname> <given-names>C. T.</given-names></name> <name><surname>Chuang</surname> <given-names>C. H.</given-names></name> <name><surname>Huang</surname> <given-names>C. S.</given-names></name> <name><surname>Tsai</surname> <given-names>S. F.</given-names></name> <name><surname>Lu</surname> <given-names>S. W.</given-names></name> <name><surname>Chen</surname> <given-names>Y. H.</given-names></name> <etal/></person-group>. (<year>2014</year>). <article-title>Wireless and wearable EEG system for evaluating driver vigilance</article-title>. <source>IEEE Trans. Biomed. Circuits Syst.</source> <volume>8</volume>, <fpage>165</fpage>&#x02013;<lpage>176</lpage>. <pub-id pub-id-type="doi">10.1109/TBCAS.2014.2316224</pub-id><pub-id pub-id-type="pmid">24860041</pub-id></citation></ref>
<ref id="B32">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lin</surname> <given-names>C.-T.</given-names></name> <name><surname>Wu</surname> <given-names>R.-C.</given-names></name> <name><surname>Jung</surname> <given-names>T.-P.</given-names></name> <name><surname>Liang</surname> <given-names>S.-F.</given-names></name> <name><surname>Huang</surname> <given-names>T.-Y.</given-names></name></person-group> (<year>2005</year>). <article-title>Estimating driving performance based on EEG spectrum analysis</article-title>. <source>EURASIP J. Appl. Signal Process.</source> <volume>2005</volume>, <fpage>3165</fpage>&#x02013;<lpage>3174</lpage>. <pub-id pub-id-type="doi">10.1155/ASP.2005.3165</pub-id></citation></ref>
<ref id="B33">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Marathe</surname> <given-names>A. R.</given-names></name> <name><surname>Lawhern</surname> <given-names>V. J.</given-names></name> <name><surname>Wu</surname> <given-names>D.</given-names></name> <name><surname>Slayback</surname> <given-names>D.</given-names></name> <name><surname>Lance</surname> <given-names>B. J.</given-names></name></person-group> (<year>2016</year>). <article-title>Improved neural signal classification in a rapid serial visual presentation task using active learning</article-title>. <source>IEEE Trans. Neural Syst. Rehab. Eng.</source> <volume>24</volume>, <fpage>333</fpage>&#x02013;<lpage>343</lpage>. <pub-id pub-id-type="doi">10.1109/TNSRE.2015.2502323</pub-id><pub-id pub-id-type="pmid">26600162</pub-id></citation></ref>
<ref id="B34">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Matthews</surname> <given-names>G.</given-names></name> <name><surname>Desmond</surname> <given-names>P. A.</given-names></name> <name><surname>Neubauer</surname> <given-names>C.</given-names></name> <name><surname>Hancock</surname> <given-names>P. A.</given-names></name></person-group> (<year>2012</year>). <source>The Handbook of Operator Fatigue</source>. <publisher-loc>Surrey</publisher-loc>: <publisher-name>Ashgate Publishing, Ltd</publisher-name>.</citation></ref>
<ref id="B35">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>McFarland</surname> <given-names>D. J.</given-names></name> <name><surname>Wolpaw</surname> <given-names>J. R.</given-names></name></person-group> (<year>2008</year>). <article-title>Sensorimotor rhythm-based brain&#x02013;computer interface (BCI): model order selection for autoregressive spectral analysis</article-title>. <source>J. Neural Eng.</source> <volume>5</volume>, <fpage>155</fpage>. <pub-id pub-id-type="doi">10.1088/1741-2560/5/2/006</pub-id><pub-id pub-id-type="pmid">18430974</pub-id></citation></ref>
<ref id="B36">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Mohamed</surname> <given-names>A.-R.</given-names></name> <name><surname>Yu</surname> <given-names>D.</given-names></name> <name><surname>Deng</surname> <given-names>L.</given-names></name></person-group> (<year>2010</year>). <article-title>Investigation of full-sequence training of deep belief networks for speech recognition</article-title>, in <source>INTERSPEECH</source>, <fpage>2846</fpage>&#x02013;<lpage>2849</lpage>.</citation></ref>
<ref id="B37">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Nguyen</surname> <given-names>H. T.</given-names></name></person-group> (<year>2008</year>). <article-title>Intelligent technologies for real-time biomedical engineering applications</article-title>. <source>Int. J. Autom. Control</source> <volume>2</volume>, <fpage>274</fpage>&#x02013;<lpage>285</lpage>. <pub-id pub-id-type="doi">10.1504/IJAAC.2008.022181</pub-id></citation></ref>
<ref id="B38">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>O&#x00027;Connor</surname> <given-names>P.</given-names></name> <name><surname>Neil</surname> <given-names>D.</given-names></name> <name><surname>Liu</surname> <given-names>S.-C.</given-names></name> <name><surname>Delbruck</surname> <given-names>T.</given-names></name> <name><surname>Pfeiffer</surname> <given-names>M.</given-names></name></person-group> (<year>2013</year>). <article-title>Real-time classification and sensor fusion with a spiking deep belief network</article-title>. <source>Front. Neurosci.</source> <volume>7</volume>:<fpage>178</fpage>. <pub-id pub-id-type="doi">10.3389/fnins.2013.00178</pub-id><pub-id pub-id-type="pmid">24115919</pub-id></citation></ref>
<ref id="B39">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Stromatias</surname> <given-names>E.</given-names></name> <name><surname>Neil</surname> <given-names>D.</given-names></name> <name><surname>Pfeiffer</surname> <given-names>M.</given-names></name> <name><surname>Galluppi</surname> <given-names>F.</given-names></name> <name><surname>Furber</surname> <given-names>S. B.</given-names></name> <name><surname>Liu</surname> <given-names>S.-C.</given-names></name></person-group> (<year>2015</year>). <article-title>Robustness of spiking Deep Belief Networks to noise and reduced bit precision of neuro-inspired hardware platforms</article-title>. <source>Front. Neurosci.</source> <volume>9</volume>:<fpage>222</fpage>. <pub-id pub-id-type="doi">10.3389/fnins.2015.00222</pub-id><pub-id pub-id-type="pmid">26217169</pub-id></citation></ref>
<ref id="B40">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Touryan</surname> <given-names>J.</given-names></name> <name><surname>Apker</surname> <given-names>G.</given-names></name> <name><surname>Kerick</surname> <given-names>S.</given-names></name> <name><surname>Lance</surname> <given-names>B.</given-names></name> <name><surname>Ries</surname> <given-names>A. J.</given-names></name> <name><surname>McDowell</surname> <given-names>K.</given-names></name></person-group> (<year>2013</year>). <article-title>Translation of EEG-based performance prediction models to rapid serial visual presentation tasks</article-title>, in <source>International Conference on Augmented Cognition</source> (<publisher-loc>Las Vegas</publisher-loc>), <fpage>521</fpage>&#x02013;<lpage>530</lpage>.</citation></ref>
<ref id="B41">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Touryan</surname> <given-names>J.</given-names></name> <name><surname>Apker</surname> <given-names>G.</given-names></name> <name><surname>Lance</surname> <given-names>B. J.</given-names></name> <name><surname>Kerick</surname> <given-names>S. E.</given-names></name> <name><surname>Ries</surname> <given-names>A. J.</given-names></name> <name><surname>McDowell</surname> <given-names>K.</given-names></name></person-group> (<year>2014</year>). <article-title>Estimating endogenous changes in task performance from EEG</article-title>. <source>Front. Neurosci.</source> <volume>8</volume>:<fpage>155</fpage>. <pub-id pub-id-type="doi">10.3389/fnins.2014.00155</pub-id><pub-id pub-id-type="pmid">24994968</pub-id></citation></ref>
<ref id="B42">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Tran</surname> <given-names>Y.</given-names></name> <name><surname>Wijesuriya</surname> <given-names>N.</given-names></name> <name><surname>Tarvainen</surname> <given-names>M.</given-names></name> <name><surname>Karjalainen</surname> <given-names>P.</given-names></name> <name><surname>Craig</surname> <given-names>A.</given-names></name></person-group> (<year>2009</year>). <article-title>The relationship between spectral changes in heart rate variability and fatigue</article-title>. <source>J. Psychophysiol.</source> <volume>23</volume>, <fpage>143</fpage>&#x02013;<lpage>151</lpage>. <pub-id pub-id-type="doi">10.1027/0269-8803.23.3.143</pub-id></citation></ref>
<ref id="B43">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Vanlaar</surname> <given-names>W.</given-names></name> <name><surname>Simpson</surname> <given-names>H.</given-names></name> <name><surname>Mayhew</surname> <given-names>D.</given-names></name> <name><surname>Robertson</surname> <given-names>R.</given-names></name></person-group> (<year>2008</year>). <article-title>Fatigued and drowsy driving: a survey of attitudes, opinions and behaviors</article-title>. <source>J. Saf. Res.</source> <volume>39</volume>, <fpage>303</fpage>&#x02013;<lpage>309</lpage>. <pub-id pub-id-type="doi">10.1016/j.jsr.2007.12.007</pub-id><pub-id pub-id-type="pmid">18571572</pub-id></citation></ref>
<ref id="B44">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wang</surname> <given-names>G.</given-names></name> <name><surname>Sun</surname> <given-names>Z.</given-names></name> <name><surname>Tao</surname> <given-names>R.</given-names></name> <name><surname>Li</surname> <given-names>K.</given-names></name> <name><surname>Bao</surname> <given-names>G.</given-names></name> <name><surname>Yan</surname> <given-names>X.</given-names></name></person-group> (<year>2016</year>). <article-title>Epileptic seizure detection based on partial directed coherence analysis</article-title>. <source>IEEE J. Biomed. Health Informat.</source> <volume>20</volume>, <fpage>873</fpage>&#x02013;<lpage>879</lpage>. <pub-id pub-id-type="doi">10.1109/JBHI.2015.2424074</pub-id><pub-id pub-id-type="pmid">25898286</pub-id></citation></ref>
<ref id="B45">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wijesuriya</surname> <given-names>N.</given-names></name> <name><surname>Tran</surname> <given-names>Y.</given-names></name> <name><surname>Craig</surname> <given-names>A.</given-names></name></person-group> (<year>2007</year>). <article-title>The psychophysiological determinants of fatigue</article-title>. <source>Int. J. Psychophysiol.</source> <volume>63</volume>, <fpage>77</fpage>&#x02013;<lpage>86</lpage>. <pub-id pub-id-type="doi">10.1016/j.ijpsycho.2006.08.005</pub-id><pub-id pub-id-type="pmid">17007946</pub-id></citation></ref>
<ref id="B46">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wu</surname> <given-names>D.</given-names></name></person-group> (<year>2016</year>). <article-title>Online and offline domain adaptation for reducing BCI calibration effort</article-title>. <source>IEEE Trans. Hum. Mach. Syst.</source> <volume>PP</volume>, <fpage>1</fpage>&#x02013;<lpage>14</lpage>. <pub-id pub-id-type="doi">10.1109/THMS.2016.2608931</pub-id></citation></ref>
<ref id="B47">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Wu</surname> <given-names>D.</given-names></name> <name><surname>Lance</surname> <given-names>B.</given-names></name> <name><surname>Lawhern</surname> <given-names>V.</given-names></name></person-group> (<year>2014</year>). <article-title>Transfer learning and active transfer learning for reducing calibration data in single-trial classification of visually-evoked potentials</article-title>, in <source>2014 IEEE International Conference on Systems, Man, and Cybernetics</source> (<publisher-loc>San Diego, CA</publisher-loc>: <publisher-name>SMC</publisher-name>), <fpage>2801</fpage>&#x02013;<lpage>2807</lpage>. <pub-id pub-id-type="doi">10.1109/smc.2014.6974353</pub-id></citation></ref>
<ref id="B48">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Xiong</surname> <given-names>Y.</given-names></name> <name><surname>Gao</surname> <given-names>J.</given-names></name> <name><surname>Yang</surname> <given-names>Y.</given-names></name> <name><surname>Yu</surname> <given-names>X.</given-names></name> <name><surname>Huang</surname> <given-names>W.</given-names></name></person-group> (<year>2016</year>). <article-title>Classifying driving fatigue based on combined entropy measure using EEG signals</article-title>. <source>Int. J. Control Automat.</source> <volume>9</volume>, <fpage>329</fpage>&#x02013;<lpage>338</lpage>. <pub-id pub-id-type="doi">10.14257/ijca.2016.9.3.30</pub-id></citation></ref>
<ref id="B49">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zhang</surname> <given-names>C.</given-names></name> <name><surname>Wang</surname> <given-names>H.</given-names></name> <name><surname>Fu</surname> <given-names>R.</given-names></name></person-group> (<year>2014</year>). <article-title>Automated detection of driver fatigue based on entropy and complexity measures</article-title>. <source>IEEE Trans. Intell. Transp. Syst.</source> <volume>15</volume>, <fpage>168</fpage>&#x02013;<lpage>177</lpage>. <pub-id pub-id-type="doi">10.1109/TITS.2013.2275192</pub-id></citation></ref>
</ref-list>
</back>
</article>
