<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v2.3 20070202//EN" "journalpublishing.dtd">
<article article-type="research-article" dtd-version="2.3" xml:lang="EN" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">Front. Sens.</journal-id>
<journal-title>Frontiers in Sensors</journal-title>
<abbrev-journal-title abbrev-type="pubmed">Front. Sens.</abbrev-journal-title>
<issn pub-type="epub">2673-5067</issn>
<publisher>
<publisher-name>Frontiers Media S.A.</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="publisher-id">1662060</article-id>
<article-id pub-id-type="doi">10.3389/fsens.2025.1662060</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Sensors</subject>
<subj-group>
<subject>Original Research</subject>
</subj-group>
</subj-group>
</article-categories>
<title-group>
<article-title>Machine learning pipeline for microparticle size classification in self-mixing interferometric signals for flow cytometry</article-title>
<alt-title alt-title-type="left-running-head">Sierra-Alarc&#xf3;n et al.</alt-title>
<alt-title alt-title-type="right-running-head">
<ext-link ext-link-type="uri" xlink:href="https://doi.org/10.3389/fsens.2025.1662060">10.3389/fsens.2025.1662060</ext-link>
</alt-title>
</title-group>
<contrib-group>
<contrib contrib-type="author">
<name>
<surname>Sierra-Alarc&#xf3;n</surname>
<given-names>Sebasti&#xe1;n</given-names>
</name>
<uri xlink:href="https://loop.frontiersin.org/people/3127116/overview"/>
<role content-type="https://credit.niso.org/contributor-roles/conceptualization/"/>
<role content-type="https://credit.niso.org/contributor-roles/data-curation/"/>
<role content-type="https://credit.niso.org/contributor-roles/formal-analysis/"/>
<role content-type="https://credit.niso.org/contributor-roles/investigation/"/>
<role content-type="https://credit.niso.org/contributor-roles/methodology/"/>
<role content-type="https://credit.niso.org/contributor-roles/validation/"/>
<role content-type="https://credit.niso.org/contributor-roles/visualization/"/>
<role content-type="https://credit.niso.org/contributor-roles/writing-original-draft/"/>
<role content-type="https://credit.niso.org/contributor-roles/Writing - review &#x26; editing/"/>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Perchoux</surname>
<given-names>Julien</given-names>
</name>
<role content-type="https://credit.niso.org/contributor-roles/conceptualization/"/>
<role content-type="https://credit.niso.org/contributor-roles/formal-analysis/"/>
<role content-type="https://credit.niso.org/contributor-roles/funding-acquisition/"/>
<role content-type="https://credit.niso.org/contributor-roles/investigation/"/>
<role content-type="https://credit.niso.org/contributor-roles/resources/"/>
<role content-type="https://credit.niso.org/contributor-roles/supervision/"/>
<role content-type="https://credit.niso.org/contributor-roles/writing-original-draft/"/>
<role content-type="https://credit.niso.org/contributor-roles/Writing - review &#x26; editing/"/>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Tronche</surname>
<given-names>Cl&#xe9;ment</given-names>
</name>
<role content-type="https://credit.niso.org/contributor-roles/conceptualization/"/>
<role content-type="https://credit.niso.org/contributor-roles/resources/"/>
<role content-type="https://credit.niso.org/contributor-roles/Writing - review &#x26; editing/"/>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Jayat</surname>
<given-names>Francis</given-names>
</name>
<role content-type="https://credit.niso.org/contributor-roles/conceptualization/"/>
<role content-type="https://credit.niso.org/contributor-roles/resources/"/>
<role content-type="https://credit.niso.org/contributor-roles/Writing - review &#x26; editing/"/>
</contrib>
<contrib contrib-type="author" corresp="yes">
<name>
<surname>Quotb</surname>
<given-names>Adam</given-names>
</name>
<xref ref-type="corresp" rid="c001">&#x2a;</xref>
<role content-type="https://credit.niso.org/contributor-roles/conceptualization/"/>
<role content-type="https://credit.niso.org/contributor-roles/formal-analysis/"/>
<role content-type="https://credit.niso.org/contributor-roles/funding-acquisition/"/>
<role content-type="https://credit.niso.org/contributor-roles/investigation/"/>
<role content-type="https://credit.niso.org/contributor-roles/project-administration/"/>
<role content-type="https://credit.niso.org/contributor-roles/resources/"/>
<role content-type="https://credit.niso.org/contributor-roles/supervision/"/>
<role content-type="https://credit.niso.org/contributor-roles/validation/"/>
<role content-type="https://credit.niso.org/contributor-roles/Writing - review &#x26; editing/"/>
</contrib>
</contrib-group>
<aff>
<institution>INP, CNRS, LAAS-CNRS, Universit&#xe9; de Toulouse</institution>, <addr-line>Toulouse</addr-line>, <country>France</country>
</aff>
<author-notes>
<fn fn-type="edited-by">
<p>
<bold>Edited by:</bold> <ext-link ext-link-type="uri" xlink:href="https://loop.frontiersin.org/people/2309309/overview">Li-Peng Sun</ext-link>, Jinan University, China</p>
</fn>
<fn fn-type="edited-by">
<p>
<bold>Reviewed by:</bold> <ext-link ext-link-type="uri" xlink:href="https://loop.frontiersin.org/people/3162465/overview">Yanzhen Tan</ext-link>, Dongguan University of Technology, China</p>
<p>
<ext-link ext-link-type="uri" xlink:href="https://loop.frontiersin.org/people/3164837/overview">Fei Xie</ext-link>, Handan University, China</p>
</fn>
<corresp id="c001">&#x2a;Correspondence: Adam Quotb, <email>adam.quotb@laas.fr</email>
</corresp>
</author-notes>
<pub-date pub-type="epub">
<day>05</day>
<month>09</month>
<year>2025</year>
</pub-date>
<pub-date pub-type="collection">
<year>2025</year>
</pub-date>
<volume>6</volume>
<elocation-id>1662060</elocation-id>
<history>
<date date-type="received">
<day>08</day>
<month>07</month>
<year>2025</year>
</date>
<date date-type="accepted">
<day>18</day>
<month>08</month>
<year>2025</year>
</date>
</history>
<permissions>
<copyright-statement>Copyright &#xa9; 2025 Sierra-Alarc&#xf3;n, Perchoux, Tronche, Jayat and Quotb.</copyright-statement>
<copyright-year>2025</copyright-year>
<copyright-holder>Sierra-Alarc&#xf3;n, Perchoux, Tronche, Jayat and Quotb</copyright-holder>
<license xlink:href="http://creativecommons.org/licenses/by/4.0/">
<p>This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.</p>
</license>
</permissions>
<abstract>
<p>Self-mixing interferometry (SMI) is an emerging optical sensing technique for detecting and classifying microparticles in non-contact and label-free flowmetry applications. High precision and reliability are essential for its integration into medical diagnostics, such as blood analysis, and quality control in chemical manufacturing processes. While theoretical models describe SMI-induced signal modulations caused by particle passage, challenges persist due to signal noise, variability, and interpretability under experimental conditions. This study enhances SMI-based particle size classification by integrating machine learning (ML) models to improve feature extraction and classification accuracy. Three ML pipelines are evaluated, achieving 98% classification accuracy in distinguishing particles of different sizes (2, 4, and 10&#xa0;&#xb5;m). The high classification accuracy demonstrates the scalability of our approach, ensuring its applicability across diverse particle analysis scenarios.</p>
</abstract>
<kwd-group>
<kwd>self-mixing interferometry</kwd>
<kwd>micro-particle size classification</kwd>
<kwd>machine learning</kwd>
<kwd>flow citometry</kwd>
<kwd>signal processing</kwd>
</kwd-group>
<counts>
<page-count count="12"/>
</counts>
<custom-meta-wrap>
<custom-meta>
<meta-name>section-at-acceptance</meta-name>
<meta-value>Optoelectronic and Photonic Sensors</meta-value>
</custom-meta>
</custom-meta-wrap>
</article-meta>
</front>
<body>
<sec id="s1">
<title>1 Introduction</title>
<p>Self-mixing interferometry (SMI), also known as optical feedback interferometry (OFI), has gained significant attention due to its versatility and cost-effectiveness in sensing applications (<xref ref-type="bibr" rid="B16">Perchoux et al., 2016</xref>; <xref ref-type="bibr" rid="B9">Donati and Norgia, 2014</xref>; <xref ref-type="bibr" rid="B17">Quotb et al., 2021</xref>; <xref ref-type="bibr" rid="B20">Taimre et al., 2015</xref>). This laser-based technique relies on the interference between emitted laser light and backscattered light from an external target, enabling the development of compact, low-cost, and high-resolution optical sensors. One of the key research areas in SMI is its application in a microfluidic context, particularly for single-particle analysis, with the goal of establishing an SMI-based label-free flow cytometry system for medical sensing.</p>
<p>Since the initial demonstrations of detecting submicron and micron particles using SMI sensing, substantial progress has been made in understanding the signal modulation induced by single-particle interactions with the laser beam (<xref ref-type="bibr" rid="B8">Da Costa Moreira et al., 2017</xref>; <xref ref-type="bibr" rid="B12">Herbert et al., 2018</xref>). These advances have enabled detection of particles as small as 100&#xa0;nm (<xref ref-type="bibr" rid="B26">Zhao et al., 2023a</xref>) and led to the development of analytical models that enhance our understanding of SMI signals. This progress has also paved the way for the first SMI-based flow cytometers, capable of detecting polystyrene beads and even classifying cancer cells (<xref ref-type="bibr" rid="B25">Zhao Y. et al., 2020</xref>; <xref ref-type="bibr" rid="B23">Zhao et al., 2019</xref>; <xref ref-type="bibr" rid="B26">Zhao et al., 2023a</xref>). While these studies demonstrated SMI&#x2019;s potential for microparticle identification, they have predominantly focused on particle detection rather than classification, revealing persistent challenges in isolating signal bursts and defining particle signatures that are clear and distinct enough to enable reliable classification.</p>
<p>To address these challenges, different signal processing techniques have been explored, including bandpass filtering, fringe counting based on the Hilbert transform, and frequency/amplitude modulation analysis (<xref ref-type="bibr" rid="B23">Zhao et al., 2019</xref>; <xref ref-type="bibr" rid="B12">Herbert et al., 2018</xref>). However, SMI signals are often noisy, especially for smaller particles, where the signal-to-noise ratio is low. Additionally, the modulation strength of backscattered light, which is critical for particle identification, varies significantly depending on factors such as particle size, speed, refractive index, and surface characteristics. These challenges become more pronounced in complex environments containing heterogeneous particle mixtures, where signal features often overlap, making particle classification more difficult.</p>
<p>Previous efforts have explored the relationship between the temporal and frequency domains of SMI signals, utilizing features such as Doppler frequency peaks (which correlate with particle speed) and fringe amplitude and duration (which relate to particle size). To improve particle passage detection and feature extraction, advanced techniques such as wavelet transforms and spectrogram analysis have been introduced (<xref ref-type="bibr" rid="B27">Zhao et al., 2023b</xref>; <xref ref-type="bibr" rid="B19">Sierra-Alarc&#xf3;n et al., 2024</xref>). Despite these advances, classical signal processing methods alone are often insufficient to address the full range of classification challenges. Machine learning techniques provide a promising alternative, as they can extract complex patterns from noisy SMI signals (<xref ref-type="bibr" rid="B5">Barland and Gustave, 2021</xref>; <xref ref-type="bibr" rid="B15">Novac et al., 2024</xref>; <xref ref-type="bibr" rid="B3">An and Liu, 2022</xref>; <xref ref-type="bibr" rid="B7">Chen et al., 2024</xref>), potentially improving SMI-based particle identification and the classification accuracy. However, the application of ML models to SMI single-particle analysis remains in its initial stages due to the lack of comprehensive datasets for single-particle transit modulation and diverse particle types.</p>
<p>This study aims to enhance the reliability of SMI-based particle classification by integrating ML models into the SMI signal processing pipeline. Building on prior work focused on understanding signal characteristics, the study transitions from feature exploration to predictive classification. As shown in <xref ref-type="fig" rid="F1">Figure 1</xref>, the proposed workflow begins with accurate signal acquisition from polystyrene particles of 2, 4, and 10&#xa0;&#x3bc;m, followed by preprocessing steps that include both online and offline filtering. Three data representations were evaluated for ML-based classification: (i) handcrafted features extracted from the time-domain and frequency signal, (ii) spectrograms to capture time-frequency correlations, commonly used in audio and biomedical signal classification tasks (<xref ref-type="bibr" rid="B11">Ha et al., 2023</xref>; <xref ref-type="bibr" rid="B24">Zhao K. et al., 2020</xref>; <xref ref-type="bibr" rid="B10">Gourisaria et al., 2024</xref>), and (iii) the temporal SMI sensor signal waveform. To improve generalization and balance the dataset, data augmentation techniques were applied. Finally, different ML models were trained and compared to determine their effectiveness in accurately classifying particle size.</p>
<fig id="F1" position="float">
<label>FIGURE 1</label>
<caption>
<p>Overview of the ML pipeline developed for micro-particle size classification from SMI-sensor signals.</p>
</caption>
<graphic xlink:href="fsens-06-1662060-g001.tif">
<alt-text content-type="machine-generated">Flowchart depicting a process starting with Signal Acquisition, followed by Signal Preprocessing, then ML Classifier, and ending with Particle Classification. Each stage includes subcomponents: Signal Acquisition involves detection algorithm and dataset cleaning, Signal Preprocessing includes feature extraction and data augmentation, ML Classifier involves models testing and performance calibration, and Particle Classification involves size measurement (2 micrometers, 4 micrometers, 10 micrometers) and results analysis.</alt-text>
</graphic>
</fig>
<p>The paper is organized as follows: <xref ref-type="sec" rid="s2-1">Section 2.1</xref> explains the induced modulation due to single-particle transit. <xref ref-type="sec" rid="s2-2">Section 2.2</xref> describes the experimental setup for the SMI flow cytometer. <xref ref-type="sec" rid="s2-3">Section 2.3</xref> outlines the data acquisition and classification pipeline. Finally, <xref ref-type="sec" rid="s3">Section 3</xref> presents and discusses the classification results obtained from the different approaches.</p>
</sec>
<sec sec-type="materials|methods" id="s2">
<title>2 Materials and methods</title>
<sec id="s2-1">
<title>2.1 Theory</title>
<p>The self-mixing interferometry phenomenon arises from the interaction between the internal light wave propagating within the laser cavity and the portion of light backscattered by an external target that re-enters into the cavity, causing a modulation in the laser output power. In the context of this study, we focus exclusively on the single-particle case, where only one particle at a time crosses the laser beam, thus scattering light back into the laser cavity (<xref ref-type="bibr" rid="B22">Zhao et al., 2016</xref>). Under this condition, each photon is assumed to be scattered solely by that individual particle during its round-trip propagation. The case involving multiple scatterers has been addressed in various studies (<xref ref-type="bibr" rid="B6">Campagnolo, 2013</xref>; <xref ref-type="bibr" rid="B4">Atashkhooei et al., 2018</xref>). A general schematic of the effect involved is presented in <xref ref-type="fig" rid="F2">Figure 2</xref>.</p>
<fig id="F2" position="float">
<label>FIGURE 2</label>
<caption>
<p>Schematic representation of an SMI sensor detecting the backscattered light from a single particle suspended in a fluid, moving at velocity V as it traverses the laser beam&#x2019;s sensing volume.</p>
</caption>
<graphic xlink:href="fsens-06-1662060-g002.tif">
<alt-text content-type="machine-generated">Diagram illustrating a laser Doppler anemometry setup. A focusing lens directs an incident laser beam into a flow channel, creating a sensing volume. Backscattered light from particles is analyzed to measure velocity, indicated as V1, V3, V4. An inset shows the angle &#x3B8; between light and particle velocity vector V within the sensing volume.</alt-text>
</graphic>
</fig>
<p>Due to the Doppler effect, when a particle moves through the laser beam with a constant velocity <inline-formula id="inf1">
<mml:math id="m1">
<mml:mrow>
<mml:mi>V</mml:mi>
</mml:mrow>
</mml:math>
</inline-formula>, the output power signal exhibits periodic modulation at the Doppler frequency shift <inline-formula id="inf2">
<mml:math id="m2">
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>f</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>D</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> (<xref ref-type="bibr" rid="B2">Albrecht et al., 2003</xref>). The value of <inline-formula id="inf3">
<mml:math id="m3">
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>f</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>D</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> depends on the incidence angle <inline-formula id="inf4">
<mml:math id="m4">
<mml:mrow>
<mml:mi>&#x3b8;</mml:mi>
</mml:mrow>
</mml:math>
</inline-formula> between the laser beam axis and the flow direction and the laser wavelength <inline-formula id="inf5">
<mml:math id="m5">
<mml:mrow>
<mml:mi>&#x3bb;</mml:mi>
</mml:mrow>
</mml:math>
</inline-formula>, as given by <xref ref-type="disp-formula" rid="e1">Equation 1</xref>:<disp-formula id="e1">
<mml:math id="m6">
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>f</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>D</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo>&#x3d;</mml:mo>
<mml:mfrac>
<mml:mrow>
<mml:mn>2</mml:mn>
<mml:mi>V</mml:mi>
<mml:mo>&#x2061;</mml:mo>
<mml:mi>sin</mml:mi>
<mml:mfenced open="(" close=")">
<mml:mrow>
<mml:mi>&#x3b8;</mml:mi>
</mml:mrow>
</mml:mfenced>
</mml:mrow>
<mml:mrow>
<mml:mi>&#x3bb;</mml:mi>
</mml:mrow>
</mml:mfrac>
</mml:mrow>
</mml:math>
<label>(1)</label>
</disp-formula>
</p>
<p>The initial modulation in the laser output power <inline-formula id="inf6">
<mml:math id="m7">
<mml:mrow>
<mml:mi>P</mml:mi>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:mrow>
<mml:mi>t</mml:mi>
</mml:mrow>
<mml:mo stretchy="false">)</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:math>
</inline-formula> due to the self-mixing effect is expressed in <xref ref-type="disp-formula" rid="e2">Equation 2</xref>, where <inline-formula id="inf7">
<mml:math id="m8">
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>P</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mn>0</mml:mn>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> represents the initial laser output power, <inline-formula id="inf8">
<mml:math id="m9">
<mml:mrow>
<mml:mi>m</mml:mi>
</mml:mrow>
</mml:math>
</inline-formula> is the modulation index, indicating the feedback strength, and <inline-formula id="inf9">
<mml:math id="m10">
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>&#x3d5;</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>D</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> denotes the phase variation due to the Doppler frequency <inline-formula id="inf10">
<mml:math id="m11">
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>f</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>D</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula>, carrying information about the uniquely scattered particle.<disp-formula id="e2">
<mml:math id="m12">
<mml:mrow>
<mml:mi>P</mml:mi>
<mml:mfenced open="(" close=")">
<mml:mrow>
<mml:mi>t</mml:mi>
</mml:mrow>
</mml:mfenced>
<mml:mo>&#x3d;</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi>P</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mn>0</mml:mn>
</mml:mrow>
</mml:msub>
<mml:mfenced open="[" close="]">
<mml:mrow>
<mml:mn>1</mml:mn>
<mml:mo>&#x2b;</mml:mo>
<mml:mi>m</mml:mi>
<mml:mo>&#x2061;</mml:mo>
<mml:mi>cos</mml:mi>
<mml:mfenced open="(" close=")">
<mml:mrow>
<mml:mn>2</mml:mn>
<mml:mi>&#x3c0;</mml:mi>
<mml:msub>
<mml:mrow>
<mml:mi>f</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>D</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mi>t</mml:mi>
</mml:mrow>
</mml:mfenced>
</mml:mrow>
</mml:mfenced>
</mml:mrow>
</mml:math>
<label>(2)</label>
</disp-formula>
</p>
<p>As a particle passes through the laser sensing volume, defined as the spatial region where sufficient light is scattered back from the particle to the laser and produces detectable modulation in the laser output power within our acquisition system, it experiences a Gaussian spatial intensity profile consistent with Gaussian beam theory. The modulation amplitude reaches its peak when the particle&#x2019;s center crosses the central axis of the laser beam <inline-formula id="inf11">
<mml:math id="m13">
<mml:mrow>
<mml:mi>k</mml:mi>
</mml:mrow>
</mml:math>
</inline-formula> at <inline-formula id="inf12">
<mml:math id="m14">
<mml:mrow>
<mml:mi>t</mml:mi>
<mml:mo>&#x3d;</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi>t</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mn>0</mml:mn>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula>, gradually decreasing as the particle exits the interrogation zone (<xref ref-type="bibr" rid="B27">Zhao et al., 2023b</xref>). The final expression for the output power modulation resulting from particle transit is:<disp-formula id="e3">
<mml:math id="m15">
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>P</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>F</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mfenced open="(" close=")">
<mml:mrow>
<mml:mi>t</mml:mi>
</mml:mrow>
</mml:mfenced>
<mml:mo>&#x3d;</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi>P</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mn>0</mml:mn>
</mml:mrow>
</mml:msub>
<mml:mfenced open="[" close="]">
<mml:mrow>
<mml:mn>1</mml:mn>
<mml:mo>&#x2b;</mml:mo>
<mml:mi>m</mml:mi>
<mml:mo>&#x2061;</mml:mo>
<mml:mi>cos</mml:mi>
<mml:mfenced open="(" close=")">
<mml:mrow>
<mml:mn>2</mml:mn>
<mml:mi>&#x3c0;</mml:mi>
<mml:msub>
<mml:mrow>
<mml:mi>f</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>D</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mi>t</mml:mi>
</mml:mrow>
</mml:mfenced>
</mml:mrow>
</mml:mfenced>
<mml:msup>
<mml:mrow>
<mml:mi>e</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>&#x2212;</mml:mo>
<mml:mfrac>
<mml:mrow>
<mml:msup>
<mml:mrow>
<mml:mfenced open="(" close=")">
<mml:mrow>
<mml:mi>t</mml:mi>
<mml:mo>&#x2212;</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi>t</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mn>0</mml:mn>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:mfenced>
</mml:mrow>
<mml:mrow>
<mml:mn>2</mml:mn>
</mml:mrow>
</mml:msup>
</mml:mrow>
<mml:mrow>
<mml:mn>2</mml:mn>
<mml:msup>
<mml:mrow>
<mml:mi>&#x3c4;</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mn>2</mml:mn>
</mml:mrow>
</mml:msup>
</mml:mrow>
</mml:mfrac>
</mml:mrow>
</mml:msup>
</mml:mrow>
</mml:math>
<label>(3)</label>
</disp-formula>
</p>
<p>Here, <inline-formula id="inf13">
<mml:math id="m16">
<mml:mrow>
<mml:mi>&#x3c4;</mml:mi>
</mml:mrow>
</mml:math>
</inline-formula> represents the particle&#x2019;s transit time inside the laser beam, which can be estimated using the laser spot size <inline-formula id="inf14">
<mml:math id="m17">
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>L</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>s</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula>, the particle diameter <inline-formula id="inf15">
<mml:math id="m18">
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>P</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>d</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula>, and its velocity <inline-formula id="inf16">
<mml:math id="m19">
<mml:mrow>
<mml:mi>V</mml:mi>
</mml:mrow>
</mml:math>
</inline-formula> as presented in <xref ref-type="disp-formula" rid="e4">Equation 4</xref>:<disp-formula id="e4">
<mml:math id="m20">
<mml:mrow>
<mml:mi>&#x3c4;</mml:mi>
<mml:mo>&#x3d;</mml:mo>
<mml:mfrac>
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>L</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>s</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo>&#x2b;</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi>P</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>d</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
<mml:mrow>
<mml:mi>V</mml:mi>
<mml:mo>&#x2061;</mml:mo>
<mml:mi>sin</mml:mi>
<mml:mfenced open="(" close=")">
<mml:mrow>
<mml:mi>&#x3b8;</mml:mi>
</mml:mrow>
</mml:mfenced>
</mml:mrow>
</mml:mfrac>
</mml:mrow>
</mml:math>
<label>(4)</label>
</disp-formula>
</p>
<p>
<xref ref-type="fig" rid="F3">Figure 3A</xref> illustrates an example of the modulation induced in the laser output power by the passage of two 4&#xa0;&#xb5;m diameter spheres, each transiting through the laser sensing volume at different times. <xref ref-type="fig" rid="F3">Figure 3B</xref> shows a filtered SMI signal corresponding to a single particle crossing the sensing volume, highlighting the Gaussian-shaped envelope that characterizes the amplitude burst. Finally, <xref ref-type="fig" rid="F3">Figure 3C</xref> presents the characteristic Doppler frequency peak extracted from the filtered signal, which is directly related to the particle&#x2019;s velocity.</p>
<fig id="F3" position="float">
<label>FIGURE 3</label>
<caption>
<p>Experimental acquisition of the SMI signal during particle transit. <bold>(A)</bold> Raw SMI signal highlighting the passage of two particles at different times. <bold>(B)</bold> Filtered SMI signal corresponding to a 4&#xa0;&#xb5;m polystyrene sphere. <bold>(C)</bold> Frequency spectrum (FFT) of the filtered signal.</p>
</caption>
<graphic xlink:href="fsens-06-1662060-g003.tif">
<alt-text content-type="machine-generated">Chart A shows a waveform in millivolts over 140 milliseconds with labeled particle transit events. Chart B features a waveform in millivolts over 10 milliseconds inside a Gaussian envelope marked as 4 micrometers. Chart C presents a frequency spectrum with a Doppler peak labeled, in amplitude over kilohertz.</alt-text>
</graphic>
</fig>
</sec>
<sec id="s2-2">
<title>2.2 System overview</title>
<p>The schematic in <xref ref-type="fig" rid="F4">Figure 4</xref> illustrates the setup assembled for single-particle detection using the SMI sensing scheme. The system consists of two main subsystems: an optoelectronic system, responsible for enhancing and acquiring the SMI signal, and a microfluidic system, designed to control a consistent particle flow.</p>
<fig id="F4" position="float">
<label>FIGURE 4</label>
<caption>
<p>SMI flow cytometer experimental setup assembled, highlighting the main components of the system, including the microfluidic chip designed for single-particle isolation through the hydrodynamic focusing effect.</p>
</caption>
<graphic xlink:href="fsens-06-1662060-g004.tif">
<alt-text content-type="machine-generated">Microscope setup with labeled components includes a pump and flow rate sensors, high-speed camera, SMI sensor, 3D-Zaber, and DAQ. An inverted microscope is connected to a custom-made microfluidic chip, shown in a detailed inset.</alt-text>
</graphic>
</fig>
<sec id="s2-2-1">
<title>2.2.1 Optoelectronic subsystem</title>
<p>The optical setup employs a 1,550&#xa0;nm single-mode distributed feedback (DFB) laser diode (ThorLabs-L1550P5DFB) equipped with a package-integrated monitoring photodiode. To ensure sufficient power and signal enhancement during particle passage, the laser beam is focused using a doublet lens (AC254-030-C), achieving a measured spot diameter of 80&#xa0;&#x3bc;m at its waist with an initial power of 4.7&#xa0;mW. The propagation axis of the laser is set at an angle of 80<inline-formula id="inf17">
<mml:math id="m21">
<mml:mrow>
<mml:mo>&#xb0;</mml:mo>
</mml:mrow>
</mml:math>
</inline-formula> relative to the channel flow. The laser is mounted on a 3-axis linear stage (ZaberTech T-LSM050A) to allow precise micrometer-scale alignment. The SMI signal is acquired by monitoring variations in the photodiode current using a custom-made transimpedance amplifier and recorded at a sampling rate of 2&#xa0;MHz using an acquisition card (DAQ NI-6361).</p>
</sec>
<sec id="s2-2-2">
<title>2.2.2 Microfluidic subsystem</title>
<p>To achieve single-particle alignment and ensure a constant flow of individual particles through the laser sensing volume, a custom-made PDMS microfluidic chip was fabricated, specifically designed to perform hydrodynamic focusing (HF) for particle alignment. The channel structure was created using photolithography, and its dimensions were verified using a profilometer, confirming a consistent height of 70&#xa0;&#xb5;m and a width of 80&#xa0;&#xb5;m. For particle isolation via HF, the flow rates are set at 5&#xa0;&#x3bc;L/min for the sheath flow and 10&#xa0;&#x3bc;L/min for the sample flow. The velocity profile inside the chip was estimated through simulations in COMSOL to determine the range of particle speeds within the chip. To verify the correct operation of the HF system, the microfluidic chip is mounted on an inverted microscope, allowing real-time monitoring of the channel and flow using a high-speed camera.</p>
<p>Inspired to simulate human blood cells for flow cytometry experiments, synthetic 2, 4, and 10&#xa0;&#xb5;m monodisperse polystyrene particles are used, each with a coefficient of variation of 1.8% in its diameter. For each particle size, a 4% concentration was prepared in 1&#xa0;mL of deionized water (DI) and introduced into the channels using a microfluidic control system (Fluigent MFCS-EZ) equipped with multiple flow rate sensors (Fluigent Flow Unit M&#x2b;) with a precision of <inline-formula id="inf18">
<mml:math id="m22">
<mml:mrow>
<mml:mo>&#xb1;</mml:mo>
</mml:mrow>
</mml:math>
</inline-formula> 0.2&#xa0;mL/min, allowing precise adjustment of the flow rate during experiments.</p>
</sec>
</sec>
<sec id="s2-3">
<title>2.3 Pipeline</title>
<sec id="s2-3-1">
<title>2.3.1 Signal acquisition</title>
<p>To construct a robust database containing the induced modulation caused by particle transit, all potential particle events were recorded in real-time using a Python-based acquisition routine. The collected signals were then analyzed to segment the time intervals corresponding to each particle&#x2019;s passage for further analysis. Additionally, to increase the size of the dataset and improve model generalization, multiple augmentation techniques were applied.</p>
<p>The data acquisition system (DAQ) was configured with a sampling frequency of 2&#xa0;MHz, chosen to cover the expected Doppler frequency peaks range while retaining higher-order harmonics and transient components, and to provide a comprehensive dataset for both algorithm development and later decimation analysis. Each acquisition window captured 8.192&#xa0;ms (16,384 samples) to ensure full coverage of the slowest particle transits while limiting unrelated signal content. A real-valued FFT of the full segment with a rectangular window was applied to identify the characteristic Doppler peaks. This configuration balanced detection reliability, computational efficiency, and preservation of amplitude information in low-SNR conditions (<xref ref-type="bibr" rid="B18">Rapuano and Harris, 2008</xref>). To define a broad frequency range of interest, the expected particle velocity was estimated through numerical simulations. Based on <xref ref-type="disp-formula" rid="e1">Equation 1</xref>, a detection range from 5&#xa0;kHz to 100&#xa0;kHz was established, broad enough to avoid missing Doppler peaks outside the expected range while still allowing effective filtering of irrelevant frequencies. A threshold level for peak detection was determined experimentally by analyzing the noise level when only deionized (DI) water was flowing. Segments that met the detection criteria were stored for further processing.</p>
</sec>
<sec id="s2-3-2">
<title>2.3.2 Offline validation</title>
<p>Following the approach described in <xref ref-type="bibr" rid="B19">Sierra-Alarc&#xf3;n et al. (2024)</xref>, an offline adaptive spectrogram algorithm was employed to extract only the signal segments corresponding to particle transits. For each detected event, the spectrogram parameters were adjusted based on the estimated particle velocity and transit duration. A Gaussian fit was then applied to verify whether the observed modulation matched the expected signal shape defined in <xref ref-type="disp-formula" rid="e3">Equation 3</xref>.</p>
<p>To support this validation, the signal-to-noise ratio (SNR) was evaluated using <xref ref-type="disp-formula" rid="e7">Equation 7</xref>, providing initial evidence that the signal amplitude decreases as particle size decreases (<xref ref-type="fig" rid="F5">Figure 5</xref>). The SNR was estimated by comparing the average power of segments containing particle-induced modulation, <inline-formula id="inf19">
<mml:math id="m23">
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>x</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mtext>p</mml:mtext>
</mml:mrow>
</mml:msub>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:mrow>
<mml:mi>t</mml:mi>
</mml:mrow>
<mml:mo stretchy="false">)</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:math>
</inline-formula>, with those containing only noise signal, <inline-formula id="inf20">
<mml:math id="m24">
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>x</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mtext>np</mml:mtext>
</mml:mrow>
</mml:msub>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:mrow>
<mml:mi>t</mml:mi>
</mml:mrow>
<mml:mo stretchy="false">)</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:math>
</inline-formula>.</p>
<fig id="F5" position="float">
<label>FIGURE 5</label>
<caption>
<p>Representation of signal-to-noise ratio for different particle sizes. The red line represents the mean, the box indicates the standard deviation, and the blue lines show the maximum and minimum values.</p>
</caption>
<graphic xlink:href="fsens-06-1662060-g005.tif">
<alt-text content-type="machine-generated">Box plot showing Signal-to-Noise Ratio (SNR) in decibels against particle size in micrometers. Three data points are plotted at particle sizes of ten, four, and two micrometers. The highest median SNR is at ten micrometers, while the lowest is at two micrometers.</alt-text>
</graphic>
</fig>
<p>The final dataset included 700 labeled samples for each particle size (2, 4, and 10&#xa0;&#xb5;m), with each sample spanning 1.25&#xa0;ms (2,500 data points), capturing the complete transit event while excluding irrelevant portions of the signal.<disp-formula id="e5">
<mml:math id="m25">
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>P</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mtext>signal</mml:mtext>
</mml:mrow>
</mml:msub>
<mml:mo>&#x3d;</mml:mo>
<mml:mfrac>
<mml:mrow>
<mml:mn>1</mml:mn>
</mml:mrow>
<mml:mrow>
<mml:mi>N</mml:mi>
</mml:mrow>
</mml:mfrac>
<mml:mstyle displaystyle="true">
<mml:munderover>
<mml:mrow>
<mml:mo>&#x2211;</mml:mo>
</mml:mrow>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mo>&#x3d;</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
<mml:mrow>
<mml:mi>N</mml:mi>
</mml:mrow>
</mml:munderover>
</mml:mstyle>
<mml:msub>
<mml:mrow>
<mml:mi>x</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mtext>p</mml:mtext>
</mml:mrow>
</mml:msub>
<mml:msup>
<mml:mrow>
<mml:mfenced open="(" close=")">
<mml:mrow>
<mml:mi>i</mml:mi>
</mml:mrow>
</mml:mfenced>
</mml:mrow>
<mml:mrow>
<mml:mn>2</mml:mn>
</mml:mrow>
</mml:msup>
</mml:mrow>
</mml:math>
<label>(5)</label>
</disp-formula>
<disp-formula id="e6">
<mml:math id="m26">
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>P</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mtext>noise</mml:mtext>
</mml:mrow>
</mml:msub>
<mml:mo>&#x3d;</mml:mo>
<mml:mfrac>
<mml:mrow>
<mml:mn>1</mml:mn>
</mml:mrow>
<mml:mrow>
<mml:mi>N</mml:mi>
</mml:mrow>
</mml:mfrac>
<mml:mstyle displaystyle="true">
<mml:munderover>
<mml:mrow>
<mml:mo>&#x2211;</mml:mo>
</mml:mrow>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mo>&#x3d;</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
<mml:mrow>
<mml:mi>N</mml:mi>
</mml:mrow>
</mml:munderover>
</mml:mstyle>
<mml:msub>
<mml:mrow>
<mml:mi>x</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mtext>np</mml:mtext>
</mml:mrow>
</mml:msub>
<mml:msup>
<mml:mrow>
<mml:mfenced open="(" close=")">
<mml:mrow>
<mml:mi>i</mml:mi>
</mml:mrow>
</mml:mfenced>
</mml:mrow>
<mml:mrow>
<mml:mn>2</mml:mn>
</mml:mrow>
</mml:msup>
</mml:mrow>
</mml:math>
<label>(6)</label>
</disp-formula>
<disp-formula id="e7">
<mml:math id="m27">
<mml:mrow>
<mml:mi>S</mml:mi>
<mml:mi>N</mml:mi>
<mml:mi>R</mml:mi>
<mml:mfenced open="(" close=")">
<mml:mrow>
<mml:mi mathvariant="normal">d</mml:mi>
<mml:mi mathvariant="normal">B</mml:mi>
</mml:mrow>
</mml:mfenced>
<mml:mo>&#x3d;</mml:mo>
<mml:mn>10</mml:mn>
<mml:mo>&#x22c5;</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi>log</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mn>10</mml:mn>
</mml:mrow>
</mml:msub>
<mml:mfenced open="(" close=")">
<mml:mrow>
<mml:mfrac>
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>P</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mtext>signal</mml:mtext>
</mml:mrow>
</mml:msub>
<mml:mo>&#x2212;</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi>P</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mtext>noise</mml:mtext>
</mml:mrow>
</mml:msub>
</mml:mrow>
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>P</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mtext>noise</mml:mtext>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:mfrac>
</mml:mrow>
</mml:mfenced>
</mml:mrow>
</mml:math>
<label>(7)</label>
</disp-formula>
</p>
</sec>
<sec id="s2-3-3">
<title>2.3.3 Data augmentation</title>
<p>A combination of the following data augmentation techniques was randomly applied to represent possible variation in the real raw signals while preserving the essential characteristics of the modulations.<list list-type="simple">
<list-item>
<p>
<inline-formula id="inf21">
<mml:math id="m28">
<mml:mrow>
<mml:mo>&#x2022;</mml:mo>
</mml:mrow>
</mml:math>
</inline-formula> Additive Noise: Gaussian noise is added to the signal to reduce the SNR in each sample. The noisy signal is given by <xref ref-type="disp-formula" rid="e8">Equation 8</xref>:</p>
</list-item>
</list>
<disp-formula id="e8">
<mml:math id="m29">
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>x</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mtext>noisy</mml:mtext>
</mml:mrow>
</mml:msub>
<mml:mfenced open="(" close=")">
<mml:mrow>
<mml:mi>t</mml:mi>
</mml:mrow>
</mml:mfenced>
<mml:mo>&#x3d;</mml:mo>
<mml:mi>x</mml:mi>
<mml:mfenced open="(" close=")">
<mml:mrow>
<mml:mi>t</mml:mi>
</mml:mrow>
</mml:mfenced>
<mml:mo>&#x2b;</mml:mo>
<mml:mi mathvariant="script">N</mml:mi>
<mml:mfenced open="(" close=")">
<mml:mrow>
<mml:mn>0</mml:mn>
<mml:mo>,</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mi>&#x3c3;</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mn>2</mml:mn>
</mml:mrow>
</mml:msup>
</mml:mrow>
</mml:mfenced>
</mml:mrow>
</mml:math>
<label>(8)</label>
</disp-formula>where <inline-formula id="inf22">
<mml:math id="m30">
<mml:mrow>
<mml:mi mathvariant="script">N</mml:mi>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:mrow>
<mml:mn>0</mml:mn>
<mml:mo>,</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mi>&#x3c3;</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mn>2</mml:mn>
</mml:mrow>
</mml:msup>
</mml:mrow>
<mml:mo stretchy="false">)</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:math>
</inline-formula> represents Gaussian noise with zero mean and variance <inline-formula id="inf23">
<mml:math id="m31">
<mml:mrow>
<mml:msup>
<mml:mrow>
<mml:mi>&#x3c3;</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mn>2</mml:mn>
</mml:mrow>
</mml:msup>
</mml:mrow>
</mml:math>
</inline-formula>.<list list-type="simple">
<list-item>
<p>
<inline-formula id="inf24">
<mml:math id="m32">
<mml:mrow>
<mml:mo>&#x2022;</mml:mo>
</mml:mrow>
</mml:math>
</inline-formula> Quantization: This technique reduces the resolution of the signal by constraining each sample to a fixed number of possible values. For a given resolution <inline-formula id="inf25">
<mml:math id="m33">
<mml:mrow>
<mml:mi>R</mml:mi>
</mml:mrow>
</mml:math>
</inline-formula>, each sample <inline-formula id="inf26">
<mml:math id="m34">
<mml:mrow>
<mml:mi>x</mml:mi>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:mrow>
<mml:mi>t</mml:mi>
</mml:mrow>
<mml:mo stretchy="false">)</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:math>
</inline-formula> is transformed as <xref ref-type="disp-formula" rid="e9">Equation 9</xref>:</p>
</list-item>
</list>
<disp-formula id="e9">
<mml:math id="m35">
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>x</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mtext>qt</mml:mtext>
</mml:mrow>
</mml:msub>
<mml:mfenced open="(" close=")">
<mml:mrow>
<mml:mi>t</mml:mi>
</mml:mrow>
</mml:mfenced>
<mml:mo>&#x3d;</mml:mo>
<mml:mfrac>
<mml:mrow>
<mml:mrow>
<mml:mo>&#x230a;</mml:mo>
<mml:mrow>
<mml:mi>R</mml:mi>
<mml:mo>&#x22c5;</mml:mo>
<mml:mi>x</mml:mi>
<mml:mfenced open="(" close=")">
<mml:mrow>
<mml:mi>t</mml:mi>
</mml:mrow>
</mml:mfenced>
</mml:mrow>
<mml:mo>&#x230b;</mml:mo>
</mml:mrow>
</mml:mrow>
<mml:mrow>
<mml:mi>R</mml:mi>
</mml:mrow>
</mml:mfrac>
</mml:mrow>
</mml:math>
<label>(9)</label>
</disp-formula>where <inline-formula id="inf27">
<mml:math id="m36">
<mml:mrow>
<mml:mo>&#x230a;</mml:mo>
<mml:mrow>
<mml:mo>&#x22c5;</mml:mo>
</mml:mrow>
<mml:mo>&#x230b;</mml:mo>
</mml:mrow>
</mml:math>
</inline-formula> represents the floor operation. Here, <inline-formula id="inf28">
<mml:math id="m37">
<mml:mrow>
<mml:mi>R</mml:mi>
</mml:mrow>
</mml:math>
</inline-formula> is a random integer selected between 40 and 100, for quantizing the signal and reducing its precision.<list list-type="simple">
<list-item>
<p>
<inline-formula id="inf29">
<mml:math id="m38">
<mml:mrow>
<mml:mo>&#x2022;</mml:mo>
</mml:mrow>
</mml:math>
</inline-formula> Downsampling: In this method, the temporal resolution of the signal is reduced by selecting a downsampling factor <inline-formula id="inf30">
<mml:math id="m39">
<mml:mrow>
<mml:mi>k</mml:mi>
</mml:mrow>
</mml:math>
</inline-formula> (randomly chosen between 2 and 9). For every <inline-formula id="inf31">
<mml:math id="m40">
<mml:mrow>
<mml:mi>k</mml:mi>
</mml:mrow>
</mml:math>
</inline-formula>-th sample <inline-formula id="inf32">
<mml:math id="m41">
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>x</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>i</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula>, the next <inline-formula id="inf33">
<mml:math id="m42">
<mml:mrow>
<mml:mi>k</mml:mi>
</mml:mrow>
</mml:math>
</inline-formula> samples are overwritten with <inline-formula id="inf34">
<mml:math id="m43">
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>x</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>i</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula>, maintaining the original length of the signal. Mathematically, this can be as expressed in <xref ref-type="disp-formula" rid="e10">Equation 10</xref>:</p>
</list-item>
</list>
<disp-formula id="e10">
<mml:math id="m44">
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>x</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mtext>ds</mml:mtext>
</mml:mrow>
</mml:msub>
<mml:mfenced open="(" close=")">
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mo>&#x2b;</mml:mo>
<mml:mi>j</mml:mi>
</mml:mrow>
</mml:mfenced>
<mml:mo>&#x3d;</mml:mo>
<mml:mi>x</mml:mi>
<mml:mfenced open="(" close=")">
<mml:mrow>
<mml:mi>i</mml:mi>
</mml:mrow>
</mml:mfenced>
<mml:mo>,</mml:mo>
<mml:mspace width="1em"/>
<mml:mo>&#x2200;</mml:mo>
<mml:mi>j</mml:mi>
<mml:mo>&#x2208;</mml:mo>
<mml:mfenced open="[" close="]">
<mml:mrow>
<mml:mn>0</mml:mn>
<mml:mo>,</mml:mo>
<mml:mi>k</mml:mi>
<mml:mo>&#x2212;</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:mfenced>
</mml:mrow>
</mml:math>
<label>(10)</label>
</disp-formula>
<list list-type="simple">
<list-item>
<p>
<inline-formula id="inf35">
<mml:math id="m45">
<mml:mrow>
<mml:mo>&#x2022;</mml:mo>
</mml:mrow>
</mml:math>
</inline-formula> Amplitude Inversion: The signal is inverted to simulate phase changes (<xref ref-type="disp-formula" rid="e11">Equation 11</xref>), achieved by multiplying the amplitude of the signal by <inline-formula id="inf36">
<mml:math id="m46">
<mml:mrow>
<mml:mo>&#x2212;</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:math>
</inline-formula>:</p>
</list-item>
</list>
<disp-formula id="e11">
<mml:math id="m47">
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>x</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mtext>inverted</mml:mtext>
</mml:mrow>
</mml:msub>
<mml:mfenced open="(" close=")">
<mml:mrow>
<mml:mi>t</mml:mi>
</mml:mrow>
</mml:mfenced>
<mml:mo>&#x3d;</mml:mo>
<mml:mo>&#x2212;</mml:mo>
<mml:mi>x</mml:mi>
<mml:mfenced open="(" close=")">
<mml:mrow>
<mml:mi>t</mml:mi>
</mml:mrow>
</mml:mfenced>
</mml:mrow>
</mml:math>
<label>(11)</label>
</disp-formula>
<list list-type="simple">
<list-item>
<p>
<inline-formula id="inf37">
<mml:math id="m48">
<mml:mrow>
<mml:mo>&#x2022;</mml:mo>
</mml:mrow>
</mml:math>
</inline-formula> Random Interpolation: A random subset of the signal is replaced by interpolated values to simulate missing or corrupted data following <xref ref-type="disp-formula" rid="e12">Equation 12</xref>. For a randomly chosen set of indices <inline-formula id="inf38">
<mml:math id="m49">
<mml:mrow>
<mml:mo stretchy="false">{</mml:mo>
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>i</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:msub>
<mml:mo>,</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi>i</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mn>2</mml:mn>
</mml:mrow>
</mml:msub>
<mml:mo>,</mml:mo>
<mml:mo>&#x2026;</mml:mo>
<mml:mo>,</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi>i</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>n</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
<mml:mo stretchy="false">}</mml:mo>
</mml:mrow>
</mml:math>
</inline-formula>, the interpolated values are calculated as:</p>
</list-item>
</list>
<disp-formula id="e12">
<mml:math id="m50">
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>x</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mtext>it</mml:mtext>
</mml:mrow>
</mml:msub>
<mml:mfenced open="(" close=")">
<mml:mrow>
<mml:mi>i</mml:mi>
</mml:mrow>
</mml:mfenced>
<mml:mo>&#x3d;</mml:mo>
<mml:mtext>inter</mml:mtext>
<mml:mfenced open="(" close=")">
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi mathvariant="bold">x</mml:mi>
</mml:mrow>
</mml:mfenced>
</mml:mrow>
</mml:math>
<label>(12)</label>
</disp-formula>where <inline-formula id="inf39">
<mml:math id="m51">
<mml:mrow>
<mml:mtext>inter</mml:mtext>
</mml:mrow>
</mml:math>
</inline-formula> denotes a linear interpolation function.<list list-type="simple">
<list-item>
<p>
<inline-formula id="inf40">
<mml:math id="m52">
<mml:mrow>
<mml:mo>&#x2022;</mml:mo>
</mml:mrow>
</mml:math>
</inline-formula> Shifting: To simulate variations in timing, the signal is circularly shifted by a factor <inline-formula id="inf41">
<mml:math id="m53">
<mml:mrow>
<mml:mi mathvariant="normal">&#x394;</mml:mi>
<mml:mspace width="0.3333em"/>
<mml:mi>t</mml:mi>
</mml:mrow>
</mml:math>
</inline-formula>, determined as a percentage of the signal length using <xref ref-type="disp-formula" rid="e13">Equation 13</xref>:</p>
</list-item>
</list>
<disp-formula id="e13">
<mml:math id="m54">
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>x</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mtext>shift</mml:mtext>
</mml:mrow>
</mml:msub>
<mml:mfenced open="(" close=")">
<mml:mrow>
<mml:mi>t</mml:mi>
</mml:mrow>
</mml:mfenced>
<mml:mo>&#x3d;</mml:mo>
<mml:mi>x</mml:mi>
<mml:mfenced open="(" close=")">
<mml:mrow>
<mml:mfenced open="(" close=")">
<mml:mrow>
<mml:mi>t</mml:mi>
<mml:mo>&#x2b;</mml:mo>
<mml:mi mathvariant="normal">&#x394;</mml:mi>
<mml:mi>t</mml:mi>
</mml:mrow>
</mml:mfenced>
<mml:mi>mod</mml:mi>
<mml:mi>N</mml:mi>
</mml:mrow>
</mml:mfenced>
</mml:mrow>
</mml:math>
<label>(13)</label>
</disp-formula>where <inline-formula id="inf42">
<mml:math id="m55">
<mml:mrow>
<mml:mi>N</mml:mi>
</mml:mrow>
</mml:math>
</inline-formula> is the total number of data points.</p>
</sec>
</sec>
<sec id="s2-4">
<title>2.4 Signal data preprocessing</title>
<p>To evaluate multiple approaches for the classification task and after the different augmentation techniques, three different data representations were explored: the use of the SMI temporal signal, an optimized spectrogram, and the classification based on specific features extracted from both the temporal and frequency spectrum of the signal, as illustrated in <xref ref-type="fig" rid="F6">Figure 6</xref>.</p>
<fig id="F6" position="float">
<label>FIGURE 6</label>
<caption>
<p>Data representations explored for the classification task. <bold>(A)</bold> Filtered SMI temporal signal modulation. <bold>(B)</bold> SMI signal spectrogram. <bold>(C)</bold> Handcrafted temporal and frequency features.</p>
</caption>
<graphic xlink:href="fsens-06-1662060-g006.tif">
<alt-text content-type="machine-generated">Panel A shows a waveform plot with time in milliseconds and voltage in millivolts. Panel B displays a spectrogram with frequency in kilohertz and time in milliseconds. Panel C illustrates a combination of temporal and frequency domain features, highlighting their complementary use.</alt-text>
</graphic>
</fig>
<sec id="s2-4-1">
<title>2.4.1 SMI temporal signal enhancement</title>
<p>To reduce signal dimensionality and suppress embedded noise, all samples were processed using a band-pass filter based on the previously defined frequency ranges. A decimation step was then applied, reducing the sampling rate by a factor of 4, to 500&#xa0;kHz. This reduction aimed to decrease data size without significantly altering the signal characteristics. Additionally, the filtered signals were scaled by a factor of 10, selected after testing different values (1, 5, 10, 20) for its ability to accelerate convergence by increasing gradient magnitudes, without affecting classification accuracy or altering the relative shape of the signals (<xref ref-type="bibr" rid="B13">LeCun et al., 2012</xref>). This formatted signal was then used for the next data representation approaches.</p>
</sec>
<sec id="s2-4-2">
<title>2.4.2 Spectrogram-based features</title>
<p>Time-domain spectral analysis is essential for capturing the dynamic behavior of non-stationary signals by revealing how their frequency content evolves over time. Spectrograms were employed due to their effectiveness in detecting transient events and frequency variations. The spectrogram is computed using the Short-Time Fourier Transform (STFT), as defined in <xref ref-type="disp-formula" rid="e14">Equation 14</xref>
<disp-formula id="e14">
<mml:math id="m56">
<mml:mrow>
<mml:mi>S</mml:mi>
<mml:mfenced open="(" close=")">
<mml:mrow>
<mml:mi>t</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>f</mml:mi>
</mml:mrow>
</mml:mfenced>
<mml:mo>&#x3d;</mml:mo>
<mml:msubsup>
<mml:mrow>
<mml:mo>&#x222b;</mml:mo>
</mml:mrow>
<mml:mrow>
<mml:mo>&#x2212;</mml:mo>
<mml:mi>&#x221e;</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>&#x221e;</mml:mi>
</mml:mrow>
</mml:msubsup>
<mml:mi>x</mml:mi>
<mml:mfenced open="(" close=")">
<mml:mrow>
<mml:mi>&#x3c4;</mml:mi>
</mml:mrow>
</mml:mfenced>
<mml:mi>w</mml:mi>
<mml:mfenced open="(" close=")">
<mml:mrow>
<mml:mi>&#x3c4;</mml:mi>
<mml:mo>&#x2212;</mml:mo>
<mml:mi>t</mml:mi>
</mml:mrow>
</mml:mfenced>
<mml:msup>
<mml:mrow>
<mml:mi>e</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>&#x2212;</mml:mo>
<mml:mi>j</mml:mi>
<mml:mn>2</mml:mn>
<mml:mi>&#x3c0;</mml:mi>
<mml:mi>f</mml:mi>
<mml:mi>&#x3c4;</mml:mi>
</mml:mrow>
</mml:msup>
<mml:mi>d</mml:mi>
<mml:mi>&#x3c4;</mml:mi>
</mml:mrow>
</mml:math>
<label>(14)</label>
</disp-formula>where <inline-formula id="inf43">
<mml:math id="m57">
<mml:mrow>
<mml:mi>x</mml:mi>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:mrow>
<mml:mi>&#x3c4;</mml:mi>
</mml:mrow>
<mml:mo stretchy="false">)</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:math>
</inline-formula> represents the signal, <inline-formula id="inf44">
<mml:math id="m58">
<mml:mrow>
<mml:mi>w</mml:mi>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:mrow>
<mml:mi>&#x3c4;</mml:mi>
<mml:mo>&#x2212;</mml:mo>
<mml:mi>t</mml:mi>
</mml:mrow>
<mml:mo stretchy="false">)</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:math>
</inline-formula> is the Hamming window centered at time <inline-formula id="inf45">
<mml:math id="m59">
<mml:mrow>
<mml:mi>t</mml:mi>
</mml:mrow>
</mml:math>
</inline-formula>, and <inline-formula id="inf46">
<mml:math id="m60">
<mml:mrow>
<mml:mi>f</mml:mi>
</mml:mrow>
</mml:math>
</inline-formula> is the frequency. The computation of <inline-formula id="inf47">
<mml:math id="m61">
<mml:mrow>
<mml:mi>S</mml:mi>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:mrow>
<mml:mi>f</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>t</mml:mi>
</mml:mrow>
<mml:mo stretchy="false">)</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:math>
</inline-formula> involves three key parameters: <inline-formula id="inf48">
<mml:math id="m62">
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>N</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mtext>perseg</mml:mtext>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula>, which defines the length of the window function <inline-formula id="inf49">
<mml:math id="m63">
<mml:mrow>
<mml:mi>w</mml:mi>
<mml:mrow>
<mml:mo stretchy="false">[</mml:mo>
<mml:mrow>
<mml:mi>n</mml:mi>
<mml:mo>&#x2212;</mml:mo>
<mml:mi>&#x3c4;</mml:mi>
</mml:mrow>
<mml:mo stretchy="false">]</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:math>
</inline-formula>; <inline-formula id="inf50">
<mml:math id="m64">
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>N</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mtext>overlap</mml:mtext>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula>, which specifies the overlap between consecutive windows (typically set to half of <inline-formula id="inf51">
<mml:math id="m65">
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>N</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mtext>perseg</mml:mtext>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula> to ensure effective detection of transient events); and <inline-formula id="inf52">
<mml:math id="m66">
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>F</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mtext>range</mml:mtext>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:math>
</inline-formula>, which determines the frequency range under consideration. The selection of spectrogram parameters was guided by the Doppler frequency range of detected peaks in real particle samples, refining the analysis to focus specifically on the Doppler frequency component and its decay over time (5&#x2013;40&#xa0;kHz). This was done while considering the passage duration of the smallest, fastest particles in the dataset. The frequency resolution is given by <xref ref-type="disp-formula" rid="e15">Equation 15</xref>:<disp-formula id="e15">
<mml:math id="m67">
<mml:mrow>
<mml:mi mathvariant="normal">&#x394;</mml:mi>
<mml:mi>f</mml:mi>
<mml:mo>&#x3d;</mml:mo>
<mml:mfrac>
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>f</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>s</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>N</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mtext>perseg</mml:mtext>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:mfrac>
</mml:mrow>
</mml:math>
<label>(15)</label>
</disp-formula>where <inline-formula id="inf53">
<mml:math id="m68">
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>f</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>s</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo>&#x3d;</mml:mo>
<mml:mn>500</mml:mn>
<mml:mtext>&#x2009;kHz</mml:mtext>
</mml:mrow>
</mml:math>
</inline-formula> is the sampling rate. To achieve a target frequency resolution of 1&#xa0;kHz, the required window length is <inline-formula id="inf54">
<mml:math id="m69">
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>N</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mtext>perseg</mml:mtext>
</mml:mrow>
</mml:msub>
<mml:mo>&#x3d;</mml:mo>
<mml:mn>500</mml:mn>
</mml:mrow>
</mml:math>
</inline-formula> samples. This corresponds to a temporal resolution of approximately 1&#xa0;ms.</p>
<p>This configuration ensures that short-duration events, such as those caused by 2 <inline-formula id="inf55">
<mml:math id="m70">
<mml:mrow>
<mml:mi>&#x3bc;</mml:mi>
</mml:mrow>
</mml:math>
</inline-formula>m particles lasting approximately 1.6&#xa0;ms, remain visible while preserving spectral integrity.</p>
<p>The choice of spectrograms over alternative representations, such as Mel-Frequency Cepstral Coefficients (MFCCs), was based on their ability to preserve raw time-frequency relationships (<xref ref-type="bibr" rid="B24">Zhao K. et al., 2020</xref>; <xref ref-type="bibr" rid="B10">Gourisaria et al., 2024</xref>). While MFCCs are effective for auditory perception tasks, they involve dimensionality reduction and feature decorrelation, which can lead to information loss and increased noise sensitivity in non-speech signals. In contrast, spectrograms provide a richer representation, facilitating the extraction of meaningful patterns while maintaining correlated spectral features.</p>
</sec>
<sec id="s2-4-3">
<title>2.4.3 Specific features</title>
<p>To extract valuable information from both the temporal and frequency domains and to improve differentiation where particle modulation is embedded in noise, the following features were defined:<list list-type="simple">
<list-item>
<p>
<inline-formula id="inf56">
<mml:math id="m71">
<mml:mrow>
<mml:mo>&#x2022;</mml:mo>
</mml:mrow>
</mml:math>
</inline-formula> Signal Amplitude: The amplitude of the temporal signal correlates with particle size, as larger particles induce higher voltage variations. To ensure robustness against noise-induced peaks, the amplitude is extracted using the envelope of the absolute signal. The envelope is computed using the Hilbert transform (<xref ref-type="bibr" rid="B23">Zhao et al., 2019</xref>), providing a smooth upper bound that mitigates the impact of noise peaks.</p>
</list-item>
<list-item>
<p>
<inline-formula id="inf57">
<mml:math id="m72">
<mml:mrow>
<mml:mo>&#x2022;</mml:mo>
</mml:mrow>
</mml:math>
</inline-formula> Passage Time: quantifies the duration for which a particle remains within the laser beam&#x2019;s sensing volume. This feature is extracted by analyzing the parabolic modulation of the signal, modeled using a Gaussian fit applied to the envelope (<xref ref-type="bibr" rid="B19">Sierra-Alarc&#xf3;n et al., 2024</xref>). The passage interval is defined as the period during which the Gaussian fit remains above 10% of its peak amplitude, accounting for the SNR in smaller particles such as those with a 2&#xa0;&#xb5;m diameter.</p>
</list-item>
<list-item>
<p>
<inline-formula id="inf58">
<mml:math id="m73">
<mml:mrow>
<mml:mo>&#x2022;</mml:mo>
</mml:mrow>
</mml:math>
</inline-formula> Average Signal Power: Reflecting the overall signal intensity, this feature may correlate with particle size since larger particles induce stronger modulations (<xref ref-type="disp-formula" rid="e16">Equation 16</xref>). It is computed following the same approach as the SNR:</p>
</list-item>
</list>
<disp-formula id="e16">
<mml:math id="m74">
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>S</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mtext>avg</mml:mtext>
</mml:mrow>
</mml:msub>
<mml:mo>&#x3d;</mml:mo>
<mml:mfrac>
<mml:mrow>
<mml:mn>1</mml:mn>
</mml:mrow>
<mml:mrow>
<mml:mi>N</mml:mi>
</mml:mrow>
</mml:mfrac>
<mml:mstyle displaystyle="true">
<mml:munderover>
<mml:mrow>
<mml:mo>&#x2211;</mml:mo>
</mml:mrow>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mo>&#x3d;</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
<mml:mrow>
<mml:mi>N</mml:mi>
</mml:mrow>
</mml:munderover>
</mml:mstyle>
<mml:msub>
<mml:mrow>
<mml:mi>x</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mtext>p</mml:mtext>
</mml:mrow>
</mml:msub>
<mml:msup>
<mml:mrow>
<mml:mfenced open="(" close=")">
<mml:mrow>
<mml:mi>i</mml:mi>
</mml:mrow>
</mml:mfenced>
</mml:mrow>
<mml:mrow>
<mml:mn>2</mml:mn>
</mml:mrow>
</mml:msup>
<mml:mo>&#x2212;</mml:mo>
<mml:mfrac>
<mml:mrow>
<mml:mn>1</mml:mn>
</mml:mrow>
<mml:mrow>
<mml:mi>N</mml:mi>
</mml:mrow>
</mml:mfrac>
<mml:mstyle displaystyle="true">
<mml:munderover>
<mml:mrow>
<mml:mo>&#x2211;</mml:mo>
</mml:mrow>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mo>&#x3d;</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
<mml:mrow>
<mml:mi>N</mml:mi>
</mml:mrow>
</mml:munderover>
</mml:mstyle>
<mml:msub>
<mml:mrow>
<mml:mi>x</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mtext>np</mml:mtext>
</mml:mrow>
</mml:msub>
<mml:msup>
<mml:mrow>
<mml:mfenced open="(" close=")">
<mml:mrow>
<mml:mi>i</mml:mi>
</mml:mrow>
</mml:mfenced>
</mml:mrow>
<mml:mrow>
<mml:mn>2</mml:mn>
</mml:mrow>
</mml:msup>
</mml:mrow>
</mml:math>
<label>(16)</label>
</disp-formula>where <inline-formula id="inf59">
<mml:math id="m75">
<mml:mrow>
<mml:mi>N</mml:mi>
</mml:mrow>
</mml:math>
</inline-formula> is the total of data points in a sample.<list list-type="simple">
<list-item>
<p>
<inline-formula id="inf60">
<mml:math id="m76">
<mml:mrow>
<mml:mo>&#x2022;</mml:mo>
</mml:mrow>
</mml:math>
</inline-formula> Frequency Spectrum Power: This feature quantifies the total signal energy distributed over the time-frequency domain, estimated from the STFT. It reflects the overall energy content of the signal across all time and frequency bins according to <xref ref-type="disp-formula" rid="e17">Equation 17</xref>.</p>
</list-item>
</list>
<disp-formula id="e17">
<mml:math id="m77">
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>F</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mtext>spec</mml:mtext>
</mml:mrow>
</mml:msub>
<mml:mo>&#x3d;</mml:mo>
<mml:mstyle displaystyle="true">
<mml:munder>
<mml:mrow>
<mml:mo>&#x2211;</mml:mo>
</mml:mrow>
<mml:mrow>
<mml:mi>t</mml:mi>
</mml:mrow>
</mml:munder>
</mml:mstyle>
<mml:mstyle displaystyle="true">
<mml:munder>
<mml:mrow>
<mml:mo>&#x2211;</mml:mo>
</mml:mrow>
<mml:mrow>
<mml:mi>f</mml:mi>
</mml:mrow>
</mml:munder>
</mml:mstyle>
<mml:mo stretchy="false">&#x7c;</mml:mo>
<mml:mi>S</mml:mi>
<mml:mi>T</mml:mi>
<mml:mi>F</mml:mi>
<mml:mi>T</mml:mi>
<mml:mfenced open="(" close=")">
<mml:mrow>
<mml:mi>t</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>f</mml:mi>
</mml:mrow>
</mml:mfenced>
<mml:msup>
<mml:mrow>
<mml:mo stretchy="false">&#x7c;</mml:mo>
</mml:mrow>
<mml:mrow>
<mml:mn>2</mml:mn>
</mml:mrow>
</mml:msup>
</mml:mrow>
</mml:math>
<label>(17)</label>
</disp-formula>
<list list-type="simple">
<list-item>
<p>
<inline-formula id="inf61">
<mml:math id="m78">
<mml:mrow>
<mml:mo>&#x2022;</mml:mo>
</mml:mrow>
</mml:math>
</inline-formula> Peak Spectral Amplitude: This feature captures the highest spectral amplitude observed in the STFT magnitude, corresponding to the strongest frequency component. It provides insight into the most dominant spectral peak and can be useful for identifying particles that produce sharp localized energy bursts following <xref ref-type="disp-formula" rid="e18">Equation 18</xref>.</p>
</list-item>
</list>
<disp-formula id="e18">
<mml:math id="m79">
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>F</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mtext>peak</mml:mtext>
</mml:mrow>
</mml:msub>
<mml:mo>&#x3d;</mml:mo>
<mml:munder>
<mml:mrow>
<mml:mi>max</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>t</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>f</mml:mi>
</mml:mrow>
</mml:munder>
<mml:mfenced open="{" close="}">
<mml:mrow>
<mml:mo stretchy="false">&#x7c;</mml:mo>
<mml:mi>S</mml:mi>
<mml:mi>T</mml:mi>
<mml:mi>F</mml:mi>
<mml:mi>T</mml:mi>
<mml:mfenced open="(" close=")">
<mml:mrow>
<mml:mi>t</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>f</mml:mi>
</mml:mrow>
</mml:mfenced>
<mml:mo stretchy="false">&#x7c;</mml:mo>
</mml:mrow>
</mml:mfenced>
</mml:mrow>
</mml:math>
<label>(18)</label>
</disp-formula>
<list list-type="simple">
<list-item>
<p>
<inline-formula id="inf62">
<mml:math id="m80">
<mml:mrow>
<mml:mo>&#x2022;</mml:mo>
</mml:mrow>
</mml:math>
</inline-formula> Doppler Frequency: Extracted by identifying the highest peak in the frequency spectrum after applying the Fourier Transform (<xref ref-type="disp-formula" rid="e19">Equation 19</xref>).</p>
</list-item>
</list>
<disp-formula id="e19">
<mml:math id="m81">
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>f</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mtext>Doppler</mml:mtext>
</mml:mrow>
</mml:msub>
<mml:mo>&#x3d;</mml:mo>
<mml:mi>arg</mml:mi>
<mml:munder>
<mml:mrow>
<mml:mi>max</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>f</mml:mi>
</mml:mrow>
</mml:munder>
<mml:mfenced open="{" close="}">
<mml:mrow>
<mml:mi>X</mml:mi>
<mml:mfenced open="(" close=")">
<mml:mrow>
<mml:mi>f</mml:mi>
</mml:mrow>
</mml:mfenced>
</mml:mrow>
</mml:mfenced>
</mml:mrow>
</mml:math>
<label>(19)</label>
</disp-formula>
</p>
</sec>
<sec id="s2-4-4">
<title>2.4.4 T-SNE analysis for feature space visualization</title>
<p>To qualitatively assess the discriminative capacity of the extracted features, a t-SNE (t-distributed Stochastic Neighbor Embedding) projection was applied to both the handcrafted and spectrogram-based feature sets. This dimensionality reduction technique maps high-dimensional data into a two-dimensional space while preserving local structure, allowing for visual inspection of class separability and the clustering behavior of the features (<xref ref-type="bibr" rid="B14">Maaten and Hinton, 2008</xref>). <xref ref-type="fig" rid="F7">Figure 7</xref> presents the resulting t-SNE plots, where each point corresponds to a sample, and colors indicate particle size classes. The resulting spatial distribution suggests that the extracted features contain sufficient information to support particle size classification.</p>
<fig id="F7" position="float">
<label>FIGURE 7</label>
<caption>
<p>T-SNE visualization of feature representations for the three different particle sizes. <bold>(A)</bold> Handcrafted features. <bold>(B)</bold> Spectrogram-based features.</p>
</caption>
<graphic xlink:href="fsens-06-1662060-g007.tif">
<alt-text content-type="machine-generated">Scatter plot comparison in two panels, A and B, showing three sizes of data points: 2 micrometers (blue circles), 4 micrometers (orange triangles), and 10 micrometers (green squares). Both plots display points based on two components, with differing clustering patterns across the panels.</alt-text>
</graphic>
</fig>
</sec>
</sec>
<sec id="s2-5">
<title>2.5 ML classifier models</title>
<p>Machine learning models, specifically deep learning architectures, were evaluated using different input representations, with hyperparameters optimized via grid search based on their impact on model performance (<xref ref-type="bibr" rid="B21">Yang and Shami, 2020</xref>). The dataset was randomly shuffled prior to data augmentation, with 30% allocated for testing using real particle signals and the remaining 70% used for training and validation. This training portion was subsequently augmented and split into 80% for training and 20% for validation. Model performance was evaluated in terms of classification accuracy and computational efficiency, as detailed in the <xref ref-type="app" rid="app1">Appendix</xref>.</p>
<sec id="s2-5-1">
<title>2.5.1 Spectrogram-based model</title>
<p>This model processes spectrograms resized to dimensions <inline-formula id="inf63">
<mml:math id="m82">
<mml:mrow>
<mml:mn>63</mml:mn>
<mml:mo>&#xd7;</mml:mo>
<mml:mn>65</mml:mn>
<mml:mo>&#xd7;</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:math>
</inline-formula>, which are then passed through a fully connected neural network. The architecture includes a Flatten layer, followed by two dense layers, each using ReLU activation, batch normalization, and L2 regularization with a coefficient <inline-formula id="inf64">
<mml:math id="m83">
<mml:mrow>
<mml:mi>&#x3bb;</mml:mi>
<mml:mo>&#x3d;</mml:mo>
<mml:mn>2.5</mml:mn>
<mml:mo>&#xd7;</mml:mo>
<mml:mn>1</mml:mn>
<mml:msup>
<mml:mrow>
<mml:mn>0</mml:mn>
</mml:mrow>
<mml:mrow>
<mml:mo>&#x2212;</mml:mo>
<mml:mn>4</mml:mn>
</mml:mrow>
</mml:msup>
</mml:mrow>
</mml:math>
</inline-formula> to improve generalization (<xref ref-type="bibr" rid="B21">Yang and Shami, 2020</xref>; <xref ref-type="bibr" rid="B1">Agrawal, 2021</xref>). Dropout is applied after each dense layer to prevent overfitting. The output layer consists of three neurons with softmax activation, corresponding to the three particle size classes. Training was conducted using a batch size of 32 and the Adam optimizer with a learning rate decay initialized at <inline-formula id="inf65">
<mml:math id="m84">
<mml:mrow>
<mml:mn>6</mml:mn>
<mml:mo>&#xd7;</mml:mo>
<mml:mn>1</mml:mn>
<mml:msup>
<mml:mrow>
<mml:mn>0</mml:mn>
</mml:mrow>
<mml:mrow>
<mml:mo>&#x2212;</mml:mo>
<mml:mn>4</mml:mn>
</mml:mrow>
</mml:msup>
</mml:mrow>
</mml:math>
</inline-formula>, using categorical cross-entropy as the loss function. Early stopping was implemented to further reduce overfitting by monitoring validation loss.</p>
<p>All model hyperparameters were optimized via grid search. This included the STFT parameter <inline-formula id="inf66">
<mml:math id="m85">
<mml:mrow>
<mml:mtext>nperseg</mml:mtext>
<mml:mo>&#x3d;</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mn>2</mml:mn>
</mml:mrow>
<mml:mrow>
<mml:mi>n</mml:mi>
</mml:mrow>
</mml:msup>
</mml:mrow>
</mml:math>
</inline-formula>, with <inline-formula id="inf67">
<mml:math id="m86">
<mml:mrow>
<mml:mi>n</mml:mi>
<mml:mo>&#x2208;</mml:mo>
<mml:mrow>
<mml:mo stretchy="false">[</mml:mo>
<mml:mrow>
<mml:mn>7,9</mml:mn>
</mml:mrow>
<mml:mo stretchy="false">]</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:math>
</inline-formula>, dropout rates <inline-formula id="inf68">
<mml:math id="m87">
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>p</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mtext>dropout</mml:mtext>
</mml:mrow>
</mml:msub>
<mml:mo>&#x2208;</mml:mo>
<mml:mrow>
<mml:mo stretchy="false">[</mml:mo>
<mml:mrow>
<mml:mn>0.1</mml:mn>
<mml:mo>,</mml:mo>
<mml:mn>0.2</mml:mn>
</mml:mrow>
<mml:mo stretchy="false">]</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:math>
</inline-formula>, and dense layer sizes dense1, dense2 <inline-formula id="inf69">
<mml:math id="m88">
<mml:mrow>
<mml:mo>&#x3d;</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mn>2</mml:mn>
</mml:mrow>
<mml:mrow>
<mml:mi>n</mml:mi>
</mml:mrow>
</mml:msup>
</mml:mrow>
</mml:math>
</inline-formula>, with <inline-formula id="inf70">
<mml:math id="m89">
<mml:mrow>
<mml:mi>n</mml:mi>
<mml:mo>&#x2208;</mml:mo>
<mml:mrow>
<mml:mo stretchy="false">[</mml:mo>
<mml:mrow>
<mml:mn>3,8</mml:mn>
</mml:mrow>
<mml:mo stretchy="false">]</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:math>
</inline-formula>. The optimal configuration, which also reflects the most effective spectrogram resolution, was found to be dense1 &#x3d; 64, dense2 &#x3d; 8, <inline-formula id="inf71">
<mml:math id="m90">
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>p</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mtext>dropout</mml:mtext>
</mml:mrow>
</mml:msub>
<mml:mo>&#x3d;</mml:mo>
<mml:mn>0.1</mml:mn>
</mml:mrow>
</mml:math>
</inline-formula>, and <inline-formula id="inf72">
<mml:math id="m91">
<mml:mrow>
<mml:mtext>nperseg</mml:mtext>
<mml:mo>&#x3d;</mml:mo>
<mml:mn>128</mml:mn>
</mml:mrow>
</mml:math>
</inline-formula>. This configuration achieved a test accuracy of 97.7% and a validation accuracy of 98.6%.</p>
</sec>
<sec id="s2-5-2">
<title>2.5.2 Feature-based model</title>
<p>Following a similar topology to the spectrogram-based model, this version replaces the spectrogram inputs with engineered statistical and frequency-domain features, which are normalized using Z-score scaling. The training methodology remains unchanged, with the L2 regularization coefficient adjusted to <inline-formula id="inf73">
<mml:math id="m92">
<mml:mrow>
<mml:mi>&#x3bb;</mml:mi>
<mml:mo>&#x3d;</mml:mo>
<mml:mn>1</mml:mn>
<mml:mo>&#xd7;</mml:mo>
<mml:mn>1</mml:mn>
<mml:msup>
<mml:mrow>
<mml:mn>0</mml:mn>
</mml:mrow>
<mml:mrow>
<mml:mo>&#x2212;</mml:mo>
<mml:mn>4</mml:mn>
</mml:mrow>
</mml:msup>
</mml:mrow>
</mml:math>
</inline-formula>. Hyperparameter tuning explored dense layer sizes defined as <inline-formula id="inf74">
<mml:math id="m93">
<mml:mrow>
<mml:mtext>dense</mml:mtext>
<mml:mn>1</mml:mn>
<mml:mo>,</mml:mo>
<mml:mtext>dense</mml:mtext>
<mml:mn>2</mml:mn>
<mml:mo>&#x3d;</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mn>2</mml:mn>
</mml:mrow>
<mml:mrow>
<mml:mi>n</mml:mi>
</mml:mrow>
</mml:msup>
</mml:mrow>
</mml:math>
</inline-formula>, with <inline-formula id="inf75">
<mml:math id="m94">
<mml:mrow>
<mml:mi>n</mml:mi>
<mml:mo>&#x2208;</mml:mo>
<mml:mrow>
<mml:mo stretchy="false">[</mml:mo>
<mml:mrow>
<mml:mn>5,6,7,8,9</mml:mn>
</mml:mrow>
<mml:mo stretchy="false">]</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:math>
</inline-formula>, and dropout rates <inline-formula id="inf76">
<mml:math id="m95">
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>p</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mtext>dropout</mml:mtext>
</mml:mrow>
</mml:msub>
<mml:mo>&#x2208;</mml:mo>
<mml:mrow>
<mml:mo stretchy="false">[</mml:mo>
<mml:mrow>
<mml:mn>0.1</mml:mn>
<mml:mo>,</mml:mo>
<mml:mn>0.2</mml:mn>
</mml:mrow>
<mml:mo stretchy="false">]</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:math>
</inline-formula>.</p>
<p>The best configuration was achieved with <inline-formula id="inf77">
<mml:math id="m96">
<mml:mrow>
<mml:mtext>dense</mml:mtext>
<mml:mn>1</mml:mn>
<mml:mo>&#x3d;</mml:mo>
<mml:mn>128</mml:mn>
</mml:mrow>
</mml:math>
</inline-formula>, <inline-formula id="inf78">
<mml:math id="m97">
<mml:mrow>
<mml:mtext>dense</mml:mtext>
<mml:mn>2</mml:mn>
<mml:mo>&#x3d;</mml:mo>
<mml:mn>256</mml:mn>
</mml:mrow>
</mml:math>
</inline-formula>, and <inline-formula id="inf79">
<mml:math id="m98">
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>p</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mtext>dropout</mml:mtext>
</mml:mrow>
</mml:msub>
<mml:mo>&#x3d;</mml:mo>
<mml:mn>0.1</mml:mn>
</mml:mrow>
</mml:math>
</inline-formula>. This feature-based model achieved a test accuracy of 97.0% and a validation accuracy of 97.8%.</p>
</sec>
<sec id="s2-5-3">
<title>2.5.3 SMI temporal signal-based model</title>
<p>The 1D Convolutional Neural Network (CNN) model was developed for temporal signal classification, following the band-pass filtering and decimation steps and employing a hierarchical feature extraction strategy. The architecture is composed of three convolutional layers with an increasing number of filters <inline-formula id="inf80">
<mml:math id="m99">
<mml:mrow>
<mml:msup>
<mml:mrow>
<mml:mn>2</mml:mn>
</mml:mrow>
<mml:mrow>
<mml:mi>n</mml:mi>
</mml:mrow>
</mml:msup>
</mml:mrow>
</mml:math>
</inline-formula> with <inline-formula id="inf81">
<mml:math id="m100">
<mml:mrow>
<mml:mi>n</mml:mi>
<mml:mo>&#x2208;</mml:mo>
<mml:mrow>
<mml:mo stretchy="false">[</mml:mo>
<mml:mrow>
<mml:mn>6</mml:mn>
<mml:mo>&#x2212;</mml:mo>
<mml:mn>8</mml:mn>
</mml:mrow>
<mml:mo stretchy="false">]</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:math>
</inline-formula> and a kernel size of 5. Each layer uses ReLU activation to capture temporal patterns within the waveform. Batch normalization is applied after each convolutional layer to stabilize the training process, followed by MaxPooling1D (pool size &#x3d; 2) to reduce dimensionality while preserving relevant temporal features. To prevent overfitting, a dropout rate of <inline-formula id="inf82">
<mml:math id="m101">
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>p</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mtext>dropout</mml:mtext>
</mml:mrow>
</mml:msub>
<mml:mo>&#x3d;</mml:mo>
<mml:mn>0.2</mml:mn>
</mml:mrow>
</mml:math>
</inline-formula> is applied after each pooling layer.</p>
<p>The extracted features were then flattened and passed through a fully connected dense layer with 256 neurons and L2 regularization <inline-formula id="inf83">
<mml:math id="m102">
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:mrow>
<mml:mi>&#x3bb;</mml:mi>
<mml:mo>&#x3d;</mml:mo>
<mml:mn>1</mml:mn>
<mml:mo>&#xd7;</mml:mo>
<mml:mn>1</mml:mn>
<mml:msup>
<mml:mrow>
<mml:mn>0</mml:mn>
</mml:mrow>
<mml:mrow>
<mml:mo>&#x2212;</mml:mo>
<mml:mn>4</mml:mn>
</mml:mrow>
</mml:msup>
</mml:mrow>
<mml:mo stretchy="false">)</mml:mo>
</mml:mrow>
</mml:math>
</inline-formula>, before reaching the softmax output layer with three neurons corresponding to the particle size classes. The model is trained using the Adam optimizer with a learning rate of <inline-formula id="inf84">
<mml:math id="m103">
<mml:mrow>
<mml:mn>6</mml:mn>
<mml:mo>&#xd7;</mml:mo>
<mml:mn>1</mml:mn>
<mml:msup>
<mml:mrow>
<mml:mn>0</mml:mn>
</mml:mrow>
<mml:mrow>
<mml:mo>&#x2212;</mml:mo>
<mml:mn>4</mml:mn>
</mml:mrow>
</mml:msup>
</mml:mrow>
</mml:math>
</inline-formula>, employing categorical cross-entropy loss. All hyperparameters were tuned via grid search. This CNN model achieved a test accuracy of 98.9% and a validation accuracy of 98.3%.</p>
</sec>
</sec>
</sec>
<sec sec-type="results|discussion" id="s3">
<title>3 Results and discussion</title>
<p>
<xref ref-type="fig" rid="F8">Figure 8</xref> presents the classification performance achieved across the different ML models, showing a high level of accuracy in correctly predicting each particle category. To complement the accuracy results and provide a more complete evaluation, <xref ref-type="table" rid="T1">Table 1</xref> details the precision (P), recall (R), and F1-score (F1) for each particle size and model.</p>
<fig id="F8" position="float">
<label>FIGURE 8</label>
<caption>
<p>Confusion matrix showing the classification performance across different ML models: the spectrogram-based model, the feature-engineered model, and the SMI temporal signal-based model.</p>
</caption>
<graphic xlink:href="fsens-06-1662060-g008.tif">
<alt-text content-type="machine-generated">Three confusion matrices compare model performance: Spectrogram-based, Feature-based, and Temporal signal-based, each with true labels (rows) and predicted labels (columns) for 2, 4, and 10 micrometers. Each matrix shows high accuracy along the diagonal, indicating strong model performance.</alt-text>
</graphic>
</fig>
<table-wrap id="T1" position="float">
<label>TABLE 1</label>
<caption>
<p>Detailed classification results for each particle type based on different data representations, including the model&#x2019;s precision (P%), recall (R%), and F1-score (F1%).</p>
</caption>
<table>
<thead valign="top">
<tr>
<th colspan="4" align="center">Spectrogram-based model</th>
</tr>
<tr>
<th align="center">Size</th>
<th align="center">P (%)</th>
<th align="center">R (%)</th>
<th align="center">F1 (%)</th>
</tr>
</thead>
<tbody valign="top">
<tr>
<td align="center">2&#xa0;&#xb5;m</td>
<td align="center">0.97</td>
<td align="center">0.97</td>
<td align="center">0.97</td>
</tr>
<tr>
<td align="center">4&#xa0;&#xb5;m</td>
<td align="center">0.97</td>
<td align="center">0.97</td>
<td align="center">0.97</td>
</tr>
<tr>
<td align="center">10&#xa0;&#xb5;m</td>
<td align="center">0.94</td>
<td align="center">0.95</td>
<td align="center">0.95</td>
</tr>
</tbody>
</table>
<table>
<thead valign="top">
<tr>
<th colspan="4" align="center">Featured-based model</th>
</tr>
<tr>
<th align="center">Size</th>
<th align="center">P (%)</th>
<th align="center">R (%)</th>
<th align="center">F1 (%)</th>
</tr>
</thead>
<tbody valign="top">
<tr>
<td align="center">2&#xa0;&#xb5;m</td>
<td align="center">0.97</td>
<td align="center">0.95</td>
<td align="center">0.96</td>
</tr>
<tr>
<td align="center">4&#xa0;&#xb5;m</td>
<td align="center">0.99</td>
<td align="center">0.97</td>
<td align="center">0.99</td>
</tr>
<tr>
<td align="center">10&#xa0;&#xb5;m</td>
<td align="center">0.94</td>
<td align="center">0.98</td>
<td align="center">0.96</td>
</tr>
</tbody>
</table>
<table>
<thead valign="top">
<tr>
<th colspan="4" align="center">Temporal signal-based model</th>
</tr>
<tr>
<th align="center">Size</th>
<th align="center">P (%)</th>
<th align="center">R (%)</th>
<th align="center">F1 (%)</th>
</tr>
</thead>
<tbody valign="top">
<tr>
<td align="center">2&#xa0;&#xb5;m</td>
<td align="center">0.99</td>
<td align="center">0.99</td>
<td align="center">0.99</td>
</tr>
<tr>
<td align="center">4&#xa0;&#xb5;m</td>
<td align="center">0.99</td>
<td align="center">0.98</td>
<td align="center">0.99</td>
</tr>
<tr>
<td align="center">10&#xa0;&#xb5;m</td>
<td align="center">0.98</td>
<td align="center">0.98</td>
<td align="center">0.98</td>
</tr>
</tbody>
</table>
</table-wrap>
<p>The results confirm that the proposed signal analysis pipeline enables a reliable and consistent particle classification system, as all data representations achieved accuracy values close to 98% and maintained precision, recall, and F1-scores above 94% across all classes. The temporal signal-based model achieved the most balanced performance, with all three metrics in the 0.98&#x2013;0.99 range, indicating consistent detection with minimal false positives and false negatives. The spectrogram-based and feature-engineered models also yielded strong results, although slightly lower recall for 2&#xa0;&#xb5;m particles (0.95) suggest occasional misclassification for the smaller particles sizes.</p>
<p>These findings align with the dimensionality reduction analysis, where particle sizes were well separated in the 2D feature space, indicating that the classification task is not overly complex. Additionally, the relatively simple architectures used in the spectrogram-based and feature-engineered models, consisting of only two dense layers, reinforce that the chosen data representations provided sufficient discriminative information for accurate classification. Even for the most challenging case (2&#xa0;&#xb5;m particles), the system maintained a 97% of accuracy, indicating strong classification performance.</p>
<p>On the other hand, the temporal SMI signal-based model demonstrated effective classification without significantly increasing model complexity. Despite having only three convolutional layers and a single dense layer, this model achieved comparable performance, demonstrating that even a small deep learning model can successfully classify particles. This is particularly relevant for real-time implementation, as the raw signal model eliminates the need for explicit feature extraction steps, showcasing the powerful feature learning and generalization capabilities of SMI signals.</p>
<p>To further analyze the computational performance of the models, quantization techniques were applied by reducing data precision from floating-point (FP64) to integer 8-bit (Int8). The results indicate that classification accuracy remained unaffected, confirming that the quantization process did not degrade model performance, likely due to the model&#x2019;s small size, allowing quantization to reduce storage size, RAM usage, and inference latency without significant loss of accuracy.</p>
<p>
<xref ref-type="table" rid="T2">Table 2</xref> presents the storage, inference latency, and peak RAM usage of each model before and after quantization. Notably, the raw signal model, despite handling unprocessed data, required only 2.16&#xa0;MB of storage and achieved a theoretical inference time of 1.8&#xa0;ms. The spectrogram-based and feature-engineered models exhibited lower storage and RAM consumption but with slightly higher inference latency. These computational metrics were validated using TensorFlow Lite profiling, confirming that the models are suitable for deployment on low-power and resource-constrained devices. However, further evaluation on embedded hardware remains necessary to validate real-world performance. These findings demonstrate that high-performance classification is achievable without requiring large-scale models.</p>
<table-wrap id="T2" position="float">
<label>TABLE 2</label>
<caption>
<p>Computational efficiency metrics for different data representations and precisions across ML models, including storage size, inference latency, and RAM usage.</p>
</caption>
<table>
<thead valign="top">
<tr>
<th colspan="1" align="center">Criteria</th>
<th colspan="2" align="center">Spectrogram</th>
<th colspan="2" align="center">Features</th>
<th colspan="2" align="center">Temporal</th>
</tr>
<tr>
<th align="center"/>
<th align="center">FP64</th>
<th align="center">Int8</th>
<th align="center">FP64</th>
<th align="center">Int8</th>
<th align="center">FP64</th>
<th align="center">Int8</th>
</tr>
</thead>
<tbody valign="top">
<tr>
<td align="center">Storage [MB]</td>
<td align="center">0.44</td>
<td align="center">0.03</td>
<td align="center">0.45</td>
<td align="center">0.04</td>
<td align="center">25.7</td>
<td align="center">2.16</td>
</tr>
<tr>
<td align="center">Latency [ms]</td>
<td align="center">18.4</td>
<td align="center">0.01</td>
<td align="center">17.5</td>
<td align="center">0.01</td>
<td align="center">23.4</td>
<td align="center">0.18</td>
</tr>
<tr>
<td align="center">Peak RAM [kB]</td>
<td align="center">120</td>
<td align="center">1.0</td>
<td align="center">120</td>
<td align="center">1.0</td>
<td align="center">130</td>
<td align="center">1.0</td>
</tr>
</tbody>
</table>
</table-wrap>
<p>The present evaluation employed monodisperse polystyrene particles, providing a controlled and repeatable test case for assessing the system&#x2019;s baseline performance. Future work will focus on extending the analysis to heterogeneous mixtures containing particles of different sizes, shapes, and materials, including biological cells. This will allow for a more comprehensive assessment of the model&#x2019;s robustness in complex, application-relevant scenarios, while expanding the dataset of SMI signals and validating performance under conditions representative of practical flow cytometry tasks.</p>
<p>Additionally, enhancements in both the optical and microfluidic components of the system are anticipated. Optimizing the received laser feedback, exploring alternative light sources such as VCSELs with more uniform light distribution, and refining hydrodynamic focusing, potentially by incorporating 3D hydrodynamic effects to ensure more consistent particle alignment and velocity, will be key to further improving the system&#x2019;s performance.</p>
</sec>
<sec sec-type="conclusion" id="s4">
<title>4 Conclusion</title>
<p>This study proposed a machine learning pipeline for classifying particles in self-mixing interferometry signals, enhancing the accuracy of real-time particle analysis. The approach integrated data acquisition, filtering, data augmentation, and three data representations: spectrogram-based, feature-engineered, and temporal signal-based models. The results demonstrated that both fully connected neural networks and 1D convolutional networks achieved high classification accuracy, reaching up to 98% for particle size classification. These findings validate the effectiveness of the proposed pipeline in distinguishing particle sizes across varying signal-to-noise ratios. Moreover, the model architectures were computationally efficient, with low inference times, making them suitable for deployment on low-power embedded systems. This research highlights the potential of machine learning in improving the robustness and reliability of SMI-based particle classification and contributes to the advancement of real-time, label-free SMI particle analysis, with direct applications in medical sensing and flow cytometry. Future work will focus on handling more complex particle mixtures, integrating models into embedded systems such as microcontrollers or FPGAs, and further optimizing real-time classification applications.</p>
</sec>
</body>
<back>
<sec sec-type="data-availability" id="s5">
<title>Data availability statement</title>
<p>The raw data supporting the conclusions of this article will be made available by the authors, without undue reservation.</p>
</sec>
<sec sec-type="author-contributions" id="s6">
<title>Author contributions</title>
<p>SS: Conceptualization, Data curation, Formal Analysis, Investigation, Methodology, Validation, Visualization, Writing &#x2013; original draft, Writing &#x2013; review and editing. JP: Conceptualization, Formal Analysis, Funding acquisition, Investigation, Resources, Supervision, Writing &#x2013; original draft, Writing &#x2013; review and editing. CT: Conceptualization, Resources, Writing &#x2013; review and editing. FJ: Conceptualization, Resources, Writing &#x2013; review and editing. AQ: Conceptualization, Formal Analysis, Funding acquisition, Investigation, Project administration, Resources, Supervision, Validation, Writing &#x2013; review and editing.</p>
</sec>
<sec sec-type="funding-information" id="s7">
<title>Funding</title>
<p>The author(s) declare that financial support was received for the research and/or publication of this article. The author(s) declare that this work received financial support from the LAAS-CNRS Micro and Nanotechnologies Platform, a member of the French Renatech network, for the research and/or publication of this article.</p>
</sec>
<sec sec-type="COI-statement" id="s8">
<title>Conflict of interest</title>
<p>The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.</p>
</sec>
<sec sec-type="ai-statement" id="s9">
<title>Generative AI statement</title>
<p>The author(s) declare that no Generative AI was used in the creation of this manuscript.</p>
<p>Any alternative text (alt text) provided alongside figures in this article has been generated by Frontiers with the support of artificial intelligence and reasonable efforts have been made to ensure accuracy, including review by the authors wherever possible. If you identify any issues, please contact us.</p>
</sec>
<sec sec-type="disclaimer" id="s10">
<title>Publisher&#x2019;s note</title>
<p>All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.</p>
</sec>
<ref-list>
<title>References</title>
<ref id="B1">
<citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname>Agrawal</surname>
<given-names>T.</given-names>
</name>
</person-group> (<year>2021</year>). <source>Hyperparameter optimization in machine learning: make your machine learning and deep learning models more efficient</source>. <publisher-name>Springer</publisher-name>.</citation>
</ref>
<ref id="B2">
<citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname>Albrecht</surname>
<given-names>H.-E.</given-names>
</name>
<name>
<surname>Borys</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Damaschke</surname>
<given-names>N.</given-names>
</name>
<name>
<surname>Tropea</surname>
<given-names>C.</given-names>
</name>
</person-group> (<year>2003</year>). <source>Laser Doppler and phase Doppler measurement techniques</source>. <publisher-name>Springer</publisher-name>, <fpage>9</fpage>&#x2013;<lpage>26</lpage>. <pub-id pub-id-type="doi">10.1007/978-3-662-05165-8_2</pub-id>
</citation>
</ref>
<ref id="B3">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>An</surname>
<given-names>L.</given-names>
</name>
<name>
<surname>Liu</surname>
<given-names>B.</given-names>
</name>
</person-group> (<year>2022</year>). <article-title>Measuring parameters of laser self-mixing interferometry sensor based on back propagation neural network</article-title>. <source>Opt. Express</source> <volume>30</volume>, <fpage>19134</fpage>&#x2013;<lpage>19144</lpage>. <pub-id pub-id-type="doi">10.1364/OE.460625</pub-id>
<pub-id pub-id-type="pmid">36221698</pub-id>
</citation>
</ref>
<ref id="B4">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Atashkhooei</surname>
<given-names>R.</given-names>
</name>
<name>
<surname>Ram&#xed;rez-Miquet</surname>
<given-names>E. E.</given-names>
</name>
<name>
<surname>Moreira</surname>
<given-names>R. d. C.</given-names>
</name>
<name>
<surname>Quotb</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Royo</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Perchoux</surname>
<given-names>J.</given-names>
</name>
</person-group> (<year>2018</year>). <article-title>Optical feedback flowmetry: impact of particle concentration on the signal processing method</article-title>. <source>IEEE Sensors J.</source> <volume>18</volume>, <fpage>1457</fpage>&#x2013;<lpage>1463</lpage>. <pub-id pub-id-type="doi">10.1109/JSEN.2017.2781902</pub-id>
</citation>
</ref>
<ref id="B5">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Barland</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Gustave</surname>
<given-names>F.</given-names>
</name>
</person-group> (<year>2021</year>). <article-title>Convolutional neural network for self-mixing interferometric displacement sensing</article-title>. <source>Opt. Express</source> <volume>29</volume>, <fpage>11433</fpage>&#x2013;<lpage>11444</lpage>. <pub-id pub-id-type="doi">10.1364/OE.419844</pub-id>
<pub-id pub-id-type="pmid">33984922</pub-id>
</citation>
</ref>
<ref id="B6">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Campagnolo</surname>
<given-names>L.</given-names>
</name>
<name>
<surname>Nikoli&#x107;</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Perchoux</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Lim</surname>
<given-names>Y. L.</given-names>
</name>
<name>
<surname>Bertling</surname>
<given-names>K.</given-names>
</name>
<name>
<surname> Loubiere</surname>
<given-names>K.</given-names>
</name>
<etal/>
</person-group> (<year>2013</year>). <article-title>Flow profile measurement in microchannel using the optical feedback interferometry sensing technique</article-title>. <source>Microfluid. Nanofluid.</source> <volume>14</volume>, <fpage>113</fpage>&#x2013;<lpage>119</lpage>.</citation>
</ref>
<ref id="B7">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Chen</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Wang</surname>
<given-names>X.</given-names>
</name>
<name>
<surname>He</surname>
<given-names>C.</given-names>
</name>
<name>
<surname>Wang</surname>
<given-names>M.</given-names>
</name>
</person-group> (<year>2024</year>). <article-title>Optical shaping self-mixing interferometry with a neural network for displacement measurement</article-title>. <source>J. Opt. Soc. Am. B</source> <volume>41</volume>, <fpage>1947</fpage>&#x2013;<lpage>1952</lpage>. <pub-id pub-id-type="doi">10.1364/JOSAB.533685</pub-id>
</citation>
</ref>
<ref id="B8">
<citation citation-type="confproc">
<person-group person-group-type="author">
<name>
<surname>Da Costa Moreira</surname>
<given-names>R.</given-names>
</name>
<name>
<surname>Perchoux</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Zhao</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Tronche</surname>
<given-names>C.</given-names>
</name>
<name>
<surname>Jayat</surname>
<given-names>F.</given-names>
</name>
<name>
<surname>Bosch</surname>
<given-names>T.</given-names>
</name>
</person-group> (<year>2017</year>). &#x201c;<article-title>Single nano-particle flow detection and velocimetry using optical feedback interferometry</article-title>,&#x201d; in <conf-name>2017 IEEE Sensors</conf-name>, <conf-loc>Glasgow, UK</conf-loc>, <conf-date>29 October 2017 - 01 November 2017</conf-date> (<publisher-name>IEEE</publisher-name>). <pub-id pub-id-type="doi">10.1109/icsens.2017.8234105</pub-id>
</citation>
</ref>
<ref id="B9">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Donati</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Norgia</surname>
<given-names>M.</given-names>
</name>
</person-group> (<year>2014</year>). <article-title>Self-mixing interferometry for biomedical signals sensing</article-title>. <source>IEEE J. Sel. Top. Quantum Electron.</source> <volume>20</volume>, <fpage>104</fpage>&#x2013;<lpage>111</lpage>. <pub-id pub-id-type="doi">10.1109/JSTQE.2013.2270279</pub-id>
</citation>
</ref>
<ref id="B10">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Gourisaria</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Agrawal</surname>
<given-names>R.</given-names>
</name>
<name>
<surname>Sahni</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Singh</surname>
<given-names>P.</given-names>
</name>
</person-group> (<year>2024</year>). <article-title>Comparative analysis of audio classification with mfcc and stft features using machine learning techniques</article-title>. <source>Discov. Internet Things</source> <volume>4</volume>, <fpage>1</fpage>. <pub-id pub-id-type="doi">10.1007/s43926-023-00049-y</pub-id>
</citation>
</ref>
<ref id="B11">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Ha</surname>
<given-names>M.-K.</given-names>
</name>
<name>
<surname>Phan</surname>
<given-names>T.-L.</given-names>
</name>
<name>
<surname>Nguyen</surname>
<given-names>D. H. H.</given-names>
</name>
<name>
<surname>Quan</surname>
<given-names>N. H.</given-names>
</name>
<name>
<surname>Ha-Phan</surname>
<given-names>N.-Q.</given-names>
</name>
<name>
<surname>Ching</surname>
<given-names>C. T. S.</given-names>
</name>
<etal/>
</person-group> (<year>2023</year>). <article-title>Comparative analysis of audio processing techniques on doppler radar signature of human walking motion using cnn models</article-title>. <source>Sensors</source> <volume>23</volume>, <fpage>8743</fpage>. <pub-id pub-id-type="doi">10.3390/s23218743</pub-id>
<pub-id pub-id-type="pmid">37960447</pub-id>
</citation>
</ref>
<ref id="B12">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Herbert</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Bertling</surname>
<given-names>K.</given-names>
</name>
<name>
<surname>Taimre</surname>
<given-names>T.</given-names>
</name>
<name>
<surname>Raki&#x107;</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Wilson</surname>
<given-names>S.</given-names>
</name>
</person-group> (<year>2018</year>). <article-title>Microparticle discrimination using laser feedback interferometry</article-title>. <source>Opt. Express</source> <volume>26</volume>, <fpage>25778</fpage>. <pub-id pub-id-type="doi">10.1364/oe.26.025778</pub-id>
<pub-id pub-id-type="pmid">30469674</pub-id>
</citation>
</ref>
<ref id="B13">
<citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname>LeCun</surname>
<given-names>Y. A.</given-names>
</name>
<name>
<surname>Bottou</surname>
<given-names>L.</given-names>
</name>
<name>
<surname>Orr</surname>
<given-names>G. B.</given-names>
</name>
<name>
<surname>M&#xfc;ller</surname>
<given-names>K.-R.</given-names>
</name>
</person-group> (<year>2012</year>). <source>Efficient BackProp</source>. <publisher-loc>Berlin, Heidelberg</publisher-loc>: <publisher-name>Springer Berlin Heidelberg</publisher-name>, <fpage>9</fpage>&#x2013;<lpage>48</lpage>. <pub-id pub-id-type="doi">10.1007/978-3-642-35289-8_3</pub-id>
</citation>
</ref>
<ref id="B14">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Maaten</surname>
<given-names>V. D. L.</given-names>
</name>
<name>
<surname>Hinton</surname>
<given-names>G.</given-names>
</name>
</person-group> (<year>2008</year>). <article-title>Visualizing Data using t-SNE</article-title>. <source>J. Mach. Learn. Res.</source> <volume>9</volume>, <fpage>2579</fpage>&#x2013;<lpage>2605</lpage>. <comment>Available online at: <ext-link ext-link-type="uri" xlink:href="http://jmlr.org/papers/v9/vandermaaten08a.html">http://jmlr.org/papers/v9/vandermaaten08a.html</ext-link>
</comment>
</citation>
</ref>
<ref id="B15">
<citation citation-type="confproc">
<person-group person-group-type="author">
<name>
<surname>Novac</surname>
<given-names>P.-E.</given-names>
</name>
<name>
<surname>Rodriguez</surname>
<given-names>L.</given-names>
</name>
<name>
<surname>Barland</surname>
<given-names>S.</given-names>
</name>
</person-group> (<year>2024</year>). &#x201c;<article-title>Integrating embedded neural networks and self-mixing interferometry for smart sensors design</article-title>,&#x201d; in <conf-name>2024 IEEE Sensors Applications Symposium (SAS)</conf-name>, <conf-loc>Naples, Italy</conf-loc>, <conf-date>23-25 July 2024</conf-date> (<publisher-name>IEEE</publisher-name>).</citation>
</ref>
<ref id="B16">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Perchoux</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Quotb</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Atashkhooei</surname>
<given-names>R.</given-names>
</name>
<name>
<surname>Azcona</surname>
<given-names>F.</given-names>
</name>
<name>
<surname>Ram&#xed;rez-Miquet</surname>
<given-names>E.</given-names>
</name>
<name>
<surname>Bernal</surname>
<given-names>O.</given-names>
</name>
<etal/>
</person-group> (<year>2016</year>). <article-title>Current developments on optical feedback interferometry as an all-optical sensor for biomedical applications</article-title>. <source>Sensors</source> <volume>16</volume>, <fpage>694</fpage>. <pub-id pub-id-type="doi">10.3390/s16050694</pub-id>
<pub-id pub-id-type="pmid">27187406</pub-id>
</citation>
</ref>
<ref id="B17">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Quotb</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Atashkhooei</surname>
<given-names>R.</given-names>
</name>
<name>
<surname>Magaletti</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Jayat</surname>
<given-names>F.</given-names>
</name>
<name>
<surname>Tronche</surname>
<given-names>C.</given-names>
</name>
<name>
<surname>Goechnahts</surname>
<given-names>J.</given-names>
</name>
<etal/>
</person-group> (<year>2021</year>). <article-title>Methods and limits for micro scale blood vessel flow imaging in scattering media by optical feedback interferometry: application to human skin</article-title>. <source>Sensors</source> <volume>21</volume>, <fpage>1300</fpage>. <pub-id pub-id-type="doi">10.3390/s21041300</pub-id>
<pub-id pub-id-type="pmid">33670276</pub-id>
</citation>
</ref>
<ref id="B18">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Rapuano</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Harris</surname>
<given-names>F. J.</given-names>
</name>
</person-group> (<year>2008</year>). <article-title>An introduction to fft and time domain windows</article-title>. <source>IEEE Instrum. Meas. Mag.</source> <volume>10</volume>, <fpage>32</fpage>&#x2013;<lpage>44</lpage>. <pub-id pub-id-type="doi">10.1109/mim.2007.4428580</pub-id>
</citation>
</ref>
<ref id="B19">
<citation citation-type="confproc">
<person-group person-group-type="author">
<name>
<surname>Sierra-Alarc&#xf3;n</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Perchoux</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Jayat</surname>
<given-names>F.</given-names>
</name>
<name>
<surname>Tronche</surname>
<given-names>C.</given-names>
</name>
<name>
<surname>P&#xe9;rez</surname>
<given-names>S. S.</given-names>
</name>
<name>
<surname>Quotb</surname>
<given-names>A.</given-names>
</name>
</person-group> (<year>2024</year>). &#x201c;<article-title>Adaptive single micro-particle detection and segmentation in self-mixing interferometry signals</article-title>,&#x201d; in <conf-name>2025 IEEE Applied Sensing Conference (APSCON)</conf-name> (<publisher-name>IEEE</publisher-name>).</citation>
</ref>
<ref id="B20">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Taimre</surname>
<given-names>T.</given-names>
</name>
<name>
<surname>Nikoli&#x107;</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Bertling</surname>
<given-names>K.</given-names>
</name>
<name>
<surname>Lim</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Bosch</surname>
<given-names>T.</given-names>
</name>
<name>
<surname>Raki&#x107;</surname>
<given-names>A.</given-names>
</name>
</person-group> (<year>2015</year>). <article-title>Laser feedback interferometry: a tutorial on the self-mixing effect for coherent sensing</article-title>. <source>Adv. Opt. Photonics</source> <volume>7</volume>, <fpage>570</fpage>. <pub-id pub-id-type="doi">10.1364/aop.7.000570</pub-id>
</citation>
</ref>
<ref id="B21">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Yang</surname>
<given-names>L.</given-names>
</name>
<name>
<surname>Shami</surname>
<given-names>A.</given-names>
</name>
</person-group> (<year>2020</year>). <article-title>On hyperparameter optimization of machine learning algorithms: theory and practice</article-title>. <source>Neurocomputing</source> <volume>415</volume>, <fpage>295</fpage>&#x2013;<lpage>316</lpage>. <pub-id pub-id-type="doi">10.1016/j.neucom.2020.07.061</pub-id>
</citation>
</ref>
<ref id="B22">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Zhao</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Perchoux</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Campagnolo</surname>
<given-names>L.</given-names>
</name>
<name>
<surname>Camps</surname>
<given-names>T.</given-names>
</name>
<name>
<surname>Atashkhooei</surname>
<given-names>R.</given-names>
</name>
<name>
<surname>Bardinal</surname>
<given-names>V.</given-names>
</name>
</person-group> (<year>2016</year>). <article-title>Optical feedback interferometry for microscale-flow sensing study: numerical simulation and experimental validation</article-title>. <source>Opt. Express</source> <volume>24</volume>, <fpage>23849</fpage>&#x2013;<lpage>23862</lpage>. <pub-id pub-id-type="doi">10.1364/OE.24.023849</pub-id>
<pub-id pub-id-type="pmid">27828220</pub-id>
</citation>
</ref>
<ref id="B23">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Zhao</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Zhang</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Zhang</surname>
<given-names>C.</given-names>
</name>
<name>
<surname>Yang</surname>
<given-names>W.</given-names>
</name>
<name>
<surname>Chen</surname>
<given-names>T.</given-names>
</name>
<name>
<surname>Perchoux</surname>
<given-names>J.</given-names>
</name>
<etal/>
</person-group> (<year>2019</year>). <article-title>Micro particle sizing using hilbert transform time domain signal analysis method in self-mixing interferometry</article-title>. <source>Appl. Sci.</source> <volume>9</volume>, <fpage>5563</fpage>. <pub-id pub-id-type="doi">10.3390/app9245563</pub-id>
</citation>
</ref>
<ref id="B24">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Zhao</surname>
<given-names>K.</given-names>
</name>
<name>
<surname>Jiang</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>Wang</surname>
<given-names>Z.</given-names>
</name>
<name>
<surname>Chen</surname>
<given-names>P.</given-names>
</name>
<name>
<surname>Zhu</surname>
<given-names>B.</given-names>
</name>
<name>
<surname>Duan</surname>
<given-names>X.</given-names>
</name>
</person-group> (<year>2020a</year>). <article-title>Long-term bowel sound monitoring and segmentation by wearable devices and convolutional neural networks</article-title>. <source>IEEE Trans. Biomed. Circuits Syst.</source> <volume>14</volume>, <fpage>985</fpage>&#x2013;<lpage>996</lpage>. <pub-id pub-id-type="doi">10.1109/TBCAS.2020.3018711</pub-id>
<pub-id pub-id-type="pmid">32833642</pub-id>
</citation>
</ref>
<ref id="B25">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Zhao</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Shen</surname>
<given-names>X.</given-names>
</name>
<name>
<surname>Zhang</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Yu</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Li</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Wang</surname>
<given-names>X.</given-names>
</name>
<etal/>
</person-group> (<year>2020b</year>). <article-title>Self-mixing interferometry-based micro flow cytometry system for label-free cells classification</article-title>. <source>Appl. Sci.</source> <volume>10</volume>, <fpage>478</fpage>. <pub-id pub-id-type="doi">10.3390/app10020478</pub-id>
</citation>
</ref>
<ref id="B26">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Zhao</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Li</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Zhang</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Chen</surname>
<given-names>T.</given-names>
</name>
<name>
<surname>Zou</surname>
<given-names>J.</given-names>
</name>
</person-group> (<year>2023a</year>). <article-title>Investigation of the multiple characteristics of the self-mixing effect subject to a single particle</article-title>. <source>Opt. Express</source> <volume>31</volume>, <fpage>5458</fpage>. <pub-id pub-id-type="doi">10.1364/oe.478821</pub-id>
<pub-id pub-id-type="pmid">36823825</pub-id>
</citation>
</ref>
<ref id="B27">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Zhao</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Li</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Zhang</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Zhao</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Zou</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Chen</surname>
<given-names>T.</given-names>
</name>
</person-group> (<year>2023b</year>). <article-title>Phase-unwrapping algorithm combined with wavelet transform and hilbert transform in self-mixing interference for individual microscale particle detection</article-title>. <source>Chin. Opt. Lett.</source> <volume>21</volume>, <fpage>041204</fpage>. <pub-id pub-id-type="doi">10.3788/col202321.041204</pub-id>
</citation>
</ref>
</ref-list>
<app-group>
<app id="app1">
<title>Appendix: classification metrics</title>
<p>Classification Accuracy Metrics<list list-type="simple">
<list-item>
<p>
<inline-formula id="inf85">
<mml:math id="m104">
<mml:mrow>
<mml:mo>&#x2022;</mml:mo>
</mml:mrow>
</mml:math>
</inline-formula> Accuracy: The proportion of correctly classified instances relative to the total number of samples.</p>
</list-item>
<list-item>
<p>
<inline-formula id="inf86">
<mml:math id="m105">
<mml:mrow>
<mml:mo>&#x2022;</mml:mo>
</mml:mrow>
</mml:math>
</inline-formula> Confusion Matrix: A detailed comparison of predicted classifications versus actual labels, highlighting classifications and misclassifications.</p>
</list-item>
<list-item>
<p>
<inline-formula id="inf87">
<mml:math id="m106">
<mml:mrow>
<mml:mo>&#x2022;</mml:mo>
</mml:mrow>
</mml:math>
</inline-formula> Precision (P): The fraction of correctly classified particles out of all predicted positive instances, minimizing false positives.</p>
</list-item>
<list-item>
<p>
<inline-formula id="inf88">
<mml:math id="m107">
<mml:mrow>
<mml:mo>&#x2022;</mml:mo>
</mml:mrow>
</mml:math>
</inline-formula> Recall (R): The model&#x2019;s ability to identify all actual particle instances, reducing false negatives.</p>
</list-item>
<list-item>
<p>
<inline-formula id="inf89">
<mml:math id="m108">
<mml:mrow>
<mml:mo>&#x2022;</mml:mo>
</mml:mrow>
</mml:math>
</inline-formula> F1-score (F1): The harmonic mean of precision and recall, providing a balanced evaluation of classification performance.</p>
</list-item>
</list>
</p>
<p>Computational Efficiency Metrics<list list-type="simple">
<list-item>
<p>
<inline-formula id="inf90">
<mml:math id="m109">
<mml:mrow>
<mml:mo>&#x2022;</mml:mo>
</mml:mrow>
</mml:math>
</inline-formula> Storage Size: The total memory required to store the trained model, impacting its deployment on embedded platforms.</p>
</list-item>
<list-item>
<p>
<inline-formula id="inf91">
<mml:math id="m110">
<mml:mrow>
<mml:mo>&#x2022;</mml:mo>
</mml:mrow>
</mml:math>
</inline-formula> Inference Latency: The theoretical time required for the model to make a prediction, determining its suitability for real-time applications.</p>
</list-item>
<list-item>
<p>
<inline-formula id="inf92">
<mml:math id="m111">
<mml:mrow>
<mml:mo>&#x2022;</mml:mo>
</mml:mrow>
</mml:math>
</inline-formula> RAM Consumption: The peak dynamic memory usage during inference, critical deployment on low-memory devices.</p>
</list-item>
</list>
</p>
</app>
</app-group>
</back>
</article>