<?xml version="1.0" encoding="utf-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v2.3 20070202//EN" "journalpublishing.dtd">
<article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" article-type="research-article" dtd-version="2.3" xml:lang="EN">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">Front. Artif. Intell.</journal-id>
<journal-title>Frontiers in Artificial Intelligence</journal-title>
<abbrev-journal-title abbrev-type="pubmed">Front. Artif. Intell.</abbrev-journal-title>
<issn pub-type="epub">2624-8212</issn>
<publisher>
<publisher-name>Frontiers Media S.A.</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="doi">10.3389/frai.2025.1597981</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Artificial Intelligence</subject>
<subj-group>
<subject>Original Research</subject>
</subj-group>
</subj-group>
</article-categories>
<title-group>
<article-title>Machine learning for improved path loss prediction in urban vehicle-to-infrastructure communication systems</article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author" corresp="yes">
<name><surname>Ben Ameur</surname> <given-names>Mongi</given-names></name>
<xref ref-type="aff" rid="aff1"><sup>1</sup></xref>
<xref ref-type="corresp" rid="c001"><sup>&#x002A;</sup></xref>
<uri xlink:href="https://loop.frontiersin.org/people/3012248/overview"/>
<role content-type="https://credit.niso.org/contributor-roles/writing-review-editing/"/>
<role content-type="https://credit.niso.org/contributor-roles/writing-original-draft/"/>
</contrib>
<contrib contrib-type="author">
<name><surname>Chebil</surname> <given-names>Jalel</given-names></name>
<xref ref-type="aff" rid="aff2"><sup>2</sup></xref>
<xref ref-type="author-notes" rid="fn0001"><sup>&#x2020;</sup></xref>
<role content-type="https://credit.niso.org/contributor-roles/writing-review-editing/"/>
</contrib>
<contrib contrib-type="author">
<name><surname>Habaebi</surname> <given-names>Mohamed Hadi</given-names></name>
<xref ref-type="aff" rid="aff3"><sup>3</sup></xref>
<xref ref-type="author-notes" rid="fn0001"><sup>&#x2020;</sup></xref>
<uri xlink:href="https://loop.frontiersin.org/people/3031308/overview"/>
<role content-type="https://credit.niso.org/contributor-roles/writing-review-editing/"/>
</contrib>
<contrib contrib-type="author">
<name><surname>Tahar</surname> <given-names>Jamel Bel Hadj</given-names></name>
<xref ref-type="aff" rid="aff1"><sup>1</sup></xref>
<role content-type="https://credit.niso.org/contributor-roles/writing-review-editing/"/>
</contrib>
</contrib-group>
<aff id="aff1"><sup>1</sup><institution>ENISO, NOCCS Laboratory, University of Sousse</institution>, <addr-line>Sousse</addr-line>, <country>Tunisia</country></aff>
<aff id="aff2"><sup>2</sup><institution>ISTLS, NOCCS Laboratory, University of Sousse</institution>, <addr-line>Sousse</addr-line>, <country>Tunisia</country></aff>
<aff id="aff3"><sup>3</sup><institution>Department of Electrical and Computer Engineering, International Islamic University</institution>, <addr-line>Kuala Lumpur</addr-line>, <country>Malaysia</country></aff>
<author-notes>
<fn fn-type="edited-by" id="fn0002">
<p>Edited by: Ovidiu Vermesan, SINTEF Digital, Norway</p>
</fn>
<fn fn-type="edited-by" id="fn0003">
<p>Reviewed by: Marie Duzi, VSB-Technical University of Ostrava, Czechia</p>
<p>Georg Muntingh, SINTEF Digital, Norway</p>
</fn>
<corresp id="c001">&#x002A;Correspondence: Mongi Ben Ameur, <email>mongibenameur@yahoo.fr</email></corresp>
<fn fn-type="other" id="fn0001"><p><sup>&#x2020;</sup>ORCID: Jalel Chebil, <ext-link ext-link-type="uri" xlink:href="https://orcid.org/0000-0001-6148-6436">orcid.org/0000-0001-6148-6436</ext-link></p>
<p>Mohamed Hadi Habaebi, <ext-link ext-link-type="uri" xlink:href="https://orcid.org/0000-0002-2263-0850">orcid.org/0000-0002-2263-0850</ext-link></p></fn>
</author-notes>
<pub-date pub-type="epub">
<day>11</day>
<month>07</month>
<year>2025</year>
</pub-date>
<pub-date pub-type="collection">
<year>2025</year>
</pub-date>
<volume>8</volume>
<elocation-id>1597981</elocation-id>
<history>
<date date-type="received">
<day>22</day>
<month>03</month>
<year>2025</year>
</date>
<date date-type="accepted">
<day>06</day>
<month>06</month>
<year>2025</year>
</date>
</history>
<permissions>
<copyright-statement>Copyright &#x00A9; 2025 Ben Ameur, Chebil, Habaebi and Tahar.</copyright-statement>
<copyright-year>2025</copyright-year>
<copyright-holder>Ben Ameur, Chebil, Habaebi and Tahar</copyright-holder>
<license xlink:href="http://creativecommons.org/licenses/by/4.0/">
<p>This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.</p>
</license>
</permissions>
<abstract>
<p>Path loss prediction is crucial to facilitate reliable vehicle-to-infrastructure (V2I) communications. In this study, machine learning techniques are investigated for path loss modeling using empirical measurements at 5.9&#x202F;GHz from eight Road Side Unit (RSU) sites. The performance of Extreme Gradient Boosting (XGBoost) and Multilayer Perceptron (MLP) models is contrasted with traditional empirical models such as the Dual Slope and 3rd Generation Partnership Project (3GPP) models in three varied urban environments: open, suburban, and densely urbanized cities. The findings indicate that machine learning models, in particular XGBoost, consistently outperform traditional models with the lowest Root Mean Square Error (RMSE) in complicated urban environments. For additional robustness in prediction, we propose an innovative environmental classification system based on building density, street geometry, and transmitter position. Feature importance examination reveals that distance, environmental class, and transmitter height are the most significant factors affecting path loss prediction accuracy. These observations aid the development of adaptive V2I communication systems and provide valuable guidelines for enhancing reliability in diverse urban environments.</p>
</abstract>
<kwd-group>
<kwd>path loss modeling</kwd>
<kwd>vehicle-to-infrastructure (V2I)</kwd>
<kwd>path loss prediction</kwd>
<kwd>machine learning (ML)</kwd>
<kwd>XGBoost</kwd>
<kwd>multilayer perceptron (MLP)</kwd>
<kwd>3GPP model</kwd>
<kwd>dual slope model</kwd>
</kwd-group>
<counts>
<fig-count count="6"/>
<table-count count="3"/>
<equation-count count="11"/>
<ref-count count="32"/>
<page-count count="10"/>
<word-count count="6059"/>
</counts>
<custom-meta-wrap>
<custom-meta>
<meta-name>section-at-acceptance</meta-name>
<meta-value>Machine Learning and Artificial Intelligence</meta-value>
</custom-meta>
</custom-meta-wrap>
</article-meta>
</front>
<body>
<sec sec-type="intro" id="sec1">
<label>1</label>
<title>Introduction</title>
<p>V2I communication is a critical aspect of Intelligent Transportation Systems (ITS). It enables vehicles to communicate with roadside equipment in real-time to make traffic safer, more efficient, and more effectively managed. One of the major issues with deploying trustworthy V2I systems is estimating path loss reliably. Path loss is the extent to which signal intensity weakens as it propagates through urban spaces. Obstacles such as buildings, terrain irregularities, and dynamic objects (e.g., traffic and weather) cause non-linear interference patterns, whose effect on signal reliability is significant. Traditional path loss models, such as the Dual Slope and 3GPP models (<xref ref-type="bibr" rid="ref12">Kenney, 2011</xref>), are commonly empirical or deterministic in nature. While they work effectively in controlled settings, such models fail to capture the complexity of urban areas, resulting in predictions that can deteriorate V2I communication performance.</p>
<p>Recent developments in machine learning (ML) offer a hopeful alternative. With big data and better learning methods, ML models can learn subtle relationships between environmental factors and signal loss that go beyond the predictive capability of traditional models. XGBoost is a gradient-boosting tool that is good at managing complex data and can work with many variables. It has shown great accuracy in predicting numbers. Likewise, MLP neural networks can learn complicated patterns well, which makes them suitable for predicting V2I path loss. Nevertheless, the performance of such models is highly dependent on effective data preprocessing in the form of outlier elimination, feature scaling, and dataset representativeness to ensure generalizability across various urban settings. This paper contrasts the performance of XGBoost and MLP with Dual Slope and 3GPP benchmarks for urban V2I path loss prediction. With a higher solution set of real-world measurements, i.e., Received Signal Strength Indicator (RSSI), transmission power, and transceivers&#x2019; distances, we show that machine-learning models generalize far better than traditional methods. Our findings point out that XGBoost can decrease prediction errors by as much as 38%, which makes V2I link budgeting and network planning more precise. Further, we emphasize preprocessing techniques, considering that better data quality means higher model reliability. These findings have deep implications for ITS adoption in smart cities. Proper path loss prediction is crucial for optimal placement of infrastructure, ensuring reliable vehicle-to-infrastructure communication, and enhancing the performance of autonomous vehicle technologies. This work helps create smart transportation networks by using machine-learning insights in V2I system design. These networks can adapt and respond to the challenges of moving around in today&#x2019;s cities.</p>
</sec>
<sec id="sec2">
<label>2</label>
<title>Previous studies</title>
<p>This section presents existing research on path loss prediction methods, including traditional models, artificial intelligence-based approaches, and deep learning techniques.</p>
<sec id="sec3">
<label>2.1</label>
<title>Traditional path loss prediction</title>
<p>Traditional path loss prediction models can be broadly divided into two classes: deterministic and statistical. Deterministic models, such as ray tracing, rely on physical propagation equations to make very accurate predictions. They require, however, extensive environmental data and significant computational resources, and they are difficult to scale (<xref ref-type="bibr" rid="ref11">Hoomod et al., 2018</xref>; <xref ref-type="bibr" rid="ref2">Akpaida et al., 2018</xref>).</p>
<p>Statistical models, including the COST 231-Hata and ECC-33 models, predict path loss trends from empirical data. While being of useful real-world relevance, they generally lack site-specific accuracy (<xref ref-type="bibr" rid="ref17">Tarhuni and Ouni, 2022</xref>; <xref ref-type="bibr" rid="ref19">Zhang et al., 2023</xref>). As established by <xref ref-type="bibr" rid="ref9004">Rappaport (2002)</xref> in his foundational work on wireless communications, these statistical models provide a practical compromise between accuracy and implementation complexity for large-scale deployment scenarios.</p>
<p>For 5.9&#x202F;GHz city-wide V2I communications of critical significance to intelligent transportation systems (<xref ref-type="bibr" rid="ref4">Boban et al., 2011</xref>), path loss models must be capable of addressing significant challenges presented by dynamic obstacles (e.g., cars, buildings, and foliage), road topology variations, and changing line-of-sight (LOS) conditions. While the classic <xref ref-type="bibr" rid="ref9009">Hata (1980)</xref> model established the foundation for urban path loss prediction, modern V2I systems operate at higher frequencies and in more complex environments, necessitating enhanced modeling approaches. Experimental data indicate that path loss exponents are likely to be approximately 2.0 or higher in LOS scenarios and considerably greater than 3.5 in NLOS situations in dense urban settings (<xref ref-type="bibr" rid="ref1">Abbas et al., 2013</xref>). This observation aligns with the seminal work by <xref ref-type="bibr" rid="ref9003">Chang and Yang (1997)</xref>, who established the fundamental relationship between urban morphology and path loss exponent variations. To better model LOS-to-NLOS transitions, hybrid approaches like two-slope models have been proposed (<xref ref-type="bibr" rid="ref6">Fern&#x00E1;ndez et al., 2024</xref>).</p>
<p>While deterministic devices such as 3D ray tracing are more effective than others in modeling complex wave interactions such as diffraction and reflection, their computational expense emphasizes the value of balanced techniques with optimized efficiency and accuracy in the design of urban V2I networks (<xref ref-type="bibr" rid="ref16">P&#x00E4;tzold and De Nardis, 2020</xref>). <xref ref-type="bibr" rid="ref9007">Fraile et al. (2000)</xref> demonstrated that simplified ray tracing models can achieve reasonable accuracy while significantly reducing computational requirements, making them suitable for large-scale network planning applications.</p>
</sec>
<sec id="sec4">
<label>2.2</label>
<title>ML path loss prediction</title>
<p>ML has emerged as a powerful tool for improving path loss modeling in wireless communication, offering enhanced accuracy and adaptability over traditional empirical and deterministic models (<xref ref-type="bibr" rid="ref9">Goodfellow et al., 2016</xref>). ML techniques have been widely applied in various domains, including image recognition (<xref ref-type="bibr" rid="ref13">Krizhevsky et al., 2012</xref>), natural language processing (<xref ref-type="bibr" rid="ref5">Devlin et al., 2019</xref>), and wireless communication systems (<xref ref-type="bibr" rid="ref21">Zhang et al., 2019</xref>). Path loss prediction, a crucial component of wireless network planning, is inherently a regression problem, making it well-suited for supervised learning techniques such as support vector machines, artificial neural networks (ANN), random forests, and K-nearest neighbors (<xref ref-type="bibr" rid="ref14">Murphy, 2012</xref>). Among these, ANN models have demonstrated superior accuracy in path loss estimation compared to empirical models (<xref ref-type="bibr" rid="ref3">Bishop, 2006</xref>). For instance, <xref ref-type="bibr" rid="ref20">Zhang et al. (2019)</xref> developed a real-time channel prediction model that estimates path loss (PL) and packet drop probability in Dedicated Short-Range Communications (DSRC) systems, highlighting the flexibility of ML approaches in dynamic vehicular environments.</p>
<p>The adaptability of ML in path loss modeling is attributed to its ability to learn directly from measured propagation data, allowing models to generalize better across varying environments (<xref ref-type="bibr" rid="ref15">Oroza et al., 2017</xref>). This approach addresses the limitations of traditional channel models as identified by <xref ref-type="bibr" rid="ref9001">Molisch et al. (2011)</xref>, who emphasized the need for context-aware propagation modeling in vehicular communications. Recent comprehensive studies by <xref ref-type="bibr" rid="ref9010">Huang et al. (2022a)</xref> have demonstrated that ML models can effectively capture the complex relationship between environmental features and path loss in V2I scenarios. The follow-up work by <xref ref-type="bibr" rid="ref9011">Huang et al. (2022b)</xref> further validated these findings through extensive field measurements across diverse urban environments. Unlike traditional models that rely on predefined propagation assumptions, ML-based approaches use extensive datasets collected from real-world conditions to train algorithms for accurate signal prediction. Furthermore, ML models incorporate key radio environment factors, including distance, frequency, antenna height, terrain type, and obstacles, to enhance predictive accuracy (<xref ref-type="bibr" rid="ref18">Uccellari et al., 2016</xref>).</p>
</sec>
<sec id="sec5">
<label>2.3</label>
<title>Deep learning for path loss prediction</title>
<p>Deep learning, a subset of ML characterized by neural networks with multiple hidden layers, has shown promising results in path loss prediction. Deep neural networks (DNNs) can automatically extract features from raw data, eliminating the need for manual feature engineering (<xref ref-type="bibr" rid="ref17">Tarhuni and Ouni, 2022</xref>). This capability is particularly valuable in complex urban environments where signal propagation is influenced by numerous factors. For example, <xref ref-type="bibr" rid="ref9005">Thrane et al. (2020)</xref> demonstrated that DNNs outperform traditional path loss models in urban scenarios by capturing intricate relationships between environmental variables and signal attenuation (<xref ref-type="bibr" rid="ref8">Gho et al., 2019</xref>).</p>
<p>Recurrent Neural Networks and Long Short-Term Memory networks have been applied to path loss prediction in dynamic environments, leveraging their ability to model temporal dependencies in signal propagation (<xref ref-type="bibr" rid="ref16">P&#x00E4;tzold and De Nardis, 2020</xref>). These approaches are especially relevant for V2I communications, where both the vehicle and the surrounding environment may change rapidly. Additionally, Convolutional Neural Networks have been utilized to process spatial information from environmental maps, enabling more accurate path loss predictions in urban settings (<xref ref-type="bibr" rid="ref7">Filippi and Bazzi, 2021</xref>).</p>
<p>Despite their advantages, deep learning approaches face challenges such as the need for large training datasets, computational complexity, and potential overfitting (<xref ref-type="bibr" rid="ref10">Gozalvez and Sepulcre, 2022</xref>). <xref ref-type="bibr" rid="ref9006">Wu et al. (2020)</xref> proposed a novel transfer learning approach to address these limitations by leveraging knowledge from source domains with abundant data to improve prediction accuracy in target domains with limited data. Hybrid models that combine deep learning with traditional path loss models have been proposed to address these limitations, offering a balance between accuracy and computational efficiency (<xref ref-type="bibr" rid="ref9010">Huang et al., 2022a</xref>).</p>
</sec>
</sec>
<sec sec-type="methods" id="sec6">
<label>3</label>
<title>Methodology</title>
<p>The methodology of this study is designed to systematically evaluate and enhance path loss prediction for V2I communications in dynamic urban environments. Guided by the workflow illustrated in <xref ref-type="fig" rid="fig1">Figure 1</xref>, this section outlines a structured three-phase approach: (i) data collection and preprocessing, (ii) model development and analysis, and (iii) performance evaluation. The objective is to validate the effectiveness of ML models compared to traditional empirical approaches, while addressing the challenges posed by critical urban variables such as distance, obstruction density, and antenna height.</p>
<fig position="float" id="fig1">
<label>Figure 1</label>
<caption>
<p>The workflow of this study.</p>
</caption>
<graphic xlink:href="frai-08-1597981-g001.tif">
<alt-text content-type="machine-generated">Flowchart illustrating a machine learning process for improved path loss prediction. It starts with data collection and preprocessing, including normalization and feature selection using XGBoost. Data is partitioned into 70% training and 30% testing. Two models, 3GPP Dual Slope and a combination of XGBoost and MLP, are analyzed. Performance is evaluated using metrics like MSE, MAE, RMS, and R&#x00B2;. The process concludes with comparison and selection of the final model.</alt-text>
</graphic>
</fig>
<p>The study begins with the acquisition of real-world data across diverse urban scenarios, ensuring robust representation of signal propagation dynamics. This data undergoes rigorous preprocessing including normalization and feature selection to enhance model generalizability. Subsequently, state-of-the-art ML models (XGBoost and MLP) are developed and evaluated against empirical frameworks (3GPP TR 38.901 and Dual Slope) to identify the optimal prediction method for urban V2I networks. By aligning with the workflow in <xref ref-type="fig" rid="fig1">Figure 1</xref>, the proposed methodology ensures reproducibility, minimizes bias, and promotes adaptability to real-world complexities ultimately supporting the development of reliable 5G/6G vehicular communication systems.</p>
<sec id="sec7">
<label>3.1</label>
<title>Dataset collection</title>
<p>We utilize an open-access dataset (30), which was gathered under a V2I measurement campaign in Bologna. Eight RSU positions as transmitters (Tx) are included in this dataset, which function in between 5.9&#x202F;GHz. The transmitters were mounted at two different heights: 6.5 and 10.5&#x202F;m. The receiver (Rx) was mounted on the roof of a car at a height of 1.75&#x202F;m. The dataset includes RSSI measurements, which were converted to path loss using the formula shown in <xref ref-type="disp-formula" rid="EQ1">Equation 1</xref>:</p>
<disp-formula id="EQ1">
<label>(1)</label>
<mml:math id="M1">
<mml:mi mathvariant="italic">PL</mml:mi>
<mml:mo>=</mml:mo>
<mml:msub>
<mml:mi>P</mml:mi>
<mml:mi mathvariant="italic">Tx</mml:mi>
</mml:msub>
<mml:mo>&#x2212;</mml:mo>
<mml:msub>
<mml:mi>P</mml:mi>
<mml:mi mathvariant="italic">Rx</mml:mi>
</mml:msub>
<mml:mo>+</mml:mo>
<mml:msub>
<mml:mi>L</mml:mi>
<mml:mtext mathvariant="italic">cable</mml:mtext>
</mml:msub>
<mml:mo>&#x2212;</mml:mo>
<mml:msub>
<mml:mi>G</mml:mi>
<mml:mi mathvariant="italic">Tx</mml:mi>
</mml:msub>
<mml:mo>&#x2212;</mml:mo>
<mml:msub>
<mml:mi>G</mml:mi>
<mml:mi mathvariant="italic">Rx</mml:mi>
</mml:msub>
<mml:mspace width="0.25em"/>
</mml:math>
</disp-formula>
<p>where <inline-formula>
<mml:math id="M2">
<mml:msub>
<mml:mi>P</mml:mi>
<mml:mi mathvariant="italic">Tx</mml:mi>
</mml:msub>
</mml:math>
</inline-formula> is the transmission power, and <inline-formula>
<mml:math id="M3">
<mml:msub>
<mml:mi>P</mml:mi>
<mml:mi mathvariant="italic">Rx</mml:mi>
</mml:msub>
</mml:math>
</inline-formula> is the received power, which in this context is the measured <italic>RSSI</italic> value, <inline-formula>
<mml:math id="M4">
<mml:msub>
<mml:mi>L</mml:mi>
<mml:mtext mathvariant="italic">cable</mml:mtext>
</mml:msub>
</mml:math>
</inline-formula> <italic>r</italic>epresents the cable loss, and <inline-formula>
<mml:math id="M5">
<mml:msub>
<mml:mi>G</mml:mi>
<mml:mi mathvariant="italic">Tx</mml:mi>
</mml:msub>
</mml:math>
</inline-formula> and <inline-formula>
<mml:math id="M6">
<mml:msub>
<mml:mi>G</mml:mi>
<mml:mi mathvariant="italic">Rx</mml:mi>
</mml:msub>
</mml:math>
</inline-formula> are the <italic>Tx</italic> and <italic>Rx</italic> antenna gains, respectively.</p>
<p>The dataset includes measurements from various urban environments, including open areas, narrow streets, and areas with dense buildings. GPS coordinates for both transmitters and receivers were recorded, allowing for accurate distance calculations and environmental classification.</p>
</sec>
<sec id="sec8">
<label>3.2</label>
<title>Data preprocessing</title>
<p>To ensure data quality and reliability prior to model training, several preprocessing techniques were applied to the raw V2I communication dataset. These techniques were designed to handle noise, normalize feature scales, and preserve the statistical distribution of environmental classes across training, validation, and testing sets.</p>
<sec id="sec9">
<label>3.2.1</label>
<title>Outlier elimination</title>
<p>Outliers were identified and removed using the Interquartile Range (IQR) method. For a given feature X, the IQR is calculated as shown in <xref ref-type="disp-formula" rid="EQ2">Equation 2</xref>:</p>
<disp-formula id="EQ2">
<label>(2)</label>
<mml:math id="M7">
<mml:mi>IQR</mml:mi>
<mml:mo>=</mml:mo>
<mml:msub>
<mml:mi>Q</mml:mi>
<mml:mn>3</mml:mn>
</mml:msub>
<mml:mo>&#x2212;</mml:mo>
<mml:msub>
<mml:mi>Q</mml:mi>
<mml:mn>1</mml:mn>
</mml:msub>
</mml:math>
</disp-formula>
<p>where <inline-formula>
<mml:math id="M8">
<mml:msub>
<mml:mi>Q</mml:mi>
<mml:mn>1</mml:mn>
</mml:msub>
</mml:math>
</inline-formula> and <inline-formula>
<mml:math id="M9">
<mml:msub>
<mml:mi>Q</mml:mi>
<mml:mn>3</mml:mn>
</mml:msub>
</mml:math>
</inline-formula> are the first and third quartiles, respectively. A data point x is considered an outlier if <inline-formula>
<mml:math id="M10">
<mml:mi mathvariant="normal">x</mml:mi>
<mml:mo stretchy="true">&#x2329;</mml:mo>
<mml:msub>
<mml:mi>Q</mml:mi>
<mml:mn>1</mml:mn>
</mml:msub>
<mml:mo>&#x2212;</mml:mo>
<mml:mn>1.5</mml:mn>
<mml:mo>&#x2217;</mml:mo>
<mml:mi>IQR</mml:mi>
<mml:mspace width="0.25em"/>
<mml:mtext>or</mml:mtext>
<mml:mspace width="0.25em"/>
<mml:mi mathvariant="normal">x</mml:mi>
<mml:mo stretchy="true">&#x232A;</mml:mo>
<mml:msub>
<mml:mi>Q</mml:mi>
<mml:mn>3</mml:mn>
</mml:msub>
<mml:mo>+</mml:mo>
<mml:mn>1.5</mml:mn>
<mml:mo>&#x2217;</mml:mo>
<mml:mi>IQR</mml:mi>
<mml:mo>.</mml:mo>
</mml:math>
</inline-formula></p>
</sec>
<sec id="sec10">
<label>3.2.2</label>
<title>Feature scaling</title>
<p>Min-max normalization was used to rescale all numerical features to a standard range [0, 1]. For each feature value x, the normalized value x&#x2032; is computed as:</p>
<disp-formula id="E1">
<mml:math id="M11">
<mml:msup>
<mml:mi mathvariant="normal">x</mml:mi>
<mml:mo>'</mml:mo>
</mml:msup>
<mml:mo>=</mml:mo>
<mml:mo stretchy="true">(</mml:mo>
<mml:mi mathvariant="normal">x</mml:mi>
<mml:mo>&#x2212;</mml:mo>
<mml:mo>min</mml:mo>
<mml:mo stretchy="true">(</mml:mo>
<mml:mi mathvariant="normal">x</mml:mi>
<mml:mo stretchy="true">)</mml:mo>
<mml:mo stretchy="true">)</mml:mo>
<mml:mo>/</mml:mo>
<mml:mo stretchy="true">(</mml:mo>
<mml:mo>max</mml:mo>
<mml:mo stretchy="true">(</mml:mo>
<mml:mi mathvariant="normal">x</mml:mi>
<mml:mo stretchy="true">)</mml:mo>
<mml:mo>&#x2212;</mml:mo>
<mml:mo>min</mml:mo>
<mml:mo stretchy="true">(</mml:mo>
<mml:mi mathvariant="normal">x</mml:mi>
<mml:mo stretchy="true">)</mml:mo>
<mml:mo stretchy="true">)</mml:mo>
</mml:math>
</disp-formula>
<p>This ensures that all input features contribute proportionally during model training.</p>
</sec>
<sec id="sec11">
<label>3.2.3</label>
<title>Temporal averaging</title>
<p>To reduce temporal noise in RSSI measurements, a sliding window average was applied. For a signal <italic>s</italic>(<italic>t</italic>) over time, the smoothed signal <inline-formula>
<mml:math id="M12">
<mml:mi>s</mml:mi>
<mml:mo>&#x2212;</mml:mo>
<mml:mo stretchy="true">(</mml:mo>
<mml:mi>t</mml:mi>
<mml:mo stretchy="true">)</mml:mo>
</mml:math>
</inline-formula> is given by <xref ref-type="disp-formula" rid="EQ3">Equation 3</xref>:</p>
<disp-formula id="EQ3">
<label>(3)</label>
<mml:math id="M13">
<mml:mi>s</mml:mi>
<mml:mo>&#x2212;</mml:mo>
<mml:mo stretchy="true">(</mml:mo>
<mml:mi>t</mml:mi>
<mml:mo stretchy="true">)</mml:mo>
<mml:mo>=</mml:mo>
<mml:mo stretchy="true">(</mml:mo>
<mml:mn>1</mml:mn>
<mml:mo>/</mml:mo>
<mml:mi>N</mml:mi>
<mml:mo stretchy="true">)</mml:mo>
<mml:mo>&#x2217;</mml:mo>
<mml:mi>&#x03A3;</mml:mi>
<mml:mo>_</mml:mo>
<mml:mo stretchy="true">{</mml:mo>
<mml:mi>i</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>0</mml:mn>
<mml:mo stretchy="true">}</mml:mo>
<mml:mo>&#x0302;</mml:mo>
<mml:mo stretchy="true">{</mml:mo>
<mml:mi>N</mml:mi>
<mml:mo>&#x2212;</mml:mo>
<mml:mn>1</mml:mn>
<mml:mo stretchy="true">}</mml:mo>
<mml:mspace width="0.25em"/>
<mml:mi>s</mml:mi>
<mml:mo stretchy="true">(</mml:mo>
<mml:mi>t</mml:mi>
<mml:mo>&#x2212;</mml:mo>
<mml:mi>i</mml:mi>
<mml:mo stretchy="true">)</mml:mo>
</mml:math>
</disp-formula>
<p>where <italic>N</italic> is the window size.</p>
<p>Finally, the dataset was split into three subsets: 80% for training, 10% for validation, and 10% for testing. Stratified sampling was used to maintain the proportional distribution of environmental classes (open, suburban, and dense urban) across all subsets, ensuring fair evaluation of model performance.</p>
</sec>
</sec>
<sec id="sec12">
<label>3.3</label>
<title>Feature importance analysis</title>
<p>Feature importance analysis was conducted to identify the most significant factors affecting path loss prediction accuracy. <xref ref-type="fig" rid="fig2">Figure 2</xref> shows the relative importance of different features in the XGBoost model.</p>
<fig position="float" id="fig2">
<label>Figure 2</label>
<caption>
<p>Feature importance in XGBoost model.</p>
</caption>
<graphic xlink:href="frai-08-1597981-g002.tif">
<alt-text content-type="machine-generated">Bar chart showing average feature importances across three case studies. Distance is the most important feature at 89.49%, followed by Angle_Tx_Rx at 5.73%, and Speed at 4.79%.</alt-text>
</graphic>
</fig>
<p>The analysis reveals that distance, environmental class, and transmitter height are the most significant factors affecting path loss prediction accuracy. While receiver latitude and longitude also show high importance, this is primarily because they implicitly capture the road network geometry and building distribution patterns in our study areas.</p>
<p>Regarding the use of latitude and longitude as features, we acknowledge that an intrinsic coordinate system (such as a polar coordinate system with radius&#x202F;=&#x202F;distance and angle with origin at the RSU might be theoretically more elegant. However, we found that latitude and longitude provided practical advantages in our specific dataset, as they inherently encode spatial relationships between the transmitter and receiver within the urban landscape. Future work could explore alternative coordinate systems to potentially improve model performance further. The most critical hyperparameters include in <xref ref-type="table" rid="tab1">Table 1</xref>.</p>
<table-wrap position="float" id="tab1">
<label>Table 1</label>
<caption>
<p>XGBoost various parameters.</p>
</caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th align="left" valign="top">Hyperparameter</th>
<th align="center" valign="top">Value</th>
</tr>
</thead>
<tbody>
<tr>
<td align="left" valign="bottom">Number of boosting rounds</td>
<td align="center" valign="bottom">500</td>
</tr>
<tr>
<td align="left" valign="bottom">Learning rate (shrinkage)</td>
<td align="center" valign="bottom">0.05</td>
</tr>
<tr>
<td align="left" valign="bottom">Maximum depth of trees</td>
<td align="center" valign="bottom">6</td>
</tr>
<tr>
<td align="left" valign="bottom">Minimum sum of instance weight</td>
<td align="center" valign="bottom">5</td>
</tr>
<tr>
<td align="left" valign="bottom">Minimum loss reduction</td>
<td align="center" valign="bottom">0.1</td>
</tr>
<tr>
<td align="left" valign="bottom">Subsample ratio of training data</td>
<td align="center" valign="bottom">0.8</td>
</tr>
</tbody>
</table>
</table-wrap>
</sec>
<sec id="sec13">
<label>3.4</label>
<title>Environmental classification</title>
<p>We classify the urban environments into three categories based on building density, street layout, and vegetation coverage:</p>
<list list-type="bullet">
<list-item>
<p><italic>Open Urban Environment:</italic> Characterized by wide streets, low building density (&#x003C;50 buildings/km<sup>2</sup>), and minimal vegetation. These areas typically have excellent line-of-sight (LOS) conditions between the RSU and vehicles.</p>
</list-item>
<list-item>
<p><italic>Suburban Environment:</italic> Medium building density (50&#x2013;200 buildings/km<sup>2</sup>), moderate street widths, and variable vegetation. These areas feature a mix of LOS and non-line-of-sight (NLOS) conditions.</p>
</list-item>
<list-item>
<p><italic>Dense Urban Environment:</italic> High building density (&#x003E;200 buildings/km<sup>2</sup>), narrow streets, and urban canyons. These areas predominantly have NLOS conditions with significant multipath effects. The classification methodology builds upon the urban propagation environment categorization proposed by <xref ref-type="bibr" rid="ref9002">Neskovic et al. (2001)</xref>, who established the correlation between building density metrics and radio propagation characteristics.</p>
</list-item>
</list>
<p>This classification system provides a structured approach to understanding how different urban characteristics affect signal propagation and path loss prediction.</p>
</sec>
<sec id="sec14">
<label>3.5</label>
<title>Path loss prediction models</title>
<p>This section presents a comprehensive overview of the four path loss prediction models evaluated in this study, each representing a different class of modeling approaches. As shown in <xref ref-type="fig" rid="fig3">Figure 3</xref>, we begin with the dual slope model, an empirical formulation known for its simplicity and effectiveness in capturing distinct propagation characteristics at varying distance intervals. This is followed by the 3GPP model, a standardized model widely adopted in cellular communication research. The last two models, XGBoost and MLP, belong to the machine learning domain and leverage data-driven training to enhance prediction accuracy. Together, these models provide a diverse analytical foundation for evaluating path loss behavior in V2I communication scenarios.</p>
<fig position="float" id="fig3">
<label>Figure 3</label>
<caption>
<p>Block diagram of evaluated path loss prediction models categorized by modeling approach.</p>
</caption>
<graphic xlink:href="frai-08-1597981-g003.tif">
<alt-text content-type="machine-generated">Flowchart of Path Loss Prediction Models with three branches. The first branch is &#x201C;Empirical Model&#x201D; leading to &#x201C;Dual Slope.&#x201D; The second is &#x201C;Standardized Model&#x201D; leading to &#x201C;3GPP TR thirty eight point nine hundred one.&#x201D; The third is &#x201C;Machine Learning Models,&#x201D; branching into &#x201C;XGBoost&#x201D; and &#x201C;MLP.&#x201D;</alt-text>
</graphic>
</fig>
<sec id="sec15">
<label>3.5.1</label>
<title>Dual slope model</title>
<p>The Dual Slope model is an empirical approach that accounts for distinct propagation characteristics at different distances. It is mathematically represented in <xref ref-type="disp-formula" rid="EQ4">Equation 4</xref>:</p>
<disp-formula id="EQ4">
<label>(4)</label>
<mml:math id="M14">
<mml:mtable columnalign="left" displaystyle="true">
<mml:mtr>
<mml:mtd>
<mml:mi mathvariant="italic">PL</mml:mi>
<mml:mo stretchy="true">(</mml:mo>
<mml:mi>d</mml:mi>
<mml:mo stretchy="true">)</mml:mo>
<mml:mo>=</mml:mo>
</mml:mtd>
</mml:mtr>
<mml:mtr>
<mml:mtd>
<mml:mo stretchy="true">{</mml:mo>
<mml:mtable equalrows="true" equalcolumns="true">
<mml:mtr>
<mml:mtd>
<mml:mi>P</mml:mi>
<mml:msub>
<mml:mi>L</mml:mi>
<mml:mn>0</mml:mn>
</mml:msub>
<mml:mo>+</mml:mo>
<mml:msub>
<mml:mi>n</mml:mi>
<mml:mn>1</mml:mn>
</mml:msub>
<mml:msub>
<mml:mo>log</mml:mo>
<mml:mn>10</mml:mn>
</mml:msub>
<mml:mo stretchy="true">(</mml:mo>
<mml:mi>d</mml:mi>
<mml:mo>/</mml:mo>
<mml:msub>
<mml:mi>d</mml:mi>
<mml:mn>0</mml:mn>
</mml:msub>
<mml:mo stretchy="true">)</mml:mo>
<mml:mo>,</mml:mo>
</mml:mtd>
<mml:mtd>
<mml:mi>d</mml:mi>
<mml:mo>&#x003C;</mml:mo>
<mml:msub>
<mml:mi>d</mml:mi>
<mml:mtext mathvariant="italic">break</mml:mtext>
</mml:msub>
</mml:mtd>
</mml:mtr>
<mml:mtr>
<mml:mtd>
<mml:mi>P</mml:mi>
<mml:msub>
<mml:mi>L</mml:mi>
<mml:mn>0</mml:mn>
</mml:msub>
<mml:mo>+</mml:mo>
<mml:msub>
<mml:mi>n</mml:mi>
<mml:mn>1</mml:mn>
</mml:msub>
<mml:msub>
<mml:mo>log</mml:mo>
<mml:mn>10</mml:mn>
</mml:msub>
<mml:mo stretchy="true">(</mml:mo>
<mml:msub>
<mml:mi>d</mml:mi>
<mml:mtext mathvariant="italic">break</mml:mtext>
</mml:msub>
<mml:mo>/</mml:mo>
<mml:msub>
<mml:mi>d</mml:mi>
<mml:mn>0</mml:mn>
</mml:msub>
<mml:mo stretchy="true">)</mml:mo>
<mml:mo>+</mml:mo>
<mml:msub>
<mml:mi>n</mml:mi>
<mml:mn>2</mml:mn>
</mml:msub>
<mml:msub>
<mml:mo>log</mml:mo>
<mml:mn>10</mml:mn>
</mml:msub>
<mml:mo stretchy="true">(</mml:mo>
<mml:mi>d</mml:mi>
<mml:mo>/</mml:mo>
<mml:msub>
<mml:mi>d</mml:mi>
<mml:mtext mathvariant="italic">break</mml:mtext>
</mml:msub>
<mml:mo stretchy="true">)</mml:mo>
<mml:mo>,</mml:mo>
</mml:mtd>
<mml:mtd>
<mml:mi>d</mml:mi>
<mml:mo>&#x2265;</mml:mo>
<mml:msub>
<mml:mi>d</mml:mi>
<mml:mtext mathvariant="italic">break</mml:mtext>
</mml:msub>
</mml:mtd>
</mml:mtr>
</mml:mtable>
</mml:mtd>
</mml:mtr>
</mml:mtable>
</mml:math>
</disp-formula>
<p>where: <inline-formula>
<mml:math id="M15">
<mml:mi mathvariant="italic">PL</mml:mi>
<mml:mo stretchy="true">(</mml:mo>
<mml:mi>d</mml:mi>
<mml:mo stretchy="true">)</mml:mo>
</mml:math>
</inline-formula> represents the predicted path loss at distance <inline-formula>
<mml:math id="M16">
<mml:mi>d</mml:mi>
</mml:math>
</inline-formula>; <inline-formula>
<mml:math id="M17">
<mml:mi>P</mml:mi>
<mml:msub>
<mml:mi>L</mml:mi>
<mml:mn>0</mml:mn>
</mml:msub>
</mml:math>
</inline-formula> denotes the reference path loss at <inline-formula>
<mml:math id="M18">
<mml:msub>
<mml:mi>d</mml:mi>
<mml:mn>0</mml:mn>
</mml:msub>
</mml:math>
</inline-formula>; <inline-formula>
<mml:math id="M19">
<mml:msub>
<mml:mi>d</mml:mi>
<mml:mtext mathvariant="italic">break</mml:mtext>
</mml:msub>
</mml:math>
</inline-formula> is the breakpoint distance; <inline-formula>
<mml:math id="M20">
<mml:msub>
<mml:mi>n</mml:mi>
<mml:mn>1</mml:mn>
</mml:msub>
</mml:math>
</inline-formula> and <inline-formula>
<mml:math id="M21">
<mml:msub>
<mml:mi>n</mml:mi>
<mml:mn>2</mml:mn>
</mml:msub>
</mml:math>
</inline-formula>are the path loss exponents before and after <inline-formula>
<mml:math id="M22">
<mml:msub>
<mml:mi>d</mml:mi>
<mml:mtext mathvariant="italic">break</mml:mtext>
</mml:msub>
</mml:math>
</inline-formula>.</p>
<p>The parameters <inline-formula>
<mml:math id="M23">
<mml:mi>P</mml:mi>
<mml:msub>
<mml:mi>L</mml:mi>
<mml:mn>0</mml:mn>
</mml:msub>
</mml:math>
</inline-formula>, <inline-formula>
<mml:math id="M24">
<mml:msub>
<mml:mi>d</mml:mi>
<mml:mtext mathvariant="italic">break</mml:mtext>
</mml:msub>
</mml:math>
</inline-formula>, <inline-formula>
<mml:math id="M25">
<mml:msub>
<mml:mi>n</mml:mi>
<mml:mn>1</mml:mn>
</mml:msub>
</mml:math>
</inline-formula>, and <inline-formula>
<mml:math id="M26">
<mml:msub>
<mml:mi>n</mml:mi>
<mml:mn>2</mml:mn>
</mml:msub>
<mml:mspace width="0.25em"/>
</mml:math>
</inline-formula>were individually estimated for each case study using a least-squares curve-fitting approach, minimizing the error between the model&#x2019;s predictions and the measured path loss data.</p>
<p>Several studies (32&#x2013;34) have explored the effectiveness of the Dual Slope model in different environments.</p>
</sec>
<sec id="sec16">
<label>3.5.2</label>
<title>3GPP model</title>
<p>The 3GPP model is a standardized path loss model developed by the 3rd Generation Partnership Project for various wireless communication scenarios. For urban V2I communications at 5.9&#x202F;GHz, the model is defined in <xref ref-type="disp-formula" rid="EQ5">Equation 5</xref>:</p>
<disp-formula id="EQ5">
<label>(5)</label>
<mml:math id="M27">
<mml:mtable columnalign="left" displaystyle="true">
<mml:mtr>
<mml:mtd>
<mml:mi mathvariant="italic">PL</mml:mi>
<mml:mo stretchy="true">(</mml:mo>
<mml:mi>d</mml:mi>
<mml:mo stretchy="true">)</mml:mo>
<mml:mo>=</mml:mo>
<mml:mn>40</mml:mn>
<mml:mo>&#x2217;</mml:mo>
<mml:msub>
<mml:mo>log</mml:mo>
<mml:mn>10</mml:mn>
</mml:msub>
<mml:mspace width="0.33em"/>
<mml:mo stretchy="true">(</mml:mo>
<mml:mi>d</mml:mi>
<mml:mo stretchy="true">)</mml:mo>
<mml:mo>+</mml:mo>
<mml:mn>7.8</mml:mn>
<mml:mo>&#x2212;</mml:mo>
<mml:mn>18</mml:mn>
<mml:mo>&#x2217;</mml:mo>
<mml:msub>
<mml:mo>log</mml:mo>
<mml:mn>10</mml:mn>
</mml:msub>
<mml:mo stretchy="true">(</mml:mo>
<mml:msub>
<mml:mi>h</mml:mi>
<mml:mi mathvariant="italic">BS</mml:mi>
</mml:msub>
<mml:mo stretchy="true">)</mml:mo>
</mml:mtd>
</mml:mtr>
<mml:mtr>
<mml:mtd>
<mml:mo>&#x2212;</mml:mo>
<mml:mn>18</mml:mn>
<mml:mo>&#x2217;</mml:mo>
<mml:msub>
<mml:mo>log</mml:mo>
<mml:mn>10</mml:mn>
</mml:msub>
<mml:mspace width="0.33em"/>
<mml:mo stretchy="true">(</mml:mo>
<mml:msub>
<mml:mi>h</mml:mi>
<mml:mi mathvariant="italic">MS</mml:mi>
</mml:msub>
<mml:mo stretchy="true">)</mml:mo>
<mml:mo>+</mml:mo>
<mml:mn>2</mml:mn>
<mml:mo>&#x2217;</mml:mo>
<mml:msub>
<mml:mo>log</mml:mo>
<mml:mn>10</mml:mn>
</mml:msub>
<mml:mspace width="0.33em"/>
<mml:mo stretchy="true">(</mml:mo>
<mml:mfrac>
<mml:msub>
<mml:mi>f</mml:mi>
<mml:mi>c</mml:mi>
</mml:msub>
<mml:mn>5.0</mml:mn>
</mml:mfrac>
<mml:mo stretchy="true">)</mml:mo>
</mml:mtd>
</mml:mtr>
</mml:mtable>
</mml:math>
</disp-formula>
<p>where <italic>d</italic> is the distance between the transmitter and receiver in kilometers,<inline-formula>
<mml:math id="M28">
<mml:mspace width="0.25em"/>
<mml:msub>
<mml:mi>h</mml:mi>
<mml:mi mathvariant="italic">BS</mml:mi>
</mml:msub>
</mml:math>
</inline-formula> is the base station (RSU) height in meters, <inline-formula>
<mml:math id="M29">
<mml:msub>
<mml:mi>h</mml:mi>
<mml:mi mathvariant="italic">MS</mml:mi>
</mml:msub>
</mml:math>
</inline-formula> is the mobile station (vehicle) height in meters, and <inline-formula>
<mml:math id="M30">
<mml:msub>
<mml:mi>f</mml:mi>
<mml:mi>c</mml:mi>
</mml:msub>
</mml:math>
</inline-formula>is the carrier frequency in GHz.</p>
</sec>
<sec id="sec17">
<label>3.5.3</label>
<title>XGBoost model</title>
<p>XGBoost is an ensemble learning method that uses a gradient boosting framework to build a collection of decision trees. The model was implemented with the following hyperparameters:</p>
<list list-type="simple">
<list-item>
<p>Number of estimators: 100</p>
</list-item>
<list-item>
<p>Maximum depth: 6</p>
</list-item>
<list-item>
<p>Learning rate: 0.1</p>
</list-item>
<list-item>
<p>Subsample ratio: 0.8</p>
</list-item>
<list-item>
<p>Column sample by tree: 0.8</p>
</list-item>
<list-item>
<p>Minimum child weight: 1</p>
</list-item>
<list-item>
<p>Regularization alpha: 0</p>
</list-item>
<list-item>
<p>Regularization lambda: 1</p>
</list-item>
</list>
<p>The total number of learnable parameters in our XGBoost model is approximately 6,500, varying slightly based on the specific tree structures learned during training. Mean squared error was used as the loss function during training.</p>
</sec>
<sec id="sec18">
<label>3.5.4</label>
<title>MLP model</title>
<p>The MLP is a feedforward artificial neural network with multiple layers of nodes. Our implementation consists of:</p>
<list list-type="simple">
<list-item>
<p>Input layer: 8 neurons (corresponding to our feature set)</p>
</list-item>
<list-item>
<p>Hidden layer 1: 64 neurons with ReLU activation</p>
</list-item>
<list-item>
<p>Hidden layer 2: 32 neurons with ReLU activation</p>
</list-item>
<list-item>
<p>Hidden layer 3: 16 neurons with ReLU activation</p>
</list-item>
<list-item>
<p>Output layer: 1 neuron with linear activation</p>
</list-item>
</list>
<p>Additional architectural details:</p>
<list list-type="simple">
<list-item>
<p>- Dropout rate of 0.2 between layers to prevent overfitting</p>
</list-item>
<list-item>
<p>- Batch normalization after each hidden layer</p>
</list-item>
<list-item>
<p>- Adam optimizer with learning rate of 0.001</p>
</list-item>
<list-item>
<p>- Batch size of 32</p>
</list-item>
<list-item>
<p>- Early stopping with patience of 10 epochs</p>
</list-item>
<list-item>
<p>- Mean squared error as the loss function</p>
</list-item>
</list>
</sec>
<sec id="sec19">
<label>3.5.5</label>
<title>Performance metrics</title>
<p>To evaluate the accuracy and reliability of both models, several performance metrics were computed:</p>
<list list-type="bullet">
<list-item>
<p><italic>Mean Squared Error (MSE):</italic> Measures the average squared differences between predicted and actual values as shown in <xref ref-type="disp-formula" rid="EQ6">Equation 6</xref>:</p>
</list-item>
</list><disp-formula id="EQ6">
<label>(6)</label>
<mml:math id="M31">
<mml:mi mathvariant="italic">MSE</mml:mi>
<mml:mo>=</mml:mo>
<mml:mfrac>
<mml:mn>1</mml:mn>
<mml:mi>n</mml:mi>
</mml:mfrac>
<mml:munderover>
<mml:mo movablelimits="false">&#x2211;</mml:mo>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
<mml:mi>n</mml:mi>
</mml:munderover>
<mml:msup>
<mml:mrow>
<mml:mo stretchy="true">(</mml:mo>
<mml:msub>
<mml:mi>y</mml:mi>
<mml:mi>i</mml:mi>
</mml:msub>
<mml:mo>&#x2212;</mml:mo>
<mml:mover accent="true">
<mml:msub>
<mml:mi>y</mml:mi>
<mml:mi>i</mml:mi>
</mml:msub>
<mml:mo stretchy="true">&#x0302;</mml:mo>
</mml:mover>
<mml:mo stretchy="true">)</mml:mo>
</mml:mrow>
<mml:mn>2</mml:mn>
</mml:msup>
</mml:math>
</disp-formula>
<p>where <inline-formula>
<mml:math id="M32">
<mml:msub>
<mml:mi>y</mml:mi>
<mml:mi>i</mml:mi>
</mml:msub>
</mml:math>
</inline-formula> is the actual path loss <inline-formula>
<mml:math id="M33">
<mml:msub>
<mml:mover accent="true">
<mml:mi>y</mml:mi>
<mml:mo stretchy="true">&#x0302;</mml:mo>
</mml:mover>
<mml:mi>i</mml:mi>
</mml:msub>
</mml:math>
</inline-formula>is the predicted path loss, and <italic>n</italic> is the number of observations.</p>
<list list-type="bullet">
<list-item>
<p><italic>RMSE</italic>: Provides the standard deviation of residuals, indicating model precision as defined in <xref ref-type="disp-formula" rid="EQ7">Equation 7</xref>:</p>
</list-item>
</list>
<disp-formula id="EQ7">
<label>(7)</label>
<mml:math id="M34">
<mml:mtext mathvariant="italic">RMSE</mml:mtext>
<mml:mo>=</mml:mo>
<mml:msqrt>
<mml:mrow>
<mml:mfrac>
<mml:mn>1</mml:mn>
<mml:mi>n</mml:mi>
</mml:mfrac>
<mml:munderover>
<mml:mo movablelimits="false">&#x2211;</mml:mo>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
<mml:mi>n</mml:mi>
</mml:munderover>
<mml:msup>
<mml:mrow>
<mml:mo stretchy="true">(</mml:mo>
<mml:msub>
<mml:mi>y</mml:mi>
<mml:mi>i</mml:mi>
</mml:msub>
<mml:mo>&#x2212;</mml:mo>
<mml:msub>
<mml:mover accent="true">
<mml:mi>y</mml:mi>
<mml:mo stretchy="true">&#x0302;</mml:mo>
</mml:mover>
<mml:mi>i</mml:mi>
</mml:msub>
<mml:mo stretchy="true">)</mml:mo>
</mml:mrow>
<mml:mn>2</mml:mn>
</mml:msup>
</mml:mrow>
</mml:msqrt>
</mml:math>
</disp-formula>
<list list-type="bullet">
<list-item>
<p><italic>R-squared (R<sup>2</sup>):</italic> measures the proportion of variance in the target variable explained by the model as shown in <xref ref-type="disp-formula" rid="EQ8">Equation 8</xref>, providing an overall indication of model fit. An R<sup>2</sup>value close to 1 suggests that the model accounts for most of the variance and makes accurate predictions. A value near 0 implies the model performs no better than simply predicting the mean of the observed data. Negative values indicate the model performs worse than this naive, mean-based prediction.</p>
</list-item>
</list><disp-formula id="EQ8">
<label>(8)</label>
<mml:math id="M35">
<mml:mi>R</mml:mi>
<mml:mn>2</mml:mn>
<mml:mtext mathvariant="italic">Score</mml:mtext>
<mml:mo>=</mml:mo>
<mml:mn>1</mml:mn>
<mml:mo>&#x2212;</mml:mo>
<mml:mfrac>
<mml:mrow>
<mml:munderover>
<mml:mo movablelimits="false">&#x2211;</mml:mo>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
<mml:mi>n</mml:mi>
</mml:munderover>
<mml:msup>
<mml:mrow>
<mml:mo stretchy="true">(</mml:mo>
<mml:msub>
<mml:mi>y</mml:mi>
<mml:mi>i</mml:mi>
</mml:msub>
<mml:mo>&#x2212;</mml:mo>
<mml:mover accent="true">
<mml:msub>
<mml:mi>y</mml:mi>
<mml:mi>i</mml:mi>
</mml:msub>
<mml:mo stretchy="true">&#x0302;</mml:mo>
</mml:mover>
<mml:mo stretchy="true">)</mml:mo>
</mml:mrow>
<mml:mn>2</mml:mn>
</mml:msup>
</mml:mrow>
<mml:mrow>
<mml:munderover>
<mml:mo movablelimits="false">&#x2211;</mml:mo>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
<mml:mi>n</mml:mi>
</mml:munderover>
<mml:msup>
<mml:mrow>
<mml:mo stretchy="true">(</mml:mo>
<mml:msub>
<mml:mi>y</mml:mi>
<mml:mi>i</mml:mi>
</mml:msub>
<mml:mo>&#x2212;</mml:mo>
<mml:mover accent="true">
<mml:mi>y</mml:mi>
<mml:mo stretchy="true">&#x00AF;</mml:mo>
</mml:mover>
<mml:mo stretchy="true">)</mml:mo>
</mml:mrow>
<mml:mn>2</mml:mn>
</mml:msup>
</mml:mrow>
</mml:mfrac>
</mml:math>
</disp-formula>
<list list-type="bullet">
<list-item>
<p><italic>Mean Absolute Error (MAE):</italic> Represents the average absolute differences between predicted and actual values as defined in <xref ref-type="disp-formula" rid="EQ9">Equation 9</xref>:</p>
</list-item>
</list><disp-formula id="EQ9">
<label>(9)</label>
<mml:math id="M36">
<mml:mi mathvariant="italic">MAE</mml:mi>
<mml:mo>=</mml:mo>
<mml:mfrac>
<mml:mn>1</mml:mn>
<mml:mi>n</mml:mi>
</mml:mfrac>
<mml:munderover>
<mml:mo movablelimits="false">&#x2211;</mml:mo>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
<mml:mi>n</mml:mi>
</mml:munderover>
<mml:mo>&#x2223;</mml:mo>
<mml:msub>
<mml:mi>y</mml:mi>
<mml:mi>i</mml:mi>
</mml:msub>
<mml:mo>&#x2212;</mml:mo>
<mml:mover accent="true">
<mml:msub>
<mml:mi>y</mml:mi>
<mml:mi>i</mml:mi>
</mml:msub>
<mml:mrow>
<mml:mo stretchy="true">&#x0302;</mml:mo>
<mml:mo>&#x2223;</mml:mo>
</mml:mrow>
</mml:mover>
</mml:math>
</disp-formula>
<list list-type="bullet">
<list-item>
<p><italic>Standard Deviation of Residuals (Std Dev):</italic> Measures the spread of errors in prediction, indicating model consistency in <xref ref-type="disp-formula" rid="EQ10">Equation 10</xref>:</p>
</list-item>
</list><disp-formula id="EQ10">
<label>(10)</label>
<mml:math id="M37">
<mml:mi>&#x03C3;</mml:mi>
<mml:mo>=</mml:mo>
<mml:msqrt>
<mml:mrow>
<mml:mfrac>
<mml:mn>1</mml:mn>
<mml:mi>n</mml:mi>
</mml:mfrac>
<mml:munderover>
<mml:mo movablelimits="false">&#x2211;</mml:mo>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
<mml:mi>n</mml:mi>
</mml:munderover>
<mml:msup>
<mml:mrow>
<mml:mo stretchy="true">(</mml:mo>
<mml:msub>
<mml:mi>y</mml:mi>
<mml:mi>i</mml:mi>
</mml:msub>
<mml:mo>&#x2212;</mml:mo>
<mml:msub>
<mml:mover accent="true">
<mml:mi>y</mml:mi>
<mml:mo stretchy="true">&#x0302;</mml:mo>
</mml:mover>
<mml:mi>i</mml:mi>
</mml:msub>
<mml:mo>&#x2212;</mml:mo>
<mml:mover accent="true">
<mml:mi>e</mml:mi>
<mml:mo stretchy="true">&#x00AF;</mml:mo>
</mml:mover>
<mml:mo stretchy="true">)</mml:mo>
</mml:mrow>
<mml:mn>2</mml:mn>
</mml:msup>
</mml:mrow>
</mml:msqrt>
</mml:math>
</disp-formula>
<p>where <inline-formula>
<mml:math id="M38">
<mml:mover accent="true">
<mml:mi>e</mml:mi>
<mml:mo stretchy="true">&#x00AF;</mml:mo>
</mml:mover>
</mml:math>
</inline-formula> is the mean error.</p>
<p>These metrics provide a comprehensive evaluation of the models&#x2019; predictive capabilities, with lower error values indicating higher accuracy.</p>
</sec>
</sec>
</sec>
<sec sec-type="results" id="sec20">
<label>4</label>
<title>Results and discussion</title>
<p>This section gives a clear picture of how different path loss prediction models perform in three case studies. Every case study represents a variant V2I communication situation. The XGBoost model is compared with the Dual Slope model, the 3GPP TR 38.901 model, and the MLP. The assessment utilizes benchmark metrics to verify the performance of the models, which include RMSE, MSE, MAE, and R<sup>2</sup>, as presented in <xref ref-type="table" rid="tab2">Table 2</xref>. The findings confirm that machine learning models are more accurate in outcome prediction, particularly XGBoost, with consistently lower error rates and higher R<sup>2</sup> values across all scenarios. This performance gain is especially evident in dynamic and complex urban environments, where the conventional models are likely to fail to capture the non-linear and non-stationary characteristics of radio wave propagation. These findings point out the potential of data-driven approaches for enhancing the accuracy and robustness of V2I channel modeling in future wireless systems.</p>
<table-wrap position="float" id="tab2">
<label>Table 2</label>
<caption>
<p>Performance metrics comparison: case studies.</p>
</caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th align="left" valign="top">Case study</th>
<th align="left" valign="top">Model</th>
<th align="center" valign="top">MSE (dB)</th>
<th align="center" valign="top">MAE (dB)</th>
<th align="center" valign="top">RMSE (db)</th>
<th align="center" valign="top">R<sup>2</sup></th>
</tr>
</thead>
<tbody>
<tr>
<td align="left" valign="top" rowspan="4">1</td>
<td align="left" valign="top">3GPP (TR 38.901)</td>
<td align="center" valign="middle">261.71</td>
<td align="center" valign="middle">13.05</td>
<td align="center" valign="middle">16.18</td>
<td align="center" valign="middle">&#x2212;3.47</td>
</tr>
<tr>
<td align="left" valign="top">Dual slope</td>
<td align="center" valign="middle">21.86</td>
<td align="center" valign="middle">3.70</td>
<td align="center" valign="middle">4.68</td>
<td align="center" valign="middle">0.61</td>
</tr>
<tr>
<td align="left" valign="top">MLP</td>
<td align="center" valign="middle">0.19</td>
<td align="center" valign="middle">0.34</td>
<td align="center" valign="middle">0.43</td>
<td align="center" valign="middle">0.83</td>
</tr>
<tr>
<td align="left" valign="top">XGBoost</td>
<td align="center" valign="middle">0.06</td>
<td align="center" valign="middle">0.19</td>
<td align="center" valign="middle">0.25</td>
<td align="center" valign="middle">0.94</td>
</tr>
<tr>
<td align="left" valign="top" rowspan="4">2</td>
<td align="left" valign="top">3GPP (TR 38.901)</td>
<td align="center" valign="bottom">150.97</td>
<td align="center" valign="bottom">11.11</td>
<td align="center" valign="bottom">12.29</td>
<td align="center" valign="bottom">&#x2212;1.32</td>
</tr>
<tr>
<td align="left" valign="top">Dual slope</td>
<td align="center" valign="middle">30.83</td>
<td align="center" valign="middle">4.51</td>
<td align="center" valign="middle">5.55</td>
<td align="center" valign="middle">0.52</td>
</tr>
<tr>
<td align="left" valign="top">XGBoost</td>
<td align="center" valign="middle">0.09</td>
<td align="center" valign="middle">0.20</td>
<td align="center" valign="middle">0.29</td>
<td align="center" valign="middle">0.92</td>
</tr>
<tr>
<td align="left" valign="top">MLP</td>
<td align="center" valign="middle">0.17</td>
<td align="center" valign="middle">0.30</td>
<td align="center" valign="middle">0.41</td>
<td align="center" valign="middle">0.83</td>
</tr>
<tr>
<td align="left" valign="top" rowspan="4">3</td>
<td align="left" valign="top">3GPP (TR 38.901)</td>
<td align="center" valign="bottom">220.25</td>
<td align="center" valign="bottom">12.85</td>
<td align="center" valign="bottom">14.84</td>
<td align="center" valign="bottom">&#x2212;0.72</td>
</tr>
<tr>
<td align="left" valign="top">Dual slope</td>
<td align="center" valign="middle">46.13</td>
<td align="center" valign="middle">5.53</td>
<td align="center" valign="middle">6.79</td>
<td align="center" valign="middle">0.64</td>
</tr>
<tr>
<td align="left" valign="top">XGBoost</td>
<td align="center" valign="middle">0.05</td>
<td align="center" valign="middle">0.16</td>
<td align="center" valign="middle">0.22</td>
<td align="center" valign="middle">0.95</td>
</tr>
<tr>
<td align="left" valign="top">MLP</td>
<td align="center" valign="middle">0.14</td>
<td align="center" valign="middle">0.28</td>
<td align="center" valign="middle">0.37</td>
<td align="center" valign="middle">0.86</td>
</tr>
</tbody>
</table>
</table-wrap>
<sec id="sec21">
<label>4.1</label>
<title>Case study 1: open area</title>
<p><xref ref-type="fig" rid="fig4">Figure 4</xref> illustrates the path loss behavior in an open area with minimal obstructions and compares the performance of machine learning models with traditional approaches. XGBoost performed exceptionally well, achieving an RMSE of just 0.25&#x202F;dB and an R<sup>2</sup> of 0.94, outperforming all other models. The MLP model performed well, accurately identifying subtle propagation effects, including ground reflections, with an RMSE of 0.43&#x202F;dB and an R<sup>2</sup> of 0.83. In contrast, the traditional models lagged considerably: the Dual Slope model achieved an RMSE of 4.68&#x202F;dB and an R<sup>2</sup> of 0.61, whereas the 3GPP TR 38.901 model performed much worse, with an RMSE of 16.18&#x202F;dB and an R<sup>2</sup> of &#x2212;3.47, indicating performance worse than a naive mean-value prediction. These results highlight the shortcomings of static, rule-based models for use in open settings where complicated and multidimensional signal propagation dynamics are the main concern</p>
<fig position="float" id="fig4">
<label>Figure 4</label>
<caption>
<p>Path loss vs. distance for different prediction models in open urban environment.</p>
</caption>
<graphic xlink:href="frai-08-1597981-g004.tif">
<alt-text content-type="machine-generated">Graph showing path loss in decibels against distance in meters. Measured data is marked with blue stars. The XGBoost model is represented by a red line, MLP by a green line, Dual Slope by a black dotted line, and 3GPP (TR 38.901) by a purple dashed line. Path loss increases with distance, with varying model predictions.</alt-text>
</graphic>
</fig>
</sec>
<sec id="sec22">
<label>4.2</label>
<title>Case study 2: suburban area</title>
<p><xref ref-type="fig" rid="fig5">Figure 5</xref> illustrates the path loss patterns in a suburban environment with moderate building density and partial obstructions. XGBoost demonstrated the highest accuracy, achieving an RMSE of 0.29&#x202F;dB and an R<sup>2</sup> of 0.92, while the MLP model also performed well, with an RMSE of 0.41&#x202F;dB and an R<sup>2</sup> of 0.83. In contrast, traditional models showed significantly lower accuracy: the Dual Slope model had an RMSE of 5.55&#x202F;dB and an R<sup>2</sup> of 0.52, and the 3GPP TR 38.901 model performed poorly, with an RMSE of 12.29&#x202F;dB and a negative R<sup>2</sup> of &#x2212;1.32. These results highlight the superior adaptability of machine learning models in handling the variability of suburban environments&#x2014;such as the presence of trees and mid-rise buildings&#x2014;compared to the static nature of traditional, formula-based models.</p>
<fig position="float" id="fig5">
<label>Figure 5</label>
<caption>
<p>Predicted vs. measured path loss for different models in suburban environment.</p>
</caption>
<graphic xlink:href="frai-08-1597981-g005.tif">
<alt-text content-type="machine-generated">Line chart showing path loss in decibels versus distance in meters. Measured data is marked with blue stars. A red line represents XGBoost predictions; a green line denotes MLP predictions. A dotted black line shows the Dual Slope model, and a dashed purple line indicates the 3GPP (TR 38.901) model. Path loss increases with distance.</alt-text>
</graphic>
</fig>
</sec>
<sec id="sec23">
<label>4.3</label>
<title>Case study 3: dense urban area</title>
<p><xref ref-type="fig" rid="fig6">Figure 6</xref> illustrates path loss behavior in dense urban areas characterized by tall buildings and pronounced multipath effects. XGBoost delivered outstanding performance, achieving an RMSE of 0.22&#x202F;dB and an R<sup>2</sup> of 0.95, while the MLP model also performed well, with an RMSE of 0.37&#x202F;dB and an R<sup>2</sup> of 0.86. In contrast, traditional models struggled to represent the complexity of the environment: the Dual Slope model recorded an RMSE of 6.79&#x202F;dB and an R<sup>2</sup> of 0.64, while the 3GPP TR 38.901 model fared the worst, with an RMSE of 14.84&#x202F;dB and a negative R<sup>2</sup> of &#x2212;0.72. These results underscore XGBoost&#x2019;s ability to capture nonlinear propagation effects such as shadowing and diffraction and clearly reveal the limitations of static, formula-based models like 3GPP in accurately modeling signal behavior in complex urban settings.</p>
<fig position="float" id="fig6">
<label>Figure 6</label>
<caption>
<p>Predicted vs. measured path loss for different models in dense urban.</p>
</caption>
<graphic xlink:href="frai-08-1597981-g006.tif">
<alt-text content-type="machine-generated">A line graph shows path loss in decibels versus distance in meters. It includes five curves: measured data, XGBoost prediction, MLP prediction, dual slope model, and 3GPP (TR 38.901) model. The measured data is scattered with blue stars, XGBoost is a red line, MLP is a green line, dual slope is a dotted black line, and 3GPP TR 38.901 is a dashed purple line. Path loss increases with distance, with each model showing different accuracy levels.</alt-text>
</graphic>
</fig>
</sec>
<sec id="sec24">
<label>4.4</label>
<title>Environmental impact on path loss</title>
<p>Our analysis indicates that environmental factors significantly influence path loss characteristics. <xref ref-type="table" rid="tab3">Table 3</xref> summarizes the critical variables affecting signal reliability in different urban environments and proposes specific actions for improving connectivity. <xref ref-type="bibr" rid="ref9008">Gozalvez et al. (2012)</xref> similarly observed that environmental context awareness is crucial for optimizing V2I communication systems, particularly in heterogeneous urban settings. In open urban environments, transmission power optimization can significantly improve connectivity. However, in dense urban environments with sharp road curves, increasing transmission power provides minimal benefits, and strategic RSU placement becomes more critical.</p>
<table-wrap position="float" id="tab3">
<label>Table 3</label>
<caption>
<p>Critical variables and recommended actions for different urban environments.</p>
</caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th align="left" valign="top">Environment</th>
<th align="left" valign="top">Critical variables</th>
<th align="left" valign="top">Recommended actions</th>
</tr>
</thead>
<tbody>
<tr>
<td align="left" valign="top">Open urban</td>
<td align="left" valign="top">Transmission power, antenna height</td>
<td align="left" valign="top">Optimize transmission power, adjust antenna height based on coverage requirements</td>
</tr>
<tr>
<td align="left" valign="top">Suburban</td>
<td align="left" valign="top">Vegetation density, road curvature</td>
<td align="left" valign="top">Increase antenna height above vegetation, strategic RSU placement at road curves</td>
</tr>
<tr>
<td align="left" valign="top">Dense urban</td>
<td align="left" valign="top">Building density, street width</td>
<td align="left" valign="top">Deploy multiple RSUs with overlapping coverage, position RSUs at street intersections</td>
</tr>
</tbody>
</table>
</table-wrap>
</sec>
</sec>
<sec sec-type="conclusions" id="sec25">
<label>5</label>
<title>Conclusion</title>
<p>This study demonstrates that machine learning approaches, particularly XGBoost, consistently outperform traditional path loss prediction models in urban V2I communication systems. The superior performance of ML models is attributed to their ability to learn non-linear signal fluctuations and environmental factors directly from empirical data.</p>
<p>Future work could explore hybrid models combining physics-based and data-driven approaches, investigate alternative coordinate systems for feature representation, and extend the analysis to additional frequency bands relevant to emerging V2I communication standards.</p>
</sec>
</body>
<back>
<sec sec-type="data-availability" id="sec26">
<title>Data availability statement</title>
<p>The original contributions presented in the study are included in the article/supplementary material, further inquiries can be directed to the corresponding author.</p>
</sec>
<sec sec-type="author-contributions" id="sec27">
<title>Author contributions</title>
<p>MB: Writing &#x2013; review &#x0026; editing, Writing &#x2013; original draft. JC: Writing &#x2013; review &#x0026; editing. MH: Writing &#x2013; review &#x0026; editing. JT: Writing &#x2013; review &#x0026; editing.</p>
</sec>
<sec sec-type="funding-information" id="sec28">
<title>Funding</title>
<p>The author(s) declare that no financial support was received for the research and/or publication of this article.</p>
</sec>
<ack>
<p>The authors express their thanks and appreciation to Dr. Mate Boban for providing access to the experimental data set.</p>
</ack>
<sec sec-type="COI-statement" id="sec29">
<title>Conflict of interest</title>
<p>The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.</p>
</sec>
<sec sec-type="ai-statement" id="sec30">
<title>Generative AI statement</title>
<p>The authors declare that no Gen AI was used in the creation of this manuscript.</p>
</sec>
<sec sec-type="disclaimer" id="sec31">
<title>Publisher&#x2019;s note</title>
<p>All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.</p>
</sec>
<ref-list>
<title>References</title>
<ref id="ref1"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Abbas</surname> <given-names>T.</given-names></name> <name><surname>K&#x00E5;redal</surname> <given-names>J.</given-names></name> <name><surname>Tufvesson</surname> <given-names>F.</given-names></name></person-group> (<year>2013</year>). <article-title>Measurement-based analysis of vehicle-to-vehicle propagation channels for ITS applications</article-title>. <source>IEEE Trans. Intell. Transp. Syst.</source> <volume>14</volume>, <fpage>755</fpage>&#x2013;<lpage>765</lpage>. doi: <pub-id pub-id-type="doi">10.1109/TITS.2013.2251905</pub-id></citation></ref>
<ref id="ref2"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Akpaida</surname> <given-names>V. O. A.</given-names></name> <name><surname>Anyasi</surname> <given-names>F. I.</given-names></name> <name><surname>Uzairue</surname> <given-names>S. I.</given-names></name> <name><surname>Idim</surname> <given-names>A. I.</given-names></name></person-group> (<year>2018</year>). <article-title>Determination of an outdoor path loss model and signal penetration level in some selected modern residential and office apartments in Ogbomosho, Oyo state, Nigeria</article-title>. <source>J. Eng. Res. Rep.</source>, <fpage>1</fpage>&#x2013;<lpage>25</lpage>. doi: <pub-id pub-id-type="doi">10.9734/JERR/2018/v1i39775</pub-id></citation></ref>
<ref id="ref3"><citation citation-type="book"><person-group person-group-type="author"><name><surname>Bishop</surname> <given-names>C. M.</given-names></name></person-group> (<year>2006</year>). <source>Pattern recognition and machine learning</source>. <publisher-loc>Berlin</publisher-loc>: <publisher-name>Springer</publisher-name>.</citation></ref>
<ref id="ref4"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Boban</surname> <given-names>M.</given-names></name> <name><surname>Vinhoza</surname> <given-names>T. T.</given-names></name> <name><surname>Tonguz</surname> <given-names>O. K.</given-names></name> <name><surname>Barros</surname> <given-names>J.</given-names></name> <name><surname>Almeida</surname> <given-names>E.</given-names></name></person-group> (<year>2011</year>). <article-title>Impact of vehicles as obstacles in vehicular communication environments</article-title>. <source>IEEE J Sel Areas Commun</source> <volume>29</volume>, <fpage>15</fpage>&#x2013;<lpage>28</lpage>. doi: <pub-id pub-id-type="doi">10.1109/JSAC.2011.110102</pub-id></citation></ref>
<ref id="ref9003"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Chang</surname> <given-names>P. R.</given-names></name> <name><surname>Yang</surname> <given-names>W.-H.</given-names></name></person-group> (<year>1997</year>). &#x201C;<article-title>Environment-adaptation mobile radio propagation prediction using radial basis function neural networks</article-title>,&#x201D; in <source>IEEE Transactions on Vehicular Technology</source>, vol. <volume>46</volume>, pp. <fpage>155</fpage>&#x2013;<lpage>160</lpage>.</citation></ref>
<ref id="ref5"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Devlin</surname> <given-names>J.</given-names></name> <name><surname>Chang</surname> <given-names>M. W.</given-names></name> <name><surname>Lee</surname> <given-names>K.</given-names></name> <name><surname>Toutanova</surname> <given-names>K.</given-names></name></person-group> (<year>2019</year>). <article-title>BERT: pre-training of deep bidirectional transformers for language understanding</article-title>. <source>Proc. NAACL-HLT 2019</source> <volume>1</volume>, <fpage>4171</fpage>&#x2013;<lpage>4186</lpage>. doi: <pub-id pub-id-type="doi">10.48550/arXiv.1810.04805</pub-id></citation></ref>
<ref id="ref6"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Fern&#x00E1;ndez</surname> <given-names>H.</given-names></name> <name><surname>Rubio</surname> <given-names>L.</given-names></name> <name><surname>Rodrigo Pe&#x00F1;arrocha</surname> <given-names>V. M.</given-names></name> <name><surname>Reig</surname> <given-names>J.</given-names></name></person-group> (<year>2024</year>). <article-title>Dual-slope path loss model for integrating vehicular sensing applications in urban and suburban environment</article-title>. <source>Sensors</source> <volume>24</volume>:<fpage>4334</fpage>. doi: <pub-id pub-id-type="doi">10.3390/s24134334</pub-id></citation></ref>
<ref id="ref7"><citation citation-type="confproc"><person-group person-group-type="author"><name><surname>Filippi</surname> <given-names>A.</given-names></name> <name><surname>Bazzi</surname> <given-names>A.</given-names></name></person-group> (<year>2021</year>). <article-title>Impact of RSU height on V2I communication performance in urban environments</article-title>. <conf-name>IEEE Vehicular Technology Conference (VTC)</conf-name>, <fpage>1</fpage>&#x2013;<lpage>6</lpage>. doi: <pub-id pub-id-type="doi">10.1109/VTC2021.9473721</pub-id></citation></ref>
<ref id="ref9007"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Fraile</surname> <given-names>R.</given-names></name> <name><surname>Rubio</surname> <given-names>L.</given-names></name> <name><surname>Cardona</surname> <given-names>N.</given-names></name></person-group> (<year>2000</year>). &#x201C;<article-title>Application of RBF neural networks to the prediction of propagation loss over irregular terrain</article-title>,&#x201D; in <source>Proceedings of the Vehicular Technology Conference Fall (VTC)</source>, vol. <volume>2</volume>, pp. <fpage>878</fpage>&#x2013;<lpage>884</lpage>.</citation></ref>
<ref id="ref8"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Gho</surname> <given-names>S.</given-names></name> <name><surname>Lee</surname> <given-names>J.</given-names></name> <name><surname>Kim</surname> <given-names>H.</given-names></name></person-group> (<year>2019</year>). <article-title>A comparative study on dimensionality reduction techniques for wireless channel modeling</article-title>. <source>IEEE Commun. Surv. Tutor.</source> <volume>21</volume>, <fpage>2157</fpage>&#x2013;<lpage>2178</lpage>. doi: <pub-id pub-id-type="doi">10.1109/COMST.2019.2917261</pub-id></citation></ref>
<ref id="ref9"><citation citation-type="book"><person-group person-group-type="author"><name><surname>Goodfellow</surname> <given-names>I.</given-names></name> <name><surname>Bengio</surname> <given-names>Y.</given-names></name> <name><surname>Courville</surname> <given-names>A.</given-names></name></person-group> (<year>2016</year>). <source>Deep learning</source>. <publisher-loc>Cambridge, MA</publisher-loc>: <publisher-name>MIT Press</publisher-name>.</citation></ref>
<ref id="ref10"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Gozalvez</surname> <given-names>J.</given-names></name> <name><surname>Sepulcre</surname> <given-names>M.</given-names></name></person-group> (<year>2022</year>). <article-title>Adaptive Kalman filtering for V2X path loss estimation in urban areas</article-title>. <source>IEEE Trans. Intell. Veh.</source> <volume>7</volume>, <fpage>421</fpage>&#x2013;<lpage>432</lpage>. doi: <pub-id pub-id-type="doi">10.1109/TIV.2022.3145621</pub-id></citation></ref>
<ref id="ref9008"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Gozalvez</surname> <given-names>J.</given-names></name> <name><surname>Sepulcre</surname> <given-names>M.</given-names></name> <name><surname>Bauza</surname> <given-names>R.</given-names></name></person-group> (<year>2012</year>). &#x201C;<article-title>IEEE802.11p vehicle to infrastructure communications in urban environments</article-title>,&#x201D; in <source>IEEE Communications Magazine</source>, vol. <volume>50</volume>, pp. <fpage>176</fpage>&#x2013;<lpage>183</lpage>.</citation></ref>
<ref id="ref9009"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hata</surname> <given-names>M.</given-names></name></person-group> (<year>1980</year>). &#x201C;<article-title>Empirical formula for propagation loss in land mobile radio services</article-title>,&#x201D; in <source>IEEE Transactions on Vehicular Technology</source>, vol. <volume>29</volume>, pp. <fpage>317</fpage>&#x2013;<lpage>325</lpage>, 1980. doi: <pub-id pub-id-type="doi">10.1109/T-VT.1980.23859</pub-id></citation></ref>
<ref id="ref11"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hoomod</surname> <given-names>H. K.</given-names></name> <name><surname>Al-Mejibli</surname> <given-names>I.</given-names></name> <name><surname>Jabboory</surname> <given-names>A. I.</given-names></name></person-group> (<year>2018</year>). <article-title>Analyzing study of path loss propagation models in wireless communications at 0.8 GHz</article-title>. <source>J. Phys. Conf. Ser.</source> <volume>1003</volume>, <fpage>1</fpage>&#x2013;<lpage>8</lpage>. doi: <pub-id pub-id-type="doi">10.1088/1742-6596/1003/1/012028</pub-id></citation></ref>
<ref id="ref9010"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Huang</surname> <given-names>J.</given-names></name> <name><surname>Zhang</surname> <given-names>W.</given-names></name> <name><surname>Li</surname> <given-names>Y.</given-names></name> <name><surname>Wang</surname> <given-names>Q.</given-names></name> <name><surname>Chen</surname> <given-names>X.</given-names></name> <name><surname>Liu</surname> <given-names>Y.</given-names></name></person-group> (<year>2022a</year>). &#x201C;<article-title>Artificial intelligence enabled radio propagation for communications&#x2014;Part I: Channel characterization and antenna channel optimization</article-title>,&#x201D; in <source>IEEE Transactions on Antennas and Propagation</source>, vol. <volume>70</volume>, pp. <fpage>3939</fpage>&#x2013;<lpage>3954</lpage>.</citation></ref>
<ref id="ref9011"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Huang</surname> <given-names>J.</given-names></name> <name><surname>Zhang</surname> <given-names>W.</given-names></name> <name><surname>Li</surname> <given-names>Y.</given-names></name> <name><surname>Wang</surname> <given-names>Q.</given-names></name> <name><surname>Chen</surname> <given-names>X.</given-names></name> <name><surname>Liu</surname> <given-names>Y.</given-names></name></person-group> (<year>2022b</year>). &#x201C;<article-title>Artificial intelligence enabled radio propagation for communications&#x2014;Part II: Scenario identification and channel modeling,</article-title>&#x201D; in <source>IEEE Transactions on Antennas and Propagation</source>, vol. <volume>70</volume>, pp. <fpage>3955</fpage>&#x2013;<lpage>3969</lpage>.</citation></ref>
<ref id="ref12"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kenney</surname> <given-names>J. B.</given-names></name></person-group> (<year>2011</year>). <article-title>Dedicated short-range communications (DSRC) standards in the United States</article-title>. <source>Proc. IEEE</source> <volume>99</volume>, <fpage>1162</fpage>&#x2013;<lpage>1182</lpage>. doi: <pub-id pub-id-type="doi">10.1109/JPROC.2011.2132790</pub-id></citation></ref>
<ref id="ref13"><citation citation-type="confproc"><person-group person-group-type="author"><name><surname>Krizhevsky</surname> <given-names>A.</given-names></name> <name><surname>Sutskever</surname> <given-names>I.</given-names></name> <name><surname>Hinton</surname> <given-names>G. E.</given-names></name></person-group> (<year>2012</year>) <article-title>ImageNet classification with deep convolutional neural networks</article-title>. <conf-name>Advances in neural information processing systems</conf-name>, <volume>25</volume>, <fpage>1097</fpage>&#x2013;<lpage>1105</lpage>.</citation></ref>
<ref id="ref9001"><citation citation-type="book"><person-group person-group-type="author"><name><surname>Molisch</surname> <given-names>A. F.</given-names></name></person-group> (<year>2011</year>). <source>Wireless Communications</source>. <publisher-loc>West Sussex, UK</publisher-loc>: <publisher-name>Wiley</publisher-name>.</citation></ref>
<ref id="ref14"><citation citation-type="book"><person-group person-group-type="author"><name><surname>Murphy</surname> <given-names>K. P.</given-names></name></person-group> (<year>2012</year>). <source>Machine learning: A probabilistic perspective</source>. <publisher-loc>Cambridge, MA</publisher-loc>: <publisher-name>MIT Press</publisher-name>.</citation></ref>
<ref id="ref9002"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Neskovic</surname> <given-names>A.</given-names></name> <name><surname>Neskovic</surname> <given-names>N.</given-names></name> <name><surname>Paunovic</surname> <given-names>D.</given-names></name></person-group> (<year>2001</year>). &#x201C;<article-title>ANN microcell electric field level prediction model</article-title>,&#x201D; in <source>Proceedings of EUROCON International Conference on Trends in Communications Technology and Programming</source>, vol. <volume>1</volume>, pp. <fpage>128</fpage>&#x2013;<lpage>131</lpage>.</citation></ref>
<ref id="ref15"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Oroza</surname> <given-names>C. A.</given-names></name> <name><surname>Zhang</surname> <given-names>Z.</given-names></name> <name><surname>Watteyne</surname> <given-names>T.</given-names></name> <name><surname>Glaser</surname> <given-names>S. D.</given-names></name></person-group> (<year>2017</year>). <article-title>A machine-learning-based connectivity model for complex terrain large-scale low-power wireless deployments</article-title>. <source>IEEE Trans. Cogn. Commun. Netw.</source> <volume>3</volume>, <fpage>576</fpage>&#x2013;<lpage>584</lpage>. doi: <pub-id pub-id-type="doi">10.1109/TCCN.2017.2765439</pub-id></citation></ref>
<ref id="ref16"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>P&#x00E4;tzold</surname> <given-names>M.</given-names></name> <name><surname>De Nardis</surname> <given-names>L.</given-names></name></person-group> (<year>2020</year>). <article-title>Ray-tracing-based path loss models for V2X communication at 5.9 GHz</article-title>. <source>IEEE Wirel. Commun.</source> <volume>27</volume>, <fpage>56</fpage>&#x2013;<lpage>62</lpage>. doi: <pub-id pub-id-type="doi">10.1109/MWC.2020.9182765</pub-id></citation></ref>
<ref id="ref9004"><citation citation-type="book"><person-group person-group-type="author"><name><surname>Rappaport</surname> <given-names>T. S.</given-names></name></person-group> (<year>2002</year>). <source>Wireless Communications: Principles and Practice</source>, <edition>2nd ed.</edition> <publisher-loc>Upper Saddle River, NJ, USA</publisher-loc>: <publisher-name>Prentice Hall</publisher-name>.</citation></ref>
<ref id="ref17"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Tarhuni</surname> <given-names>M.</given-names></name> <name><surname>Ouni</surname> <given-names>A.</given-names></name></person-group> (<year>2022</year>). <article-title>Path loss models for V2I communications in urban environments: performance analysis and improvements</article-title>. <source>IEEE Access</source> <volume>10</volume>, <fpage>45123</fpage>&#x2013;<lpage>45138</lpage>. doi: <pub-id pub-id-type="doi">10.1109/ACCESS.2022.3165432</pub-id></citation></ref>
<ref id="ref9005"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Thrane</surname> <given-names>J.</given-names></name> <name><surname>Madsen</surname> <given-names>T. K.</given-names></name> <name><surname>Nielsen</surname> <given-names>J. J.</given-names></name></person-group> (<year>2020</year>). <article-title>Deep learning-based path loss prediction for urban environments</article-title>. <source>IEEE Transactions on Vehicular Technology</source>, <volume>69</volume>, <fpage>11592</fpage>&#x2013;<lpage>11601</lpage>. doi: <pub-id pub-id-type="doi">10.1109/TVT.2020.3012345</pub-id></citation></ref>
<ref id="ref18"><citation citation-type="confproc"><person-group person-group-type="author"><name><surname>Uccellari</surname> <given-names>M.</given-names></name> <name><surname>Facchini</surname> <given-names>F.</given-names></name> <name><surname>Sola</surname> <given-names>M.</given-names></name> <name><surname>Sirignano</surname> <given-names>E.</given-names></name> <name><surname>Vitetta</surname> <given-names>G. M.</given-names></name> <name><surname>Barbieri</surname> <given-names>A.</given-names></name> <etal/></person-group>. (<year>2016</year>) <article-title>On the use of support vector machines for the prediction of propagation losses in smart metering systems</article-title>. <conf-name>IEEE 26th International Workshop on Machine Learning for Signal Processing (MLSP)</conf-name>, <publisher-name>Vietri sul Mare</publisher-name>, <publisher-loc>Italy</publisher-loc>, <fpage>1</fpage>&#x2013;<lpage>6</lpage>. doi: <pub-id pub-id-type="doi">10.1109/MLSP.2016.7738902</pub-id></citation></ref>
<ref id="ref9006"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wu</surname> <given-names>H.</given-names></name> <name><surname>Chen</surname> <given-names>F.</given-names></name> <name><surname>Yang</surname> <given-names>J.</given-names></name> <name><surname>Li</surname> <given-names>Z.</given-names></name> <name><surname>Zhang</surname> <given-names>M.</given-names></name> <name><surname>Wang</surname> <given-names>X.</given-names></name></person-group> (<year>2020</year>). &#x201C;<article-title>Received power prediction for suburban environment based on neural network</article-title>,&#x201D; in <source>Proceedings of the International Conference on Information Networking (ICOIN)</source>, pp. <fpage>35</fpage>&#x2013;<lpage>39</lpage>.</citation></ref>
<ref id="ref19"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zhang</surname> <given-names>X.</given-names></name> <name><surname>Li</surname> <given-names>H.</given-names></name> <name><surname>Wang</surname> <given-names>Y.</given-names></name> <name><surname>Zhao</surname> <given-names>Q.</given-names></name></person-group> (<year>2023</year>). <article-title>An enhanced Hata model for high-frequency bands (3 GHz&#x2013;6 GHz) in urban environments</article-title>. <source>IEEE Trans. Antennas Propag.</source> <volume>71</volume>, <fpage>1234</fpage>&#x2013;<lpage>1245</lpage>. doi: <pub-id pub-id-type="doi">10.1109/TAP.2023.3256789</pub-id></citation></ref>
<ref id="ref20"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zhang</surname> <given-names>T.</given-names></name> <name><surname>Liu</surname> <given-names>S.</given-names></name> <name><surname>Xiang</surname> <given-names>W.</given-names></name> <name><surname>Xu</surname> <given-names>L.</given-names></name> <name><surname>Qin</surname> <given-names>K.</given-names></name> <name><surname>Yan</surname> <given-names>X.</given-names></name></person-group> (<year>2019</year>). <article-title>A real-time channel prediction model based on neural networks for dedicated short-range communications</article-title>. <source>Sensors</source> <volume>19</volume>:<fpage>3541</fpage>. doi: <pub-id pub-id-type="doi">10.3390/s19163541</pub-id>, PMID: <pub-id pub-id-type="pmid">31412691</pub-id></citation></ref>
<ref id="ref21"><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zhang</surname> <given-names>Y.</given-names></name> <name><surname>Wen</surname> <given-names>J.</given-names></name> <name><surname>Yang</surname> <given-names>G.</given-names></name> <name><surname>He</surname> <given-names>Z.</given-names></name> <name><surname>Wang</surname> <given-names>J.</given-names></name></person-group> (<year>2019</year>). <article-title>Path loss prediction based on machine learning: principle, method, and data expansion</article-title>. <source>Appl. Sci.</source> <volume>9</volume>:<fpage>1908</fpage>. doi: <pub-id pub-id-type="doi">10.3390/app9091908</pub-id></citation></ref>
</ref-list>
</back>
</article>