<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v2.3 20070202//EN" "journalpublishing.dtd">
<article article-type="research-article" dtd-version="2.3" xml:lang="EN" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">Front. Phys.</journal-id>
<journal-title>Frontiers in Physics</journal-title>
<abbrev-journal-title abbrev-type="pubmed">Front. Phys.</abbrev-journal-title>
<issn pub-type="epub">2296-424X</issn>
<publisher>
<publisher-name>Frontiers Media S.A.</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="publisher-id">787185</article-id>
<article-id pub-id-type="doi">10.3389/fphy.2021.787185</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Physics</subject>
<subj-group>
<subject>Original Research</subject>
</subj-group>
</subj-group>
</article-categories>
<title-group>
<article-title>HetInf: Social Influence Prediction With Heterogeneous Graph Neural Network</article-title>
<alt-title alt-title-type="left-running-head">Gao et&#x20;al.</alt-title>
<alt-title alt-title-type="right-running-head">HetInf</alt-title>
</title-group>
<contrib-group>
<contrib contrib-type="author">
<name>
<surname>Gao</surname>
<given-names>Liqun</given-names>
</name>
<xref ref-type="aff" rid="aff1">
<sup>1</sup>
</xref>
<uri xlink:href="https://loop.frontiersin.org/people/1197699/overview"/>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Wang</surname>
<given-names>Haiyang</given-names>
</name>
<xref ref-type="aff" rid="aff1">
<sup>1</sup>
</xref>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Zhang</surname>
<given-names>Zhouran</given-names>
</name>
<xref ref-type="aff" rid="aff2">
<sup>2</sup>
</xref>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Zhuang</surname>
<given-names>Hongwu</given-names>
</name>
<xref ref-type="aff" rid="aff1">
<sup>1</sup>
</xref>
</contrib>
<contrib contrib-type="author" corresp="yes">
<name>
<surname>Zhou</surname>
<given-names>Bin</given-names>
</name>
<xref ref-type="aff" rid="aff1">
<sup>1</sup>
</xref>
<xref ref-type="corresp" rid="c001">&#x2a;</xref>
</contrib>
</contrib-group>
<aff id="aff1">
<sup>1</sup>
<institution>Key Lab of Software Engineering for Complex Systems, School of Computer, National University of Defense Technology</institution>, <addr-line>ChangSha</addr-line>, <country>China</country>
</aff>
<aff id="aff2">
<sup>2</sup>
<institution>Department of Materials Science and Engineering, National University of Defense Technology</institution>, <addr-line>ChangSha</addr-line>, <country>China</country>
</aff>
<author-notes>
<fn fn-type="edited-by">
<p>
<bold>Edited by:</bold> <ext-link ext-link-type="uri" xlink:href="https://loop.frontiersin.org/people/101109/overview">Chengyi Xia</ext-link>, Tianjin University of Technology, China</p>
</fn>
<fn fn-type="edited-by">
<p>
<bold>Reviewed by:</bold> <ext-link ext-link-type="uri" xlink:href="https://loop.frontiersin.org/people/1505314/overview">Li Pan</ext-link>, Shanghai Jiao Tong University, China</p>
<p>
<ext-link ext-link-type="uri" xlink:href="https://loop.frontiersin.org/people/817752/overview">Di Jin</ext-link>, Tianjin University, China</p>
</fn>
<corresp id="c001">&#x2a;Correspondence: Bin Zhou&#x2009;, <email>binzhou@nudt.edu.cn</email>
</corresp>
<fn fn-type="other">
<p>This article was submitted to Social Physics, a section of the journal Frontiers in Physics</p>
</fn>
</author-notes>
<pub-date pub-type="epub">
<day>05</day>
<month>01</month>
<year>2022</year>
</pub-date>
<pub-date pub-type="collection">
<year>2021</year>
</pub-date>
<volume>9</volume>
<elocation-id>787185</elocation-id>
<history>
<date date-type="received">
<day>30</day>
<month>09</month>
<year>2021</year>
</date>
<date date-type="accepted">
<day>17</day>
<month>11</month>
<year>2021</year>
</date>
</history>
<permissions>
<copyright-statement>Copyright &#xa9; 2022 Gao, Wang, Zhang, Zhuang and Zhou.</copyright-statement>
<copyright-year>2022</copyright-year>
<copyright-holder>Gao, Wang, Zhang, Zhuang and Zhou</copyright-holder>
<license xlink:href="http://creativecommons.org/licenses/by/4.0/">
<p>This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these&#x20;terms.</p>
</license>
</permissions>
<abstract>
<p>With the continuous enrichment of social network applications, such as TikTok, Weibo, Twitter, and others, social media have become an indispensable part of our lives. Web users can participate in their favorite events or pay attention to people they like. The &#x201c;heterogeneous&#x201d; influence between events and users can be effectively modeled, and users&#x2019; potential future behaviors can be predicted, so as to facilitate applications such as recommendations and online advertising. For example, a user&#x2019;s favorite live streaming host (user) recommends certain products (event), can we predict whether the user will buy these products in the future? The majority of studies are based on a homogeneous graph neural network to model the influence between users. However, these studies ignore the impact of events on users in reality. For instance, when users purchase commodities through live streaming channels, in addition to the factors of the host, the commodity is also a key factor that influences the behavior of users. This study designs an influence prediction model based on a heterogeneous neural network HetInf. Specifically, we first constructed the heterogeneous social influence network according to the relationship between event nodes and user nodes, then sampled the user heterogeneous subgraph for each user, extracted the relevant node features, and finally predicted the probability of user behavior through the heterogeneous neural network model. We conducted comprehensive experiments on two large social network datasets. Furthermore, the experimental results show that HetInf is significantly superior to the previous homogeneous neural network methods.</p>
</abstract>
<kwd-group>
<kwd>social network analysis</kwd>
<kwd>social influence analysis</kwd>
<kwd>heterogeneous neural network</kwd>
<kwd>user behavior prediction</kwd>
<kwd>deep learning</kwd>
</kwd-group>
<contract-num rid="cn002">61732022 61732004&#x20;62072131 61672020</contract-num>
<contract-sponsor id="cn001">National Key Research and Development Program of China<named-content content-type="fundref-id">10.13039/501100012166</named-content>
</contract-sponsor>
<contract-sponsor id="cn002">National Natural Science Foundation of China<named-content content-type="fundref-id">10.13039/501100001809</named-content>
</contract-sponsor>
</article-meta>
</front>
<body>
<sec id="s1">
<title>1 Introduction</title>
<p>Nowadays, social networks are everywhere around us in our daily lives. Social influence occurs when we get information from social networks, which means that network events (such as network news, trending topics, publishing papers, or other events) or network users we are interested in constantly influence us through social media, and both of them can induce us to engage in social action (including retweet, comment, like, publish, and purchase). For example, live commerce is very popular nowadays, and we will choose our favorite live streaming host to buy necessary commodities. From another perspective, both the live streaming host (user) and the commodities (event) have a substantial impact on the target user&#x2019;s behavior. Similar to the definition of &#x201c;event&#x201d; in the study mentioned in reference <xref ref-type="bibr" rid="B1">1</xref>, social events can be regarded as a complete semantic unit in which network users participate and understand. One of the key computational problems is to predict the user&#x2019;s social behavior in social influence analysis. How to model the influence relationship to predict the behavior of network users on events is one of the key computational problems in user-level social influence prediction. This problem is applied to many fields, including but not limited to election [<xref ref-type="bibr" rid="B2">2</xref>], network marketing [<xref ref-type="bibr" rid="B3">3</xref>,<xref ref-type="bibr" rid="B4">4</xref>], recommendation [<xref ref-type="bibr" rid="B5">5</xref>], rumor detection [<xref ref-type="bibr" rid="B6">6</xref>], and information dissemination[<xref ref-type="bibr" rid="B7">7</xref>,<xref ref-type="bibr" rid="B8">8</xref>].</p>
<p>There are a large number of research studies on the role of the heterogeneity of nodes in social networks in social influence [<xref ref-type="bibr" rid="B9">9</xref>&#x2013;<xref ref-type="bibr" rid="B12">12</xref>]. This kind of study mainly focuses on user nodes&#x2019; interest in event nodes and predicts user behavior by capturing the influence of event nodes&#x2019; topic level. The study in reference <xref ref-type="bibr" rid="B10">10</xref> improves the traditional cascade propagation mode and applies the topic distribution methods to an independent cascade model and linear threshold model. The study in reference <xref ref-type="bibr" rid="B9">9</xref> uses a graph generation method to predict user behavior through the relationship between event topics and network users. These methods use traditional machine learning models to predict users&#x2019; social behavior through the manual feature representation of learning nodes. However, they do not consider the association between different types of network nodes in heterogeneous social networks, such as the dual impact of users and events on target users, which leads to the limited ability to capture the incentives that really affect user behavior.</p>
<p>Due to the progress of the graph neural network [<xref ref-type="bibr" rid="B13">13</xref>], the nodes of network have stronger representation ability. Many studies use graph neural network to model the problem of social influence prediction and make plenty of progress. The study in reference <xref ref-type="bibr" rid="B14">14</xref> uses the user&#x2019;s local network as the input of the graph neural network to learn the user&#x2019;s potential social representation and uses both network structures and user-specific features in convolutional neural and attention networks. Based on DeepInf, the study mentioned in reference <xref ref-type="bibr" rid="B15">15</xref> applies the multi-view impact prediction network to solve the social influence prediction problem. However, these methods are based on assumptions that users are only affected by other users, in order to model the relationship between users (homogeneous network), which lacks the analysis of the influence between heterogeneous nodes. Real social networks (such as Twitter, Digg, and Citation network) are heterogeneous and contain different types of entities [<xref ref-type="bibr" rid="B16">16</xref>], for example, user nodes and event nodes (stories, tweets, papers, and other objects), which inevitably interact with each other. For example, in <xref ref-type="fig" rid="F1">Figure&#x20;1A</xref>, Bob may forward the concert event because he is affected by the user Jerry (because he is not interested in music), and Tom may forward the concert event because he is affected by the event (he likes music).</p>
<fig id="F1" position="float">
<label>FIGURE 1</label>
<caption>
<p>Problem illustration of mining user&#x2013;event influence in heterogeneous networks and predicting user behaviors. <xref ref-type="fig" rid="F1">Figure&#x20;1A</xref> shows an instantiated prediction case, and the goal is to predict whether Smith (ego-user) will forward the concert blog (whether the red line will occur in the future). <xref ref-type="fig" rid="F1">Figure&#x20;1B</xref> is obtained by abstracting <xref ref-type="fig" rid="F1">Figure&#x20;1A</xref>, given 1) the relationship between the user and the event within the observable time (the connection in b includes three relationships: u-u, u-e, and e-e); 2) the embedding representation of the attributes of different nodes in the observable time (the rectangle next to the node in b); 3) the activation state of neighbor nodes (active or inactive); and 4) the embedding representation of each node in the network, then we predicted whether ego-user will participate in the target event.</p>
</caption>
<graphic xlink:href="fphy-09-787185-g001.tif"/>
</fig>
<p>To tackle these challenges, we focused on user-level behavior prediction in a heterogeneous network. This network contains two types of nodes: user and event. It aims to construct a heterogeneous influence network of event nodes and user nodes based on attributes, and we use graph neural networks to model the influence between these nodes so as to better mine the inducement of user behavior. Inspired by the latest research on heterogeneous neural network [<xref ref-type="bibr" rid="B13">13</xref>], the local modeling of heterogeneous networks can capture both structure and content heterogeneity and provide more reliable heterogeneous node representation ability for downstream tasks. Therefore, we combine the benefits of heterogeneous neural networks and semantic representation methods to model the influence network of local neighbor nodes based on heterogeneous network graph. For example, in <xref ref-type="fig" rid="F1">Figure&#x20;1A</xref>, with the aim of learning the influence of different types of nodes on him through historical semantic information and influence relationship, we input the heterogeneous neighbor local graph network with Smith as the ego-user, so as to predict his future behavior (whether to participate in the discussion of concert events).</p>
<p>Specifically, we proposed HetInf, a heterogeneous network influence prediction model based on two types of nodes. First, based on the influence relationship of network nodes, we constructed a heterogeneous relationship graph composed of them and hoped to build a more accurate influence model. Second, we sampled ego-user neighbor subgraphs. Specifically, an innovative heterogeneous network sampling strategy, based on restart random walk (RWR) [<xref ref-type="bibr" rid="B17">17</xref>], is used to sample the topology features and the semantic features (including event topics and user interests) of the heterogeneous nodes in ego-user neighbor subgraphs. Subsequently, an end-to-end heterogeneous neural network influence model is built, the historical topic features of events and the historical interest features of users based on semantics is embedded using Word2Vec [<xref ref-type="bibr" rid="B18">18</xref>], the node representation through the node semantics is aggregated, and the heterogeneous graph neural network model is used to learn the node relationship of event&#x2013;user heterogeneous network. Finally, we learned the influence of different neighbor nodes on ego-user node through the graph attention networks [<xref ref-type="bibr" rid="B19">19</xref>], so as to predict whether users will have the social behavior of participating in events in the future.</p>
<p>Summarizing, our contributions are given as follows:<list list-type="simple">
<list-item>
<p>(1) We applied the heterogeneous graph neural network method to predict the influence of users at the micro-level in social networks. Specifically, we extend the deep learning method of homogeneity social influence networks and analyze the dynamic propagation mode of heterogeneous networks to infer more accurate influence network.</p>
</list-item>
<list-item>
<p>(2) As respect to heterogeneous networks, we design a local sampling method in line with time sequence process, established the influence relationship between events and users, and applied an innovative end-to-end heterogeneous graph neural network model to more accurately predict users&#x2019; social behavior.</p>
</list-item>
<list-item>
<p>(3) Therefore, we tested two real large-scale social network data: Digg and Weibo. The experimental results show that HetInf exhibit significant improved accuracy when constructing a heterogeneous network compared with several state-of-the-art baselines.</p>
</list-item>
</list>
</p>
<p>The rest of this article is organized as follows: <xref ref-type="sec" rid="s2">Section 2</xref> formulates social influence locality problem. <xref ref-type="sec" rid="s3">Section 3</xref> introduces the proposed framework in detail. <xref ref-type="sec" rid="s4">Section 4</xref> describes extensive experiments with two datasets, <xref ref-type="sec" rid="s5">Section 5</xref> summarizes related study, and <xref ref-type="sec" rid="s6">Section 6</xref> concludes this&#x20;study.</p>
</sec>
<sec id="s2">
<title>2 Problem Formulation</title>
<p>In this section, we introduced several related definition and then formally formulated the problem of heterogeneous social network influence locality.</p>
<sec id="s2-1">
<title>2.1 Definition 1: R-Neighbors and r-Heterogeneous Neighbor Subgraph</title>
<p>A heterogeneous network <inline-formula id="inf1">
<mml:math id="m1">
<mml:mi mathvariant="script">G</mml:mi>
<mml:mo>&#x3d;</mml:mo>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:mrow>
<mml:mi>V</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>E</mml:mi>
<mml:mo>;</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi>O</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>V</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo>,</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi>R</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>E</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo>,</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi>A</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>V</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
<mml:mo stretchy="false">)</mml:mo>
</mml:mrow>
</mml:math>
</inline-formula> with two types of nodes V, three types of links E, and node attributes <italic>A</italic>
<sub>
<italic>V</italic>
</sub> is defined. In <xref ref-type="fig" rid="F1">Figure&#x20;1B</xref>, <italic>O</italic>
<sub>
<italic>V</italic>
</sub> includes user node U (round), event nodes E (square), and <italic>V</italic>&#x20;&#x3d; <italic>U</italic>&#x20;&#x222a; <italic>E</italic>; <italic>R</italic>
<sub>
<italic>E</italic>
</sub> includes user&#x2013;event relation <italic>R</italic>
<sub>
<italic>ue</italic>
</sub>, event&#x2013;event relation <italic>R</italic>
<sub>
<italic>ee</italic>
</sub>, user&#x2013;user relation <italic>R</italic>
<sub>
<italic>ue</italic>
</sub>, and <italic>E</italic>&#x20;&#x3d; <italic>R</italic>
<sub>
<italic>ue</italic>
</sub> &#x222a; <italic>R</italic>
<sub>
<italic>ee</italic>
</sub> &#x222a; <italic>R</italic>
<sub>
<italic>uu</italic>
</sub>. <italic>A</italic>
<sub>
<italic>V</italic>
</sub> is the attribute feature of the node, including the semantic attribute <inline-formula id="inf2">
<mml:math id="m2">
<mml:msubsup>
<mml:mrow>
<mml:mi>A</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>V</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>S</mml:mi>
</mml:mrow>
</mml:msubsup>
</mml:math>
</inline-formula>(rectangle) and topology attribute <inline-formula id="inf3">
<mml:math id="m3">
<mml:msubsup>
<mml:mrow>
<mml:mi>A</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>V</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>T</mml:mi>
</mml:mrow>
</mml:msubsup>
</mml:math>
</inline-formula>. For user <italic>u</italic>, its r-neighbors are defined to be <inline-formula id="inf4">
<mml:math id="m4">
<mml:msubsup>
<mml:mrow>
<mml:mi mathvariant="normal">&#x393;</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>u</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>r</mml:mi>
</mml:mrow>
</mml:msubsup>
<mml:mo>&#x3d;</mml:mo>
<mml:mrow>
<mml:mo stretchy="false">{</mml:mo>
<mml:mrow>
<mml:mi>v</mml:mi>
<mml:mo>:</mml:mo>
<mml:mi>d</mml:mi>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:mrow>
<mml:mi>v</mml:mi>
<mml:mo>,</mml:mo>
<mml:mi>u</mml:mi>
</mml:mrow>
<mml:mo stretchy="false">)</mml:mo>
</mml:mrow>
<mml:mo>&#x3c;</mml:mo>
<mml:mo>&#x3d;</mml:mo>
<mml:mi>r</mml:mi>
</mml:mrow>
<mml:mo stretchy="false">}</mml:mo>
</mml:mrow>
</mml:math>
</inline-formula>, where <italic>d</italic> (<italic>v</italic>, <italic>v</italic>
<sub>
<italic>u</italic>
</sub>) is the distance (number of hops) from <italic>v</italic> to <italic>u</italic>, and <italic>v</italic> is different types of nodes in heterogeneous subgraph <italic>G</italic>. The r-ego heterogeneous subgraph of user <italic>u</italic> is the local heterogeneous subgraph induced by <inline-formula id="inf5">
<mml:math id="m5">
<mml:msubsup>
<mml:mrow>
<mml:mi mathvariant="normal">&#x393;</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>u</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>r</mml:mi>
</mml:mrow>
</mml:msubsup>
</mml:math>
</inline-formula> and denoted by&#x20;<inline-formula id="inf6">
<mml:math id="m6">
<mml:msubsup>
<mml:mrow>
<mml:mi>G</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>u</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>r</mml:mi>
</mml:mrow>
</mml:msubsup>
</mml:math>
</inline-formula>.</p>
</sec>
<sec id="s2-2">
<title>2.2 Definition 2: Social Action</title>
<p>Social action refers to the behavior of users in the social network events, such as social network users retweet tweets (events) or publish papers (events). Formally, a social action can be regarded as the action of user <italic>u</italic> on event <italic>e</italic> at time <italic>t</italic> in the heterogeneous graph <italic>G</italic>
<sub>
<italic>u</italic>
</sub>. We define social action as a binary problem. Action status <inline-formula id="inf7">
<mml:math id="m7">
<mml:msubsup>
<mml:mrow>
<mml:mi>S</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>u</mml:mi>
<mml:mi>e</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>t</mml:mi>
</mml:mrow>
</mml:msubsup>
</mml:math>
</inline-formula> belongs to (0,1). When <inline-formula id="inf8">
<mml:math id="m8">
<mml:msubsup>
<mml:mrow>
<mml:mi>S</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>u</mml:mi>
<mml:mi>e</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>t</mml:mi>
</mml:mrow>
</mml:msubsup>
<mml:mo>&#x3d;</mml:mo>
<mml:mn>1</mml:mn>
</mml:math>
</inline-formula>, it means that user <italic>u</italic> has social action for an event <italic>e</italic> after time <italic>t</italic>; when <inline-formula id="inf9">
<mml:math id="m9">
<mml:msubsup>
<mml:mrow>
<mml:mi>S</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>u</mml:mi>
<mml:mi>e</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>t</mml:mi>
</mml:mrow>
</mml:msubsup>
<mml:mo>&#x3d;</mml:mo>
<mml:mn>0</mml:mn>
</mml:math>
</inline-formula>, it means that no social action has occurred.</p>
</sec>
<sec id="s2-3">
<title>2.3 Problem 1: Heterogeneous Social Network Influence Locality</title>
<p>Social influence locality models the probability of social action when ego-user <italic>u</italic>
<sub>
<italic>i</italic>
</sub> is influenced by neighbor nodes on his r-ego network <inline-formula id="inf10">
<mml:math id="m10">
<mml:msubsup>
<mml:mrow>
<mml:mi>G</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>u</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>i</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
<mml:mrow>
<mml:mi>r</mml:mi>
</mml:mrow>
</mml:msubsup>
</mml:math>
</inline-formula>; formally, given a 6-tuple {<italic>u</italic>, <italic>e</italic>, <italic>t</italic>, <italic>G</italic>, <italic>A</italic>, <italic>S</italic>}, social influence locality aims to quantify the activation probability of user <italic>u</italic>
<sub>
<italic>i</italic>
</sub>&#x2019;s social action in response to event e after a given time <italic>t</italic> in <italic>G</italic> with attribute feature <italic>A</italic> and action status <italic>S</italic> as follows:<disp-formula id="e1">
<mml:math id="m11">
<mml:mi>p</mml:mi>
<mml:mi>r</mml:mi>
<mml:mi>o</mml:mi>
<mml:mi>b</mml:mi>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:mrow>
<mml:msubsup>
<mml:mrow>
<mml:mi>S</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>u</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>i</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
<mml:mrow>
<mml:mi>t</mml:mi>
<mml:mo>&#x2b;</mml:mo>
<mml:mi mathvariant="normal">&#x394;</mml:mi>
<mml:mi>t</mml:mi>
</mml:mrow>
</mml:msubsup>
<mml:mo stretchy="false">&#x7c;</mml:mo>
<mml:msubsup>
<mml:mrow>
<mml:mi>e</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>j</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>t</mml:mi>
</mml:mrow>
</mml:msubsup>
<mml:mo>,</mml:mo>
<mml:msubsup>
<mml:mrow>
<mml:mi>G</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:msup>
<mml:mrow>
<mml:mi>u</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>i</mml:mi>
</mml:mrow>
</mml:msup>
</mml:mrow>
<mml:mrow>
<mml:mi>r</mml:mi>
</mml:mrow>
</mml:msubsup>
<mml:mo>,</mml:mo>
<mml:msubsup>
<mml:mrow>
<mml:mi>A</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>G</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>t</mml:mi>
</mml:mrow>
</mml:msubsup>
<mml:mo>,</mml:mo>
<mml:msubsup>
<mml:mrow>
<mml:mi>S</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>G</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>t</mml:mi>
</mml:mrow>
</mml:msubsup>
</mml:mrow>
<mml:mo stretchy="false">)</mml:mo>
</mml:mrow>
<mml:mo>,</mml:mo>
</mml:math>
<label>(1)</label>
</disp-formula>where <italic>u</italic>
<sub>
<italic>i</italic>
</sub> represents the ego-user, <inline-formula id="inf11">
<mml:math id="m12">
<mml:msubsup>
<mml:mrow>
<mml:mi>e</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>j</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>t</mml:mi>
</mml:mrow>
</mml:msubsup>
</mml:math>
</inline-formula> represents the event <italic>e</italic>
<sub>
<italic>j</italic>
</sub> at time <italic>t</italic>, <inline-formula id="inf12">
<mml:math id="m13">
<mml:msubsup>
<mml:mrow>
<mml:mi>G</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>u</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>i</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
<mml:mrow>
<mml:mi>r</mml:mi>
</mml:mrow>
</mml:msubsup>
</mml:math>
</inline-formula> represents r-heterogeneous neighbor subgraph, <inline-formula id="inf13">
<mml:math id="m14">
<mml:msubsup>
<mml:mrow>
<mml:mi>A</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>G</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>t</mml:mi>
</mml:mrow>
</mml:msubsup>
</mml:math>
</inline-formula> represents the node attributes by time <italic>t</italic>, including semantic attribute <italic>A</italic>
<sup>
<italic>S</italic>
</sup>, topology attribute <italic>A</italic>
<sup>
<italic>T</italic>
</sup>, and <inline-formula id="inf14">
<mml:math id="m15">
<mml:msubsup>
<mml:mrow>
<mml:mi>S</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>G</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>t</mml:mi>
</mml:mrow>
</mml:msubsup>
</mml:math>
</inline-formula> represents the action state of the subgraph node before timestamp&#x20;<italic>t</italic>.</p>
<p>After determining the problem, we sample <italic>N</italic> 6-tuple samples through preprocessing data. Similar to the definition of DeepInf [<xref ref-type="bibr" rid="B14">14</xref>], we regard social influence locality as a binary classification problem and calculate the model parameters by minimizing the negative log likelihood objective method. We use the following objective with parameters <italic>&#x3b8;</italic>:<disp-formula id="e2">
<mml:math id="m16">
<mml:mi>L</mml:mi>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:mrow>
<mml:mi>&#x3b8;</mml:mi>
</mml:mrow>
<mml:mo stretchy="false">)</mml:mo>
</mml:mrow>
<mml:mo>&#x3d;</mml:mo>
<mml:mo>&#x2212;</mml:mo>
<mml:munderover accentunder="false" accent="false">
<mml:mrow>
<mml:mo>&#x2211;</mml:mo>
</mml:mrow>
<mml:mrow>
<mml:mi>j</mml:mi>
<mml:mo>&#x3d;</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
<mml:mrow>
<mml:mi>M</mml:mi>
</mml:mrow>
</mml:munderover>
<mml:munderover accentunder="false" accent="false">
<mml:mrow>
<mml:mo>&#x2211;</mml:mo>
</mml:mrow>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mo>&#x3d;</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
<mml:mrow>
<mml:mi>N</mml:mi>
</mml:mrow>
</mml:munderover>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:mrow>
<mml:msubsup>
<mml:mrow>
<mml:mi>S</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>u</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>i</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
<mml:mrow>
<mml:mi>t</mml:mi>
<mml:mo>&#x2b;</mml:mo>
<mml:mi mathvariant="normal">&#x394;</mml:mi>
<mml:mi>t</mml:mi>
</mml:mrow>
</mml:msubsup>
<mml:mo stretchy="false">&#x7c;</mml:mo>
<mml:msubsup>
<mml:mrow>
<mml:mi>e</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>j</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>t</mml:mi>
</mml:mrow>
</mml:msubsup>
<mml:mo>,</mml:mo>
<mml:msubsup>
<mml:mrow>
<mml:mi>G</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:msup>
<mml:mrow>
<mml:mi>u</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>i</mml:mi>
</mml:mrow>
</mml:msup>
</mml:mrow>
<mml:mrow>
<mml:mi>r</mml:mi>
</mml:mrow>
</mml:msubsup>
<mml:mo>,</mml:mo>
<mml:msubsup>
<mml:mrow>
<mml:mi>A</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>G</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>t</mml:mi>
</mml:mrow>
</mml:msubsup>
<mml:mo>,</mml:mo>
<mml:msubsup>
<mml:mrow>
<mml:mi>S</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>G</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>t</mml:mi>
</mml:mrow>
</mml:msubsup>
</mml:mrow>
<mml:mo stretchy="false">)</mml:mo>
</mml:mrow>
<mml:mo>.</mml:mo>
</mml:math>
<label>(2)</label>
</disp-formula>
</p>
<p>Similarly, we assume that time &#x394;<italic>t</italic> is infinite, that is, we only predict user action outside the observation time window.</p>
</sec>
</sec>
<sec id="s3">
<title>3 Model Framework</title>
<p>Our goal is to design a heterogeneous graph network model based on the interaction between events and users, named HetInf, which aims to learn the dynamic preferences of individuals and the influence of heterogeneous neighbors in detail. Building the HetInf model needs three steps: 1) constructing heterogeneous relational networks with attributes; 2) sampling r-heterogeneous neighbor subgraph; and 3) building heterogeneous graph neural network model. <xref ref-type="fig" rid="F2">Figure&#x20;2</xref> shows the framework of HetInf.</p>
<fig id="F2" position="float">
<label>FIGURE 2</label>
<caption>
<p>Model framework of HetInf: It first <bold>(A)</bold> constructs heterogeneous relational networks with attributes), next <bold>(B)</bold> samples fix sized heterogeneous neighbors for each node, then aggregates the attribute features and topological features of the sampled heterogeneous neighbors <italic>via</italic> (C-1) and (C-2), and finally optimizes the model <italic>via</italic> (C-3). NN-1: Bi-LSTM, used to aggregate semantic features based on time series; NN-2: GCN network, used to aggregate heterogeneous node attributes; and NN-3: GAT network, used to learn the weights of interaction between&#x20;nodes.</p>
</caption>
<graphic xlink:href="fphy-09-787185-g002.tif"/>
</fig>
<sec id="s3-1">
<title>3.1 Construct Heterogeneous Graph</title>
<sec id="s3-1-1">
<title>3.1.1 Constructing Heterogeneous Relational Networks</title>
<p>Considering the heterogeneous network based on influence prediction user action, an intuitive way is to construct heterogeneous influence graph [<xref ref-type="bibr" rid="B20">20</xref>] to obtain the influence between nodes from the heterogeneous graph. We first obtain two types of nodes, including users and events. The event node can be regarded as a specific network event, such as a hashtag in social network dataset or a &#x201c;story&#x201d; in Digg dataset.</p>
<p>Then we establish the relationship between nodes according to the data characteristics, including the relationship between user&#x2013;user, user&#x2013;event, and event&#x2013;event. Specifically, we use follow relationship and interact relationship as the user&#x2013;user relationship; the user&#x2013;event relationship between users and events can be determined by the user&#x2019;s historical behavior, for example, the user has participated in the event; and the event&#x2013;event relationship can use co-occurrence association [<xref ref-type="bibr" rid="B21">21</xref>] or semantic association [<xref ref-type="bibr" rid="B22">22</xref>]. In this way, we construct the global heterogeneous relationship network <inline-formula id="inf15">
<mml:math id="m17">
<mml:mi mathvariant="script">G</mml:mi>
</mml:math>
</inline-formula> (Definition 2.1), as shown in <xref ref-type="fig" rid="F2">Figure&#x20;2A</xref>.</p>
</sec>
<sec id="s3-1-2">
<title>3.1.2 Extract Node Attributes</title>
<p>For different types of nodes, we select two features as the initial node representation of the heterogeneous relationship network:</p>
<p>Semantic features: Since the user&#x2019;s behavior is more influenced by the semantic information of the social event [<xref ref-type="bibr" rid="B9">9</xref>], the semantic feature <inline-formula id="inf16">
<mml:math id="m18">
<mml:msubsup>
<mml:mrow>
<mml:mi>A</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>v</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>S</mml:mi>
</mml:mrow>
</mml:msubsup>
</mml:math>
</inline-formula> of each node was used to indicate the &#x201c;bias&#x201d; in the semantic level. For the event nodes, TF-IDF [<xref ref-type="bibr" rid="B23">23</xref>] was used to sample <italic>K</italic> keywords <italic>Ew</italic>
<sub>
<italic>i</italic>
</sub>, <italic>i</italic>&#x20;&#x2208; <italic>K</italic> in each <italic>e</italic> and to distinguish between different events. For the user nodes, the stopwords (non-meaningful) was first removed, then the most frequent <italic>I</italic> keywords <italic>Uw</italic>
<sub>
<italic>i</italic>
</sub> (<italic>i</italic>&#x20;&#x2208; <italic>I</italic>) of a user was sampled to represent the user&#x2019;s interest, and finally, these keywords with timestamp was sorted to represent the semantic evolution of nodes over time. Formalization is given as follows:<disp-formula id="e3">
<mml:math id="m19">
<mml:msubsup>
<mml:mrow>
<mml:mi>A</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>v</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>S</mml:mi>
</mml:mrow>
</mml:msubsup>
<mml:mo>&#x3d;</mml:mo>
<mml:mfenced open="{" close="">
<mml:mrow>
<mml:mtable class="cases">
<mml:mtr>
<mml:mtd columnalign="left">
<mml:mi>S</mml:mi>
<mml:mi>o</mml:mi>
<mml:mi>r</mml:mi>
<mml:mi>t</mml:mi>
<mml:mtext>_</mml:mtext>
<mml:mi>b</mml:mi>
<mml:mi>y</mml:mi>
<mml:mtext>_</mml:mtext>
<mml:mi>t</mml:mi>
<mml:mi>i</mml:mi>
<mml:mi>m</mml:mi>
<mml:mi>e</mml:mi>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:mrow>
<mml:mi>E</mml:mi>
<mml:msub>
<mml:mrow>
<mml:mi>w</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>i</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
<mml:mo stretchy="false">)</mml:mo>
</mml:mrow>
<mml:mspace width="1em"/>
</mml:mtd>
<mml:mtd columnalign="left">
<mml:mi mathvariant="normal">i</mml:mi>
<mml:mi mathvariant="normal">f</mml:mi>
<mml:mi>v</mml:mi>
<mml:mo>&#x2208;</mml:mo>
<mml:mi>E</mml:mi>
</mml:mtd>
</mml:mtr>
<mml:mtr>
<mml:mtd columnalign="left">
<mml:mi>S</mml:mi>
<mml:mi>o</mml:mi>
<mml:mi>r</mml:mi>
<mml:mi>t</mml:mi>
<mml:mtext>_</mml:mtext>
<mml:mi>b</mml:mi>
<mml:mi>y</mml:mi>
<mml:mtext>_</mml:mtext>
<mml:mi>t</mml:mi>
<mml:mi>i</mml:mi>
<mml:mi>m</mml:mi>
<mml:mi>e</mml:mi>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:mrow>
<mml:mi>U</mml:mi>
<mml:msub>
<mml:mrow>
<mml:mi>w</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>i</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
<mml:mo stretchy="false">)</mml:mo>
</mml:mrow>
<mml:mspace width="1em"/>
</mml:mtd>
<mml:mtd columnalign="left">
<mml:mi mathvariant="normal">i</mml:mi>
<mml:mi mathvariant="normal">f</mml:mi>
<mml:mi>v</mml:mi>
<mml:mo>&#x2208;</mml:mo>
<mml:mi>U</mml:mi>
<mml:mo>.</mml:mo>
</mml:mtd>
</mml:mtr>
</mml:mtable>
</mml:mrow>
</mml:mfenced>
</mml:math>
<label>(3)</label>
</disp-formula>
</p>
<p>Topology features: In addition, inspired by the study in reference <xref ref-type="bibr" rid="B14">14</xref>, the DeepWalk [<xref ref-type="bibr" rid="B24">24</xref>] algorithm was used to obtain the topology feature <inline-formula id="inf17">
<mml:math id="m20">
<mml:msubsup>
<mml:mrow>
<mml:mi>A</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>v</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>T</mml:mi>
</mml:mrow>
</mml:msubsup>
<mml:mo>,</mml:mo>
<mml:msubsup>
<mml:mrow>
<mml:mi>A</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>v</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>T</mml:mi>
</mml:mrow>
</mml:msubsup>
<mml:mo>&#x2208;</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mi>R</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>d</mml:mi>
</mml:mrow>
</mml:msup>
</mml:math>
</inline-formula>, where <italic>d</italic> represents the embedding dimension, <italic>v</italic>&#x20;&#x2208;&#x20;<italic>V</italic>.</p>
</sec>
</sec>
<sec id="s3-2">
<title>3.2 Sample r-Heterogeneous Neighbor Subgraph</title>
<p>Generally, the graph neural network uses the feature information of the node&#x2019;s n-order neighbor (e.g., first older) for the aggregation process of node features, such as GAT [<xref ref-type="bibr" rid="B19">19</xref>]. Therefore, the breadth first search (BFS) strategy [<xref ref-type="bibr" rid="B25">25</xref>] used in the graph localization process will make the weight between users and events too large in hot events. For example, in popular events, due to a large number of related events, most of the neighbor nodes sampled by a user are event type nodes, which will lead to the event type nodes becoming the dominant influence, and ignore the influence between users. In order to solve this problem, an improved random walk strategy was used to comply with the law of information dissemination. The sampling strategy includes two steps:<list list-type="simple">
<list-item>
<p>1. Sample a fixed-length random walk sequence. We took each user <italic>u</italic>&#x20;&#x2208; <italic>U</italic> as the starting point, utilizing the RWR [<xref ref-type="bibr" rid="B17">17</xref>] method to sample a fixed number of <italic>N</italic>
<sub>
<italic>r</italic>
</sub> neighbor&#x20;nodes.</p>
</list-item>
<list-item>
<p>2. Use the meta-path method to perform sub-sampling in the sequence of step 1. We used random walk probability and u-e-u (user publishes an event, which is then forwarded by other users) and U-U (users directly forward through other users) meta-paths, sampled neighbor nodes with a fixed length of <italic>N</italic>&#x20;&#x3c; <italic>N</italic>
<sub>
<italic>R</italic>
</sub>, and then used these neighbor nodes to construct a subgraph&#x20;<italic>G</italic>
<sub>
<italic>u</italic>
</sub>.</p>
</list-item>
</list>
</p>
<p>This strategy satisfies the law of information dissemination and helps avoid the problem of too little sampling of some types of nodes. Similar to the definition in DeepInf [<xref ref-type="bibr" rid="B14">14</xref>], a positive instance of a local heterogeneous subgraph was generated if a user has social action with an event after the timestamp <italic>t</italic>, and a negative instance was generated if the user is not observed in the watch window to be associated with the&#x20;event.</p>
</sec>
<sec id="s3-3">
<title>3.3 Build Heterogeneous Graph Neural Network Model</title>
<p>In this way, 6-tuple (2.3) was used as a set of examples and a deep learning model was designed to predict the action state <inline-formula id="inf18">
<mml:math id="m21">
<mml:msubsup>
<mml:mrow>
<mml:mi>S</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>u</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>t</mml:mi>
<mml:mo>&#x2b;</mml:mo>
<mml:mi mathvariant="normal">&#x394;</mml:mi>
<mml:mi>t</mml:mi>
</mml:mrow>
</mml:msubsup>
</mml:math>
</inline-formula> of the ego-user. Our neural network model consists of three parts: a semantic feature aggregation module (shown in <xref ref-type="fig" rid="F2">Figure 2c-1</xref>), topological feature aggregation module (shown in <xref ref-type="fig" rid="F2">Figure 2c-2</xref>), and heterogeneous multi-attribute hidden layer aggregation module (shown in <xref ref-type="fig" rid="F2">Figure 2c-3</xref>). In this section, the different modules are introduced to express the process of model building.</p>
<sec id="s3-3-1">
<title>3.3.1 Semantic Feature Aggregation Module</title>
<p>A neural network module was designed to learn the deep association between user and event semantics. The module uses the node semantic attribute <inline-formula id="inf19">
<mml:math id="m22">
<mml:msubsup>
<mml:mrow>
<mml:mi>A</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>v</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>S</mml:mi>
</mml:mrow>
</mml:msubsup>
</mml:math>
</inline-formula> (obtained in 3.1.2) and the local heterogeneous subgraph <italic>G</italic>
<sub>
<italic>u</italic>
</sub> as input and realizes the aggregation function of semantic features through a neural network <inline-formula id="inf20">
<mml:math id="m23">
<mml:msubsup>
<mml:mrow>
<mml:mi>H</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>S</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>&#x2032;</mml:mo>
</mml:mrow>
</mml:msubsup>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:mrow>
<mml:mi>v</mml:mi>
</mml:mrow>
<mml:mo stretchy="false">)</mml:mo>
</mml:mrow>
</mml:math>
</inline-formula>. Specifically, we denoted the semantic feature as <inline-formula id="inf21">
<mml:math id="m24">
<mml:msubsup>
<mml:mrow>
<mml:mi>W</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>v</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>i</mml:mi>
</mml:mrow>
</mml:msubsup>
</mml:math>
</inline-formula>, indicating the <italic>i</italic>th semantic feature of node <italic>v</italic>, and utilized Word2Vec [<xref ref-type="bibr" rid="B18">18</xref>] to pre-train <inline-formula id="inf22">
<mml:math id="m25">
<mml:msubsup>
<mml:mrow>
<mml:mi>W</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>v</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>i</mml:mi>
</mml:mrow>
</mml:msubsup>
</mml:math>
</inline-formula> as <italic>x</italic>
<sub>
<italic>i</italic>
</sub>. Inspired by the study in reference <xref ref-type="bibr" rid="B13">13</xref>, the neural network structure of Bi-LSTM was used to capture the association between semantic features of nodes at a deeper level, and the average value of all hidden states was used to represent the general aggregation embedding as follows:<disp-formula id="e4">
<mml:math id="m26">
<mml:msub>
<mml:mrow>
<mml:mi>H</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>S</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:mrow>
<mml:mi>v</mml:mi>
</mml:mrow>
<mml:mo stretchy="false">)</mml:mo>
</mml:mrow>
<mml:mo>&#x3d;</mml:mo>
<mml:mfrac>
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mo movablelimits="false" form="prefix">&#x2211;</mml:mo>
</mml:mrow>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mo>&#x2208;</mml:mo>
<mml:msubsup>
<mml:mrow>
<mml:mi>W</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>v</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>i</mml:mi>
</mml:mrow>
</mml:msubsup>
</mml:mrow>
</mml:msub>
<mml:mfenced open="[" close="]">
<mml:mrow>
<mml:mrow>
<mml:mover accent="true">
<mml:mrow>
<mml:mi>L</mml:mi>
<mml:mi>S</mml:mi>
<mml:mi>T</mml:mi>
<mml:msub>
<mml:mrow>
<mml:mi>M</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>&#x3b8;</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
<mml:mo>&#x20d7;</mml:mo>
</mml:mover>
</mml:mrow>
<mml:mfenced open="(" close=")">
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>x</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>i</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:mfenced>
<mml:mo stretchy="false">&#x2016;</mml:mo>
<mml:mrow>
<mml:mover accent="true">
<mml:mrow>
<mml:mi>L</mml:mi>
<mml:mi>S</mml:mi>
<mml:mi>T</mml:mi>
<mml:msub>
<mml:mrow>
<mml:mi>M</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>&#x3b8;</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
<mml:mo>&#x2190;</mml:mo>
</mml:mover>
</mml:mrow>
<mml:mfenced open="(" close=")">
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>x</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>i</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:mfenced>
</mml:mrow>
</mml:mfenced>
</mml:mrow>
<mml:mrow>
<mml:mfenced open="|" close="|">
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>W</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>v</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:mfenced>
</mml:mrow>
</mml:mfrac>
<mml:mo>,</mml:mo>
</mml:math>
<label>(4)</label>
</disp-formula>where <italic>H</italic>
<sub>
<italic>S</italic>
</sub>(<italic>v</italic>) &#x2208; <italic>R</italic>
<sup>
<italic>d</italic>&#xd7;1</sup> (semantic feature embedding dimension), <italic>v</italic> represents a node in subgraph <italic>G</italic>
<sub>
<italic>u</italic>
</sub>, and <italic>x</italic>
<sub>
<italic>i</italic>
</sub> represents the <italic>i</italic>th feature word of node <italic>v</italic> (refer to [<xref ref-type="bibr" rid="B13">13</xref>]). <inline-formula id="inf23">
<mml:math id="m27">
<mml:mrow>
<mml:mover accent="true">
<mml:mrow>
<mml:mi>L</mml:mi>
<mml:mi>S</mml:mi>
<mml:mi>T</mml:mi>
<mml:mi>M</mml:mi>
</mml:mrow>
<mml:mo>&#x20d7;</mml:mo>
</mml:mover>
</mml:mrow>
</mml:math>
</inline-formula> represents the forward LSTM network, <inline-formula id="inf24">
<mml:math id="m28">
<mml:mrow>
<mml:mover accent="true">
<mml:mrow>
<mml:mi>L</mml:mi>
<mml:mi>S</mml:mi>
<mml:mi>T</mml:mi>
<mml:mi>M</mml:mi>
</mml:mrow>
<mml:mo>&#x20d6;</mml:mo>
</mml:mover>
</mml:mrow>
</mml:math>
</inline-formula> represents the backward LSTM network, <italic>&#x3b8;</italic> represents the neural network parameter, and operator &#x2016; represents concatenation. Bi-LSTM can learn the potential evolution process of node semantics, leading to a strong expression capability&#x20;[<xref ref-type="bibr" rid="B26">26</xref>].</p>
<p>Then the GCN [<xref ref-type="bibr" rid="B27">27</xref>] framework was used to aggregate semantic nodes of <italic>H</italic>
<sub>
<italic>S</italic>
</sub>(<italic>5</italic>) to learn the influence relationship between different nodes, which is formally expressed as follows:<disp-formula id="e5">
<mml:math id="m29">
<mml:mtable class="aligned">
<mml:mtr>
<mml:mtd columnalign="right"/>
<mml:mtd columnalign="left">
<mml:msubsup>
<mml:mrow>
<mml:mi>H</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>S</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>&#x2032;</mml:mo>
</mml:mrow>
</mml:msubsup>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:mrow>
<mml:mi>v</mml:mi>
</mml:mrow>
<mml:mo stretchy="false">)</mml:mo>
</mml:mrow>
<mml:mo>&#x3d;</mml:mo>
<mml:mi>G</mml:mi>
<mml:mi>C</mml:mi>
<mml:mi>N</mml:mi>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>H</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>S</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:mrow>
<mml:mi>v</mml:mi>
</mml:mrow>
<mml:mo stretchy="false">)</mml:mo>
</mml:mrow>
</mml:mrow>
<mml:mo stretchy="false">)</mml:mo>
</mml:mrow>
<mml:mo>&#x3d;</mml:mo>
<mml:mi>g</mml:mi>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:mrow>
<mml:mi>A</mml:mi>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>G</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>u</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
<mml:mo stretchy="false">)</mml:mo>
</mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>H</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>S</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:mrow>
<mml:mi>v</mml:mi>
</mml:mrow>
<mml:mo stretchy="false">)</mml:mo>
</mml:mrow>
<mml:msup>
<mml:mrow>
<mml:mi>W</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>T</mml:mi>
</mml:mrow>
</mml:msup>
<mml:mo>&#x2b;</mml:mo>
<mml:mi>b</mml:mi>
</mml:mrow>
<mml:mo stretchy="false">)</mml:mo>
</mml:mrow>
</mml:mtd>
</mml:mtr>
<mml:mtr>
<mml:mtd columnalign="right"/>
<mml:mtd columnalign="left">
<mml:mi>A</mml:mi>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>G</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>u</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
<mml:mo stretchy="false">)</mml:mo>
</mml:mrow>
<mml:mo>&#x3d;</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mi>D</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>&#x2212;</mml:mo>
<mml:mn>1</mml:mn>
<mml:mo>/</mml:mo>
<mml:mn>2</mml:mn>
</mml:mrow>
</mml:msup>
<mml:mi>A</mml:mi>
<mml:msup>
<mml:mrow>
<mml:mi>D</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>&#x2212;</mml:mo>
<mml:mn>1</mml:mn>
<mml:mo>/</mml:mo>
<mml:mn>2</mml:mn>
</mml:mrow>
</mml:msup>
</mml:mtd>
</mml:mtr>
</mml:mtable>
<mml:mo>,</mml:mo>
</mml:math>
<label>(5)</label>
</disp-formula>where <italic>W</italic>&#x20;&#x2208; <italic>R</italic>
<sup>
<italic>d</italic>&#xd7;<italic>d</italic>
</sup>, <italic>b</italic>&#x20;&#x2208; <italic>R</italic>
<sup>
<italic>d</italic>
</sup> are model parameters, <italic>g</italic> is a non-linear activation function, <italic>A</italic> is the adjacency matrix of <italic>G</italic>(<italic>u</italic>), and <italic>D</italic> represents <italic>diag</italic>(<italic>A</italic>). Since the number of subgraph nodes is fixed, <italic>A</italic> (<italic>G</italic>
<sub>
<italic>u</italic>
</sub>) can be calculated efficiently.</p>
</sec>
<sec id="s3-3-2">
<title>3.3.2 Topological Feature Aggregation Module</title>
<p>A topological feature can represent the importance of nodes in the network [<xref ref-type="bibr" rid="B28">28</xref>]. To aggregate topological feature embeddings of heterogeneous neighbors for each node, a layer of the GCN model was used for feature aggregation; in particular, the input vector consists of topological features <inline-formula id="inf25">
<mml:math id="m30">
<mml:msubsup>
<mml:mrow>
<mml:mi>A</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>v</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>T</mml:mi>
</mml:mrow>
</mml:msubsup>
</mml:math>
</inline-formula> and node state <inline-formula id="inf26">
<mml:math id="m31">
<mml:msubsup>
<mml:mrow>
<mml:mi>S</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>v</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>t</mml:mi>
</mml:mrow>
</mml:msubsup>
</mml:math>
</inline-formula>, which is inspired by the study in reference <xref ref-type="bibr" rid="B14">14</xref>. Then the concatenated vector into the GCN layer was input to generate the node topological features hidden layer vector <inline-formula id="inf27">
<mml:math id="m32">
<mml:msubsup>
<mml:mrow>
<mml:mi>H</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>T</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>&#x2032;</mml:mo>
</mml:mrow>
</mml:msubsup>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:mrow>
<mml:mi>v</mml:mi>
</mml:mrow>
<mml:mo stretchy="false">)</mml:mo>
</mml:mrow>
</mml:math>
</inline-formula>. The formalization is given as follows:<disp-formula id="e6">
<mml:math id="m33">
<mml:mtable class="aligned">
<mml:mtr>
<mml:mtd columnalign="right"/>
<mml:mtd columnalign="left">
<mml:msub>
<mml:mrow>
<mml:mi>H</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>T</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:mrow>
<mml:mi>v</mml:mi>
</mml:mrow>
<mml:mo stretchy="false">)</mml:mo>
</mml:mrow>
<mml:mo>&#x3d;</mml:mo>
<mml:msubsup>
<mml:mrow>
<mml:mi>A</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>v</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>T</mml:mi>
</mml:mrow>
</mml:msubsup>
<mml:mo stretchy="false">&#x2016;</mml:mo>
<mml:msubsup>
<mml:mrow>
<mml:mi>S</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>v</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>t</mml:mi>
</mml:mrow>
</mml:msubsup>
</mml:mtd>
</mml:mtr>
<mml:mtr>
<mml:mtd columnalign="right"/>
<mml:mtd columnalign="left">
<mml:msubsup>
<mml:mrow>
<mml:mi>H</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>T</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>&#x2032;</mml:mo>
</mml:mrow>
</mml:msubsup>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:mrow>
<mml:mi>v</mml:mi>
</mml:mrow>
<mml:mo stretchy="false">)</mml:mo>
</mml:mrow>
<mml:mo>&#x3d;</mml:mo>
<mml:mi>G</mml:mi>
<mml:mi>C</mml:mi>
<mml:mi>N</mml:mi>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>H</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>T</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:mrow>
<mml:mi>v</mml:mi>
</mml:mrow>
<mml:mo stretchy="false">)</mml:mo>
</mml:mrow>
</mml:mrow>
<mml:mo stretchy="false">)</mml:mo>
</mml:mrow>
<mml:mo>&#x3d;</mml:mo>
<mml:mi>g</mml:mi>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:mrow>
<mml:mi>A</mml:mi>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>G</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>u</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
<mml:mo stretchy="false">)</mml:mo>
</mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>H</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>T</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:mrow>
<mml:mi>v</mml:mi>
</mml:mrow>
<mml:mo stretchy="false">)</mml:mo>
</mml:mrow>
<mml:msup>
<mml:mrow>
<mml:mi>W</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>T</mml:mi>
</mml:mrow>
</mml:msup>
<mml:mo>&#x2b;</mml:mo>
<mml:mi>b</mml:mi>
</mml:mrow>
<mml:mo stretchy="false">)</mml:mo>
</mml:mrow>
</mml:mtd>
</mml:mtr>
<mml:mtr>
<mml:mtd columnalign="right"/>
<mml:mtd columnalign="left">
<mml:mi>A</mml:mi>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>G</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>u</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
<mml:mo stretchy="false">)</mml:mo>
</mml:mrow>
<mml:mo>&#x3d;</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mi>D</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>&#x2212;</mml:mo>
<mml:mn>1</mml:mn>
<mml:mo>/</mml:mo>
<mml:mn>2</mml:mn>
</mml:mrow>
</mml:msup>
<mml:mi>A</mml:mi>
<mml:msup>
<mml:mrow>
<mml:mi>D</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>&#x2212;</mml:mo>
<mml:mn>1</mml:mn>
<mml:mo>/</mml:mo>
<mml:mn>2</mml:mn>
</mml:mrow>
</mml:msup>
</mml:mtd>
</mml:mtr>
</mml:mtable>
<mml:mo>.</mml:mo>
</mml:math>
<label>(6)</label>
</disp-formula>
</p>
<p>
<xref ref-type="disp-formula" rid="e6">Eq. 6</xref> is the same as <xref ref-type="disp-formula" rid="e5">Eq. 5</xref>, except for the input and model parameters. We used the GCN to aggregate the topological embeddedness of all heterogeneous neighbors. Obviously, GCN has excellent performance in relation aggregation capability&#x20;[<xref ref-type="bibr" rid="B27">27</xref>].</p>
</sec>
<sec id="s3-3-3">
<title>3.3.3 Heterogeneous Multi-Attribute Hidden Layer Aggregation Module</title>
<p>We can obtain the semantic feature embedding <inline-formula id="inf28">
<mml:math id="m34">
<mml:msubsup>
<mml:mrow>
<mml:mi>H</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>T</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>&#x2032;</mml:mo>
</mml:mrow>
</mml:msubsup>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:mrow>
<mml:mi>v</mml:mi>
</mml:mrow>
<mml:mo stretchy="false">)</mml:mo>
</mml:mrow>
</mml:math>
</inline-formula> and structural feature embedding <inline-formula id="inf29">
<mml:math id="m35">
<mml:msubsup>
<mml:mrow>
<mml:mi>H</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>S</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>&#x2032;</mml:mo>
</mml:mrow>
</mml:msubsup>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:mrow>
<mml:mi>v</mml:mi>
</mml:mrow>
<mml:mo stretchy="false">)</mml:mo>
</mml:mrow>
</mml:math>
</inline-formula> of each node in the heterogeneous subgraph <italic>G</italic>
<sub>
<italic>u</italic>
</sub>. To combine these features based on neural network module for each node <italic>v</italic> of subgraph <italic>G</italic>, the graph attention network [<xref ref-type="bibr" rid="B19">19</xref>] was employed. The advantage of this is that since different nodes have different influence contributions to the results, the multi-head GAT learns the influence between different attributes of heterogeneous&#x20;nodes.</p>
<p>First, we concatenated the hidden layer results of the previous steps, then, following the study in reference <xref ref-type="bibr" rid="B14">14</xref>, we used multi-head GAT and calculated the normalized attention coefficients <inline-formula id="inf30">
<mml:math id="m36">
<mml:msub>
<mml:mrow>
<mml:msup>
<mml:mrow>
<mml:mi>H</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>&#x2032;</mml:mo>
</mml:mrow>
</mml:msup>
</mml:mrow>
<mml:mrow>
<mml:mi>f</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:mrow>
<mml:mi>v</mml:mi>
</mml:mrow>
<mml:mo stretchy="false">)</mml:mo>
</mml:mrow>
</mml:math>
</inline-formula> as follows:<disp-formula id="e7">
<mml:math id="m37">
<mml:mtable class="aligned">
<mml:mtr>
<mml:mtd columnalign="right"/>
<mml:mtd columnalign="left">
<mml:msub>
<mml:mrow>
<mml:mi>H</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>f</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:mrow>
<mml:mi>v</mml:mi>
</mml:mrow>
<mml:mo stretchy="false">)</mml:mo>
</mml:mrow>
<mml:mo>&#x3d;</mml:mo>
<mml:msubsup>
<mml:mrow>
<mml:mi>H</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>T</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>&#x2032;</mml:mo>
</mml:mrow>
</mml:msubsup>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:mrow>
<mml:mi>v</mml:mi>
</mml:mrow>
<mml:mo stretchy="false">)</mml:mo>
</mml:mrow>
<mml:mo stretchy="false">&#x2016;</mml:mo>
<mml:msubsup>
<mml:mrow>
<mml:mi>H</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>S</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>&#x2032;</mml:mo>
</mml:mrow>
</mml:msubsup>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:mrow>
<mml:mi>v</mml:mi>
</mml:mrow>
<mml:mo stretchy="false">)</mml:mo>
</mml:mrow>
</mml:mtd>
</mml:mtr>
<mml:mtr>
<mml:mtd columnalign="right"/>
<mml:mtd columnalign="left">
<mml:msubsup>
<mml:mrow>
<mml:mi>H</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>f</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mo>&#x2032;</mml:mo>
</mml:mrow>
</mml:msubsup>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:mrow>
<mml:mi>v</mml:mi>
</mml:mrow>
<mml:mo stretchy="false">)</mml:mo>
</mml:mrow>
<mml:mo>&#x3d;</mml:mo>
<mml:mi>&#x3c3;</mml:mi>
<mml:mfenced open="(" close=")">
<mml:mrow>
<mml:mfrac>
<mml:mrow>
<mml:mn>1</mml:mn>
</mml:mrow>
<mml:mrow>
<mml:mi>K</mml:mi>
</mml:mrow>
</mml:mfrac>
<mml:munderover accentunder="false" accent="false">
<mml:mrow>
<mml:mo>&#x2211;</mml:mo>
</mml:mrow>
<mml:mrow>
<mml:mi>k</mml:mi>
<mml:mo>&#x3d;</mml:mo>
<mml:mn>1</mml:mn>
</mml:mrow>
<mml:mrow>
<mml:mn>1</mml:mn>
</mml:mrow>
</mml:munderover>
<mml:munder>
<mml:mrow>
<mml:mo>&#x2211;</mml:mo>
</mml:mrow>
<mml:mrow>
<mml:mi>v</mml:mi>
<mml:mo>&#x2208;</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi>G</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>u</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:munder>
<mml:msubsup>
<mml:mrow>
<mml:mi>&#x3b1;</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mi>v</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>k</mml:mi>
</mml:mrow>
</mml:msubsup>
<mml:msup>
<mml:mrow>
<mml:mi mathvariant="bold">W</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>k</mml:mi>
</mml:mrow>
</mml:msup>
<mml:msub>
<mml:mrow>
<mml:mi>H</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>f</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:mrow>
<mml:mi>v</mml:mi>
</mml:mrow>
<mml:mo stretchy="false">)</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:mfenced>
</mml:mtd>
</mml:mtr>
<mml:mtr>
<mml:mtd columnalign="right"/>
<mml:mtd columnalign="left">
<mml:msub>
<mml:mrow>
<mml:mi>&#x3b1;</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>i</mml:mi>
<mml:mi>v</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo>&#x3d;</mml:mo>
<mml:mfrac>
<mml:mrow>
<mml:mi>exp</mml:mi>
<mml:mfenced open="(" close=")">
<mml:mrow>
<mml:mtext>LeakyReLU&#x2009;</mml:mtext>
<mml:mfenced open="(" close=")">
<mml:mrow>
<mml:msup>
<mml:mrow>
<mml:mi mathvariant="bold">a</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>T</mml:mi>
</mml:mrow>
</mml:msup>
<mml:mfenced open="[" close="]">
<mml:mrow>
<mml:mi mathvariant="bold">W</mml:mi>
<mml:msub>
<mml:mrow>
<mml:mi>H</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>f</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:mrow>
<mml:mi>i</mml:mi>
</mml:mrow>
<mml:mo stretchy="false">)</mml:mo>
</mml:mrow>
<mml:mo stretchy="false">&#x2016;</mml:mo>
<mml:mi mathvariant="bold">W</mml:mi>
<mml:msub>
<mml:mrow>
<mml:mi>H</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>f</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:mrow>
<mml:mi>v</mml:mi>
</mml:mrow>
<mml:mo stretchy="false">)</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:mfenced>
</mml:mrow>
</mml:mfenced>
</mml:mrow>
</mml:mfenced>
</mml:mrow>
<mml:mrow>
<mml:munder>
<mml:mrow>
<mml:mo movablelimits="false" form="prefix">&#x2211;</mml:mo>
</mml:mrow>
<mml:mrow>
<mml:mi>v</mml:mi>
<mml:mo>&#x2208;</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi>G</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>u</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:munder>
<mml:mo>&#x2061;</mml:mo>
<mml:mi>exp</mml:mi>
<mml:mfenced open="(" close=")">
<mml:mrow>
<mml:mtext>LeakyReLU&#x2009;</mml:mtext>
<mml:mfenced open="(" close=")">
<mml:mrow>
<mml:msup>
<mml:mrow>
<mml:mi mathvariant="bold">a</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>T</mml:mi>
</mml:mrow>
</mml:msup>
<mml:mfenced open="[" close="]">
<mml:mrow>
<mml:mi mathvariant="bold">W</mml:mi>
<mml:msub>
<mml:mrow>
<mml:mi>H</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>f</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:mrow>
<mml:mi>i</mml:mi>
</mml:mrow>
<mml:mo stretchy="false">)</mml:mo>
</mml:mrow>
<mml:mo stretchy="false">&#x2016;</mml:mo>
<mml:mi mathvariant="bold">W</mml:mi>
<mml:msub>
<mml:mrow>
<mml:mi>H</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>f</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:mrow>
<mml:mi>v</mml:mi>
</mml:mrow>
<mml:mo stretchy="false">)</mml:mo>
</mml:mrow>
</mml:mrow>
</mml:mfenced>
</mml:mrow>
</mml:mfenced>
</mml:mrow>
</mml:mfenced>
</mml:mrow>
</mml:mfrac>
<mml:mo>,</mml:mo>
</mml:mtd>
</mml:mtr>
</mml:mtable>
</mml:math>
<label>(7)</label>
</disp-formula>where <bold>a</bold> is the attention parameter, <bold>a</bold> &#x2208; <italic>R</italic>
<sup>2<italic>d</italic>
</sup>, <bold>W</bold> &#x2208; <italic>R</italic>
<sup>
<italic>d</italic>&#xd7;<italic>d</italic>
</sup> is model parameters, &#x22c5;<sup>
<italic>T</italic>
</sup> represents transposition, &#x2016; is the concatenation operation, <italic>&#x3b1;</italic>
<sub>
<italic>iv</italic>
</sub> indicates the importance of node <italic>i</italic> to node <italic>v</italic>, and <italic>K</italic> represents the number of&#x20;heads.</p>
</sec>
<sec id="s3-3-4">
<title>3.3.4 Output Layer and Loss Function</title>
<p>As shown in <xref ref-type="fig" rid="F2">Figure 2c-3</xref>, the full connection layer (FC layer [<xref ref-type="bibr" rid="B27">27</xref>]) was used to output the two-dimensional representation of each node, then the current ego-user result was taken out, the ground truth was compared with, and <xref ref-type="disp-formula" rid="e2">formula 2</xref> was optimized as the loss function used in our&#x20;study.</p>
</sec>
</sec>
</sec>
<sec id="s4">
<title>4 Experiments</title>
<p>In this section, extensive experiments were conducted with the aim of answering the following research questions:<list list-type="simple">
<list-item>
<p>&#x2022; (RQ1) How does HetInf perform vs state-of-the-art baselines for influence prediction tasks?</p>
</list-item>
<list-item>
<p>&#x2022; (RQ2) How do different components, for example, heterogeneous multi-attribute hidden layer aggregation module or semantic feature aggregation module, affect the model performance? How much performance gain is added to these modules?</p>
</list-item>
<list-item>
<p>&#x2022; (RQ3) How do various hyper-parameters, for example, embedding dimension of keywords or the size of sampled heterogeneous neighbors set, impact the model performance?</p>
</list-item>
</list>
</p>
<sec id="s4-1">
<title>4.1 Datasets</title>
<p>Following the previous studies [<xref ref-type="bibr" rid="B14">14</xref>], experiments on two public datasets were conducted to quantitatively evaluate our proposed model. The detailed statistics are presented in <xref ref-type="table" rid="T1">Table&#x20;1</xref>.</p>
<table-wrap id="T1" position="float">
<label>TABLE 1</label>
<caption>
<p>Statistics of the datasets.</p>
</caption>
<table>
<thead valign="top">
<tr>
<th align="left">Statistics</th>
<th align="center">Weibo</th>
<th align="center">Digg</th>
</tr>
</thead>
<tbody valign="top">
<tr>
<td align="left">Num of users</td>
<td align="center">410764</td>
<td align="center">279631</td>
</tr>
<tr>
<td align="left">Num of events</td>
<td align="center">7124</td>
<td align="center">3155</td>
</tr>
<tr>
<td align="left">User&#x2013;user</td>
<td align="center">92819</td>
<td align="center">1731657</td>
</tr>
<tr>
<td align="left">User&#x2013;event</td>
<td align="center">1196489</td>
<td align="center">3018196</td>
</tr>
<tr>
<td align="left">Event&#x2013;event</td>
<td align="center">22518</td>
<td align="center">&#x2014;</td>
</tr>
<tr>
<td align="left">Num/dim of user interest topics</td>
<td align="center">20/32</td>
<td align="center">&#x2014;</td>
</tr>
<tr>
<td align="left">Num/dim of event historical topics</td>
<td align="center">20/32</td>
<td align="center">&#x2014;</td>
</tr>
<tr>
<td align="left">Dim of node topology embedding</td>
<td align="center">128</td>
<td align="center">128</td>
</tr>
<tr>
<td align="left">Num of samples</td>
<td align="center">52130</td>
<td align="center">34113</td>
</tr>
</tbody>
</table>
</table-wrap>
<p>Digg [<xref ref-type="bibr" rid="B29">29</xref>]: The Digg dataset is a story collector, which contains the data of stories that were promoted to Digg home page within 1&#xa0;month in 2009. For each story, this dataset collects the voting lists of all users, the voters&#x2019; friendship links, and the timestamp of each vote. This dataset comes from the study in reference <xref ref-type="bibr" rid="B30">30</xref>. In our experiment, we took the story as the &#x201c;event&#x201d; node and &#x201c;voting&#x201d; as the user action to build a heterogeneity graph. Due to the lack of text data in the dataset, the deep framework (<xref ref-type="fig" rid="F2">Figure 2c-2</xref>) of semantic information was not used in this dataset.</p>
<p>Weibo [<xref ref-type="bibr" rid="B31">31</xref>]: Weibo is the most popular social networking platform in China. This dataset contains 3,000,000 original tweets and retweets and comments of the original tweets from September 28, 2012 to October 29, 2012. At the same time, the dataset&#x20;also contains the follow relationship between users who participate in these tweets. The dataset comes from in the study in reference <xref ref-type="bibr" rid="B32">32</xref>. In our experiment, we extracted hashtags as events and built the heterogeneous graph of users and events, and the behavior of users participating in events (comment or retweet) is regarded as user action.</p>
</sec>
<sec id="s4-2">
<title>4.2 Data Preparation</title>
<p>In view of the imbalance in the number of active neighbors proposed by DeepInf, we set a threshold <italic>n</italic>&#x20;&#x3e; 5 (n is the sum of the number of active users and active events).Therefore, less active observation samples are removed, and thus, the sample characteristics involved in training are significantly related to social influence [<xref ref-type="bibr" rid="B31">31</xref>]. In order to solve the problem of data skew, the down-sampling method was used to control the positive and negative ratio of samples at&#x20;1:3.</p>
<p>Compared with the previous study, the preparation has the following differences:<list list-type="simple">
<list-item>
<p>(1) For the choice of events, due to the shortcomings of the number of participants lack significance, we excluded some events with fewer participants and set the threshold of the number of participants to the top 30% of the distribution of the number of events so as to extract the total number of events in <xref ref-type="table" rid="T1">Table&#x20;1</xref>.</p>
</list-item>
<list-item>
<p>(2) In the Weibo dataset, we established the event&#x2013;event relationship (edge) through the semantic correlation of events. Specifically, the historical text of each event was collected, the tweets text collection was sampled in the time window t, and then the semantic vectorization representation of the event was obtained by the par2vec [<xref ref-type="bibr" rid="B33">33</xref>] method. Then we calculated the cosine similarity between events; if the correlation threshold <italic>n</italic>&#x20;&#x3e; 0.7, the two events are semantically strongly correlated, and then we established the relation (edge) between events.</p>
</list-item>
<list-item>
<p>(3) For the extraction of node semantic features (<xref ref-type="fig" rid="F2">Figure 2c-2</xref>), we fixed the number of keywords of each node (for example, <italic>n</italic>&#x20;&#x3d; 20). If the keyword samples of some nodes are insufficient (<italic>n</italic>&#x20;&#x3c; 20), we filled zeros to complete the vector to ensure the consistency of the input of the neural network.</p>
</list-item>
</list>
</p>
</sec>
<sec id="s4-3">
<title>4.3 Baselines</title>
<p>Support Vector Machine (SVM) [<xref ref-type="bibr" rid="B34">34</xref>]: A support vector machine (SVM) with linear kernel was used as the classification model. Specifically, the splicing of three features (including semantic features, topology features, and action status) was used as the input vector, and the problem was defined as two classification method.</p>
<p>DeepInf [<xref ref-type="bibr" rid="B14">14</xref>]: Our framework was compared with the influence network model based on the graph neural network, which constructs homogeneity subgraph based on user relationship and predicts user node action in the future.</p>
<p>MvInf [<xref ref-type="bibr" rid="B15">15</xref>]: Our framework was compared with the state-of-the-art graph neural network model MvInf, which introduces a multi-view structure based on DeepInf and uses the complementarity and consistency between different views to enhance learning performance. The difference is that our proposed model is based on the common influence of events and&#x20;users.</p>
<p>HetInf and Its Variants: In the heterogeneous multi-attribute hidden layer aggregation module, different graph neural network frameworks were used to distinguish the two methods: HetInf&#x2013;GCN and HetInf&#x2013;GAT. Separately, HetInf&#x2013;GAT uses the GAT [<xref ref-type="bibr" rid="B19">19</xref>] method as a method to fuse node features, mainly using attention mechanism to obtain the importance between nodes, while the HetInf&#x2013;GCN method uses the GCN [<xref ref-type="bibr" rid="B27">27</xref>] framework to aggregate the node features and calculate the node influence by learning the node relationship of the subgraph.</p>
</sec>
<sec id="s4-4">
<title>4.4&#x20;Hyper-Parameter</title>
<p>In our proposed method, we used DeepWalk [<xref ref-type="bibr" rid="B24">24</xref>] to embed the node topology features; the restart probability of this method is 0.8, and the output vector length is 64 dimensions. We used the ReLU [<xref ref-type="bibr" rid="B35">35</xref>] as the activation function sigma (<xref ref-type="disp-formula" rid="e5">Eq. 5</xref>, <xref ref-type="disp-formula" rid="e6">6</xref>) and used the Adam [<xref ref-type="bibr" rid="B36">36</xref>] optimizer for training, with a learning rate of 0.005, and we set dropout &#x3d; 0.5. We used 50, 25, and 25% of the instances for training, validation, and testing respectively; the batch size of all datasets was set to 256. In order to accommodate more nodes, we set the total number of nodes in the subgraph to 100 (including two different types of nodes).</p>
</sec>
<sec id="s4-5">
<title>4.5 Result and Analysis</title>
<sec id="s4-5-1">
<title>4.5.1 (RQ1) Performance Analysis</title>
<p>How does HetInf perform vs state-of-the-art baselines of influence prediction methods? Will users take actions on the events in the future? What are the advantages of the proposed model compared with baseline? In order to answer question RQ1, we applied four indicators to compare with the previous state-of-the-art model (the same evaluation metrics as MvInf).</p>
<p>It should be noted that there are the following differences from the baseline method: in the semantic feature aggregation module, we used Word2Vec to embed each feature word into a vector with a dimension of 32. Specifically, the number of keywords for each node is 20. The output dimension of Bi-LSTM hidden layer is 128 and was used as GCN input (as shown in <xref ref-type="fig" rid="F2">Figure 2c-2</xref>). The final output dimension of the GCN module is 128. In the topological feature aggregation module, the output dimension of DeepWalk is 128 and the state feature dimension was 2 (including action state and ego-state), so the GCN&#x2019;s input dimension is 130, and the output dimension of this module is 128 (similar to the DeepInf method). In the multi-attribute hidden layer aggregation module, for HetInf&#x2013;GCN, we used two layers of GCN as the aggregation function of the module, in which the input layer of the first layer dimension is 256 and the output dimension of the second layer is 128. For HetInf&#x2013;GAT, we used the GAT method, the input dimension is 256 and the output dimension is 128. Performance report of all models in <xref ref-type="table" rid="T2">Table&#x20;2</xref> and <xref ref-type="table" rid="T3">Table&#x20;3</xref> in which the best results are highlighted in bold.<list list-type="simple">
<list-item>
<p>(1) It can be seen from the results that in most cases, our proposed model is better than the baseline, especially in the accuracy and F1 value of microblog dataset (F1: 17.9%, Prec.: 35.7%), which proves that we have obtained the gain of accuracy after introducing heterogeneous networks and establishing event influence relations and verified the effectiveness of the proposed framework.</p>
</list-item>
<list-item>
<p>(2) From the results of the Digg dataset, it can be seen that the heterogeneous graph neural network model with two types of nodes can also bring performance gain (F1: 0.6%, Prec.: 0.3%) (lack of semantic information of heterogeneous nodes), which proves that our proposed model can improve the prediction ability of user behavior only through heterogeneous social networks.</p>
</list-item>
</list>
</p>
<table-wrap id="T2" position="float">
<label>TABLE 2</label>
<caption>
<p>Prediction performance of different methods on the two datasets (%).</p>
</caption>
<table>
<thead valign="top">
<tr>
<th align="left">Dataset</th>
<th align="center">Model</th>
<th align="center">AUC</th>
<th align="center">Prec</th>
<th align="center">Rec</th>
<th align="center">F1</th>
</tr>
</thead>
<tbody valign="top">
<tr>
<td align="left">Weibo</td>
<td align="left">SVM</td>
<td align="char" char=".">77.11</td>
<td align="char" char=".">43.27</td>
<td align="char" char=".">70.79</td>
<td align="char" char=".">53.71</td>
</tr>
<tr>
<td align="left"/>
<td align="left">DeepInf</td>
<td align="char" char=".">82.75</td>
<td align="char" char=".">48.86</td>
<td align="char" char=".">74.13</td>
<td align="char" char=".">58.90</td>
</tr>
<tr>
<td align="left"/>
<td align="left">MvInf</td>
<td align="char" char=".">83.75</td>
<td align="char" char=".">50.18</td>
<td align="char" char=".">75.02</td>
<td align="char" char=".">60.13</td>
</tr>
<tr>
<td align="left"/>
<td align="left">HetInf-GCN</td>
<td align="char" char=".">
<bold>85.10</bold>
</td>
<td align="char" char=".">67.88</td>
<td align="char" char=".">73.60</td>
<td align="char" char=".">70.62</td>
</tr>
<tr>
<td align="left"/>
<td align="left">HetInf-GAT</td>
<td align="char" char=".">85.02</td>
<td align="char" char=".">
<bold>68.12</bold>
</td>
<td align="char" char=".">
<bold>74.03</bold>
</td>
<td align="char" char=".">
<bold>70.95</bold>
</td>
</tr>
<tr>
<td align="left">Digg</td>
<td align="left">SVM</td>
<td align="char" char=".">90.65</td>
<td align="char" char=".">66.82</td>
<td align="char" char=".">78.49</td>
<td align="char" char=".">72.19</td>
</tr>
<tr>
<td align="left"/>
<td align="left">DeepInf</td>
<td align="char" char=".">88.97</td>
<td align="char" char=".">68.80</td>
<td align="char" char=".">73.49</td>
<td align="char" char=".">71.21</td>
</tr>
<tr>
<td align="left"/>
<td align="left">MlvInf</td>
<td align="char" char=".">91.11</td>
<td align="char" char=".">
<bold>70.35</bold>
</td>
<td align="char" char=".">78.50</td>
<td align="char" char=".">74.20</td>
</tr>
<tr>
<td align="left"/>
<td align="left">HetInf-GCN</td>
<td align="char" char=".">90.74</td>
<td align="char" char=".">69.94</td>
<td align="char" char=".">
<bold>79.75</bold>
</td>
<td align="char" char=".">74.52</td>
</tr>
<tr>
<td align="left"/>
<td align="left">HetInf-GAT</td>
<td align="char" char=".">
<bold>92.03</bold>
</td>
<td align="char" char=".">70.12</td>
<td align="char" char=".">77.63</td>
<td align="char" char=".">
<bold>74.68</bold>
</td>
</tr>
</tbody>
</table>
</table-wrap>
<table-wrap id="T3" position="float">
<label>TABLE 3</label>
<caption>
<p>Relative gain in terms of F1 and Prec. against the best baseline (%).</p>
</caption>
<table>
<thead valign="top">
<tr>
<th align="left">&#xa0;F1</th>
<th align="center">Weibo</th>
<th align="center">Digg</th>
<th align="center">Prec</th>
<th align="center">Weibo</th>
<th align="center">Digg</th>
</tr>
</thead>
<tbody valign="top">
<tr>
<td align="left">MvInf</td>
<td align="char" char=".">60.13</td>
<td align="char" char=".">74.20</td>
<td align="left">MvInf</td>
<td align="char" char=".">50.18</td>
<td align="char" char=".">70.35</td>
</tr>
<tr>
<td align="left">HetInf</td>
<td align="char" char=".">70.95</td>
<td align="char" char=".">74.68</td>
<td align="left">HetInf</td>
<td align="char" char=".">68.12</td>
<td align="char" char=".">70.12</td>
</tr>
<tr>
<td align="left">Relative gain</td>
<td align="char" char=".">17.9%</td>
<td align="char" char=".">0.6%</td>
<td align="left">Relative gain</td>
<td align="char" char=".">35.7%</td>
<td align="char" char=".">&#x2212;0.3%</td>
</tr>
</tbody>
</table>
</table-wrap>
</sec>
<sec id="s4-5-2">
<title>4.5.2 (RQ2) Ablation Analysis</title>
<p>HetInf is a deep learning model combining different modules, which calculates the influence between different nodes and predicts user behavior by aggregating the embedding of different types of node attributes. To answer RQ2, we used Auc and F1 indicators as the standard for evaluating results, we conducted ablation studies to evaluate performances of several model variants which include:</p>
<p>1) No-NN-1 that cancels the LSTM method (NN-1) and then concatenates the vectors to embed the representation of the semantic feature to verify the impact of the semantic feature aggregation module on the results; 2) Only-Topology that uses heterogeneous topology encoding (C-2) to represent each node embedding (cancel C-1 module); 3) Only-Semantic that uses heterogeneous topology encoding (C-1) to represent each node embedding (cancel C-2 module); and 4) No-NN-3 that utilizes a FC layer to combine embeddings of different neighbor representation (replace NN-3). It should be noted that the Digg dataset lacks semantic information, so we only tested the results of 4) to verify the effectiveness of heterogeneous multi-attribute hidden layer aggregation module. The results of predicted AUC and F1 values are shown in <xref ref-type="table" rid="T4">Table&#x20;4</xref> and <xref ref-type="fig" rid="F3">Figure&#x20;3</xref>.<list list-type="simple">
<list-item>
<p>(1) The performance of Only-Topology is better than that of Only-Semantic, indicating that the position of key nodes in the network is more influential (such as opinion leaders).</p>
</list-item>
<list-item>
<p>(2) The performance of Only-Semantic is better than that of No-lstm, indicating that Bi-LSTM&#x2013;based content encoding is superior to &#x201c;shallow&#x201d; encoding like FC for capturing &#x201c;deep&#x201d; content feature interactions.</p>
</list-item>
<list-item>
<p>(3) HetInf (including GCN and GAT) is better than No-NN-3 shows that graph convolution network plays a role in capturing node type influence.</p>
</list-item>
<list-item>
<p>(4) HetInf&#x2013;GAT is superior to HetInF&#x2013;GCN, indicating that graph attention network can obtain more influential potential relationships than GCN method.</p>
</list-item>
</list>
</p>
<table-wrap id="T4" position="float">
<label>TABLE 4</label>
<caption>
<p>Statistics of the datasets.</p>
</caption>
<table>
<thead valign="top">
<tr>
<th align="left">Dataset</th>
<th align="center">Model</th>
<th align="center">F1</th>
<th align="center">AUC</th>
</tr>
</thead>
<tbody valign="top">
<tr>
<td align="left">Weibo</td>
<td align="left">Only-Topology</td>
<td align="char" char=".">62.20</td>
<td align="char" char=".">81.17</td>
</tr>
<tr>
<td align="left"/>
<td align="left">Only-Semantic</td>
<td align="char" char=".">57.89</td>
<td align="char" char=".">74.33</td>
</tr>
<tr>
<td align="left"/>
<td align="left">No-LSTM</td>
<td align="char" char=".">54.33</td>
<td align="char" char=".">72.65</td>
</tr>
<tr>
<td align="left"/>
<td align="left">No-NN-3</td>
<td align="char" char=".">64.48</td>
<td align="char" char=".">83.31</td>
</tr>
<tr>
<td align="left"/>
<td align="left">HetInf-GAT</td>
<td align="char" char=".">70.95</td>
<td align="char" char=".">85.10</td>
</tr>
<tr>
<td align="left"/>
<td align="left">HetInf-GCN</td>
<td align="char" char=".">70.62</td>
<td align="char" char=".">85.02</td>
</tr>
<tr>
<td align="left">Digg</td>
<td align="left">No-NN-3</td>
<td align="char" char=".">66.28</td>
<td align="char" char=".">85.32</td>
</tr>
<tr>
<td align="left"/>
<td align="left">HetInf-GAT</td>
<td align="char" char=".">74.71</td>
<td align="char" char=".">92.03</td>
</tr>
<tr>
<td align="left"/>
<td align="left">HetInf-GCN</td>
<td align="char" char=".">67.72</td>
<td align="char" char=".">90.74</td>
</tr>
</tbody>
</table>
</table-wrap>
<fig id="F3" position="float">
<label>FIGURE 3</label>
<caption>
<p>Performances of variant proposed models <bold>(A)</bold> eibo dataset ablation analysis. <bold>(B)</bold> Digg dataset ablation analysis.</p>
</caption>
<graphic xlink:href="fphy-09-787185-g003.tif"/>
</fig>
</sec>
<sec id="s4-5-3">
<title>4.5.3 (RQ3) Hyper-Parameter Analysis</title>
<p>To answer question RQ3, we investigated how hyper-parameters affect the predictive performance of the model. We conducted a parameter analysis on the Weibo dataset and used F1 value as an evaluation indicator. Specifically, we tested the impact of three key parameters: 1) semantic attribute embedding dimension; 2) head for multi-head attention; and 3) number of keywords. The experimental results are shown in <xref ref-type="fig" rid="F4">Figure&#x20;4</xref>.<list list-type="simple">
<list-item>
<p>(1) Semantic Attribute Embedding Dimension: As shown in <xref ref-type="fig" rid="F4">Figure&#x20;4A</xref>, when the semantic attribute dimension d varies from 16 to 256, the overall evaluation indicator is increasing because more dimensions contain more information. However, when the dimension reaches 128, the performance begins to decline, which is likely due to the result of overfitting.</p>
</list-item>
<list-item>
<p>(2) Head for Multi-head Attention: Like DeepInf, we are concerned about the number of GAT heads in heterogeneous multi-attribute hidden layer. As shown in <xref ref-type="fig" rid="F4">Figure&#x20;4B</xref>, the increase of heads brings benefits to performance, but after more than 8, the performance remains stable but it has a negative impact on efficiency.</p>
</list-item>
<list-item>
<p>(3) Number of Keywords: The feature words of network nodes represent the semantic bias of nodes, which directly affect the prediction results. As shown in <xref ref-type="fig" rid="F4">Figure&#x20;4C</xref>, when the numbers changes, it means that the amount of semantic information of network nodes increases and the evaluation improves at the same time. However, when the number of keywords exceeds a certain value, it will bring down performance, which is likely due to the noise caused by sampling too many non-significant keywords. As can be seen from the figure, it is best to control the feature words between 20 and&#x20;40.</p>
</list-item>
</list>
</p>
<fig id="F4" position="float">
<label>FIGURE 4</label>
<caption>
<p>Parameter analysis <bold>(A)</bold> Semantic attribute embedding. <bold>(B)</bold> Head for multi-head attention. <bold>(C)</bold> Number of node Keywords.</p>
</caption>
<graphic xlink:href="fphy-09-787185-g004.tif"/>
</fig>
</sec>
</sec>
</sec>
<sec id="s5">
<title>5 Related Work</title>
<sec id="s5-1">
<title>5.1 Social Influence Prediction</title>
<p>Social influence prediction is a fundamental problem in a social network analysis, which supports downstream tasks. At the micro-perspective [<xref ref-type="bibr" rid="B37">37</xref>,<xref ref-type="bibr" rid="B38">38</xref>], this problem is mainly modeled by analyzing user relationships. There are many different research directions, such as a user interaction influence analysis [<xref ref-type="bibr" rid="B39">39</xref>,<xref ref-type="bibr" rid="B40">40</xref>], network structure diversity analysis [<xref ref-type="bibr" rid="B41">41</xref>,<xref ref-type="bibr" rid="B42">42</xref>], topic influence analysis [<xref ref-type="bibr" rid="B10">10</xref>], and influence maximization [<xref ref-type="bibr" rid="B43">43</xref>]. Specifically, in the study in reference <xref ref-type="bibr" rid="B44">44</xref>, the existence of social influence was proved by quantitative analysis of mutual influence. The study in reference <xref ref-type="bibr" rid="B31">31</xref> proposes social local network concepts, using user interaction and network structure to predict user behavior. The study in reference <xref ref-type="bibr" rid="B45">45</xref> uses topic level influence to model user influence. The study in reference <xref ref-type="bibr" rid="B46">46</xref> introduces a topic-level influence propagation and aggregation algorithm to derive the indirect influence between nodes. In recent years, with the continuous progress of deep learning, many studies have introduced deep learning into social influence prediction to improve the prediction performance. A popular deep learning method is [<xref ref-type="bibr" rid="B14">14</xref>], which provides an end-to-end framework to predict social influence by learning the potential features of users. The study in reference <xref ref-type="bibr" rid="B15">15</xref> has improved the study in reference <xref ref-type="bibr" rid="B14">14</xref> to enhance feature representation and result accuracy with a multi-view model. The study in reference <xref ref-type="bibr" rid="B47">47</xref> proposes a social influence prediction model NNMInf based on neural network multi-label classification. The study in reference <xref ref-type="bibr" rid="B48">48</xref> introduces a deep neural network framework witch simulate social influence and predict human behavior. Compared with traditional methods, these deep learning models show better learning performance.</p>
</sec>
<sec id="s5-2">
<title>5.2 Heterogeneous Graph Neural Network</title>
<p>In recent years, we have identified a huge development of the graph neural network in deep learning technology [<xref ref-type="bibr" rid="B27">27</xref>,<xref ref-type="bibr" rid="B49">49</xref>], and the state-of-the-art model GAT [<xref ref-type="bibr" rid="B19">19</xref>], which represents the method of depth learning-based graphical representation as the graph neural network (GNN), the main idea is as follows: the first step is to calculate the feature representation of neighbor nodes, and the second step is to aggregate neighbors through message passing mechanism to obtain the feature representation of nodes&#x20;[<xref ref-type="bibr" rid="B50">50</xref>].</p>
<p>Recently, the heterogeneous graph neural network has become the main branch of GNN. The main task is to learn the representation of heterogeneous nodes on the graph neural network, so as to adapt to the downstream tasks based on heterogeneous networks. The study in reference <xref ref-type="bibr" rid="B13">13</xref> realizes node representation of heterogeneous networks by aggregating features of different types of nodes in stages. The study in reference <xref ref-type="bibr" rid="B51">51</xref> proposes a heterogeneous graph neural network based on hierarchical attention, including node level attention and semantic level attention. Node level attention aims to learn the importance between nodes and their neighbors based on meta-paths, while semantic level attention can learn the importance of different meta-paths. The study in reference <xref ref-type="bibr" rid="B52">52</xref> proposes a heterogeneous graph neural network method for subgraphs, which trains a classifier to learn the neighbor average features of the random sampling graph of the relational &#x201c;metagraph.&#x201d; The MAGNN [<xref ref-type="bibr" rid="B53">53</xref>] model which contains the node content transformation to encapsulate input node attributes, the intra-meta-path aggregation to incorporate intermediate semantic nodes, and the inter-meta-path aggregation to combine messages from multiple meta-paths. GTN [<xref ref-type="bibr" rid="B54">54</xref>], which generates new graph structures by identifying useful connections between unconnected nodes on the original graph, can learn effective node embeddings on the new graphs in an end-to-end fashion. HGNN-AC [<xref ref-type="bibr" rid="B55">55</xref>] based on reference <xref ref-type="bibr" rid="B53">53</xref> proposed a general framework for heterogeneous graph neural network <italic>via</italic> Attribute Completion, including pre-learning of topological embedding and attribute completion with attention mechanism. These heterogeneous graph neural network representation methods enhance the representation ability of nodes and provide a more practical idea for downstream&#x20;tasks.</p>
</sec>
</sec>
<sec sec-type="conclusion" id="s6">
<title>6 Conclusion</title>
<p>In this study, we studied the problem of influence prediction based on a heterogeneous neural network, introduced a novel model HetInf that combines three neural network modules models to jointly infer the interaction between events and users in heterogeneous networks, and predicted the future behavior of network users. The local sampling method of heterogeneous networks was improved to capture the law of information dissemination, so as to obtain a more realistic user influence subgraph. Experimental results show that the influence prediction model can benefit from the heterogeneous network as well as joint learning embedding of users and events. In general, the empirical studies verify the effectiveness of our proposed model compared to the baseline methods.</p>
</sec>
</body>
<back>
<sec id="s7">
<title>Data Availability Statement</title>
<p>The original contributions presented in the study are included in the article/Supplementary Material; further inquiries can be directed to the corresponding author.</p>
</sec>
<sec id="s8">
<title>Author Contributions</title>
<p>LG contributed to the core idea of the experiment design and analysis results under the guidance of BZ. HW assisted in experiment code and experiment analysis. HZ and ZZ analyzed the comparative experiment. BZ supervised the research, provided financial support, and provided the financial support and experimental equipment. BZ is the corresponding author. All authors discussed the results and contributed to the final manuscript.</p>
</sec>
<sec id="s9">
<title>Funding</title>
<p>This work is supported by the National Key Research and Development Program of China (2018YFC0831703) and the National Natural Science Foundation of China (Nos. 61732022, 61732004, 62072131, and 61672020).</p>
</sec>
<sec sec-type="COI-statement" id="s10">
<title>Conflict of Interest</title>
<p>The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.</p>
</sec>
<sec sec-type="disclaimer" id="s11">
<title>Publisher&#x2019;s Note</title>
<p>All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.</p>
</sec>
<ref-list>
<title>References</title>
<ref id="B1">
<label>1.</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Gui</surname>
<given-names>H</given-names>
</name>
<name>
<surname>Liu</surname>
<given-names>J</given-names>
</name>
<name>
<surname>Tao</surname>
<given-names>F</given-names>
</name>
<name>
<surname>Jiang</surname>
<given-names>M</given-names>
</name>
<name>
<surname>Norick</surname>
<given-names>B</given-names>
</name>
<name>
<surname>Kaplan</surname>
<given-names>L</given-names>
</name>
<etal/>
</person-group> <article-title>Embedding Learning with Events in Heterogeneous Information Networks</article-title>. <source>IEEE Trans Knowl Data Eng</source> (<year>2017</year>) <volume>29</volume>:<fpage>2428</fpage>&#x2013;<lpage>41</lpage>. <pub-id pub-id-type="doi">10.1109/TKDE.2017.2733530</pub-id> </citation>
</ref>
<ref id="B2">
<label>2.</label>
<citation citation-type="confproc">
<person-group person-group-type="author">
<name>
<surname>Mehrizi</surname>
<given-names>MA</given-names>
</name>
<name>
<surname>Cor&#xf2;</surname>
<given-names>F</given-names>
</name>
<name>
<surname>Cruciani</surname>
<given-names>E</given-names>
</name>
<name>
<surname>D&#x2019;Angelo</surname>
<given-names>G</given-names>
</name>
</person-group>. <article-title>Election Control through Social Influence with Unknown Preferences</article-title>. In: <conf-name>International Computing and Combinatorics Conference</conf-name>. <publisher-loc>New York, NY</publisher-loc>: <publisher-name>Springer</publisher-name> (<year>2020</year>). p. <fpage>397</fpage>&#x2013;<lpage>410</lpage>. <pub-id pub-id-type="doi">10.1007/978-3-030-58150-3_32</pub-id> </citation>
</ref>
<ref id="B3">
<label>3.</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Dholakia</surname>
<given-names>UM</given-names>
</name>
<name>
<surname>Bagozzi</surname>
<given-names>RP</given-names>
</name>
<name>
<surname>Pearo</surname>
<given-names>LK</given-names>
</name>
</person-group>. <article-title>A Social Influence Model of Consumer Participation in Network- and Small-Group-Based Virtual Communities</article-title>. <source>Int J&#x20;Res marketing</source> (<year>2004</year>) <volume>21</volume>:<fpage>241</fpage>&#x2013;<lpage>63</lpage>. <pub-id pub-id-type="doi">10.1016/j.ijresmar.2003.12.004</pub-id> </citation>
</ref>
<ref id="B4">
<label>4.</label>
<citation citation-type="confproc">
<person-group person-group-type="author">
<name>
<surname>Kim</surname>
<given-names>YA</given-names>
</name>
<name>
<surname>Srivastava</surname>
<given-names>J</given-names>
</name>
</person-group>. <article-title>Impact of Social Influence in E-Commerce Decision Making</article-title>. <comment>vol. 258 of ACM International Conference Proceeding Series</comment>. In: <person-group person-group-type="editor">
<name>
<surname>Gini</surname>
<given-names>ML</given-names>
</name>
<name>
<surname>Kauffman</surname>
<given-names>RJ</given-names>
</name>
<name>
<surname>Sarppo</surname>
<given-names>D</given-names>
</name>
<name>
<surname>Dellarocas</surname>
<given-names>C</given-names>
</name>
<name>
<surname>Dignum</surname>
<given-names>F</given-names>
</name>
</person-group>, editors. <conf-name>Proceedings of the 9th International Conference on Electronic Commerce: The Wireless World of Electronic Commerce, 2007</conf-name>; <conf-date>August 19-22, 2007</conf-date>; <conf-loc>Minneapolis, MN, USA</conf-loc>. <publisher-name>University of MinnesotaACM</publisher-name> (<year>2007</year>). p. <fpage>293</fpage>&#x2013;<lpage>302</lpage>. <pub-id pub-id-type="doi">10.1145/1282100.1282157</pub-id> </citation>
</ref>
<ref id="B5">
<label>5.</label>
<citation citation-type="confproc">
<person-group person-group-type="author">
<name>
<surname>Ye</surname>
<given-names>M</given-names>
</name>
<name>
<surname>Liu</surname>
<given-names>X</given-names>
</name>
<name>
<surname>Lee</surname>
<given-names>W</given-names>
</name>
</person-group>. <article-title>Exploring Social Influence for Recommendation: a Generative Model Approach</article-title>. In: <person-group person-group-type="editor">
<name>
<surname>Hersh</surname>
<given-names>WR</given-names>
</name>
<name>
<surname>Callan</surname>
<given-names>J</given-names>
</name>
<name>
<surname>Maarek</surname>
<given-names>Y</given-names>
</name>
<name>
<surname>Sanderson</surname>
<given-names>M</given-names>
</name>
</person-group>, editors. <conf-name>The 35th International ACM SIGIR conference on research and development in Information Retrieval, SIGIR &#x2019;12</conf-name>; <conf-date>August 12-16, 2012</conf-date>; <conf-loc>Portland, OR, USA</conf-loc>. <publisher-name>ACM</publisher-name> (<year>2012</year>). p. <fpage>671</fpage>&#x2013;<lpage>80</lpage>. </citation>
</ref>
<ref id="B6">
<label>6.</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Hosni</surname>
<given-names>AIE</given-names>
</name>
<name>
<surname>Li</surname>
<given-names>K</given-names>
</name>
<name>
<surname>Ahmad</surname>
<given-names>S</given-names>
</name>
</person-group>. <article-title>Minimizing Rumor Influence in Multiplex Online Social Networks Based on Human Individual and Social Behaviors</article-title>. <source>Inf Sci</source> (<year>2020</year>) <volume>512</volume>:<fpage>1458</fpage>&#x2013;<lpage>80</lpage>. <pub-id pub-id-type="doi">10.1016/j.ins.2019.10.063</pub-id> </citation>
</ref>
<ref id="B7">
<label>7.</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Zhou</surname>
<given-names>X</given-names>
</name>
<name>
<surname>Wu</surname>
<given-names>B</given-names>
</name>
<name>
<surname>Jin</surname>
<given-names>Q</given-names>
</name>
</person-group>. <article-title>User Role Identification Based on Social Behavior and Networking Analysis for Information Dissemination</article-title>. <source>Future Generation Comput Syst</source> (<year>2019</year>) <volume>96</volume>:<fpage>639</fpage>&#x2013;<lpage>48</lpage>. <pub-id pub-id-type="doi">10.1016/j.future.2017.04.043</pub-id> </citation>
</ref>
<ref id="B8">
<label>8.</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Li</surname>
<given-names>S</given-names>
</name>
<name>
<surname>Zhao</surname>
<given-names>D</given-names>
</name>
<name>
<surname>Wu</surname>
<given-names>X</given-names>
</name>
<name>
<surname>Tian</surname>
<given-names>Z</given-names>
</name>
<name>
<surname>Li</surname>
<given-names>A</given-names>
</name>
<name>
<surname>Wang</surname>
<given-names>Z</given-names>
</name>
</person-group>. <article-title>Functional Immunization of Networks Based on Message Passing</article-title>. <source>Appl Math Comput</source> (<year>2020</year>) <volume>366</volume>:<fpage>124728</fpage>. <pub-id pub-id-type="doi">10.1016/j.amc.2019.124728</pub-id> </citation>
</ref>
<ref id="B9">
<label>9.</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Liu</surname>
<given-names>L</given-names>
</name>
<name>
<surname>Tang</surname>
<given-names>J</given-names>
</name>
<name>
<surname>Han</surname>
<given-names>J</given-names>
</name>
<name>
<surname>Yang</surname>
<given-names>S</given-names>
</name>
</person-group>. <article-title>Learning Influence from Heterogeneous Social Networks</article-title>. <source>Data Min Knowl Disc</source> (<year>2012</year>) <volume>25</volume>:<fpage>511</fpage>&#x2013;<lpage>44</lpage>. <pub-id pub-id-type="doi">10.1007/s10618-012-0252-3</pub-id> </citation>
</ref>
<ref id="B10">
<label>10.</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Barbieri</surname>
<given-names>N</given-names>
</name>
<name>
<surname>Bonchi</surname>
<given-names>F</given-names>
</name>
<name>
<surname>Manco</surname>
<given-names>G</given-names>
</name>
</person-group>. <article-title>Topic-Aware Social Influence Propagation Models</article-title>. <source>Knowl Inf Syst</source> (<year>2013</year>) <volume>37</volume>:<fpage>555</fpage>&#x2013;<lpage>84</lpage>. <pub-id pub-id-type="doi">10.1007/s10115-013-0646-6</pub-id> </citation>
</ref>
<ref id="B11">
<label>11.</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Zhang</surname>
<given-names>Z-K</given-names>
</name>
<name>
<surname>Liu</surname>
<given-names>C</given-names>
</name>
<name>
<surname>Zhan</surname>
<given-names>X-X</given-names>
</name>
<name>
<surname>Lu</surname>
<given-names>X</given-names>
</name>
<name>
<surname>Zhang</surname>
<given-names>C-X</given-names>
</name>
<name>
<surname>Zhang</surname>
<given-names>Y-C</given-names>
</name>
</person-group>. <article-title>Dynamics of Information Diffusion and its Applications on Complex Networks</article-title>. <source>Phys Rep</source> (<year>2016</year>) <volume>651</volume>:<fpage>1</fpage>&#x2013;<lpage>34</lpage>. <pub-id pub-id-type="doi">10.1016/j.physrep.2016.07.002</pub-id> </citation>
</ref>
<ref id="B12">
<label>12.</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Quan</surname>
<given-names>Y</given-names>
</name>
<name>
<surname>Jia</surname>
<given-names>Y</given-names>
</name>
<name>
<surname>Zhou</surname>
<given-names>B</given-names>
</name>
<name>
<surname>Han</surname>
<given-names>W</given-names>
</name>
<name>
<surname>Li</surname>
<given-names>S</given-names>
</name>
</person-group>. <article-title>Repost Prediction Incorporating Time-Sensitive Mutual Influence in Social Networks</article-title>. <source>J&#x20;Comput Sci</source> (<year>2018</year>) <volume>28</volume>:<fpage>217</fpage>&#x2013;<lpage>27</lpage>. <pub-id pub-id-type="doi">10.1016/j.jocs.2017.11.015</pub-id> </citation>
</ref>
<ref id="B13">
<label>13.</label>
<citation citation-type="confproc">
<person-group person-group-type="author">
<name>
<surname>Zhang</surname>
<given-names>C</given-names>
</name>
<name>
<surname>Song</surname>
<given-names>D</given-names>
</name>
<name>
<surname>Huang</surname>
<given-names>C</given-names>
</name>
<name>
<surname>Swami</surname>
<given-names>A</given-names>
</name>
<name>
<surname>Chawla</surname>
<given-names>NV</given-names>
</name>
</person-group>. <article-title>Heterogeneous Graph Neural Network</article-title>. In: <conf-name>Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery &#x26; Data Mining</conf-name>, <conf-loc>Anchorage, AK</conf-loc>, <conf-date>August 4&#x2013;8, 2019</conf-date> (<year>2019</year>) p. <fpage>793</fpage>&#x2013;<lpage>803</lpage>. <pub-id pub-id-type="doi">10.1145/3292500.3330961</pub-id> </citation>
</ref>
<ref id="B14">
<label>14.</label>
<citation citation-type="confproc">
<person-group person-group-type="author">
<name>
<surname>Qiu</surname>
<given-names>J</given-names>
</name>
<name>
<surname>Tang</surname>
<given-names>J</given-names>
</name>
<name>
<surname>Ma</surname>
<given-names>H</given-names>
</name>
<name>
<surname>Dong</surname>
<given-names>Y</given-names>
</name>
<name>
<surname>Wang</surname>
<given-names>K</given-names>
</name>
<name>
<surname>Tang</surname>
<given-names>J</given-names>
</name>
</person-group>. <article-title>Deepinf: Social Influence Prediction with Deep Learning</article-title>. In: <conf-name>Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery &#x26; Data Mining</conf-name>, <conf-loc>Halifax, NS</conf-loc>, <conf-date>August 13&#x2013;17, 2017</conf-date> (<year>2018</year>). p. <fpage>2110</fpage>&#x2013;<lpage>9</lpage>. </citation>
</ref>
<ref id="B15">
<label>15.</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Xu</surname>
<given-names>H</given-names>
</name>
<name>
<surname>Jiang</surname>
<given-names>B</given-names>
</name>
<name>
<surname>Ding</surname>
<given-names>C</given-names>
</name>
</person-group>. <article-title>Mvinf: Social Influence Prediction with Multi-View Graph Attention Learning</article-title>. <source>Cogn Comput</source> (<year>2021</year>) <fpage>1</fpage>&#x2013;<lpage>7</lpage>. <pub-id pub-id-type="doi">10.1007/s12559-021-09822-z</pub-id> </citation>
</ref>
<ref id="B16">
<label>16.</label>
<citation citation-type="confproc">
<person-group person-group-type="author">
<name>
<surname>Calais Guerra</surname>
<given-names>PH</given-names>
</name>
<name>
<surname>Veloso</surname>
<given-names>A</given-names>
</name>
<name>
<surname>Meira</surname>
<given-names>W</given-names>
<suffix>Jr</suffix>
</name>
<name>
<surname>Almeida</surname>
<given-names>V</given-names>
</name>
</person-group>. <article-title>From Bias to Opinion: A Transfer-Learning Approach to Real-Time Sentiment Analysis</article-title>. In: <conf-name>Proceedings of the 17th ACM SIGKDD international conference on Knowledge discovery and data mining</conf-name>, <conf-loc>San Diego, CA</conf-loc>, <conf-date>August 21&#x2013;24, 2011</conf-date> (<year>2011</year>). p. <fpage>150</fpage>&#x2013;<lpage>8</lpage>. </citation>
</ref>
<ref id="B17">
<label>17.</label>
<citation citation-type="confproc">
<person-group person-group-type="author">
<name>
<surname>Tong</surname>
<given-names>H</given-names>
</name>
<name>
<surname>Faloutsos</surname>
<given-names>C</given-names>
</name>
<name>
<surname>Pan</surname>
<given-names>J</given-names>
</name>
</person-group>. <article-title>Fast Random Walk with Restart and its Applications</article-title>. In: <conf-name>Proceedings of the 6th IEEE International Conference on Data Mining (ICDM 2006)</conf-name>; <conf-date>18-22 December 2006</conf-date>; <conf-loc>Hong Kong, China</conf-loc>. <publisher-name>IEEE Computer Society</publisher-name> (<year>2006</year>) p. <fpage>613</fpage>&#x2013;<lpage>22</lpage>. <pub-id pub-id-type="doi">10.1109/icdm.2006.70</pub-id> </citation>
</ref>
<ref id="B18">
<label>18.</label>
<citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname>Goldberg</surname>
<given-names>Y</given-names>
</name>
<name>
<surname>Levy</surname>
<given-names>O</given-names>
</name>
</person-group>. <source>word2vec explained: deriving mikolov et&#x20;al.&#x2019;s negative-sampling word-embedding method</source>. <comment>CoRR abs/1402.3722</comment> (<year>2014</year>). </citation>
</ref>
<ref id="B19">
<label>19.</label>
<citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname>Veli&#x10d;kovi&#x107;</surname>
<given-names>P</given-names>
</name>
<name>
<surname>Cucurull</surname>
<given-names>G</given-names>
</name>
<name>
<surname>Casanova</surname>
<given-names>A</given-names>
</name>
<name>
<surname>Romero</surname>
<given-names>A</given-names>
</name>
<name>
<surname>Lio</surname>
<given-names>P</given-names>
</name>
<name>
<surname>Bengio</surname>
<given-names>Y</given-names>
</name>
</person-group>. <source>Graph Attention Networks</source>. <comment>arXiv preprint arXiv:1710.10903</comment> (<year>2017</year>). </citation>
</ref>
<ref id="B20">
<label>20.</label>
<citation citation-type="confproc">
<person-group person-group-type="author">
<name>
<surname>Niepert</surname>
<given-names>M</given-names>
</name>
<name>
<surname>Ahmed</surname>
<given-names>M</given-names>
</name>
<name>
<surname>Kutzkov</surname>
<given-names>K</given-names>
</name>
</person-group>. <article-title>Learning Convolutional Neural Networks for Graphs</article-title>. <comment>vol. 48 of JMLR Workshop and Conference Proceedings</comment>. In: <person-group person-group-type="editor">
<name>
<surname>Balcan</surname>
<given-names>M</given-names>
</name>
<name>
<surname>Weinberger</surname>
<given-names>KQ</given-names>
</name>
</person-group>, editors. <conf-name>Proceedings of the 33nd International Conference on Machine Learning, ICML 2016</conf-name>; <conf-date>June 19-24, 2016</conf-date>; <conf-loc>New York City, NY, USA</conf-loc>. <publisher-name>JMLR.org</publisher-name> (<year>2016</year>). p. <fpage>2014</fpage>&#x2013;<lpage>23</lpage>. </citation>
</ref>
<ref id="B21">
<label>21.</label>
<citation citation-type="confproc">
<person-group person-group-type="author">
<name>
<surname>Duan</surname>
<given-names>Y</given-names>
</name>
<name>
<surname>Chen</surname>
<given-names>Z</given-names>
</name>
<name>
<surname>Wei</surname>
<given-names>F</given-names>
</name>
<name>
<surname>Zhou</surname>
<given-names>M</given-names>
</name>
<name>
<surname>Shum</surname>
<given-names>H</given-names>
</name>
</person-group>. <article-title>Twitter Topic Summarization by Ranking Tweets Using Social Influence and Content Quality</article-title>. In: <person-group person-group-type="editor">
<name>
<surname>Kay</surname>
<given-names>M</given-names>
</name>
<name>
<surname>Boitet</surname>
<given-names>C</given-names>
</name>
</person-group>, editors. <conf-name>COLING 2012, 24th International Conference on Computational Linguistics, Proceedings of the Conference: Technical Papers</conf-name>; <conf-date>8-15 December 2012</conf-date>; <conf-loc>Mumbai, India</conf-loc>. <publisher-name>Indian Institute of Technology Bombay</publisher-name> (<year>2012</year>). p. <fpage>763</fpage>&#x2013;<lpage>80</lpage>. </citation>
</ref>
<ref id="B22">
<label>22.</label>
<citation citation-type="confproc">
<person-group person-group-type="author">
<name>
<surname>Zheng</surname>
<given-names>J</given-names>
</name>
<name>
<surname>Cai</surname>
<given-names>F</given-names>
</name>
<name>
<surname>Chen</surname>
<given-names>H</given-names>
</name>
</person-group>. <article-title>Incorporating Scenario Knowledge into A Unified Fine-Tuning Architecture for Event Representation</article-title>. In: <person-group person-group-type="editor">
<name>
<surname>Huang</surname>
<given-names>J</given-names>
</name>
<name>
<surname>Chang</surname>
<given-names>Y</given-names>
</name>
<name>
<surname>Cheng</surname>
<given-names>X</given-names>
</name>
<name>
<surname>Kamps</surname>
<given-names>J</given-names>
</name>
<name>
<surname>Murdock</surname>
<given-names>V</given-names>
</name>
<name>
<surname>Wen</surname>
<given-names>J</given-names>
</name>
<etal/>
</person-group> editors. <conf-name>Proceedings of the 43rd International ACM SIGIR conference on research and development in Information Retrieval, SIGIR 2020</conf-name>; <conf-date>July 25-30, 2020</conf-date>; <conf-loc>Virtual Event, China</conf-loc>. <publisher-name>ACM</publisher-name> (<year>2020</year>) p. <fpage>249</fpage>&#x2013;<lpage>58</lpage>. <pub-id pub-id-type="doi">10.1145/3397271.3401173</pub-id> </citation>
</ref>
<ref id="B23">
<label>23.</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Ramos</surname>
<given-names>J</given-names>
</name>
</person-group>. <article-title>Using Tf-Idf to Determine Word Relevance in Document Queries</article-title>. <source>Proc first instructional Conf machine Learn (Citeseer)</source> (<year>2003</year>) <volume>242</volume>:<fpage>29</fpage>&#x2013;<lpage>48</lpage>. </citation>
</ref>
<ref id="B24">
<label>24.</label>
<citation citation-type="confproc">
<person-group person-group-type="author">
<name>
<surname>Perozzi</surname>
<given-names>B</given-names>
</name>
<name>
<surname>Al-Rfou</surname>
<given-names>R</given-names>
</name>
<name>
<surname>Skiena</surname>
<given-names>S</given-names>
</name>
</person-group>. <article-title>Deepwalk: Online Learning of Social Representations</article-title>. In: <person-group person-group-type="editor">
<name>
<surname>Macskassy</surname>
<given-names>SA</given-names>
</name>
<name>
<surname>Perlich</surname>
<given-names>C</given-names>
</name>
<name>
<surname>Leskovec</surname>
<given-names>J</given-names>
</name>
<name>
<surname>Wang</surname>
<given-names>W</given-names>
</name>
<name>
<surname>Ghani</surname>
<given-names>R</given-names>
</name>
</person-group>, editors. <conf-name>The 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD &#x2019;14</conf-name>; <conf-date>August 24 - 27, 2014</conf-date>; <conf-loc>New York, NY, USA</conf-loc>. <publisher-name>ACM</publisher-name> (<year>2014</year>). p. <fpage>701</fpage>&#x2013;<lpage>10</lpage>. </citation>
</ref>
<ref id="B25">
<label>25.</label>
<citation citation-type="confproc">
<person-group person-group-type="author">
<name>
<surname>Beamer</surname>
<given-names>S</given-names>
</name>
<name>
<surname>Asanovic</surname>
<given-names>K</given-names>
</name>
<name>
<surname>Patterson</surname>
<given-names>D</given-names>
</name>
</person-group>. <article-title>Direction-Optimizing Breadth-First Search</article-title>. In: <conf-name>SC&#x2019;12: Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis</conf-name>, <conf-loc>Salt Lake City, UT</conf-loc>, <conf-date>November 11&#x2013;15, 201</conf-date>. <publisher-name>IEEE</publisher-name> (<year>2012</year>). p. <fpage>1</fpage>&#x2013;<lpage>10</lpage>. <pub-id pub-id-type="doi">10.1109/sc.2012.50</pub-id> </citation>
</ref>
<ref id="B26">
<label>26.</label>
<citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname>Huang</surname>
<given-names>Z</given-names>
</name>
<name>
<surname>Xu</surname>
<given-names>W</given-names>
</name>
<name>
<surname>Yu</surname>
<given-names>K</given-names>
</name>
</person-group>. <source>Bidirectional Lstm-Crf Models for Sequence Tagging</source>. <comment>arXiv preprint arXiv:1508.01991</comment> (<year>2015</year>). </citation>
</ref>
<ref id="B27">
<label>27.</label>
<citation citation-type="confproc">
<person-group person-group-type="author">
<name>
<surname>Kipf</surname>
<given-names>TN</given-names>
</name>
<name>
<surname>Welling</surname>
<given-names>M</given-names>
</name>
</person-group>. <article-title>Semi-Supervised Classification with Graph Convolutional Networks</article-title>. <comment>Conference Track Proceedings (OpenReview.net)</comment>. In: <conf-name>5th International Conference on Learning Representations</conf-name>; <conf-date>April 24-26, 2017</conf-date>; <conf-loc>Toulon, France</conf-loc>. <publisher-name>ICLR 2017</publisher-name> (<year>2017</year>). </citation>
</ref>
<ref id="B28">
<label>28.</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Ugander</surname>
<given-names>J</given-names>
</name>
<name>
<surname>Backstrom</surname>
<given-names>L</given-names>
</name>
<name>
<surname>Marlow</surname>
<given-names>C</given-names>
</name>
<name>
<surname>Kleinberg</surname>
<given-names>J</given-names>
</name>
</person-group>. <article-title>Structural Diversity in Social Contagion</article-title>. <source>Proc Natl Acad Sci</source> (<year>2012</year>) <volume>109</volume>:<fpage>5962</fpage>&#x2013;<lpage>6</lpage>. <pub-id pub-id-type="doi">10.1073/pnas.1116502109</pub-id> </citation>
</ref>
<ref id="B29">
<label>29.</label>
<citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname>Lerman</surname>
<given-names>K</given-names>
</name>
<name>
<surname>Ghosh</surname>
<given-names>R</given-names>
</name>
<name>
<surname>Surachawala</surname>
<given-names>T</given-names>
</name>
</person-group>. <source>Social Contagion: An Empirical Study of Information Spread on Digg and Twitter Follower Graphs</source>. <comment>CoRR abs/1202.3162</comment> (<year>2012</year>). </citation>
</ref>
<ref id="B30">
<label>30.</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Hogg</surname>
<given-names>T</given-names>
</name>
<name>
<surname>Lerman</surname>
<given-names>K</given-names>
</name>
</person-group>. <article-title>Social Dynamics of Digg</article-title>. <source>EPJ&#x20;Data Sci</source> (<year>2012</year>) <volume>1</volume>:<fpage>5</fpage>. <pub-id pub-id-type="doi">10.1140/epjds5</pub-id> </citation>
</ref>
<ref id="B31">
<label>31.</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Zhang</surname>
<given-names>J</given-names>
</name>
<name>
<surname>Tang</surname>
<given-names>J</given-names>
</name>
<name>
<surname>Li</surname>
<given-names>J</given-names>
</name>
<name>
<surname>Liu</surname>
<given-names>Y</given-names>
</name>
<name>
<surname>Xing</surname>
<given-names>C</given-names>
</name>
</person-group>. <article-title>Who Influenced You? Predicting Retweet via Social Influence Locality</article-title>. <source>ACM Trans Knowl Discov Data</source> (<year>2015</year>) <volume>9</volume>:<fpage>1</fpage>&#x2013;<lpage>26</lpage>. <pub-id pub-id-type="doi">10.1145/2700398</pub-id> </citation>
</ref>
<ref id="B32">
<label>32.</label>
<citation citation-type="confproc">
<person-group person-group-type="author">
<name>
<surname>Zhang</surname>
<given-names>J</given-names>
</name>
<name>
<surname>Liu</surname>
<given-names>B</given-names>
</name>
<name>
<surname>Tang</surname>
<given-names>J</given-names>
</name>
<name>
<surname>Chen</surname>
<given-names>T</given-names>
</name>
<name>
<surname>Li</surname>
<given-names>J</given-names>
</name>
</person-group>. <article-title>Social Influence Locality for Modeling Retweeting Behaviors</article-title>. In: <person-group person-group-type="editor">
<name>
<surname>Rossi</surname>
<given-names>F</given-names>
</name>
</person-group>, editor. <conf-name>IJCAI 2013, Proceedings of the 23rd International Joint Conference on Artificial Intelligence</conf-name>; <conf-date>August 3-9, 2013</conf-date>; <conf-loc>Beijing, China</conf-loc>. <publisher-name>IJCAI/AAAI</publisher-name> (<year>2013</year>). p. <fpage>2761</fpage>&#x2013;<lpage>7</lpage>. </citation>
</ref>
<ref id="B33">
<label>33.</label>
<citation citation-type="confproc">
<person-group person-group-type="author">
<name>
<surname>Le</surname>
<given-names>QV</given-names>
</name>
<name>
<surname>Mikolov</surname>
<given-names>T</given-names>
</name>
</person-group>. <article-title>Distributed Representations of Sentences and Documents</article-title>. <comment>vol. 32 of JMLR Workshop and Conference Proceedings</comment>. In: <conf-name>Proceedings of the 31th International Conference on Machine Learning, ICML 2014</conf-name>; <conf-date>21-26 June 2014</conf-date>; <conf-loc>Beijing, China</conf-loc>. <publisher-name>JMLR.org</publisher-name> (<year>2014</year>). p. <fpage>1188</fpage>&#x2013;<lpage>96</lpage>. </citation>
</ref>
<ref id="B34">
<label>34.</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Fan</surname>
<given-names>R-E</given-names>
</name>
<name>
<surname>Chang</surname>
<given-names>K-W</given-names>
</name>
<name>
<surname>Hsieh</surname>
<given-names>C-J</given-names>
</name>
<name>
<surname>Wang</surname>
<given-names>X-R</given-names>
</name>
<name>
<surname>Lin</surname>
<given-names>C-J</given-names>
</name>
</person-group>. <article-title>Liblinear: A Library for Large Linear Classification</article-title>. <source>J&#x20;machine Learn Res</source> (<year>2008</year>) <volume>9</volume>:<fpage>1871</fpage>&#x2013;<lpage>4</lpage>. <pub-id pub-id-type="doi">10.1145/1390681.1442794</pub-id> </citation>
</ref>
<ref id="B35">
<label>35.</label>
<citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname>Agarap</surname>
<given-names>AF</given-names>
</name>
</person-group>. <source>Deep Learning Using Rectified Linear Units (Relu)</source>. <comment>arXiv preprint arXiv:1803.08375</comment> (<year>2018</year>). </citation>
</ref>
<ref id="B36">
<label>36.</label>
<citation citation-type="confproc">
<person-group person-group-type="author">
<name>
<surname>Zhang</surname>
<given-names>Z</given-names>
</name>
</person-group>. <article-title>Improved Adam Optimizer for Deep Neural Networks</article-title>. In: <conf-name>2018 IEEE/ACM 26th International Symposium on Quality of Service (IWQoS)</conf-name>. <publisher-name>IEEE</publisher-name> (<year>2018</year>). p. <fpage>1</fpage>&#x2013;<lpage>2</lpage>. <pub-id pub-id-type="doi">10.1109/iwqos.2018.8624183</pub-id> </citation>
</ref>
<ref id="B37">
<label>37.</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Gao</surname>
<given-names>L</given-names>
</name>
<name>
<surname>Liu</surname>
<given-names>Y</given-names>
</name>
<name>
<surname>Zhuang</surname>
<given-names>H</given-names>
</name>
<name>
<surname>Wang</surname>
<given-names>H</given-names>
</name>
<name>
<surname>Zhou</surname>
<given-names>B</given-names>
</name>
<name>
<surname>Li</surname>
<given-names>A</given-names>
</name>
</person-group>. <article-title>Public Opinion Early Warning Agent Model: A Deep Learning cascade Virality Prediction Model Based on Multi-Feature Fusion</article-title>. <source>Front Neurorobot</source> (<year>2021</year>) <volume>15</volume>:<fpage>674322</fpage>. <pub-id pub-id-type="doi">10.3389/fnbot.2021.674322</pub-id> </citation>
</ref>
<ref id="B38">
<label>38.</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Li</surname>
<given-names>M</given-names>
</name>
<name>
<surname>Wang</surname>
<given-names>X</given-names>
</name>
<name>
<surname>Gao</surname>
<given-names>K</given-names>
</name>
<name>
<surname>Zhang</surname>
<given-names>S</given-names>
</name>
</person-group>. <article-title>A Survey on Information Diffusion in Online Social Networks: Models and Methods</article-title>. <source>Information</source> (<year>2017</year>) <volume>8</volume>:<fpage>118</fpage>. <pub-id pub-id-type="doi">10.3390/info8040118</pub-id> </citation>
</ref>
<ref id="B39">
<label>39.</label>
<citation citation-type="confproc">
<person-group person-group-type="author">
<name>
<surname>Lee</surname>
<given-names>RK-W</given-names>
</name>
<name>
<surname>Hoang</surname>
<given-names>T-A</given-names>
</name>
<name>
<surname>Lim</surname>
<given-names>E-P</given-names>
</name>
</person-group>. <article-title>Discovering Hidden Topical Hubs and Authorities in Online Social Networks</article-title>. In: <person-group person-group-type="editor">
<name>
<surname>Ester</surname>
<given-names>M</given-names>
</name>
<name>
<surname>Pedreschi</surname>
<given-names>D</given-names>
</name>
</person-group>, editors. <conf-name>Proceedings of the 2018 SIAM International Conference on Data Mining, SDM 2018</conf-name>; <conf-date>May 3-5, 2018</conf-date>; <conf-loc>San Diego, CA, USA</conf-loc>. <publisher-name>San Diego Marriott Mission ValleySIAM</publisher-name> (<year>2018</year>). p. <fpage>378</fpage>&#x2013;<lpage>86</lpage>. <pub-id pub-id-type="doi">10.1137/1.9781611975321.43</pub-id> </citation>
</ref>
<ref id="B40">
<label>40.</label>
<citation citation-type="confproc">
<person-group person-group-type="author">
<name>
<surname>Cha</surname>
<given-names>M</given-names>
</name>
<name>
<surname>Haddadi</surname>
<given-names>H</given-names>
</name>
<name>
<surname>Benevenuto</surname>
<given-names>F</given-names>
</name>
<name>
<surname>Gummadi</surname>
<given-names>PK</given-names>
</name>
</person-group>. In: <person-group person-group-type="editor">
<name>
<surname>Cohen</surname>
<given-names>WW</given-names>
</name>
<name>
<surname>Gosling</surname>
<given-names>S</given-names>
</name>
</person-group>, editors. <conf-name>Measuring user influence in twitter: The million follower fallacy. In Proceedings of the Fourth International Conference on Weblogs and Social Media, ICWSM 2010</conf-name>; <conf-date>May 23-26, 2010</conf-date>; <conf-loc>Washington, DC, USA</conf-loc>. <publisher-name>The AAAI Press</publisher-name> (<year>2010</year>).</citation>
</ref>
<ref id="B41">
<label>41.</label>
<citation citation-type="confproc">
<person-group person-group-type="author">
<name>
<surname>Qiu</surname>
<given-names>J</given-names>
</name>
<name>
<surname>Chen</surname>
<given-names>Q</given-names>
</name>
<name>
<surname>Dong</surname>
<given-names>Y</given-names>
</name>
<name>
<surname>Zhang</surname>
<given-names>J</given-names>
</name>
<name>
<surname>Yang</surname>
<given-names>H</given-names>
</name>
<name>
<surname>Ding</surname>
<given-names>M</given-names>
</name>
<etal/>
</person-group> <article-title>GCC: Graph Contrastive Coding for Graph Neural Network Pre-training</article-title>. In: <person-group person-group-type="editor">
<name>
<surname>Gupta</surname>
<given-names>R</given-names>
</name>
<name>
<surname>Liu</surname>
<given-names>Y</given-names>
</name>
<name>
<surname>Tang</surname>
<given-names>J</given-names>
</name>
<name>
<surname>Prakash</surname>
<given-names>BA</given-names>
</name>
</person-group>, editors. <conf-name>KDD &#x2019;20: The 26th ACM SIGKDD Conference on Knowledge Discovery and Data Mining</conf-name>; <conf-date>August 23-27, 2020</conf-date>; <conf-loc>Virtual Event, CA, USA</conf-loc>. <publisher-name>ACM</publisher-name> (<year>2020</year>). p. <fpage>1150</fpage>&#x2013;<lpage>60</lpage>. </citation>
</ref>
<ref id="B42">
<label>42.</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Li</surname>
<given-names>S</given-names>
</name>
<name>
<surname>Jiang</surname>
<given-names>L</given-names>
</name>
<name>
<surname>Wu</surname>
<given-names>X</given-names>
</name>
<name>
<surname>Han</surname>
<given-names>W</given-names>
</name>
<name>
<surname>Zhao</surname>
<given-names>D</given-names>
</name>
<name>
<surname>Wang</surname>
<given-names>Z</given-names>
</name>
</person-group>. <article-title>A Weighted Network Community Detection Algorithm Based on Deep Learning</article-title>. <source>Appl Maths Comput</source> (<year>2021</year>) <volume>401</volume>:<fpage>126012</fpage>. <pub-id pub-id-type="doi">10.1016/j.amc.2021.126012</pub-id> </citation>
</ref>
<ref id="B43">
<label>43.</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Tian</surname>
<given-names>S</given-names>
</name>
<name>
<surname>Mo</surname>
<given-names>S</given-names>
</name>
<name>
<surname>Wang</surname>
<given-names>L</given-names>
</name>
<name>
<surname>Peng</surname>
<given-names>Z</given-names>
</name>
</person-group>. <article-title>Deep Reinforcement Learning-Based Approach to Tackle Topic-Aware Influence Maximization</article-title>. <source>Data Sci Eng</source> (<year>2020</year>) <volume>5</volume>:<fpage>1</fpage>&#x2013;<lpage>11</lpage>. <pub-id pub-id-type="doi">10.1007/s41019-020-00117-1</pub-id> </citation>
</ref>
<ref id="B44">
<label>44.</label>
<citation citation-type="confproc">
<person-group person-group-type="author">
<name>
<surname>Singla</surname>
<given-names>P</given-names>
</name>
<name>
<surname>Richardson</surname>
<given-names>M</given-names>
</name>
</person-group>. <article-title>Yes, There Is a Correlation: - from Social Networks to Personal Behavior on the Web</article-title>. In: <person-group person-group-type="editor">
<name>
<surname>Huai</surname>
<given-names>J</given-names>
</name>
<name>
<surname>Chen</surname>
<given-names>R</given-names>
</name>
<name>
<surname>Hon</surname>
<given-names>H</given-names>
</name>
<name>
<surname>Liu</surname>
<given-names>Y</given-names>
</name>
<name>
<surname>Ma</surname>
<given-names>W</given-names>
</name>
<name>
<surname>Tomkins</surname>
<given-names>A</given-names>
</name>
<etal/>
</person-group> editors. <conf-name>Proceedings of the 17th International Conference on World Wide Web, WWW 2008</conf-name>; <conf-date>April 21-25, 2008</conf-date>; <conf-loc>Beijing, China</conf-loc>. <publisher-name>ACM</publisher-name> (<year>2008</year>). p. <fpage>655</fpage>&#x2013;<lpage>64</lpage>. </citation>
</ref>
<ref id="B45">
<label>45.</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Hu</surname>
<given-names>J</given-names>
</name>
<name>
<surname>Meng</surname>
<given-names>K</given-names>
</name>
<name>
<surname>Chen</surname>
<given-names>X</given-names>
</name>
<name>
<surname>Lin</surname>
<given-names>C</given-names>
</name>
<name>
<surname>Huang</surname>
<given-names>J</given-names>
</name>
</person-group>. <article-title>Analysis of Influence Maximization in Large-Scale Social Networks</article-title>. <source>SIGMETRICS Perform Eval Rev</source> (<year>2014</year>) <volume>41</volume>:<fpage>78</fpage>&#x2013;<lpage>81</lpage>. <pub-id pub-id-type="doi">10.1145/2627534.2627559</pub-id> </citation>
</ref>
<ref id="B46">
<label>46.</label>
<citation citation-type="confproc">
<person-group person-group-type="author">
<name>
<surname>Liu</surname>
<given-names>L</given-names>
</name>
<name>
<surname>Tang</surname>
<given-names>J</given-names>
</name>
<name>
<surname>Han</surname>
<given-names>J</given-names>
</name>
<name>
<surname>Jiang</surname>
<given-names>M</given-names>
</name>
<name>
<surname>Yang</surname>
<given-names>S</given-names>
</name>
</person-group>. <article-title>Mining Topic-Level Influence in Heterogeneous Networks</article-title>. In: <person-group person-group-type="editor">
<name>
<surname>Huang</surname>
<given-names>J</given-names>
</name>
<name>
<surname>Koudas</surname>
<given-names>N</given-names>
</name>
<name>
<surname>Jones</surname>
<given-names>GJF</given-names>
</name>
<name>
<surname>Wu</surname>
<given-names>X</given-names>
</name>
<name>
<surname>Collins-Thompson</surname>
<given-names>K</given-names>
</name>
<name>
<surname>An</surname>
<given-names>A</given-names>
</name>
</person-group>, editors. <conf-name>Proceedings of the 19th ACM Conference on Information and Knowledge Management, CIKM 2010</conf-name>; <conf-date>October 26-30, 2010</conf-date>; <conf-loc>Toronto, Ontario, Canada</conf-loc>. <publisher-name>ACM</publisher-name> (<year>2010</year>). p. <fpage>199</fpage>&#x2013;<lpage>208</lpage>. <pub-id pub-id-type="doi">10.1145/1871437.1871467</pub-id> </citation>
</ref>
<ref id="B47">
<label>47.</label>
<citation citation-type="confproc">
<person-group person-group-type="author">
<name>
<surname>Wang</surname>
<given-names>X</given-names>
</name>
<name>
<surname>Guo</surname>
<given-names>Z</given-names>
</name>
<name>
<surname>Wang</surname>
<given-names>X</given-names>
</name>
<name>
<surname>Liu</surname>
<given-names>S</given-names>
</name>
<name>
<surname>Jing</surname>
<given-names>W</given-names>
</name>
<name>
<surname>Liu</surname>
<given-names>Y</given-names>
</name>
</person-group>. <article-title>Nnmlinf: Social Influence Prediction with Neural Network Multi-Label Classification</article-title>. In: <conf-name>Proceedings of the ACM Turing Celebration Conference-China</conf-name> (<year>2019</year>). p. <fpage>1</fpage>&#x2013;<lpage>5</lpage>. </citation>
</ref>
<ref id="B48">
<label>48.</label>
<citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname>Luceri</surname>
<given-names>L</given-names>
</name>
<name>
<surname>Braun</surname>
<given-names>T</given-names>
</name>
<name>
<surname>Giordano</surname>
<given-names>S</given-names>
</name>
</person-group>. <article-title>Social Influence (Deep) Learning for Human Behavior Prediction</article-title>. In: <person-group person-group-type="editor">
<name>
<surname>Cornelius</surname>
<given-names>S</given-names>
</name>
<name>
<surname>Coronges</surname>
<given-names>K</given-names>
</name>
<name>
<surname>Gon&#xe7;alves</surname>
<given-names>B</given-names>
</name>
<name>
<surname>Sinatra</surname>
<given-names>R</given-names>
</name>
<name>
<surname>Vespignani</surname>
<given-names>A</given-names>
</name>
</person-group>, editors. <source>International Workshop on Complex Networks</source>. <publisher-loc>Cham</publisher-loc>: <publisher-name>Springer</publisher-name> (<year>2018</year>). p. <fpage>261</fpage>&#x2013;<lpage>9</lpage>. <pub-id pub-id-type="doi">10.1007/978-3-319-73198-8_22</pub-id> </citation>
</ref>
<ref id="B49">
<label>49.</label>
<citation citation-type="confproc">
<person-group person-group-type="author">
<name>
<surname>Hamilton</surname>
<given-names>WL</given-names>
</name>
<name>
<surname>Ying</surname>
<given-names>Z</given-names>
</name>
<name>
<surname>Leskovec</surname>
<given-names>J</given-names>
</name>
</person-group>. <article-title>Inductive Representation Learning on Large Graphs</article-title>. In: <person-group person-group-type="editor">
<name>
<surname>Guyon</surname>
<given-names>I</given-names>
</name>
<name>
<surname>von Luxburg</surname>
<given-names>U</given-names>
</name>
<name>
<surname>Bengio</surname>
<given-names>S</given-names>
</name>
<name>
<surname>Wallach</surname>
<given-names>HM</given-names>
</name>
<name>
<surname>Fergus</surname>
<given-names>R</given-names>
</name>
<name>
<surname>Vishwanathan</surname>
<given-names>SVN</given-names>
</name>
<etal/>
</person-group> editors. <conf-name>Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017</conf-name>; <conf-date>December 4-9, 2017</conf-date>; <conf-loc>Long Beach, CA, USA</conf-loc> (<year>2017</year>). p. <fpage>1024</fpage>&#x2013;<lpage>34</lpage>. </citation>
</ref>
<ref id="B50">
<label>50.</label>
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Wu</surname>
<given-names>Z</given-names>
</name>
<name>
<surname>Pan</surname>
<given-names>S</given-names>
</name>
<name>
<surname>Chen</surname>
<given-names>F</given-names>
</name>
<name>
<surname>Long</surname>
<given-names>G</given-names>
</name>
<name>
<surname>Zhang</surname>
<given-names>C</given-names>
</name>
<name>
<surname>Yu</surname>
<given-names>PS</given-names>
</name>
</person-group>. <article-title>A Comprehensive Survey on Graph Neural Networks</article-title>. <source>IEEE Trans Neural Netw Learn Syst.</source> (<year>2021</year>) <volume>32</volume>:<fpage>4</fpage>&#x2013;<lpage>24</lpage>. <pub-id pub-id-type="doi">10.1109/tnnls.2020.2978386</pub-id> </citation>
</ref>
<ref id="B51">
<label>51.</label>
<citation citation-type="confproc">
<person-group person-group-type="author">
<name>
<surname>Wang</surname>
<given-names>X</given-names>
</name>
<name>
<surname>Ji</surname>
<given-names>H</given-names>
</name>
<name>
<surname>Shi</surname>
<given-names>C</given-names>
</name>
<name>
<surname>Wang</surname>
<given-names>B</given-names>
</name>
<name>
<surname>Ye</surname>
<given-names>Y</given-names>
</name>
<name>
<surname>Cui</surname>
<given-names>P</given-names>
</name>
<etal/>
</person-group> <article-title>Heterogeneous Graph Attention Network</article-title>. In: <person-group person-group-type="editor">
<name>
<surname>Liu</surname>
<given-names>L</given-names>
</name>
<name>
<surname>White</surname>
<given-names>RW</given-names>
</name>
<name>
<surname>Mantrach</surname>
<given-names>A</given-names>
</name>
<name>
<surname>Silvestri</surname>
<given-names>F</given-names>
</name>
<name>
<surname>McAuley</surname>
<given-names>JJ</given-names>
</name>
<name>
<surname>Baeza-Yates</surname>
<given-names>R</given-names>
</name>
<etal/>
</person-group> editors. <conf-name>The World Wide Web Conference, WWW 2019</conf-name>; <conf-date>May 13-17, 2019</conf-date>; <conf-loc>San Francisco, CA, USA</conf-loc>. <publisher-name>ACM</publisher-name> (<year>2019</year>). p. <fpage>2022</fpage>&#x2013;<lpage>32</lpage>. <pub-id pub-id-type="doi">10.1145/3308558.3313562</pub-id> </citation>
</ref>
<ref id="B52">
<label>52.</label>
<citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname>Yu</surname>
<given-names>L</given-names>
</name>
<name>
<surname>Shen</surname>
<given-names>J</given-names>
</name>
<name>
<surname>Li</surname>
<given-names>J</given-names>
</name>
<name>
<surname>Lerer</surname>
<given-names>A</given-names>
</name>
</person-group>. <source>Scalable Graph Neural Networks for Heterogeneous Graphs</source>. <comment>CoRR abs/2011.09679</comment> (<year>2020</year>). </citation>
</ref>
<ref id="B53">
<label>53.</label>
<citation citation-type="confproc">
<person-group person-group-type="author">
<name>
<surname>Fu</surname>
<given-names>X</given-names>
</name>
<name>
<surname>Zhang</surname>
<given-names>J</given-names>
</name>
<name>
<surname>Meng</surname>
<given-names>Z</given-names>
</name>
<name>
<surname>King</surname>
<given-names>I</given-names>
</name>
</person-group>. <article-title>MAGNN: Metapath Aggregated Graph Neural Network for Heterogeneous Graph Embedding</article-title>. In: <person-group person-group-type="editor">
<name>
<surname>Huang</surname>
<given-names>Y</given-names>
</name>
<name>
<surname>King</surname>
<given-names>I</given-names>
</name>
<name>
<surname>Liu</surname>
<given-names>T</given-names>
</name>
<name>
<surname>van Steen</surname>
<given-names>M</given-names>
</name>
</person-group>, editors. <conf-name>WWW &#x2019;20: The Web Conference 2020</conf-name>; <conf-date>April 20-24, 2020</conf-date>; <conf-loc>Taipei, Taiwan</conf-loc>. <publisher-name>ACM/IW3C2</publisher-name> (<year>2020</year>). p. <fpage>2331</fpage>&#x2013;<lpage>41</lpage>. <pub-id pub-id-type="doi">10.1145/3366423.3380297</pub-id> </citation>
</ref>
<ref id="B54">
<label>54.</label>
<citation citation-type="confproc">
<person-group person-group-type="author">
<name>
<surname>Yun</surname>
<given-names>S</given-names>
</name>
<name>
<surname>Jeong</surname>
<given-names>M</given-names>
</name>
<name>
<surname>Kim</surname>
<given-names>R</given-names>
</name>
<name>
<surname>Kang</surname>
<given-names>J</given-names>
</name>
<name>
<surname>Kim</surname>
<given-names>HJ</given-names>
</name>
</person-group>. <article-title>Graph Transformer Networks</article-title>. In: <person-group person-group-type="editor">
<name>
<surname>Wallach</surname>
<given-names>HM</given-names>
</name>
<name>
<surname>Larochelle</surname>
<given-names>H</given-names>
</name>
<name>
<surname>Beygelzimer</surname>
<given-names>A</given-names>
</name>
<name>
<surname>d&#x2019;Alch&#xe9;-Buc</surname>
<given-names>F</given-names>
</name>
<name>
<surname>Fox</surname>
<given-names>EB</given-names>
</name>
<name>
<surname>Garnett</surname>
<given-names>R</given-names>
</name>
</person-group>, editors. <conf-name>Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, NeurIPS 2019</conf-name>; <conf-date>December 8-14, 2019</conf-date>; <conf-loc>Vancouver, BC, Canada</conf-loc> (<year>2019</year>). p. <fpage>11960</fpage>&#x2013;<lpage>70</lpage>. </citation>
</ref>
<ref id="B55">
<label>55.</label>
<citation citation-type="confproc">
<person-group person-group-type="author">
<name>
<surname>Jin</surname>
<given-names>D</given-names>
</name>
<name>
<surname>Huo</surname>
<given-names>C</given-names>
</name>
<name>
<surname>Liang</surname>
<given-names>C</given-names>
</name>
<name>
<surname>Yang</surname>
<given-names>L</given-names>
</name>
</person-group>. <article-title>Heterogeneous Graph Neural Network via Attribute Completion</article-title>. In: <person-group person-group-type="editor">
<name>
<surname>Leskovec</surname>
<given-names>J</given-names>
</name>
<name>
<surname>Grobelnik</surname>
<given-names>M</given-names>
</name>
<name>
<surname>Najork</surname>
<given-names>M</given-names>
</name>
<name>
<surname>Tang</surname>
<given-names>J</given-names>
</name>
<name>
<surname>Zia</surname>
<given-names>L</given-names>
</name>
</person-group>, editors. <conf-name>WWW &#x2019;21: The Web Conference 2021</conf-name>; <conf-date>April 19-23, 2021</conf-date>; <conf-loc>Virtual Event/Ljubljana, Slovenia</conf-loc>. <publisher-name>ACM/IW3C2</publisher-name> (<year>2021</year>). p. <fpage>391</fpage>&#x2013;<lpage>400</lpage>. <pub-id pub-id-type="doi">10.1145/3442381.3449914</pub-id> </citation>
</ref>
</ref-list>
</back>
</article>