SAMGAT: structure-aware multilevel graph attention networks for automatic rumor detection

View article
PeerJ Computer Science

Main article text

 

Introduction

  • We put forth an innovative constrained dynamic graph attention model to effectively prioritize critical posts, representing events more accurately while preserving cardinality information and modeling different relationships simultaneously.

  • We guide graph attention using conversation structure in a self-supervised manner, outperforming traditional graph attention models. Our model leverages trustworthy prior structural knowledge to provide robust performance even under perturbed graph structures.

  • We restrict posts in the graph pooling process based on constrained claim-guided attention, demonstrating superior performance in rumor detection.

Methodology

Problem definition

Graph attention networks

Weighted multi-head attention

DynamicGAT

Constrained local attention

Relation-guided attention

Claim-guided attention pooling

Post-level attention

Constrained event-level attention

Classification of rumor detection

 
__________________________________________________________________________________ 
 
Algorithm 1 The SAMGAT Algorithm 
     Input: Graph G(V,E), Event set C, Label set Y = N,F,T,U 
    Output: Classifier f : C → Y 
    for each layer l = 1 to L do 
 4:       for all nodes i ∈ V do 
            for all neighbors j ∈Ni  do 
                Compute unnormalized attention scores using Eq.  (5), Eq.  (6) 
     and Eq. (11) 
                   Apply  dropout  to  modify  unnormalized  attention  scores  with 
     probability d 
 8:            end for 
            Apply softmax to unnormalized attention scores, obtaining attention 
     scores 
              Apply TopK to attention scores 
              Update node representations using Eq.(16) 
12:       end for 
    end for 
    Apply post-level attention for each post i using Eq. (21) and (22) 
     Compute event-level representation using Eq. (23) to (27) 
16:  Make predictions ˆ y with classifier and softmax using Eq. (28) 
     Compute loss L(y, ˆ y ) and LE to update model parameters using Eq.  (20) 
     and Eq.(29) 
___________________________________________________________________________________    

Experiments and analysis

Experimental settings

Datasets

Compared methods

  • DTC (Castillo, Mendoza & Poblete, 2011): A decision tree based approach that combines various news features.

  • SVM-TS (Ma et al., 2015): A support vector machine classifier that models the temporal properties of social context during message propagation.

  • GRU-RNN (Ma et al., 2016): This article introduces an RNN based model to detect rumors, which learns from the temporal dynamics of social media to identify rumors more effectively than methods relying on static features.

  • BU-RVNN and TD-RVNN (Ma, Gao & Wong, 2018): Models that view rumor propagation as a tree structure and adopt bottom-up and top-down recursive neural networks for the rumor classification task.

  • PLAN (Khoo et al., 2020): A tree transformer based model capturing long-term interactions with token-level and post-level attention.

  • BiGCN (Bian et al., 2020): An approach using bidirectional graph convolutional models for social media rumor detection that analyzes both propagation and dispersion patterns.

  • ClaHi-GAT (Lin et al., 2021): A GAT model based on an undirected graph, which employs claims to enhance reply posts and incorporate sibling connections between relevant messages.

  • HDGCN (Yu et al., 2022): An approach to dynamic rumor detection using heterogeneous graph convolutional networks and an ordinary differential equation system.

  • TISN (Luo et al., 2022): This study combines text and propagation structure by employing BERT and GCN. TISN arranged tweets in chronological order to extract temporal features of rumors.

Results and analysis

Ablation experiments

Early rumor detection

Sensitive analysis of neighbor number

Conclusion

Additional Information and Declarations

Competing Interests

The authors declare there are no competing interests.

Author Contributions

Yafang Li conceived and designed the experiments, analyzed the data, authored or reviewed drafts of the article, and approved the final draft.

Zhihua Chu conceived and designed the experiments, performed the experiments, analyzed the data, performed the computation work, prepared figures and/or tables, authored or reviewed drafts of the article, and approved the final draft.

Caiyan Jia analyzed the data, authored or reviewed drafts of the article, and approved the final draft.

Baokai Zu conceived and designed the experiments, authored or reviewed drafts of the article, and approved the final draft.

Data Availability

The following information was supplied regarding data availability:

The code is available at GitHub and Zenodo:

- https://github.com/qwerdabc/SAMGAT

- qwerdabc. (2024). qwerdabc/SAMGAT: init (init). Zenodo. https://doi.org/10.5281/zenodo.12598037.

The data for Twitter15 and Twitter16 is available at Github and figshare:

- https://github.com/majingCUHK/Rumor_RvNN

- ma, jing (2017). rumdetect2017. figshare. Dataset. https://doi.org/10.6084/m9.figshare.25406389.v1

The data for PHEME is available at figshare:

Zubiaga, Arkaitz; Wong Sak Hoi, Geraldine; Liakata, Maria; Procter, Rob (2016). PHEME dataset of rumours and non-rumours. figshare. Dataset. https://doi.org/10.6084/m9.figshare.4010619.v1.

Funding

This work was supported by the National Natural Science Foundation of China under grant 62006009. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

846 Visitors 855 Views 40 Downloads