Unravelling key genes and molecular pathways in gastric inflammation-to-cancer transition through causal discovery: implications for early diagnosis and therapy.
Zhen Ren, Xiaochen Li, Pengyun Liu, Jinjuan Li, Shisan Bao
Abstract
Open AccessObjective: Gastric cancer remains a major global health concern. This study aimed to identify key genes involved in the inflammation-to-cancer transition in the stomach using an integrative framework combining graph neural networks and causal discovery. Methods: In retrospective study gene expression data from two gastric cancer-related datasets were categorised into two stages: gastritis to precancerous lesions and precancerous lesions to gastric cancer. Differentially expressed genes were identified and analysed for functional enrichment. A relevance network was constructed using Pearson's correlations. Graph sample and aggregate was then applied to the expression matrix, using this network for training. Node embeddings were generated via neighbourhood aggregation, and causal regulatory relationships were inferred using a constraint-based algorithm. Genes with the highest degrees in the causal network were assessed for prognostic relevance using Kaplan-Meier analysis. Results: A total of 857 differentially expressed genes were identified in the gastritis-to-precancerous transition and 337 in the precancerous-to-gastric cancer transition, with 83 differentially expressed genes shared. Enrichment analysis highlighted pathways linked to bacterial responses, especially Helicobacter pylori. Graph sample and aggregate enhanced gene representation for causal analysis. The Peter-Clark algorithm inferred 72 genes and 99 causal edges. Nine key genes-mucin 17, brain-expressed X-linked 2, BCL2/adenovirus E1B 19 kDa protein-interacting protein 3, Ras association domain family member 2, NLR family pyrin domain-containing 7, interferon regulatory factor 4, carbamoyl phosphate synthetase 1, nucleoporin 210 and neuronal differentiation 2-were identified, all of which were significantly associated with gastric cancer survival. Conclusion: This study integrates graph neural networks and causal inference to identify critical genes involved in gastric inflammation-cancer progression, providing novel insights into the pathogenesis of gastric cancer and potential biomarkers for validation in future studies.