Identification of lactylation-related biomarkers in HNSC by integrating machine learning and Spatial transcriptomics analysis.
Pu-Yu Wang, Yi-Ping Zheng, Yu-Lin Zhao
Abstract
Open AccessOBJECTIVE: Head and neck squamous cell carcinoma (HNSC) is a common malignant tumor with a 5-year survival rate of less than 60% and lactylation modification plays a key role in its occurrence and progression. This study aims to identify key lactylation-related genes (LRGs) in HNSC and their prognostic impact by combining machine learning methods and spatial transcriptomics analysis. METHODS: High-throughput gene expression data of HNSC were obtained from the TCGA database. The "Limma" R package was used to screen differentially expressed genes (DEGs), and their biological significance was investigated through GO and KEGG functional enrichment analyses. Subsequently, weighted gene co-expression network analysis (WGCNA) was employed to identify disease trait-associated modules. We used 12 machine learning methods to screen for key genes and validated them in the GSE6631 dataset. Survival curves were plotted to evaluate the prognostic ability of key genes. Transcriptional regulation analysis via the "RcisTarget" package identified transcription factors associated with motifs exceeding NES 3.5. Single-cell and spatial transcriptomic analysis was performed, which involved dimensionality reduction, clustering via UMAP, cell type annotation and the plotting of spatial expression distribution maps of key genes. Finally, for immune infiltration analysis, the CIBERSORT algorithm was used to calculate the proportions of 22 types of immune cells, and then the correlations between key genes and immune-related genes were analyzed. RESULTS: Our study identified three key LRGs (MSN, KIF2C, and RFC4), which exhibited strong diagnostic performance, with AUC values of 0.986 for KIF2C, 0.964 for RFC4, and 0.89 for MSN in the TCGA-HNSC cohort. These findings were validated in the GSE6631 dataset. Western blotting showed a significant upregulation of the expression of these genes in the HNSC group, which was consistent with the RT-qPCR results. Survival analysis showed that the high expression of key genes was closely associated with a poorer overall survival (p < 0.05). Spatial transcriptomics analysis revealed that KIF2C, RFC4, and MSN were preferentially expressed in tumor core regions, indicating their involvement in the malignant microenvironment. Additionally, we discovered that 10 types of immune cell infiltrations were significantly changed in HNSC and well correlated with key genes. CONCLUSION: MSN, KIF2C and RFC4 may serve as potential biomarkers for diagnosing and predicting the prognosis of HNSC, providing new insights into targeted therapy.