DCMF-PPI: a protein-protein interaction predictor based on dynamic condition and multi-feature fusion.
Siqi Chen, Anhong Zheng, Weichi Yu, Chao Zhan
Abstract
Open AccessBACKGROUND: The identification of protein-protein interaction (PPI) plays a crucial role in understanding the mechanisms of complex biological processes. Current research in predicting PPI has shown remarkable progress by integrating protein information with PPI topology structure. Nevertheless, these approaches frequently overlook the dynamic nature of protein and PPI structures during cellular processes, including conformational alterations and variations in binding affinities under diverse environmental circumstances. Additionally, the insufficient availability of comprehensive protein data hinders accurate protein representation. Consequently, these shortcomings restrict the model's generalizability and predictive precision. RESULTS: To address this, we introduce DCMF-PPI (Dynamic condition and multi-feature fusion framework for PPI), a novel hybrid framework that integrates dynamic modeling, multi-scale feature extraction, and probabilistic graph representation learning. DCMF-PPI comprises three core modules: (1) PortT5-GAT Module: The protein language model PortT5 is utilized to extract residue-level protein features, which are integrated with dynamic temporal dependencies. Graph attention networks are then employed to capture context-aware structural variations in protein interactions; (2) MPSWA Module: Employs parallel convolutional neural networks combined with wavelet transform to extract multi-scale features from diverse protein residue types, enhancing the representation of sequence and structural heterogeneity; (3) VGAE Module: Utilizes a Variational Graph Autoencoder to learn probabilistic latent representations, facilitating dynamic modeling of PPI graph structures and capturing uncertainty in interaction dynamics. CONCLUSION: We conducted comprehensive experiments on benchmark datasets demonstrating that DCMF-PPI outperforms state-of-the-art methods in PPI prediction, achieving significant improvements in accuracy, precision, and recall. The framework's ability to fuse dynamic conditions and multi-level features highlights its effectiveness in modeling real-world biological complexities, positioning it as a robust tool for advancing PPI research and downstream applications in systems biology and drug discovery.