Coexistence of virome-encoded health-associated genes and pathogenic genes in global habitats.
Min Qian, Dong Zhu, Ke-Yu Yao, Shu-Yue Liu, Meng-Ke Li, Mao Ye, Yong-Guan Zhu
Abstract
Open AccessViral remnants constitute approximately 8% of the human genome, reflecting extensive historical gene exchange between viruses and their hosts. Some viral genomes harbor genes acquired through horizontal gene transfer that are associated with potential benefits to human health, alongside genes associated with pathogenicity. However, their global distribution, functional characteristics, and coexistence patterns remain poorly understood. Here, using the Integrated Microbial Genomes and Virome (IMG/VR v4) database, we identified 4,556 viruses carrying gene segments associated with human health across eight habitat types spanning 13 regions and 76 countries worldwide. Among viruses with identifiable hosts, those distributed in humans (478) accounted for the highest proportion. The viral genes associated with human health included BCO1 (beta-carotene oxygenase 1), bioB (biotin synthase), COQ2 (4-hydroxybenzoate polyprenyltransferase), GPX1 (glutathione peroxidase 1), GSTs (glutathione transferases), GSTT1 (glutathione S-transferase theta 1), GULO (L-gulonolactone oxidase), and menA (1,4-dihydroxy-2-naphthoate polyprenyltransferase). These genes not only associate with human health but also function as auxiliary metabolic genes in viral genomes. Notably, four pathogenic genes were found in viral sequences carrying health-associated genes, with potential for transcription and expression, indicating functional interactions. Experimental transduction of the viral bioB gene into Escherichia coli altered the expression of host pathogenic genes GCH1 (GTP cyclohydrolase IA) and UGDH (UDP-glucose 6-dehydrogenase), supporting potential cross-regulatory interactions. Overall, this study incorporates health-associated genes into viral genomics, highlighting their coexistence with pathogenic genes, and provides new insights into virus-host coevolution and potential biotechnological applications. IMPORTANCE: Viruses are the most abundant biological entities on Earth and key drivers of microbial evolution through horizontal gene transfer. While often studied for their pathogenic effects, viruses can also carry genes that influence host metabolism and health. Genes associated with human health have been identified in viral genomes, yet their global distribution, functions, and coexistence with pathogenic genes remain largely unexplored. This study integrates datasets of health-associated genes into viral genomic analyses, revealing for the first time the coexistence of viral health-associated genes with those linked to pathogenicity. This dual genetic potential is observed across diverse habitats, highlighting viruses as multifaceted reservoirs of both beneficial and harmful genes. The study findings advance understanding of viral functional diversity and open new avenues for exploring viral roles in microbial ecology, biotechnology, and human health.