bioRxiv : the preprint server for biology
Machine Learning Analysis of the Human Initiator Region Reveals Key Features of Different Types of Core Promoters.
Torrey E Rhyne-Carrigg, Long Vo Ngoc, Claudia Medrano, Kassidy E Gillespie, James T Kadonaga
Published: 202610.1101/2025.11.21.689830
Abstract
Open AccessThe initiator (Inr) is the starting point for the transcription of many genes. Here, we generated highly predictive machine learning models of the human Inr region, and determined that the Inr is present in about 60% of natural promoters, identified a novel TATA-specific Inr, and detected the overlapping but functionally distinct TCT motif. Quantitative genome-wide analyses revealed a strict and synergistic interaction between the Inr and DPR, a duality between the TATA and DPR, a flexible and sometimes independent function of the TATA box in relation to the Inr, and different properties of the TCT motif in humans and Drosophila.