Machine Learning for ADHD Diagnosis: Feature Selection from Parent Reports, Self-Reports and Neuropsychological Measures.
Yun-Wei Dai, Chia-Fen Hsu
Abstract
Open AccessBackground: Attention-deficit/hyperactivity disorder (ADHD) is a heterogeneous neurodevelopmental condition that currently relies on subjective clinical judgment for diagnosis, emphasizing the need for objective, clinically applicable tools. Methods: We applied machine learning techniques to parent reports, self-reports, and performance-based measures in a sample of 255 Taiwanese children and adolescents (108 ADHD and 147 controls; mean age = 11.85 years). Models were trained under a nested cross-validation framework to avoid performance overestimation. Results: Most models achieved high classification accuracy (AUCs ≈ 0.886-0.906), while convergent feature importance across models highlighted parent-rated social problems, executive dysfunction, and self-regulation traits as robust predictors. Additionally, ex-Gaussian parameters derived from reaction time distributions on the Continuous Performance Test (CPT) proved more informative than raw scores. Conclusions: These findings support the utility of integrating multi-informant ratings and task-based measures in interpretable ML models to enhance ADHD diagnosis in clinical practice.