The BRAINTEASER Datasets: Clinical, Wearable and Environmental Data for ALS & MS Progression Modeling.
Guglielmo Faggioli, Laura Menotti, Stefano Marchesin, Isotta Trescato, Lara Ahmad, Helena Aidos, Anca Loredana Alungulese, Riccardo Bellazzi, Roberto Bergamaschi, Giovanni Birolo, Pietro Bosoni, Maria Fernanda Cabrera-Umpiérrez, Paola Cavalla, Adriano Chió, Arianna Dagliati
Abstract
Open AccessAmyotrophic lateral sclerosis (ALS) and multiple sclerosis (MS) are debilitating diseases with unpredictable progression. Artificial Intelligence-based tools for modelling disease progression could significantly improve the quality of life for patients and caregivers while supporting clinicians in delivering more personalized and timely care. However, the limited availability of data hinders the development, testing, and reproducibility of such predictive tools. To address this challenge, we curated, in the context of the H2020 BRAINTEASER project, four datasets containing clinical data from a total of 2,290 ALS patients and 723 MS patients. These datasets also include environmental data and information collected through wearable devices. Unlike most existing resources, the BRAINTEASER datasets are gathered from clinical practice, offering a more accurate representation of the data that an AI progression prediction tool would encounter in real-world scenarios. In addition to manual and automated data quality checks, the research community has validated the datasets through three editions of the intelligent Disease Progression Prediction challenges held within the Conference and Labs of the Evaluation Forum (CLEF).