Major Update and Improved Validation Functionality in the mwtab Python Library and the Metabolomics Workbench File Status Website.
P Travis Thompson Ptt, Hunter N B Moseley Hnbm
Abstract
Open AccessThe Metabolomics Workbench (MW) is a public scientific data repository consisting of experimental data and metadata from metabolomics studies collected with mass spectroscopy (MS) and nuclear magnetic resonance (NMR) analyses. Although not as rapidly as in the past, MW has steadily evolved; updating its mwTab and JSON deposition text file formats and its web-based infrastructure. However, the growth of MW has been exponential since its inception in 2013 and continues to be exponential, with the number of datasets hosted on the repository increasing by 50% since April 2024. As part of regular maintenance to keep up with changes to the mwTab file format and an earnest effort to use MW datasets in meta-analyses, the mwtab Python package has been updated. Updates include better error handling for batch processing, better parsing to read more files without error, and extensive improvements to the validation capabilities of the package. These updates also required our mwFileStatusWebsite to be updated and improved. We used the enhanced validation features of the mwtab package to evaluate all available datasets in MW to facilitate improved curation, FAIRness of the repository, and reuse for meta-analyses. Version 2.0.0 of the mwtab Python package is now officially released and freely available on GitHub and the Python Package Index (PyPI) under a Clear Berkeley Software Distribution (BSD) license with documentation available on GitHub. The updated mwFileStatusWebsite is also officially in its 2.0.0 version and is still available at https://moseleybioinformaticslab.github.io/mwFileStatusWebsite/.