Inter-machine harmonization of multicenter echocardiographic images for improvement of left ventricular ejection fraction prediction model.
Ren Iwasaki, Kenya Kusunose, Hidekazu Tanaka, Makoto Miyake, Kenji Moriuchi, Yasuharu Takeda, Hirotsugu Yamada, Akihiro Haga
Abstract
Open AccessOne of the common challenges in medical artificial intelligence (AI) applications using echocardiography is the lack of image data harmonization. This study aims to improve the prediction accuracy of left ventricular ejection fraction (LVEF) AI models by incorporating data augmentation (DA) techniques to address the variability in image data across different vendor machines. A database comprising 15,770 echocardiographic videos from 3154 patients across five different centers was utilized, with the data acquired using various vendor machines. We prepared datasets specific to GE healthcare (GE), philips (PH), and canon medical systems (CA) vendors, including 1911, 804, and 427 cohorts, respectively. A three-dimensional convolutional neural network (3D-CNN) was trained to predict LVEF, using videos consisting of 20 images per heartbeat as input, with training performed exclusively on data from GE machines (GE-based model). DA techniques, including gamma correction, scaling, median filtering, unsharp masking, translation, rotation, noise correction, and image conversion using a cycle generative adversarial network, were applied. A regression analysis using five different chamber views was performed on the GE test data. The accuracy of LVEF prediction using the GE-based model with DA-specifically gamma correction, scaling, and translation corrections-was excellent. The mean absolute error (root mean square error) in test cohorts was 4.33 (5.58) for GE, 4.42 (5.57) for PH, and 4.89 (6.57) for CA, which were comparable to the results of a model developed using data from all vendors, demonstrating the effectiveness of DA in harmonizing images. This study demonstrates that echocardiographic videos are transferable across vendors and that DA is highly effective in improving LVEF predictions from data acquired using different vendor machines.