Genomic prediction in a small barley population can benefit from training on related populations.
Cathrine Kiel Skovbjerg, Pernille Sarup, Ellen Margrethe Wahlström, Jens Due Jensen, Lotte Olesen, Jihad Orabi, Just Jensen, Guillaume P Ramstein, Ahmed Jahoor
Abstract
Open AccessGenomic prediction (GP) has shown to be a valuable tool for genetic improvement in breeding programs but requires large training populations in order to build robust models. This is difficult to obtain for newly established breeding programs. Here, we aimed to overcome this challenge by combining datasets from 4 different barley breeding programs, utilizing up to 12 years of data to increase prediction accuracy in a more recently established 6-rowed winter (6RW) barley breeding program. By allowing data to accumulate in a breeding program as the years progress, we investigated when GP accuracy in 6RW benefitted from external populations. To do this, we focused on several parameters: training population size, choice of model for multipopulation GP (univariate versus multivariate), the key trait under investigation (grain yield, plant height, or rust resistance), and genetic distance between populations. We found that in the early stages of a breeding program, prediction of the 6RW population could benefit from inclusion of an external population, but the advantage depended on the specific population and trait under investigation. However, when data from all 4 years were available, multipopulation GP generally performed similarly to within-population GP. Additionally, when comparing multivariate and univariate models for multipopulation GP, the multivariate model often performed significantly worse, despite strong genetic correlations between the populations involved. This was especially the case when data were sparse and the model required estimation of numerous parameters from a small number of observations. Altogether, our results suggest that multipopulation GP is beneficial only in the very early stages of new breeding programs, emphasizing its relevance for newly established breeding programs or new breeding goals, especially for related populations.