Comprehensive analytical and empirical evaluation of genomic prediction across diverse accessions in maize
Assigned to Associate Editor Rajeev Varshney.
Abstract
Efficiently exploiting natural genetic diversity captured by accessions stored in genebanks is crucial to genetic improvement of major crops. Selecting accessions of interest from genebanks has traditionally required information from extensive and expensive evaluation; however, low-cost genotyping combined with genomic prediction have enabled us to generate predicted genetic merits for the entire set with targeted phenotypic evaluation of representative subsets. To explore this general approach, analytical assessment and empirical validation of the maize (Zea mays L.) association population (MAP) as a training population were conducted in the present study. Cross-validation within the MAP revealed mostly modest to strong predictive ability for 36 traits, generally in parallel with the square root of heritability. The MAP was then used to train the prediction models to generate genomic estimated breeding values (GEBVs) for the Ames Diversity Panel. Empirical validation conducted for nine traits across two validation populations confirmed the accuracy level indicated by the cross-validation of the training population. An upper bound for reliability (U value) was calculated for the accessions in the prediction population using genotypic data. The group of accessions with high U values generally had high predictive ability, even though the range of observed trait values was similar to the group of accessions with low U values. Our comprehensive analysis validated the general approach of turbocharging genebanks with genomics and genomic prediction. In addition, breeders and researchers can consider both GEBVs and U values to balance the needs of improving specific traits and broadening genetic diversity when selecting accessions from genebanks.
Abbreviations
-
- AmesDP
-
- Ames Diversity Panel
-
- CV
-
- cross-validation
-
- EH
-
- EarHeight
-
- GBS
-
- genotyping by sequencing
-
- GEBV
-
- genomic estimated breeding value
-
- MAP
-
- Maize Association Population
-
- MLA
-
- MiddleLeafAngle
-
- PH
-
- PlantHeight
-
- SNP
-
- single nucleotide polymorphism
-
- ULA
-
- UpperLeafAngle.
1 INTRODUCTION
Modern plant breeding is like a double-edged sword: intensive selection coupled with modified management practices produces high-yielding cultivars but at the same time narrows the genetic bases of the elite breeding germplasm pools (Tanksley & McCouch, 1997). This reduction of genetic diversity in our major crops increases their vulnerability to the ever-changing climatic conditions and rapid evolution of diseases and insects. Private and public breeders have recognized the need to expand our elite germplasm's genetic diversity to safeguard against these future challenges for some time (Goodman, 2005; Mikel & Dudley, 2006), but we have only scratched the surface in integrating available diversity into our elite germplasm (Fox et al., 2015; Tanksley et al., 1996; Xiao et al., 1998). Genebank curators have excelled in conserving the rich natural genetic diversity by preserving millions of plant accessions in genebanks across the world (FAO, 2010). Programs such as the Latin American Maize Project (Salhuana et al., 1997) and Germplasm Enhancement of Maize (Pollak, 2001) pre-breed diverse genebank accessions to increase the genetic diversity of maize (Zea mays L.) germplasm available for breeders. Using any genetic materials from genebanks requires efficient determination of the accessions worth evaluating, which depends on a breeding or research program's objectives. Accomplishing this task begins with carefully curated passport, genotypic, and phenotypic data for the millions of genebank accessions (McCouch et al., 2012), yet obtaining this information has proven to be both expensive and time consuming (Houle et al., 2010).
Advances in genomic technologies, such as genotyping by sequencing (GBS) (Elshire et al., 2011), have helped overcome one of these obstacles by providing cheap and abundant single nucleotide polymorphisms (SNPs). Recently, the application of this technology in genebanks has provided large-scale genetic characterization of these collections (Crossa et al., 2016; Romay et al., 2013; Yu et al., 2016). Even with advancements in high-through phenotyping (e.g., Crain et al., 2018; Pugh et al., 2017; Salas Fernandez et al., 2017; Wang et al., 2021), large-scale phenotyping of genebank collections for important traits is still challenging. Instead, researchers have focused on extensive phenotypic characterization of manageable representative subsets of large genebank collections, commonly called “association panels” (e.g., Flint-Garcia et al., 2005), “mini-core collections” (e.g., Upadhyaya et al., 2009), or “diversity panels” (e.g., Zhao et al., 2011).
Genomic prediction (Bernardo & Yu, 2007; Crossa et al., 2017; Heffner et al., 2009; Meuwissen et al., 2001) exploits low-cost genotypic data to bridge the gap between phenotypic data collected from the training set to the unphenotyped population of potential selection candidates. When the genotypically and phenotypically characterized association panels are used to train a prediction model and generate genomic estimated breeding values (GEBVs) for the untested but genotyped genebank collections, this approach is termed “turbocharging genebanks” (Yu et al., 2016). Research to uncover the hidden potential in genebanks across the world has been reported in sorghum [Sorghum bicolor (L.) Moench] (Yu et al., 2016), soybean [Glycine max (L.) Merr.] (Jarquin et al., 2016), wheat (Triticum aestivum L.) (Crossa et al., 2016), and cauliflower (Brassica oleracea var. botrytis) (Thorwarth et al., 2018). Although genomic prediction involving the diverse maize accessions has been conducted through cross-validation (CV) for plant height and flowering time (Peiffer et al., 2013, 2014) and seedling root length (Pace et al., 2015), a large-scale genomic prediction analysis of the Ames Diversity Panel (AmesDP) with an extensively phenotyped association panel has not yet been conducted.
Core Ideas
- Genetic diversity captured in accessions stored in genebanks is critical to crop improvement.
- Genomics and genomic prediction can be leveraged for efficient mining of genebank germplasm.
- Maize Association Population and Ames Diversity Panel were used to assess predictive ability.
- Empirical validation confirmed the moderate to high predictive ability from cross-validation.
- Selection of accessions can consider both predicted trait values and reliability values.
The AmesDP is a collection of 2,537 unique accessions stored at the USDA-ARS North Central Regional Plant Introduction Station in Ames, IA (Romay et al., 2013). Within this collection are maize landraces developed by farmers and inbreds from public breeding programs from around the world. Additionally, it contains recently expired plant variety protection inbreds registered by private companies. The Maize Association Population (MAP, or Goodman association panel) is a collection of diverse inbreds from public breeding programs from around the world (Flint-Garcia et al., 2005). The accessions from the AmesDP and MAP were genotyped using GBS, and the MAP was found to represent 75% of the total allelic diversity of the AmesDP (Romay et al., 2013). Both panels also have extensive phenotypic data available because the MAP was previously phenotyped for numerous traits across a wide range of environments, and the AmesDP was phenotyped for seven traits related to height and flowering time (Hung et al., 2012; Peiffer et al., 2013, 2014; Romay et al., 2013). Together, these datasets provide an excellent opportunity to explore the prediction potential of the MAP across a range of traits and to predict and validate GEBVs for the AmesDP.
In this study, we first filtered and imputed previously obtained SNPs for the MAP and the AmesDP. The MAP was used to train the prediction model and generate GEBVs for the AmesDP across 36 traits (Figure 1a,b). In addition to conducting CV, we conducted empirical validation of predictive ability for four traits using 292 randomly selected accessions from the AmesDP (Figure 1c) and for seven traits previously phenotyped (Figure 1b) for a 2,043-accession set. Finally, we assessed the effect of genetic relationship on predictive ability by examining the upper bound for reliability (U value) for nine traits to gain insights into how to better leverage genomics and genomic prediction for trait improvement and genetic diversity exploration.

2 MATERIALS AND METHODS
2.1 Genetic materials
Accessions from the MAP (Flint-Garcia et al., 2005) and AmesDP (Romay et al., 2013) were used in the present study. The 281 MAP accessions constituted the training population, and the 2,537 AmesDP accessions, not overlapping with the MAP, constituted the prediction population (Figure 1a; Supplemental Table S1). Together, these two sets of accessions formed the 2,818-accession reference population (Figure 2). From the prediction population, two empirical validation populations were used: (a) 292 randomly selected accessions as the small empirical validation population and (b) 2,043 previously phenotyped accessions (Peiffer et al., 2013, 2014; Romay et al., 2013) as the large empirical validation population (Supplemental Table S1).

2.2 Genotypic data and imputation
The ZeaGBSv27 (Glaubitz et al., 2014; panzea.org) SNP set was previously generated by GBS and contains 954,882 SNPs (AGPv3). A detailed workflow of the scripts and options chosen was used to process the genotypic data. Briefly, the raw SNP file was converted to variant call format using the command line options in TASSEL (Bradbury et al., 2007). An in-house python script was used to combine samples that were genotyped multiple times (Supplemental Table S1) and to split the reference population into the 281-accesion training population and 2,537-accession prediction population.
Different filtering conditions were applied to both populations, and the overlapping SNPs were extracted from each population. The program ‘VCFtools v0.1.15’ (VCFtools) (Danecek et al., 2011) was used to split the two variant call format files containing the training and prediction genotypes into 10 maize chromosomes. Filtering in the training population excluded indels and SNPs that were monomorphic, had >80% missing data, and had a minor allele frequency ≤1%. For the prediction population, similar filtering criteria were used, except we did not filter for minor allele frequency. After filtering, the remaining 344,477 SNPs were imputed for missing data using ‘Beagle, v4.1’ with program defaults (Browning & Browning, 2007). The SNPs that overlapped between the two populations were extracted via VCFtools. The training and prediction populations were indexed using ‘tabix, v1.6’ (Li, 2011) and then merged with VCFtools.
2.3 Genetic diversity
Population structure of the reference population was estimated using principal component analysis and ‘ADMIXTURE, v1.3’ (ADMIXTURE) (Alexander et al., 2009). The 344,477 filtered SNPs were used in a principal component analysis via ‘plink v1.9’ (Chang et al., 2015; Purcell et al., 2007) to assist in identifying the number of subpopulations. To prepare the filtered SNPs for ADMIXTURE, ‘plink v1.9’ was used to prune SNPs in linkage disequilibrium with the following options: 50-SNP window, a step size of 10, and a r2 threshold of .20. The remaining 103,408 SNPs were used in a CV ADMIXTURE procedure (k = 1…10). Results from the CV, principal component analysis, and previous subpopulation classifications (Romay et al., 2013) were considered together to classify five subpopulations (Figure 2). Individual with membership coefficients ≥70% were assigned to the respective subpopulations, and the rest were assigned to a mixed group.
2.4 Phenotypes for training and empirical validation populations
The 292-accession small empirical validation population was phenotyped for “PlantHeight” (PH), “EarHeight” (EH), “UpperLeafAngle” (ULA), and ”MiddleLeafAngle” (MLA) (Figure 1c). Leaf angle measurements were taken from digital images (Dzievit et al., 2018) by measuring the angle between horizontal and the middle of the midrib. The trait ULA (°) is defined as the leaf immediately below the upper most leaf, or flag leaf, and MLA (°) is defined as the second leaf below the ear leaf. The trait PH (cm) was measured as the distance between the soil surface and base of the flag leaf, and EH (cm) was measured as the distance between the soil surface and the ear-bearing node of the uppermost ear. The small empirical population was planted in Boone, IA, in 2015, 2016, and 2017, with a single replication. Genotypic values for the 292-accession small empirical validation population and the combined training and empirical validation population were determined using the previously described method.
Trait data for the large 2,043-accession empirical validation population were obtained from panzea.org (accessed Sept. 2018) for seven traits: PH; EH; “DaysToSilk”; “GDDDaysToSilk,” where GDD is growing degree days; “DaysToTassel”; “GDDDaysToTassel”; and “GDDAnthesis-SilkingInterval” (Peiffer et al., 2013, 2014; Romay et al., 2013). In total, nine traits were validated across the two empirical validation populations, with PH and EH overlapping between the two.
2.5 Genomic prediction and upper bound for reliability
The GEBVs for 36 traits were generated for each accession in the prediction population. The R package ‘rr-BLUP v4.6’ (Endelman, 2011) was used to implement a ridge-regression best linear unbiased prediction (RR-BLUP) model (Endelman, 2011; Whittaker et al., 2000) using the 344,477 SNPs and phenotypic data for the MAP. The resulting model and genotypic data of the prediction population were combined to generate GEBVs for 36 traits. A reverse prediction was assessed following the same procedure, where data from the 292-accession small empirical validation population were used to estimate marker effects and generate GEBVs for ULA, MLA, PH, and EH for the 281-accession training population.
A k-fold CV method was applied to the MAP to evaluate predictive ability for all 36 traits. The Pearson correlation (r) of GEBVs and observed values were used to estimate predictive ability. Three fold sizes (k) were tested (k = 2, 5, and 10), and predictive ability was averaged across 50 iterations for each trait and k-fold combination. A single iteration split the dataset into approximately equal size k-folds, where one fold was to be predicted and the remaining folds were the training population for the prediction model. This process was repeated until every fold was predicted, and the correlation between observed values and predictions for that iteration was determined. The means of all 50 iterations were reported.
The U value (Karaman et al., 2016) was calculated for all accessions in the prediction population using an in-house R script and the training and prediction population's genotypic data. The U value was obtained as , where , M is a matrix of training set genotypes, and v is a vector of validation genotypes (Karaman et al., 2016). This assesses the representation of the prediction population within all linear combinations of the genotypes in the training population. The U values were ranked from highest to lowest. To compare predictive ability within each group, the small empirical validation population was separated into three groups (Top50, Middle, and Bottom50), and the large empirical validation population was separated into three groups (Top400, Middle, and Bottom400).
3 RESULTS
3.1 Genetic relationships from markers
The genotypic data for the 281-accesion training population and the 2,537-accession prediction population were imputed and filtered. The training and prediction populations were filtered first and then combined to obtain the set of 344,477 common SNPs that were used for imputation. Population structure was determined using multiple methods, including ancestry estimation, principal component analysis, and previous research. From the 344,477 common SNPs in the reference population, 30% (103,408) were retained after removing other SNPs in linkage disequilibrium. Examining findings from CV error analysis (Supplemental Figure S1), principal component analysis (Figure 2), and results from an earlier study (Romay et al., 2013), we determined five subpopulations: stiff stalk, non-stiff stalk, tropical, popcorn, and sweetcorn. A membership cutoff value of 70% resulted in 57% of the accessions in the reference population assigned to a single subpopulation; the rest were assigned to a mixed group (Supplemental Table S1).
The SNP data for the training and prediction populations were used to calculate a U value (Karaman et al., 2016) for each individual within the prediction population (Supplemental Table S1). The U value is a numerical estimator of the individual's genetic representation within the training population. The obtained U values ranged from 0.507 to 0.991, with an average value of 0.653 and a median value of 0.608 (Supplemental Table S2). Within subpopulations, average U value estimates ranged from 0.587 in the tropical population to 0.824 in the stiff stalk population. Distributions across and within each subpopulation were mostly right-skewed, reflecting the overall compositions of the prediction population and the training population (Supplemental Figure S2).
3.2 Heritability estimates
Broad-sense heritability and genotypic values were calculated for 36 traits in the MAP (Supplemental Table S3). Harmonic means for each trait were estimated to indicate the number of environments in which traits were phenotyped and ranged from 1.81 for “Spikelets-PrimaryBranch” to 9.07 for PH (Supplemental Table S4). Using these estimates and variance component estimates, we calculated broad-sense heritability on an entry-mean basis for the 36 traits that ranged from 0.18 for “TilleringIndex” to 0.97 for “DaysToTassel” (Supplemental Table S4).
For the 292-accession small validation population, phenotypic distributions for all four traits (ULA, MLA, PH, and EH) were normally distributed (Supplemental Figure S3). Pearson correlations between years ranged from 0.70 to 0.78 (Supplemental Figure S4). Heritability on an entry-mean basis ranged from 0.88 for ULA to 0.91 for PH (Supplemental Table S4). For the large empirical validation population, heritability ranged from 0.86 to 0.92. To the best of our knowledge, previous studies only reported heritability values for PH, EH, “DaysToTassel,” and “DaystoAnthesis” (Peiffer et al., 2013, 2014).
When we combined the small empirical validation population with the training population MAP, harmonic means in the 573-accession became 2.65 for MLA to 4.42 for PH (Supplemental Table S4). Heritability on an entry-mean basis was calculated again for these four traits and ranged from 0.83 for MLA to 0.91 for EH and PH (Supplemental Table S4).
3.3 Cross-validation of training data
Cross-validation with three fold sizes (2, 5, and 10) was used to assess the MAP's predictive ability across 36 traits. As the size of the training population increased, predictive ability increased, whereas its standard deviation decreased for all traits (Supplemental Figure 5; Supplemental Table S4). In the 10-fold CV, average predictive ability across the 50 iterations ranged from 0.08 for “StandCount” to 0.88 for “GDDDaystoTassel” (Figure 3). Overall, we observed that predictive ability is significantly correlated (r = .79; p < .01) (Supplemental Figures S6 and S7) with the square root of heritability.

To further assess the impact of sample size on predictive ability and data quality obtained for a small validation population, we conducted additional CV analysis with data from the small empirical population alone and the combined population of MAP and four traits (ULA, MLA, PH, and EH) within the small empirical population. The combined population had the highest average predictive ability (Supplemental Figure S8) and the lowest SD (Supplemental Table S4). With the 10-fold CV, average predictive ability for the small empirical population was higher for PH, lower for ULA and MLA, and similar for EH when compared with that of the MAP (Supplemental Figure S8).
3.4 Validating predictions and relationship with upper bound for reliability
With the MAP training population, predictions were made for the 2,537-accession prediction population (Supplemental Table S5). The experiment using a 292-accession small empirical validation panel was conducted to empirically validate predictions for ULA, MLA, PH, and EH. The predictive ability for the four traits measured in the empirical validation experiment was generally equal to or lower than the average predictive ability observed from the 10-fold CV of the MAP, the small validation population, and combined populations (Supplemental Figure S8). The predictive ability was moderate and mostly consistent for all four traits: 0.56 for ULA, 0.40 for MLA, 0.57 for PH, and 0.65 for EH (Figure 4). On the other hand, within-subpopulation predictive ability varied for all four traits. For ULA, it ranged from 0.13 for popcorn to 0.78 for stiff stalk; for MLA, from −0.66 for sweet corn to 0.45 for popcorn; for PH, from 0.30 for stiff stalk to 0.81 for popcorn; and for EH, from 0.19 for sweet corn to 0.72 for popcorn.

As an extra investigation, the 292-accession small empirical validation population was treated as the training population to generate GEBVs for the 281-accession MAP for ULA, MLA, PH, and EH to assess its predictive ability. Predictive ability for this reverse prediction was generally similar to the forward prediction (Supplemental Figure S9), except we observed a higher predictive ability for ULA (0.67). The number of individuals within a subpopulation was different in this training population versus the MAP, and within-subpopulation predictive ability varied as well.
For the 2,043-accession large prediction population, we compared the GEBVs generated from the MAP training population with the observed trait values reported in two previous studies (Peiffer et al., 2013, 2014) for seven traits: PH, EH, “DaystoSilk”; “GDDDaystoSilk,” “DaysToTassel,” “GDDDaystoTassel,” and “GDDAnthesis-SilkingInterval.” Predictive ability for these seven traits ranged from 0.04 for “GDDAnthesis-SilkingInterval,” which is a derived trait, to 0.86 for “GDDDaystoTassel” (Supplemental Figure S10). Within-subpopulation predictions also varied widely for these seven traits (Supplemental Figure S10). Across all traits except “GDDAnthesis-SilkingInterval,” the mixed subpopulation had the highest predictive ability. Excluding the mixed subpopulation, predictive ability within the non-stiff stalk population tended to be the highest across most of the seven traits, whereas the popcorn subpopulation tended to have the lowest predictive ability.
Using only the genotypic data, U value was calculated for the 2,537-accession prediction population. Most of the predicted values centering on the trait mean had lower U values, whereas those with extreme (lowest and highest) GEBVs had higher U values (Supplemental Figure S11). This formed a distinct “U” shape that was consistent across all 36 traits, indicating that for accessions with low U values and less relation to the training population, the model generated GEBVs close to the trait mean. In addition, accessions classified into different subpopulations had their unique positions on the plot of U value versus GEBV, which was a result of the genetic composition of the training population, observed trait values of the training population, and genetic composition of the prediction population. For the 292-accession empirical validation population, accessions were ranked by U value from highest to lowest and classified into three groups (Top50, Middle, and Bottom50) to compare predictive ability among the different groups. Accessions from the mixed and tropical subpopulation composed most of the accessions in the Bottom50 (Figure 5). Across all four traits (ULA, MLA, PH, and EH), the top 50 group had a higher predictive ability than the overall, and the Bottom50 group was lower. Predictive ability for accessions in the Middle group was similar to the overall predictive ability. For the 2,043-accession large empirical validation set, the difference in predictive ability between the Top400 group and Bottom400 group was observed for four of the seven traits (Supplemental Figure 12).

4 DISCUSSION
Expanding the diversity of our elite breeding germplasm will rely heavily on effective mining of useful alleles from accessions warehoused in genebanks across the world. Most genebanks have focused primarily on conserving accessions rather than genotypic and phenotypic characterization and effective usage of this material. Fortunately, high-throughput sequencing and genotyping combined with increased investments in extensive phenotyping of representative subsets provide an opportunity for leveraging genomic prediction to exploit the genetic diversity housed in genebanks (Mascher et al., 2019; McCouch et al., 2012; Yu et al., 2016, 2020). It is desirable to keep generating additional information to both demonstrate the utility of this approach and identify areas to improve.
In maize, extensive phenotyping of the MAP for numerous types of traits across multiple environments (Hung et al., 2012) and investments in dense genotypic data across the AmesDP (Romay et al., 2013) provided such an opportunity. The 36 traits investigated in this study include different components of maize development, such as plant architecture, flowering, tassel, ear, and kernel. Across all traits, predictive ability increased with training population size, consistent with previous findings (Crossa et al., 2016; Muleta et al., 2017; Yu et al., 2016). Furthermore, predictive ability was strongly correlated with the square root of heritability, highlighting the need for increasing phenotyping accuracies through improved experimental designs and balancing traditional versus precision-phenotyping methods (Cobb et al., 2013). For example, high-throughput phenotyping methods using unoccupied aerial vehicles (e.g., Crain et al., 2017; Pugh et al., 2017; Wang et al., 2021) could be deployed for quick and highly accurate phenotyping for traits like plant height with a single replication across locations, whereas traits like kernel volume may benefit from improved ear and kernel sampling and phenotyping methods to increase heritability and therefore predictive ability.
Cross-validation is a resampling procedure used to evaluate a training population's predictive ability for a given trait. Validating predictions with empirical experiments would provide additional support to help convince breeders or researchers to leverage predicted trait values in selecting material from genebanks. Accordingly, we conducted experiments to measure four traits (ULA, MLA, PH, and EH) across 292 accessions from the AmesDP, and the results are encouraging. Predictive ability obtained for empirical validation was generally lower than that from 10-fold cross-validation of the training population. Although phenotyping of the training population occurred across multiple geographical locations across the United States (Hung et al., 2012), the small empirical validation population was only evaluated in a single geographic location across 3 years. In addition, future research can consider incorporating environmental dimension to generate a series of specific prediction values for the environments used in validating experiments (Li et al., 2021).
Additionally, we investigated predictive ability with previously generated phenotypic data for seven traits (Peiffer et al., 2013, 2014; Romay et al., 2013), two of which overlap with our collected data (PH and EH). These seven traits were evaluated across multiple environments and include a larger number of the accessions. Predictive ability for these seven traits was close to the average predictive ability observed in the 10-fold cross validation of the training population. Together, these results support the predictions made with the MAP as a training population for the AmesDP.
It is well established that genetic relationship between training and prediction population affects predictive ability (Crossa et al., 2017). In this study, we investigated how U values affect predictive ability. Across nine traits, predictive ability for the group of accessions with high U values was generally higher than for the group of accessions with low U values. Our findings are consistent with the original concept and simulations (Karaman et al., 2016) and empirical studies (Yu et al., 2016, 2020). At the same time, the genetic composition of the training population and the prediction population affected both the GEBVs and the U values. These results suggest that designing training populations to optimally increase U values of the prediction population would improve predictive ability and that visualizing the genetic composition of the prediction population together with GEBVs and U values is helpful to understand the process and the outcome. For example, tropical accessions and sweetcorn accessions had more extreme observed and predicted trait values as a group for several traits directly related to maturity or influenced by maturity.
Phenotypically evaluating numerous traits for all accessions in a genebank is prohibitively expensive and time consuming, particularly considering geographical adaptation zones and changing biotic and abiotic conditions. In addition to GEBVs, U values can help select accession from genebanks. The U value is derived directly from genome-wide marker data to quantify the estimability of genomic make-up of a selection candidate from genomic data of the training set (Karaman et al., 2016). It was clear that certain maize subpopulations had higher representation in the training population, leading to generally higher U values for selection accessions from those subpopulations. By checking both predictions and U values, breeders and researchers can have more confidence in the predicted trait values for a given accession with high U values, particularly if the focus is to simply select accessions with high trait values. On the other hand, if expanding diversity beyond what is captured in the training population is needed, some accessions with low U values but modest prediction values may also be selected. This is to capture diversity that is not well represented in the training population as a step-wise expansion strategy (Yu et al., 2020).
For many crop species in a genebank, it may be desirable to establish multiple training populations with different genetic compositions so that they can be evaluated at different sets of environments and adaptation zones. Besides germplasm information, different methods in training set design (Akdemir et al., 2015; Guo et al., 2019; He et al., 2016; Rincent et al., 2012, 2017) can be applied to the genomic information obtained for the collections (Mascher et al., 2019; McCouch et al., 2012; Yu et al., 2016) to optimize the process of establishing the initial training populations. Then, plots of reliability versus prediction for different traits should be investigated together with the diversity analysis to determine the validation set design. The whole process should follow the well-established understanding of performance evaluation, genetic diversity, and environmental heterogeneity as well as the concept of target population of genotypes and target population environments. Similar to many other genomic-assisted breeding research (Varshney et al., 2021), the emphasis of design will be critical, and coordinated research efforts toward turbocharging genebanks are needed to further improve the overall system to realize the potential of the genetic treasures.
ACKNOWLEDGMENTS
This work was supported by the Agriculture and Food Research Initiative competitive grant (2017-67007-25942) from the USDA National Institute of Food and Agriculture, the Iowa State University Raymond F. Baker Center for Plant Breeding, the DuPont Pioneer Graduate Assistantship, and the Iowa State University Plant Sciences Institute.
AUTHOR CONTRIBUTIONS
Matthew J. Dzievit: Conceptualization; Data curation; Formal analysis; Investigation; Methodology; Visualization; Writing-original draft; and Writing-review & editing. Tingting Guo: Methodology; Resources; and Writing-review & editing. Xianran Li: Methodology; Resources; and Writing-review & editing. JY: Conceptualization; Funding acquisition; Methodology; Project administration; and Writing-review & editing.
CONFLICT OF INTEREST
The authors declare no conflict of interest.
Open Research
DATA AVAILABILITY STATEMENT
Data and code used in this study is uploaded in an online public repository: https://github.com/mdzievit/Genomic_Prediction