INTRODUCTION
Genomic estimated breeding values (GEBVs) of selected individuals are frequently used to improve the genetics of economically important traits in livestock species. These values incorporate the results of evaluations using genomic data, pedigree records and the phenotypic performance of individuals using the best linear unbiased prediction (BLUP) or genomic method [
1,
2]. The accuracy of estimated breeding value (EBV) is an important factor that could influence the selection accuracy of breeding animals [
3]. Alternatively, accuracy based on genomic selection (GS) could increase the predictive ability of the GEBV by applying single nucleotide polymorphism (SNP) information. Previous studies have reported improved GEBV accuracy, genetic gain, selection accuracy and a reduction in the generation interval for economic traits [
2,
4,
5]. Calus [
6] reported that the efficiency of GS in livestock depends on the prediction accuracy of GEBVs. However, the prediction accuracy of GEBVs can be influenced by several factors, such as methods of prediction [
7], the training population (TP) size [
6], the h2 of the trait [
8] and the marker density [
9]. An effect of errors in pedigree on the accuracy of the EBV has been reported in Korean Hanwoo cattle [
10], but the use of genomic information could improve the accuracy of GEBVs.
Several authors have reported genomic prediction evaluation methods that could increase the accuracy of GEBVs. Hayes et al [
2] and VanRaden [
11] suggested the use of GBLUP, which employs genomic information in the form of a genomic relationship matrix. It also describes additive genetic covariance between individuals. GBLUP calculates direct genomic values (DGV) for genotyped individuals and has advantages over BLUP because marker density captures the Mendelian sampling across the genome. GBLUP is a straightforward procedure with low computational requirements, and has been used for genomic evaluations in cattle [
12]. In contrast, single-step GBLUP (ssGBLUP) predicts how non-genotyped individuals can benefit from genomic information. The pedigree record and marker (SNPs) relationship matrices are combined in ssGBLUP, permitting the blending of genotyped and non-genotyped individuals in the genetic evaluation [
13]. Weight (
w) has been added to the genomic relationship matrix (G) [
11], and such an adjustment may be interpreted as relative weight on the polygenic effect [
14]. To facilitate inversion, w values between 0.90 and 1.0 indicate variations in prediction accuracy [
11], but insignificant differences in EBVs have been reported when w ranges between 0.95 and 0.98 [
14]. Recent studies have reported improved accuracies of GEBVs obtained using ssGBLUP compared with those from GBLUP in simulated beef cattle [
15].
Korean Hanwoo cattle possess good meat flavour, tenderness and taste, and efforts have been made to improve the quantity and quality of the carcass [
16]. Applying GS is a potential approach to improve the genetic gains in economically important traits. However, Onogi et al [
17] pointed out that the use of ssGBLUP for genomic evaluations is still emerging in beef cattle due to the greater complexity of their records compared with those of other livestock species, such as the existence of pedigree errors (PEs) and fewer full or half-sib families [
10,
18]. On the other hand, a simulation study allows the testing of several theories, permitting an unravelling of the complex evolutionary patterns that are otherwise difficult to comprehend. For example, the history of human migration provides significant insight into the present patterns of DNA variation in humans [
19]. Simulation studies in beef cattle and other livestock have provided information on their potential for genomic evaluation. They have also been used in studies of predictions of total genetic value [
8], genomic prediction of simulated multi-breed and purebred cattle [
20], GS accuracy in simulated populations [
21] and a comparison between single- and two-step GBLUP methods in simulated beef cattle [
15]. These authors reported that GS increases the accuracy of the selection and economic benefits of the breeding objective during beef production. Therefore, the present study assessed the prediction accuracy of GEBV based on different selection methods, evaluation procedures, TP sizes, h
2 levels, marker densities and PE rates using a simulated Korean beef cattle population.
DISCUSSION
In our study, we investigated the prediction accuracy of GEBV under different selection scenarios, evaluation procedures, TP sizes, heritability levels, marker densities and PE rates in a simulated Korean beef cattle population. Phenotypes are frequently used for selecting superior individuals in a population. However, the statistical method of evaluation used is one factors that could influence the prediction accuracy of GEBV (
Table 2). With phenotypic selection, there was a higher prediction accuracy for ssGBLUP_0.95 than for GBLUP, indicating that ssGBLUP_0.95 had advantages over GBLUP, possibly due to the combination of both genotyped and non-genotyped individuals. The combination could also help genomic markers capture any QTL effect or polygenic effect through EBVs [
2,
4]. Our results agree with those of Gowane et al [
29], who obtained a higher prediction accuracy using ssGBLUP than GBLUP in a simulated population.
The accuracy of genomic prediction improved with greater numbers of individuals, as ssGBLUP showed a higher prediction accuracy than GBLUP at TP sizes of 1,000 to 3,000. However, when the TP size was 5,000, the results of both methods were comparable. The results further revealed that increasing the TP size across different scenarios improved the prediction accuracies. Several authors have reported improved accuracy for GEBVs when increasing the TP size in genotyped Holstein bulls [
4] and simulated beef cattle [
21].
The accuracy of the genomic predictions was affected by increased heritability. ssGBLUP showed a higher accuracy than GBLUP at all h
2 levels. Nwogwugwu et al [
10] stated that the higher the h
2, the better the accuracy because h
2 represents the strength of the association between the phenotype and breeding values. This implies that there is an association between h
2 and accuracy, as we observed; Kolbehdari et al [
30] reported similar results. Numerous studies have demonstrated increased accuracy with increasing h
2 values, which agrees with our study [
21,
29].
The impact of marker density on the accuracy of genomic predictions has been examined in previous study [
9]. With increases in marker density of 50K and 777K, the accuracies of the genomic evaluations did not improve. Zhu et al [
31] reported limited prediction accuracy of a genomic evaluation with an increase in marker density from 0.5K to 20K in live weight, carcass weight and average daily gain. However, an increase in the marker density had a conflicting effect on prediction accuracy due to co-linearity between the effects of the markers in a simulated population [
32]. Some authors have reported slightly improved accuracy of GEBVs with an increase in the marker density [
21]. Nevertheless, these differences in results may be due to the genetic architecture or population structure.
The use of individual EBVs has greatly aided in animal genetic improvement. Therefore, selecting individuals based on the EBV could increase the accuracy of genomic predictions (
Tables 3,
4, and
5). This study examined the prediction accuracies of GEBVs across multiple scenarios. The prediction accuracies of GBLUP and ssGBLUP_0.95 were higher when using EBV selection than when using phenotypic selection. This could be attributed to the impact of the pedigree relationship among individuals, which facilitates accurate sire selection decisions. Our results agree with Amari [
33], who previously stated that the EBV provides the most dependable information on the breeding results for a particular animal.
The performance of the ssGBLUP_0.95 method of prediction was superior to that of GBLUP in all scenarios. Therefore, combining genomic and pedigree data to predict traits improves accuracy, which leads to improved genetic gain in beef cattle breeding. However, GBLUP has been broadly utilised for genomic assessments in dairy cattle [
7]. This assumes that the GBLUP method is mainly based on the LD between markers and QTL. On the other hand, Meuwissen et al [
8] proposed that an evaluation based on a combination of models improves the accuracy of prediction compared to methods that assume all SNPs have predictive value. Three different weights were added to ssGBLUP to solve the collinearity problem between variables and the low rank of the matrix, which could make inversion of the matrix difficult or impossible. ssGBLUP_0.95 had the highest prediction accuracy compared with weights 0.90 and 0.85. The prediction accuracy of ssGBLUP_0.90 was comparable with that of GBLUP in some scenarios. Less bias and a high prediction accuracy were reported by Vitezica et al [
34] when the G matrix was adjusted with a weight factor using the ssGBLUP method. Similar observations have been reported in turkey [
26]. The present findings further indicated significant differences in prediction accuracies among the weights used in this study and revealed that a weighting factor of 0.95 could be an optimal choice for genetic improvement. A higher accuracy of GEBV with ssGBLUP has been reported in Japanese black cattle [
17], a simulated cattle population [
29] and Hanwoo beef cattle [
35], indicating that the ssGBLUP method could be effectively used to improve traits with low heritability as well as traits that are difficult to measure.
The present results indicate that the prediction accuracies of traits with a higher h
2 are more precise than those for traits with a lower h
2. This implies that the amount of additive genetic variance explained by markers is small with a low h
2, thereby reducing the prediction accuracy [
36]. The present study further investigated the effect of TP size on the prediction accuracy. The findings showed that the prediction accuracy of genomic evaluations improved as the TP size increased, suggesting that the prediction accuracy tends to increase as information from an increasing number of individuals is added. The results also indicate that the TP size is important for successful genomic prediction. Previous study has shown increased prediction accuracy with increasing TP size [
4]. As shown in
Tables 3,
4 and
5, that ssGBLUP_0.95 resulted in a higher accuracy at TP sizes of 1,000 to 2,000 individuals compared with GBLUP, and an even higher with a TP size of 3,000; however, both methods were comparable above a TP size of 4,000. Our results fully agree with those of VanRaden et al [
4], who observed that genomic gains increase almost linearly with an increase in TP size in Holstein bulls.
The effect of marker density on prediction accuracies was similar to that found for phenotypic selection. However, with an h
2 of 0.3, the prediction accuracies improved with an increase in marker density of 50K, whereas the accuracy of prediction declined with the 777K marker. Genomic predictions did not improve at 800K or in transcriptome panels over 50K in a pure-breed population [
37]. Wang et al [
38] also reported a similar result after increasing the marker density from 0.05K to 3.2K, which greatly improved the genomic prediction accuracy, but there was less improvement when the marker density increased further. The present findings reveal high accuracies with a 10K marker density and a heritability of 0.1 and a TP size of 5,000; however, a heritability of 0.5 with a TP size of 5,000 produced the highest prediction accuracies in both models. The findings of the present study differ slightly from previous studies possibly due to variations in the genetic structure, marker density, TP size and method of evaluation.
Several authors have reported the effect of PEs on the EBV, the accuracy of the EBV and the genetic gain in livestock species [
10,
39]. Their findings indicate that PEs greatly reduce the accuracy of the EBV in beef and dairy cattle. However, introducing genomic information may resolve this reduction in the accuracy of the EBV or GEBV in livestock breeding. The results shown in
Table 6 demonstrate the accuracy of predictions under PEs using a 50K marker density, an h
2 of 0.3, and the use of ssGBLUP with three different weight values (0.95, 0.90, and 0.85) across TP sizes. In this study, the prediction accuracy was only moderately influenced by different weights and TP sizes. The findings further reveal that the prediction accuracy decreased consistently as more PEs were introduced into the data. This suggests that PEs have a negative relationship with prediction accuracy. Nwogwugwu et al [
10] reported that the accuracy of the EBV decreased by 0.02 with a 40% PE from that generated with an estimate at 0% PE; however, with a 40% PE combined with genomic information, the prediction accuracy of the GEBV declined by only 0.003 from that obtained using 0% PE and ssGBLUP_0.95. This indicates that the accuracy of prediction based on PE combined with genomic information is more reliable than the accuracy of EBV. With increasing TP size, the effect of PE on prediction accuracy was lower or negligible. This implies that additional information from relatives or increasing the TP size may improve the prediction accuracy, even if the pedigree is erroneous.