Variances components, heritabilities, and correlations
Estimates of variances and covariances for MY and FY using genomic-polygenic models with five sets of SNP are shown in
Table 2 for additive genetic effects,
Table 3 for environmental effects, and in
Table 4 for phenotypic effects. Similar estimates of additive genetic, environmental, and phenotypic variances and covariances for MY and FY were obtained for the five sets of SNP. Estimates of additive genetic variances ranged from 146,440.00±20,231.00 kg
2 (SNP5) to 178,510.00±26,620.00 kg
2 (SNP50) for MY and from 201.54±60.91 kg
2 (SNP100) to 231.49±57.71 kg
2 (SNP25) for FY, and additive genetic covariances ranged from 4,041.40±813.25 kg×kg (SNP5) to 4,758.10±984.95 kg×kg (SNP25). Estimates of environmental variances ranged from 467,000.00±22,641.00 kg
2 (SNP25) to 488,910.00±19,276.00 kg
2 (SNP5) for MY and from 1,033.30 ±57.18 kg
2 (SNP25) to 1,065.90±60.37 kg
2 (SNP100) for FY, and environmental covariances ranged from 14,780.00±945.30 kg×kg (SNP25) to 15,590.00±1,046.00 kg×kg (SNP100). Estimates of phenotypic variances ranged from 635,580.00± 13,307.00 kg
2 (SNP5) to 650,520.00±13,794.00 kg
2 (SNP100) for MY and from 1,257.00±33.25 kg
2 (SNP5) to 1,267.20± 33.61 kg
2 (SNP75) for FY, and phenotypic covariances ranged from 19,286.00±566.53 kg×kg (SNP5) to 19,678±580.44 kg×kg (SNP75).
Estimates of additive genetic variances and covariances for MY and FY were higher for models with SNP75 (variances: 5.39% for MY, 5.96% for FY; covariances: 8.37%), SNP50 (variances: 9.30% for MY, 12.97% for FY; covariances: 14.83%), SNP25 (variances: 8.43% for MY, 14.86% for FY; covariances: 16.38%) than those estimated using SNP100. However, the model with SNP5 yielded lower estimates of additive genetic variance for MY (−10.34%) and covariance between MY and FY (−1.15%), and higher estimate of additive genetic variance for FY (5.37%) than the model with SNP100. Conversely, environmental variances and covariances for MY and FY were lower for the model with SNP75 (variances: −1.88% for MY, −1.08% for FY; covariance: −2.17%), SNP50 (variances: −3.53% for MY, −2.44% for FY; covariance: −4.11%), SNP25 (variances: −4.10% for MY, −3.06% for FY; covariance: −5.20%) than those estimated using SNP100. The model with SNP5 produced a higher environmental variance for MY (0.40%), but a lower environmental variance for FY (−1.92%) and covariance between MY and FY (−2.19%) than the corresponding estimates from the model with SNP100. Lastly, estimates of phenotypic variances and covariances for MY and FY from models with SNP75, SNP50, and SNP25 were nearly identical (differences were near zero or below one percent) to the corresponding values from model with SNP100, whereas the corresponding differences for the model with SNP5 were all negative and mostly higher than one percent (variances: −2.30% for MY, −0.77% for FY; covariance: −1.97%).
The slightly higher additive genetic variances and covari ances but lower environmental variances and covariances for MY and FY obtained with SNP75, SNP50, and SNP25 than with the complete SNP set indicated that these SNP subsets may have been able to more accurately accounted for MY and FY additive variability in this population than the complete SNP set. This may have occurred because the SNP markers in these subsets were on the average more closely associated with quantitative trait locus (QTL) affecting MY and FY than the complete set of SNP [
18].
Heritabilities and additive genetic, environmental, and phenotypic correlations between MY and FY are presented in
Table 5. Heritability estimates ranged from 0.231±0.030 (SNP5) to 0.276±0.039 (SNP25) for MY and from 0.159±0.047 (SNP100) to 0.183±0.044 (SNP25) for FY. The SNP25 MY and FY heritability estimates were slightly higher (0.4% to 20%) than those from SNP100, SNP75, SNP50, and SNP5. These differences among heritability estimates across SNP sets may be related to differences in linkage disequilibria between the SNP in each set and QTL affecting MY and FY determined by number of SNP and proximity of SNP in each set to MY and FY QTL [
18]. Higher SNP25 heritability estimates for MY and FY indicated that faster selection responses for these traits could be expected with SNP25 than with SNP100, SNP75, SNP50, and SNP5 in the Thai dairy population.
Heritability estimates for MY and FY across the five SNP sets were similar to values estimated in previous studies in the Thai dairy population using various SNP sets (0.19 to 0.26 for MY; 0.15 to 0.18 for FY [
4]). Heritabilities for MY in the Thai dairy population were within the range of heritability estimated for MY in various Holstein populations in temperate regions (0.25 to 0.30 [
6,
19,
20]), but somewhat higher than an estimate in Holstein under tropical conditions in Brazil (0.13 [
21]). Conversely, heritability estimates for FY were somewhat lower than values obtained for Holstein in temperate regions (0.25 to 0.30 [
19,
20,
22])
Estimates of additive genetic, environmental and pheno typic correlations between MY and FY across the five sets of SNP were virtually identical (
Table 5). Correlation estimates between MY and FY ranged from 0.718±0.115 (SNP100) to 0.748±0.087 (SNP25) for additive genetic, from 0.673±0.023 (SNP25) to 0.684±0.023 (SNP100) for environmental, and from 0.682±0.009 (SNP5) to 0.686±0.010 (SNP75) for phenotypic. The positive genetic correlations between MY and FY obtained here were similar to values previously reported for this Thai dairy population (0.66 to 0.79 [
4]), and in agreement with estimates for Holstein in other tropical (0.70 to 0.75 [
23,
24]), and in temperate regions (0.70 to 0.88 [
25,
26]).
The comparable or slightly higher additive genetic vari ances and heritabilities for MY and FY from SNP25, SNP50, and SNP75 than from SNP100 indicated that selecting a subset of SNP genotypes with the approach used here would be a reasonable alternative to increase the effectiveness of genomic-polygenic evaluation and selection in the Thai dairy population. However, reducing SNP genotypes to 3,826 SNP (SNP5) or 5% of the SNP in the GGP80K chip would yield lower variance component and heritability estimates than with the SNP25, SNP50, and SNP75 subsets, or with the complete SNP set. Further, the highest genetic variance component and heritability estimates for MY and FY obtained with SNP25 indicated that higher EBV prediction accuracies and selection responses for these traits would be achieved using a genomic-polygenic model with SNP25 than with SNP100, SNP75, SNP50, and SNP5.
Accuracy of genomic-polygenic estimated breeding values and animal rankings with five single nucleotide polymorphism sets
The accuracies genomic-polygenic EBV for MY and FY with the five sets of SNP genotypes (SNP100, SNP75, SNP50, SNP25, and SNP5) are shown in
Figure 1. The SNP25 had the highest mean EBV accuracy for all animals (39.76% for MY and 33.82% for FY), sires (37.30% for MY and 31.85% for FY), and cows (39.98% for MY and 33.99% for FY). Conversely, SNP100 yielded the lowest mean EBV accuracies for all animals (35.18% for MY and 28.36% for FY), sires (33.12% for MY and 26.94% for FY), and cows (35.36% for MY and 28.49% for FY). Further, the mean EBV accuracies from the four SNP subsets (SNP75, SNP50, SNP25, SNP5) were mostly higher than the mean EBV accuracy from the complete SNP set (SNP100) for all animals, sires, and cows. The percentage superiority of the mean EBV accuracies of SNP75, SNP50, SNP25, and SNP5 over SNP100 for MY were 0.58%, 5.21%, 5.34%, and 4.18% for all animals, 0.54%, 4.92%, 5.01%, and 3.76% for sires, and 0.58%, 5.24%, 5.36%, and 4.21% for cows. Similarly, the percentage superiority of the mean EBV accuracies of SNP75, SNP50, and SNP25 over SNP100 for FY were 0.98%, 1.77%, and 2.21% for all animals, 0.81%, 1.48%, and 1.84% for sires, and 0.99%, 1.80%, and 2.34% for cows. However, the mean EBV accuracies of SNP5 for FY were slightly lower (−1.91% for all animals, −2.18% for sires, and −1.89% for cows) than those of SNP100. The mostly higher mean EBV accuracies of the four SNP subsets were largely due to the higher MY and FY additive genetic variances explained by these SNP subsets than by the complete SNP set. Further, the fact that SNP25 yielded the highest mean EBV accuracy indicated that choosing the top 25% of SNP from GeneSeek GGP80K based on percent of additive genetic variance explained for MY and FY (19,130 SNP) would be a suitable alternative to the complete SNP set for genomic-polygenic evaluation and selection in the Thai dairy multibreed population.
The higher mean EBV accuracies obtained with four GG P80K subsets than with the complete SNP set supported the findings from previous research in dairy [
5,
27–
29] and in beef cattle [
30] that SNP subsets can yield comparable or higher levels of EBV accuracy than complete SNP sets while lowering genotyping costs.
Pairwise Spearman rank correlations between MY and FY EBV from of the complete SNP set and each of the four SNP subsets are shown in
Table 6. All rank correlations between SNP100 and the four SNP subsets were above 0.98 (p<0.0001) for both traits, except for the correlation between SNP100 and SNP5 (MY, 0.93; FY, 0.92; p<0.0001). Rank correlations between SNP75 and SNP100 and between SNP50 and SNP100 for MY and FY were above 0.99 (p<0.0001), followed closely by rank correlations between SNP25 and SNP100 for MY (0.98; p<0.0001). Rank correlations indicated a high degree of agreement between EBV from genomic-polygenic evaluations with the four SNP subsets and the complete SNP set. The high SNP25 estimates of genetic variances, heritabilities, EBV accuracies, and rank correlations between SNP100 and SNP25 for MY and FY indicated that SNP25 would be expected to produce higher selection responses for MY and FY than any of the other SNP subsets and the complete GeneSeek 80K set. This indicates that a strategy to keep genotyping costs reasonably low while speeding up genetic progress for MY and FY would be to genotype animals in the Thai multibreed dairy population with a dedicated chip constructed with the subset of SNP markers in the SNP25 set. Thai dairy producers could decrease their genotyping costs before the utilization of a dedicated chip likely without reducing their ability to select replacement animals based on genomic-polygenic EBV by utilizing lower-density commercial genotyping chips.