Evaluation of Single Nucleotide Polymorphisms (SNPs) Genotyped by the Illumina Bovine SNP50K in Cattle Focusing on Hanwoo Breed
Article information
Abstract
In the present study, we evaluated the informativeness of SNPs genotyped by the Illumina Bovine SNP50K assay in different cattle breeds. To investigate these on a genome-wide scale, we considered 52,678 SNPs spanning the whole autosomal and X chromosomes in cattle. Our study samples consists of six different cattle breeds. Across the breeds approximately 72 and 6% SNPs were found polymorphic and fixed or close to fix in all the breeds, respectively. The variations in the average minor allele frequency (MAF) were significantly different between the breeds studied. The level of average MAF observed in Hanwoo was significantly lower than the other breeds. Hanwoo breed also displayed the lowest number of polymorphic SNPs across all the chromosomes. More importantly, this study indicated that the Bovine SNP50K assay will have reduced power for genome-wide association studies in Hanwoo as compared to other cattle breeds. Overall, the Bovine SNP50K assay described in this study offer a useful genotyping platform for mapping quantitative trait loci (QTLs) in the cattle breeds. The assay data represent a vast and generally untapped resource to assist the investigation of the complex production traits and the development of marker-assisted selection programs.
INTRODUCTION
The application of high density throughput genotyping array opens an opportunity for the discovery of genetic variants, known as SNPs. SNPs are nucleotide variations in the DNA sequence of individuals in a population and constitute the most abundant molecular markers in the genome. There are several millions of SNPs embedded in the genome of domestic livestock (Bovine Genome Sequencing and Analysis Consortium, 2009; Zimin et al., 2009). However, only a very small subset of these have been developed into genetic markers for population genetics, genomic selection, linkage disequilibrium pattern and genome-wide association studies.
SNPs are very important due to their quantity and their association to QTLs or in reshaping the proteins produced by QTLs that determine cellular functions (Falconer and Mackay, 1997; Graur and Li, 2000; Micklos et al., 2003). With a close proximity to QTLs, SNPs are known either as a single or possibly as multiple in combination to reflect a QTL (Graur and Li, 2000; Micklos et al., 2003). Identification of SNP markers linked to QTLs, and implementing marker assisted selection become a contemporary cost-effective breeding programme. The prediction of accurate genomic breeding values from dense SNP markers dramatically changed the landscape of genetic improvement strategy (Goddard, 2008). For instance, genomic selection is underway in at least four dairy breeding programs around the world (Hayes et al., 2009). Hence, it seems that SNPs are increasingly becoming the dominant marker of choice in the field of genetics (Gupta et al., 2001).
We assume that at some future time genomic selection will become practical in several breeds around the world. Therefore, it appears that understanding the extent of informativeness of SNPs in different cattle breeds is an entry to animal breeding. For example, it was suggested that the Bovine SNP50 assay will have reduced power in indicine and African cattle breeds in revealing genome-wide association studies (Matukumalli et al., 2009). Hence, we evaluated the informativeness of SNPs genotyped by the Bovine SNP50K in different cattle breeds, focusing on the Korean Hanwoo breed.
MATERIALS AND METHODS
Blood samples were obtained from Hanwoo and Holstein Nonghyup Animal Genetic Improvement Programs. For each breed, 50 samples were collected and DNA was extracted according to the laboratory procedure. Technical details of the experiment were previously described by Melka et al. (2011). Additionally, the available SNPs frequency data of Charolais, Angus, Simmental and Limousine breeds were obtained from Decker et al. (2009).
Data analysis
The Normalized bead intensity data for each sample were loaded into the Beadstudio 3.0 software (Illumina Inc., San Diego, CA, USA), which converted fluorescent intensities into SNP genotypes. SNPs clusters for genotype calling were examined for all SNPs using Beadstudio 3.0 software. SNP frequency variations were examined by using SPSS package version 17.0 program. Breed average, rare, intermediate and common allele frequencies were computed and significance was tested by Chi-square.
RESULTS
Fixed and polymorphic SNPs in cattle
The rate of polymorphic and fixed SNPs observed in six cattle breeds are summarized in Figure 1a and b. On the average, 79% of the SNPs were polymorphic (≥0.05-≤0.95) for six breeds. Among the polymorphic, 72% (37,370) SNPs were found to be polymorphic across the whole breeds with variable or similar allele frequencies for SNPs. On the other hand, an average of 21% of the SNPs were fixed or close to fix (<0.05 or >0.95) in the studied breeds; among fixed SNPs, 6% (2,990) was common to all breeds. About, 22% (11,454) of SNPs were variable for breeds, which mean that some SNPs were fixed in some breeds but polymorphic in the other breeds. Figure 2 depicts chromosomal level variations of SNPs, Hanwoo breed displayed the lowest number of polymorphic SNPs, but had the highest proportion of fixed SNPs (data not shown). These fixed SNPs accounted for the reduced percentage of informative markers within Hanwoo. The pattern of polymorphic SNPs distribution on chromosomes was homogenous across the breeds.

(a) average polymorphic and fixed SNPs in the breeds, (b) degree of polymorphic and fixed SNPs sharing among the six cattle breeds; SNP with any missing frequency was excluded from the analysis.
Minor allele frequencies (MAFs)
The lowest number of polymorphic SNPs observed across the chromosomes in Hanwoo was further investigated to determine if the Bovine SNP50K assay had low MAF within Hanwoo breed. The estimated values for different MAF classes for different breeds are presented in Table 1. The level of average MAF observed in Hanwoo was significantly lower than the other breeds. Both average MAF and the number of informative SNP markers were reduced in Hanwoo breed. The proportion of SNPs with zero MAF was highest in Hanwoo, reflecting the presence of single fixed alleles. On the other hand, all the breeds had almost similar values for intermediate and common MAF classes.
The major alleles on the average accounted for 35% of the total SNPs, ranged from 32% in Hanwoo to 36% in Charolais and Simmental (Figure 3). The average allele frequency for Hanwoo was 0.725, with the average of 0.71 across the breeds. The proportion of major alleles in Hanwoo was the lowest, but allele frequency was slightly higher. Several markers tend to had higher allele frequencies in Hanwoo.

Proportion of major alleles and their corresponding average allele frequencies for each cattle breed.
In comparison with each breed, the number of SNPs observed only in Hanwoo is depicted in Table 2. In total, about 530 SNPs were found only in Hanwoo when compared to each breed. The number of these markers varied from 15 for Simmental to 207 for Charolais. The number of Hanwoo specific SNPs detected from this assay data was very small (ARS-BFGL-NGS-28865, Hapmap43053-BTA-79375, ARS-BFGL-NGS-3599, ARS-BFGL-NGS-68165, Hapmap51464-BTA-51997, ARS-BFGL-NGS-48933, DIAS-188). Indeed, it needs more investigation in large sample sizes for robust inference. These variants were found at frequencies ranged from 1.1 to 21.3% in Hanwoo.
DISCUSSION
The Illumina’s chips are rich source of SNPs for different applications in livestock, plants and humans. The most promising applications in livestock could be genome-wide association studies and genomic selection for improvement of meat and milk production. The discoveries of many thousands of SNPs (DNA markers) lead to the accurate computation of genomic breeding values for genomic selection in the livestock industry (Meuwissen et al., 2001; VanRaden et al., 2009). Several SNPs were genotyped by the Illumina Bovine SNP50K chip in New Zealand, United States and Australian for breeds improvement (Hayes et al., 2009). The efforts of these countries can offer important lessons, and holds great promise for improving production and saving money and time in livestock industry.
In this study, we observed differences between the cattle breeds for MAFs. The difference in allele frequencies among the breeds may attributable to recent genetic drift and those arising from more ancient divergence of the breeds. A higher proportion of the SNPs that were polymorphic in Hanwoo breed had lower allele frequencies. Significantly, lowest average MAF was noted within Hanwoo compared to other breeds. The implication is that the utility of the Bovine SNP50K assay will have reduced power for genome-wide association studies within Hanwoo breed. Despite that the Bovine SNP50K assay will be useful for implementing genome wide association studies to identify QTL regions and improve production traits in all breeds studied. In another study, relatively higher average MAFs for Holstein, Limousine, Angus, Simmental and Charolais were reported by Matukumalli et al. (2009). Major allele frequencies are also essentially equivalent to MAFs in information content for different applications. However, markers with allele frequency >0.8 have reduced information content (Kruglyak, 1997).
Genetic variation within or among breeds is usually explained in terms of allele frequencies. Most SNP markers have alleles which were shared among the breeds, or have breed specific alleles at low frequencies as we have seen in this study. With regard to Hanwoo specific SNPs, seven SNPs were found in Hanwoo, not detected in other breeds. A number of other alleles were found only in Hanwoo in comparison to each breed. Markers private to Hanwoo most likely arose following breeds divergence and that all SNPs common to all breeds arose before the divergence of the breeds. The influence of rare alleles on most quantitative traits has not been systematically evaluated in livestock species. Even so, the significant influences of rare alleles on economically important traits have been well reported in livestock species (Grobet et al., 1997; McPherron and Lee, 1997; Freking et al., 2002; Smit et al., 2003). In the case of human disease, rare alleles carrying significant disease risks have been also reported. For example, there is evidence that rare alleles contribute significantly to low plasma levels of HDL-C, a common quantitative trait in human (Cohen et al., 2004). To summarize, the Bovine SNP50K described in this study offer a vital genotyping platform for mapping QTL in cattle breeds. The SNPs data represent a vast and largely untapped resource to assist the investigation of several complex QTLs in cattle, and also useful for cattle genetic improvement programs. The pattern of allele frequency variability observed among the breeds signal the genetic imprint of past and presumably ongoing episodes of natural and artificial selection.
ACKNOWLEDGEMENT
This research was supported by grants titled “ Development of a DNA kit for selection on growth and meat quality at early stages in Hanwoo” and “Development of production technologies for high quality & nutritional values of beef in Hanwoo” from the Technology Development Program for Agriculture and Forestry, Ministry for Food, Agriculture, Forestry and Fisheries, Republic of Korea.