Genome wide association study of fatty acid composition in Duroc swine
Article information
Abstract
Objective
Genome wide association study was conducted to identify and validate candidate genes associated with fatty acid composition of pork.
Methods
A total of 480 purebreed Duroc pigs were genotyped using IlluminaPorcine60k bead chips while the association test was implemented following genome-wide rapid association using Mixed Model and Regression-Genomic Control (GRAMMAR-GC) approach.
Results
A total of 25, 29, and 16 single nucleotide polymorphisms (SNPs) were significantly associated with stearic (18:0), oleic (18:1) and saturated fatty acids (SFA), respectively. Genome wide significant variants were located on the same region of swine chromosome 14 (SSC14) that spanned from 120 to 124 Mb. Top SNP ALGA008191 was located at 5 kb near the stearoyl-CoA desaturase (SCD) gene. This gene is directly involved in desaturation of stearic acid into oleic acid. General relationship of significant SNPs showed high linkage disequilibrium thus genome-wide signals was attributed to SCD gene. However, understanding the role of other genes like elongation of very long chain fatty acids-3 (ELOVL3) located on this chromosomal segment might help in further understanding of metabolism and biosynthesis of fatty acids.
Conclusion
Overall, this study provides evidence that validates SCD gene as strong candidate gene associated with fatty acid composition in Duroc pigs. Moreover, this study confirms significant SNPs near ELOVL3 gene.
INTRODUCTION
Quality of lipids, fats and fatty acid compositions are important factors that influence both the organoleptic quality and nutritive value of meat. Fatty acid composition showed strong correlation with meat firmness and oiliness. These traits are believed to be associated with the rate of meat oxidation after slaughtering and during meat processing [1]. Investigation of fatty acid composition in Duroc pigs showed that saturated fatty acid (SFA) and monounsaturated fatty acids (MUFA) were highly heritable with genetic estimates of 0.50 to 0.57 while polyunsaturated fatty acids (PUFA) moderately with 0.25 to 0.46 [2]. These heritability estimates showed strong potential for improvement of meat quality in pigs. To date, inclusion of fatty acids in genomic breeding selection has been limited; however, with the robust advancement of technology to identify detailed quantitative trait loci (QTL), the importance of fatty acid to meat quality improvement will be soon elucidated.
Genome-wide association studies (GWAS) use high-throughput genotyping technologies such as Illumina60K bead chip to assay hundreds of thousands of single nucleotide polymorphism (SNP) and to associate them with diseases or quantitative traits [3]. Implementation of GWAS has resulted in the discovery of many genes directly involved in a variety of disease related traits, genes associated with economically important traits, and genes involved in many metabolic and physiological activities within cells and tissues. The cost of genotyping and complex analysis of data are the challenges researchers are facing today in performing GWAS, with low replications using independent populations and lower sample size leading to erroneous results. Thus, this study was conducted to identify genetic variants associated with fatty acid and to provide verification on previously reported significant QTL and identify novel genes influencing fatty acid composition in Duroc pigs.
MATERIALS AND METHODS
Animals and phenotypic data
A total of 480 purebred female Duroc pigs were selected for fatty acid composition analysis. Animals were raised under commercial conditions and fed ad libitum during the entire fattening period. After reaching 120 to 130 kg, pigs were slaughtered in batches following the standards set by the Korea Institute for Animal Production Quality Evaluation (KAPE).
Tissues from longissimus dorsi muscles were collected immediately after slaughter and kept frozen at −20°C. The total lipids were quantified following a protocol conducted by previous studies [4]. Briefly, extracted fats were saponified with potassium hydrate-ethanol solution and then methyl sterified using boron triflouride-methanol solution. Individual fatty acid was subsequently determined quantitatively by gas chromatography (Model 6890, Hewlett-Packard Co., Wilmington, DE, USA). The temperature of the inlet was 180°C, the oven was warmed from 180°C to 200°C for 25 minutes, and the temperature of the detector sensor was 260°C. Nitrogen gas was used as the carrier, and an omega wax 320 (30 m×0.32 mm ×0.25 μm, Supelco, Bellefonte, PA, USA) was used as capillary column.
Genotyping and genomic quality control
Genomic DNA was extracted from ear tissue of each animal using the phenol/chloroform method. Four hundred eighty experimental pigs were genotyped for 61,565 SNPs using Illumina PorcineSNP60K bead chips according to the manufacturer’s protocols. Quality control measures were implemented using threshold levels: genotyping call rate per individual and per marker >90%, minor allele frequency >0.05, and divergence from Hardy-Weinberg equilibrium (HWE) with p-value of 0.001. Individuals and SNP markers that did not conform to the threshold levels were eliminated prior to the association test using PLINK v. 1.09 [5].
Statistical analysis
Genome-wide association analysis was conducted following genome-wide rapid association using a mixed model and regression–genomic control (GRAMMAR–GC) approach [6,7]. Fatty acid composition was corrected for fixed effects and polygenic covariation effects based on the following model:
Where, γ1 is the phenotype of the ith individual, cij is the value of the jth covariate or fixed effect for the individual i, βj is an estimate of the jth fixed effect or covariate, and Gi and ei are random additive polygenic and residual effects, respectively. Pedigrees of the animals were retrieved back up to second generation and explored for possible relatedness and were used to create numerator relationship matrix. Fixed effects consisted of four farms and slaughter batches, while the covariate was the slaughter age. Commercial farms consist of Nunghyop, Samsung, Wonsan, and Chuksan while slaughtering was done on 37 batches. Based on this model, derivation of residuals was conducted using the blupf90 programs [8]. Residuals were then used as dependent traits in a simple linear regression analysis for each SNP marker.
Where, μ is the mean, g is the vector of genotypes, k is the regression coefficient and e is the vector of random residuals. The empirical p-values based on Wald statistics were adjusted using the genomic control (GC) method by estimation of the inflation factor (λCG) based on median chi-square statistics. Corrected population stratifications were illustrated in the quantile-quantile plot constructed within the R environment. The genome-wide significant threshold was determined based on Bonferroni adjustment at a significance of 0.05 (p-values = 1.29×10−6).
The linkage disequilibrium coefficient (r2) was calculated between SNP markers located in the same chromosomal segment that showed significant association with fatty acid composition. Haploblocks were constructed and visualized using Haploview v.4.2 [9].
SNP and candidate gene annotation were retrieved using Ensembl annotation of Sscrofa 10.2 genome version in NCBI data base [10] while functional classification of candidate genes with the target chromosomal segment was investigated using DAVID online annotation tools [11].
RESULTS AND DISCUSSION
Phenotype and genotype
The descriptive statistics of the fatty acid composition of longissimus dorsi muscles are presented in Table 1. Fatty acid composition was dominated by oleic acid (18:1) with 41.83% which are comparable with previous investigations [12]. In this study, ten genotyped animals with less than 90% call rate, and 30,426 low quality SNP markers were eliminated prior to the association test. Most of these SNP markers were removed due to minor allele frequency of less than 5%, with the total number of 20,928 SNP. Thus, a total of 470 genotyped animals and a subset of 39,919 SNPs were used in the association test. Number of low quality SNP in this study is slightly higher compared with previous studies [13]. Incidence of high number of low quality SNPs narrows possibility of false positive association, and at the same time limiting probabilities of a true positive association. This scenario might suggest the reason for low positive result in other GWAS test. Therefore, quality control in data handling and data collection improving high quality SNPs with strengthen genome-wide association results.
Population stratification
Population stratification was accounted by GC approach by correcting test statistics. The quantile-quantile (Q-Q) plot illustrates the observed chi-square statistics against expected chi-square statistics for association test of stearic (18:0), oleic (18:1) and SFA shown in Figure 1. Results revealed that estimates of genomic inflation factor for stearic, oleic and SFA were 1.009, 1.03, and 0.995, respectively. These values conclude that population stratification was effectively accounted in this study, thus relevant associations are not due to population stratification but instead to a true positive association.
Association study
In this study, genomic segment on same region of swine chromosome 14 (SSC14) about 120 to 124 Mb showed significant association with stearic (18:0), oleic (18:1) and SFA. Details of significant SNPs are given in Table 2. These associations were visualized in the Manhattan plot illustrated in Figure 2. Strong SNPs for these traits were located on the same chromosomal region which implied that candidate gene on this region is involved in the desaturation process of stearic acid into oleic acid, thus total SFA is also altered. Desaturation of fatty acids or addition of double band in fatty acid chain is affected by a complex relationship of specific enzymes and by various factors. To date, intricate mechanism of fatty acid synthesis and metabolism is not yet fully understood. Moreover, conversion of stearic acid into oleic acid connotes negative correlations.

Manhattan plots of genome wide association study (GWAS) for stearic (A), oleic (B) and saturated fatty acid (SFA) (C). Chromosomes are represented on the X-axis while p-values are on the Y-axis. Bonferroni threshold p-values at 5% and 1% are indicated by the blue and red horizontal lines, respectively.
One candidate gene that is directly involved in fatty acid biosynthesis and lipid synthesis was identified. The top SNP ALGA0081091 was located at about 5 kb from the stearoyl-CoA desaturase (SCD) gene. The same chromosomal segment was previously identified; however candidate genes associated with fatty acid composition that are situated on the other chromosomes were not elucidated in this study [14,15]. Results showed that a high number of low quality SNPs reduces probability of detecting previously identified genes. Although a high number of SNPs was removed, the chromosomal segment where the SCD gene was located remains significant.
The SCD gene is located at 120.9 Mb on SSC14 with a total length of 16,231 base pairs (bp). This gene is responsible for desaturation of SFA by the addition of single double band between carbon 9 and 10 on the acyl chain and catalyzing the synthesis of oleic and palmitoleic acids. Oleic and palmitoleic acids are then utilized in the synthesis of various kinds of lipids, confirming the importance of SCD on biological properties of fatty acids and complex lipids [16,17]. Expression of SCD gene at different levels in lipogenesis, lipolysis and fatty acid transport showed genetic variability of this gene between Wujin and Landrace pigs [18]; A meta-analysis of GWAS on five populations that includes crossbreed of Duroc, Landrace and Yorkshire (DLY) showed strong evidence of association between SCD with palmitic, palmitoleic, stearic, and arachidonic acids [19]. In addition, haplotypes of SCD gene showed strong influence on melting point of fatty acids in Duroc population [20].
Furthermore, SCD polymorphism is a relevant tool to increase MUFA and MUFA/PUFA ratio of dry cured hams in crossbreed Duroc and Iberian pigs [13]. In this study, allele A at ALGA0081091 SNP increases oleic acid while decreasing stearic acid which compliments the elongation of stearic acid to oleic acid by SCD enzyme. Therefore, this study validates SCD as a strong candidate gene influencing SFA and MUFA in Duroc pigs.
Alternatively, the signal detected in this study spanned from 120 Mb up to 124 Mb. This chromosomal segment harbors other genes involved in lipid synthesis. The SNP ASGA0066174 is located at downstream untranslated variant of elongation of very long chain fatty acids-3 (ELOVL3) gene. This gene interacts with various co-factors, controlling synthesis of saturated and MUFAs in specific tissues [21,16]. Investigation of the phenotypic diversity that arises from intensive genomic selection identified a highly differentiated region in chromosome 14 at about 123.08 to 123.41 Mb that is unique in Duroc swine, where the ELOVL3 is located [22]. Moreover, peroxisome proliferator-activated receptor gamma gene, a candidate gene that influences fatty acid composition in swine, through the coregulation with ELOVL3 and very long chain fatty acid induces adipogenesis in mouse adipocytic 3T3-L1 cells, suggesting relevant participation of ELOVL3 in biosynthesis of lipids [23]. Conversely, general relationship of significant SNP (Figure 3) illustrates high linkage disequilibrium between significant SNP proposing coincident association between fatty acids and ELOVL3 since ASGA0066174 is not the top SNP, while very limited studies support the significant influence of ELOVL3 in fatty acid composition of pork.

Linkage disequilibrium (r2) diagram of single nucleotide polymorphism (SNP) markers in chromosome 14 associated with fatty acids. The numbers outside the blocks represent SNP order in this region and the numbers inside the blocks indicate linkage disequilibrium (LD) coefficients (r2). The SNP grouped in each triangle box are grouped in one block based on LD information. The red mark indicates the position of stearoyl-CoA desaturase (SCD) gene.
Recently, the same chromosomal segment was reported in a GWAS in Duroc pigs which agrees not to discriminate candidate causative SNP around SCD gene [24,19]. Strong linkage disequilibrium between significant SNP limits the evidence that ELOVL3 is a strong candidate gene in fatty acid manipulation, but this does not eliminate the possible importance ELOVL3 in fatty acid manipulation, which requires further investigation.
CONCLUSION
Results identified a significant QTL on SSC14 (120 to 124 Mb) associated with stearic, oleic and SFA in Duroc pigs. The candidate gene located in this chromosomal segment is the SCD gene that has a direct influence on fatty acid composition and has been identified in previous studies. Located on the same chromosomal segment is ELOVL3. Hence, this information provides relevant evidence for SCD gene as a strong candidate gene in the improvement of fatty acid in Duroc pigs.
ACKNOWLEDGMENTS
This work was carried out with the support of “Cooperative Research Program for Agriculture Science and Technology Development” (Project No. PJ012704012018) Rural Development Administration, Republic of Korea.
Notes
CONFLICT OF INTEREST
We certify that there is no conflict of interest with any financial organization regarding the material discussed in the manuscript.