INTRODUCTION
Intramuscular fat (IMF) is a vital economic trait in pigs, strongly linked to meat quality, including tenderness, water retention, and flavor [
1–
3]. The breeding of modern pig breeds for faster growth, higher lean meat percentages, and reduced backfat thickness often reduces IMF content, compromising meat quality. With the improvement of living standards, the growing demand for nutritious and delicious pork has made improving pork quality a key goal in pig genetic breeding.
Current research has identified numerous genes and pathways involved in the complex process of IMF deposition, offering insights into improving meat quality of pig. Sequencing of the
longissimus dorsi muscle transcriptome of Wannanhua and Yorkshire pigs revealed that the DEGs of the two pig breeds may play an important role in meat quality traits [
4]. In a study by Miao et al, 1632 DEGs were identified in the intramuscular adipose tissue of Jinhua and Landrace pigs, indicating significant differences in fat and protein metabolism at the transcriptome level between these two breeds [
5]. Another study by Li et al, using transcriptome analysis, suggested that
PLIN1 is a key gene affecting the IMF content in pigs [
6]. Xu et al identified 717 genes affecting IMF content in the
longissimus dorsi muscle of Wei and Yorkshire pigs, mainly involved in pathways like fatty acid metabolism and steroid biosynthesis that regulate fat deposition [
7]. Zhang et al identified 85 genes affecting IMF content in Dingyuan pigs, primarily involved in the insulin pathway [
8]. Wang et al identified 15 candidate genes affecting fat deposition in Nanyang black pigs, including
FASN,
CAT, and
SLC25A20 [
9]. However, it should be noted that there is either no overlap or only partial overlap between the DEGs reported in different studies. This discrepancy can be attributed to various factors, including differences in species or tissues, the number of biological repeats, sampling methods, and the use of different bioinformatics pipelines. It is worth mentioning that even small changes in gene expression can have a significant impact on the final results when experimental responses are consistent [
10]. Consequently, the reliability of each individual study’s findings remains uncertain, and it is possible that some DEGs with minor responses may have been overlooked. To address these limitations, meta-analysis has become a widely adopted approach, as it can overcome the aforementioned shortcomings and provide more robust and reliable results.
Landrace pigs are famous for their high lean meat percentage and are widely used in commercial pork production, but their IMF content is typically below 2% [
11]. Songliao Black Pig is a Chinese lean-type breed with approximately 46% Duroc, 27% Landrace, and 27% Northeast Min Pig bloodlines, also has an IMF content exceeding 3.5% [
12,
13]. Thus, these two pig breeds provide an excellent comparison for studying growth performance and meat quality in pigs. In this study, we integrated our transcriptome datasets from the
longissimus dorsi muscle of six Songliao Black pigs and six Landrace pigs with nine IMF-related transcriptome datasets from National Center for Biotechnology Information (NCBI) for a meta-analysis to identify DEGs associated with varying IMF contents in pigs. Functional enrichment and quantitative trait locus (QTL) analyses were conducted to further explore the roles of these DEGs. The objective is to identify key genes influencing IMF content in pigs, uncover the molecular basis of fat deposition, and provide a reference for future breeding efforts.
MATERIALS AND METHODS
Ethical statements
The study was approved by the Animal Welfare Committee of China Agricultural University (Permit Number: DK996). During these experiments, every effort was made to minimize pain and discomfort to the animals. The study was conducted in accordance with the ARRIVE guidelines for reporting animal experiments.
Transcriptome data of this study
We collected 53 Songliao Black pigs and 132 Landrace pigs from the Ninghe Breeding Farm in Tianjin to evaluate their IMF content. All pigs were kept healthy with free access to food and water, raised under identical conditions, and had their IMF content measured upon reaching approximately 100 kg. The IMF content of Songliao Black pigs (2.8±1.28) was significantly higher (p<0.01) than that of Landrace pigs (1.28±0.36). We selected six Songliao Black pigs and six Landrace pigs from each breed for subsequent research. Longissimus dorsi muscle tissue was collected from the region between the third-to-last and fourth-to-last ribs and preserved in liquid nitrogen for subsequent RNA extraction.
Total RNA was extracted using the Trizol method, and the quality and integrity of the RNA were verified using 1% agarose gel electrophoresis. Additionally, the concentration and quality of the RNA were further confirmed by measuring the absorbance ratios at 260nm and 280nm using a Smart Spec Plus spectrophotometer (Bio-rad, Hercules, CA, USA). The RNA integrity number was assessed using the Agilent 2100 Bioanalyzer, which provides a quantitative measure of RNA degradation and overall quality. Following the instructions provided by the reverse transcription kit (Invitrogen, Carlsbad, CA, USA), cDNA libraries were prepared from the extracted RNA. After enrichment and purification, sequencing was performed using the Illumina HiSeq 2000 platform (Illumina, San Diego, CA, USA).
Public transcriptome data
To acquire relevant data pertaining to pig IMF, a search was conducted on NCBI Sequence Read Archive and PubMed using the keywords “pig”, “IMF” and “RNA-seq”. The following inclusion criteria considered while selecting the dataset:(1) The selected research is pig muscle tissue; (2) The data is derived from the transcriptome sequencing platform; (3) Each dataset comprises more than three samples;(4) Each study includes a grouping of high and low IMF; (5) Individual sample expression should be available for download.
Transcriptome data analysis
Raw data were filtered using fastp (v0.12.4) [
14], and the quality of sequencing data was evaluated using FastQC (v0.11.9) [
15]. The pig reference genome sequence (version: Sus scrofa v. 11.1) and the genome annotation file were downloaded from the Ensembl database (
https://asia.ensembl.org/Sus_scrofa/Info/Index). The index was constructed using Hisat 2 (v2.2.1) [
16], and the clean reads were aligned with the reference genome, mRNA quantification was performed using HTseq (v2.0.1) [
17] software, and the R software package DESeq2 [
18] was employed for gene differential expression analysis. Genes that satisfy p<0.05 and |log
2(Fold Change)|≥1 are defined as differentially expressed genes (DEGs).
Meta-analysis
The random effect model (REM) in the R software package MetavolcanoR was used to perform meta-analysis on the differential expression results of each dataset. Following the input requirements of MetavolcanoR, we combined the DESeq2 output from each dataset into a list to serve as the input data for MetavolcanoR, enabling the identification of genes with consistent changes. The REM integrates fold-change and their corresponding variances from each study, applying a weighted method to calculate a summary fold-change. This approach ensures that each study’s contribution is accurately reflected while accounting for study-specific variability. Additionally, REM evaluates heterogeneity among studies using Cochran’s Q statistics, providing insights into the consistency or potential biases across datasets. Finally, the REM estimates a summary p-value, representing the probability that the summary fold-change is not different from zero.
Function enrichment analysis
To gain a deeper understanding of the functions of DEGs obtained through meta-analysis, we utilized the clusterProfiler R software package [
19] to perform enrichment analysis of Gene Ontology (GO) terms and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathways, employing the hypergeometric test for both analyses. Furthermore, we utilized the STRING database [
20] to explore protein interactions among the DEGs.
Quantitative trait locus analysis of differentially expressed genes
To identify candidate genes associated with pig IMF trait, we performed QTL analysis on DEGs. The chromosome location information of the DEGs was obtained from the pig genome annotation file. Additionally, the QTL information related to pig IMF was downloaded from the Animal QTL database [
21] (
http://www.animalgenome.org/QTLdb). After that, the DEGs were intersected with the chromosome position information of the downloaded QTLs taken using the bedtools [
22] function in Linux.
DISCUSSION
As consumers increasingly demand healthier pork with preserved organoleptic qualities, IMF content has become a key focus in pig genetic breeding. Although there are many studies on IMF content in pork, the key genes identified in different studies are rarely consistent. Therefore, we collected 10 pig muscle transcriptome datasets for meta-analysis to identify key genes influencing IMF content in pigs. We identified a potential core gene cluster affecting IMF content, with the genes SCD and FASN located within the QTLs associated with IMF.
We obtained a total of 10 DEGs datas from each transcriptome dataset. The number of DEGs identified across different datasets ranged from several hundred to over a thousand. More than one thousand DGEs were identified in the datasets of study 10 (our own test) and study 4 (PRJNA302287), both of which involved comparisons between pig breeds with significant differences in IMF content. Our data set includes Landrace pigs and Songliao Black pigs, both of which are lean-type breeds. However, Songliao Black Pig is a crossbreed of Duroc, Landrace, and Northeast Min Pig, and its IMF content is significantly higher than that of Landrace pigs [
11–
13]. The study 4 dataset involves a comparison between Wannanhua pigs and Yorkshire pigs. Yorkshire pigs are a lean-type breed, whereas Wannanhua pigs, a Chinese local dual-purpose breed for both meat and fat, have significantly higher IMF content than Yorkshire pigs [
24]. The largest number of DEGs was identified in these two studies, likely due to the large genetic differences between the breeds. However, it is important to note that although significant differences in IMF content between breeds reflect breed specificity, there are many other breed-specific traits. Therefore, the DGEs identified between breeds with significant differences in IMF content within a single study may not necessarily be related to IMF content.
There was limited overlap among these DEG datasets, with most genes being unique to individual studies and no DEG shared across four or more datasets. The PCA and correlation analyses also revealed substantial heterogeneity among samples from different datasets. This phenomenon can be attributed to a multifaceted array of factors. Firstly, the experimental conditions and designs vary significantly across different datasets, leading to distinct transcriptional profiles. Secondly, even within the same species, there exists natural variation in gene expression among individuals, which may be influenced by factors such as genetic background, age, sex, health status, and other biological variables. These factors are important reasons for the identification of varying numbers of DGEs and for limiting the identification of common IMF-affecting genes across different studies.
Meta-analysis aims to analyze and integrate findings from multiple studies to achieve a deeper understanding of research results. It generates robust conclusions by considering the sample size and standard deviation of each study [
23]. The REM of MetavolcanoR integrates fold-change and their corresponding variances from each study, applying a weighted method to calculate a summary fold-change. This approach identifies genes with consistent trends across studies and can pinpoint genes that are stably associated with IMF content. Therefore, we performed a meta-analysis of 10 transcriptomic datasets (encompassing 16,200 genes) using this method and identified 129 DEGs, with 71 genes upregulated and 58 genes downregulated in pigs with higher IMF levels. The results of GO enrichment analysis and KEGG pathway analysis indicated that these DEGs predominantly participate in biological processes and pathways related to fat synthesis and metabolism. This result shows that the DEGs identified through meta-analysis are closely related to IMF content, confirming the reliability of the meta-analysis findings.
PPI network analysis of DEGs revealed a core cluster of five interacting proteins, with
FASN and
SCD genes located within the QTLs of IMF, highlighting the core cluster association with fat deposition. The
FASN gene encodes fatty acid synthase, a pivotal enzyme involved in the de novo synthesis of fatty acids. Acetyl coenzyme A in the cytoplasm is carboxylated, and then it is catalyzed by
FASN together with malonyl coenzyme A to form saturated palmitic acid fatty acid [
25]. The research indicates that polymorphisms in the
FASN gene have a significant impact on fatty acid deposition in pigs [
26]. Additionally, it has been observed that the expression level of
FASN in adipose tissue is significantly higher in obese pigs compared to lean pigs [
27]. Moreover, the expression level of
FASN is significantly higher in pigs with high IMF compared to those with low IMF [
28], which aligns with our research findings. The
SCD gene encodes stearyl coenzyme A desaturase, a fatty acid desaturase located in the endoplasmic reticulum membrane [
29]. Stearyl coenzyme A desaturase serves as a rate-limiting enzyme that catalyzes the conversion of saturated fatty acids (SFAs) into monounsaturated fatty acids (MUFAs). It introduces carbon-carbon double bonds at the 9th and 10th carbon atoms of SFAs, palmitic acid and stearic acid, to form MUFAs. MUFAs are essential for the biosynthesis of polyunsaturated fatty acids, cholesterol, triglycerides (TGs), and phospholipids [
29,
30]. Currently, five subtypes of
SCD (
SCD1-
5) have been identified, with
SCD1 and
SCD5 found in pigs [
31].
SCD1 is significantly associated with fat deposition and composition in skeletal muscle [
32] and plays a crucial role in the differentiation of pig adipocytes, being significantly induced during adipocyte differentiation [
33]. Knockout mice lacking the
SCD1 gene exhibit resistance to diet-induced obesity, accompanied by enhanced metabolic rate and insulin sensitivity [
34]. In our study, we found that the
SCD gene was up-regulated in pigs with high IMF and was associated with two IMF-related QTLs.
Although the other three genes in the core cluster are not in the known IMF QTLs, they are all closely related to fat-related pathways. The
PLIN1 gene encodes the perilipin, which is primarily expressed in adipocytes and plays a critical role in the regulation of lipolysis [
35]. It restricts hormone-sensitive lipase, adipose triglyceride lipase (ATGL), and their co-activator CGI-58 from entering LDs to prevent excessive lipolysis [
36,
37]. Moreover, the formation of LDs mediated by
PLIN1 can promote the activation of sterol regulatory element binding protein-1 (SREBP-1) during adipogenesis, leading to the accumulation of TGs [
38]. Knockdown of the
PLIN1 gene reduces the level of TGs and the size of LDs in pig adipocytes [
6]. Leptin (LEP), the product of the obesity gene, plays a vital role in food intake regulation and fat decomposition [
39,
40].
LEP gene expression is higher in adipose tissue of obese individuals [
41], which is consistent with our study. The
G0S2 gene is involved in regulating the cell cycle and cell proliferation. Studies have shown that
G0S2 is a target gene of PPARγ [
42]. It is a highly expressed member of the PPAR family and serves as a key regulator of adipogenesis [
43,
44].
G0S2 selectively inhibits the hydrolysis of TGs by ATGL, resulting in the blockage of lipolysis and the accumulation of TGs [
45–
47]. Therefore, these five genes are considered potential core gene cluster that may influence the IMF content in pigs.
Our objective was to integrate multiple studies, expand the sample size, and thereby ensure the accuracy of our analysis, providing a valuable reference for future researchers. In comparison with previous studies, we have confirmed the association of genes such as FASN and SCD with IMF, corroborating the reliability of our research. Notably, we have also uncovered PLIN1, LEP and GOS2 are also closely related to IMF, and together with FASN and SCD, they form a core gene cluster that regulates IMF. These findings not only enrich the understanding of the transcriptional landscape of IMF but also offer new perspectives and avenues for future research. However, it’s essential to acknowledge that meta-analyses often prioritize published studies over unpublished ones, which can impact the objectivity and accuracy of the outcomes. Currently, there is a paucity of studies on the transcriptional profiling of high and low IMF in pigs. Consequently, in this meta-analysis, we included studies with relatively small datasets. In the future, we will not only need to further validate the identified core gene clusters, but also expand the sample size to better elucidate the genetic factors influencing IMF and provide a genetic foundation for improving IMF in pigs.