INTRODUCTION
Hanwoo is a unique breed of
bos taurus native to Korea, which was widely utilized in the past for agriculture, transportation, rituals, and meat consumptions. However, since meat consumption has risen due to the increase in national income and Hanwoo consumption per capita has significantly increased from 11.3 kg in 2012 to 14.8 kg in 2022, it is currently used in meat production [
1,
2]. Due to the gentle temperament, adaptability to temperatures, and high productivity of Hanwoo, trait improvement has been ongoing since 1960 under the lead of government and institutions, and the improvement is being carried out in alignment with consumer preferences in meat consumption [
1,
3]. Hanwoo has been improved by annually selecting approximately 30 superior sires through progeny testing, producing semen from these sires, distributing it to farms, and performing artificial insemination. Thus, in the case of Hanwoo, the Korean proven bull (KPN) is used as a sire, forming a group that considers both pedigree and breeding values. The carcass weight (CWT) has increased from 423.7 kg in 2017 to 430.2 kg in 2022, and marbling score (MS) has increased from 2.0 points to 2.2 points, and consequently, occurrence rate of carcasses with a quality of grade 1 or higher increased by 3.5% and occurrence rate of carcasses with a yield grade of A or B increased by 7.4% in 2022, which indicates that improvement of Hanwoo is directly linked to the improvement of grades [
4,
5].
The continuous improvement of carcass traits directly affecting the income of Hanwoo farmers is essential [
6,
7], and traits that take a long time to measure phenotypes have been improved by selecting individuals based on genetic factors to predict the individual ability early on for better selection [
8–
10]. Recently, the development of commercial chips using single nucleotide polymorphism (SNP) has enabled the rapid obtainment of large-scale genomic information, and the quantity of genome-wide association study (GWAS) research exploring new genetic factors associated with growth and carcass traits through high-density SNP analysis has gradually increased [
11–
13]. Analysis using the GWAS technique has had a significant impact on identifying genomic variants associated with various diseases or enhanced traits. The analysis primarily focused on specific mutations that affect traits. Among several GWAS-based analyses, family-based GWAS targets groups with relatively close blood relatives, and has been reported to be more effective in complex traits compared to conventional methods, as it reduces bias in genetic effects [
14,
15]. Although there has been growing interest in utilizing family information in GWAS analysis to remove genetic bias recently [
16], the state of family-based GWAS research in Hanwoo is currently insufficient.
Owing to the advancements in biology and genetic engineering technology, GWAS research is evolving towards utilizing gene networks in the analysis of candidate genes for target traits. Since carcass traits of Hanwoo are influenced by polygenic effects, they are regulated by multiple genes, and the analysis utilizing gene networks has shown that genes and the pathway are associated with carcass or meat characteristics [
17]. Therefore, through gene ontology (GO) and Kyoto encyclopedia of genes and genomes (KEGG) pathway extraction that assigns functional annotations to genes based on cellular and molecular metabolic processes, the roles of genes and their products can be summarized [
18,
19]. In addition, protein-protein interaction (PPI) utilizes annotations to easily explain and differentiate the functions of genes, proteins, or their products. By visualizing specific locations or functions of PPI, identifying hub genes that play crucial regulatory roles in the gene and gene expression network associated with the target traits has been streamlined [
20,
21].
Therefore, this study aimed to discover and investigate SNP markers and candidate genes that affect carcass traits through GWAS targeting the full-sib group, and to compare and analyze the functional annotation analysis (GO, KEGG, PPI) results based on the presence or absence of both paternal and maternal genomic information.
RESULTS AND DISCUSSION
The basic statistics of carcass trait and phenotype data are shown in
Table 1, and the mean and standard deviation of CWT, EMA, BFT, and MS for Group A were 490.28±80.85 kg, 103.96±16.87 cm
2, 11.71±4.50 mm, and 6.69±2.22 points, respectively. Group B were 493.28±77.59 kg, 104.56±16.28 cm
2, 11.69±4.19 mm, and 6.84±2.08 points, respectively. When compared to the nationwide Hanwoo carcass grading results in 2022, which were 411.3±76.8 kg, 92.1±15.1 cm
2, 13.1±5.5 mm, and 5.2±2.3 points, the test group of this study exhibited relatively superior carcass performance [
5]. The coefficient of variation is useful for comparing the dispersion between traits with different measurement units, and BFT and MS in this study are more than twice as high compared to CWT and EMA, which confirms a greater phenotypic variability. This is similar to the findings of Nwogwugwu et al [
27] and Alam et al [
28], where the coefficient of variation for BFT and MS traits in Hanwoo is higher compared to other traits. To assess the bias in the analysis data, a quantile-quantile (QQ) plot was constructed (
Figure 2), confirming a normal distribution between the residual quantiles and the predicted quantiles for each carcass trait. This identified the linearity of the two distributions and confirmed that the data points are linearly distributed. Therefore, the data can be considered appropriate for analysis.
The linear mixed model is reported to be suitable for utilizing family-based data [
29], and SNPs that regulates gene expression and genetic variation of SNP can contribute to phenotypic variability [
30]. For the identification of significant SNP markers, the GWAS analysis results were visualized using a Manhattan plot. The Bonferroni method and the Bonferroni threshold differ in how they control the significance level in a statistical hypothesis test. The Bonferroni method divides the significance level by the number of individual hypotheses being tested and applies it to each hypothesis. This approach can result in false negatives and miss important genetic variables. Therefore, the Bonferroni threshold is used to determine the results based on the corrected significance level by reducing the significance level. The black line on the Manhattan plot represents the adjusted Bonferroni threshold, and significant markers were identified by using −log10(p) = 4.54 for Group A and −log10(p) = 4.53 for Group B. The Bonferroni method is indicated by the navy dotted line, and top SNP markers were identified using −log10(p) = 5.84 for Group A and −log10(p) = 5.83 for Group B, and were compared between the groups (
Figure 3). In Group A, 7 significant SNP markers were identified for CWT, 3 for BFT, and 6 for MS, and among them 1 marker satisfied the Bonferroni method for each of the CWT, BFT, and MS (
Table 2). In Group B, 3 markers were identified for CWT, 1 for EMA, 1 for BFT, and 2 for MS, but only 1 marker for CWT satisfied the Bonferonni method, confirming distinct GWAS analysis results between Group A and Group B. When comparing the p-values of common SNP markers—3 for CWT and 1 for MS—between Group A and Group B, Group A showed more significant estimated values, and the top SNP marker was the same for CWT, but for BFT and MS traits, Group A was only confirmed to have 1 significant marker. Group A and Group B showed divergent results, except for 4 SNP markers, and 16 significant SNP markers were identified in Group A, while 7 were identified in Group B, confirming that utilizing genetic information from both parents and offspring is more useful for exploring genetic factors associated with carcass traits. In Group A, the inclusion of parental genomic information was considered useful for exploring related genetic factors. When comparing both thresholds used in this study, more significant markers were identified in CWT, BFT, and MS, excluding EMA. Therefore, GWAS analysis using parental genomic information is considered useful because it can identify numerous candidate genes associated with complex traits. Additionally, previous studies have reported that the quantitative trait loci (QTL) for CWT were diversely dispersed on chromosomes 14 and 19 [
13,
31], aligning with the findings of our study. Candidate genes were explored within 200 kb of the SNP marker location [
32], identifying a total of 88 candidate genes, with 82 in Group A and 42 in Group B (
Table 3). Significant markers for the EMA trait were identified only in Group B. However, candidate genes were selected for CWT, BFT, and MS traits, except for EMA.
SDCBP and
TOX, identified as candidate genes for CWT in Group A, have been reported as candidate genes for CWT of Hanwoo in other studies, but the precise functions and mechanisms has not been revealed [
8,
33].
NME1, identified as a candidate gene for CWT in both Group A and Group B, has been suggested as a gene associated with carcass traits and meat color in pigs [
34], but its direct association with BFT and EMA traits could not be confirmed.
ACACA gene, discovered as a candidate gene for MS in Group A, is a key gene for fatty acid synthesis in mammals and is expressed highly in adipose tissue. It has been reported to be associated with changes in the fatty acid composition of the subcutaneous fat layer in cattle, especially in the longissimus dorsi muscle [
35–
37].
TRNAG-CCC identified in Group B has been suggested to be associated with weight gain in Nellore cattle [
38].
ACTL8 gene that has been commonly identified in the MS of both groups is also a gene involved in muscle growth and development, and it has been reported to be associated with CWT, but its direct association with MS could not be confirmed [
39]. The MS candidate gene for Group A—
AATF and
MANF—and the MS candidate gene for Group B, the
NPR3, could not confirm their functions in cattle, but preliminary studies related to adipose tissue were identified.
AATF is a transcriptional regulatory factor involved in the inhibition of apoptosis [
40], and Rodrigues et al [
41] reported that apoptosis influences the tenderness of meat. Moreover, while
MANF deficiency accelerates fat synthesis in mice,
MANF overexpression inhibits fat synthesis [
42]. The expression level of
NPR3 has been reported to significantly increase in obese adults and children, suggesting that it plays a role in the MS of Hanwoo, but further research will be needed [
43].
Functional annotation analysis, which enables the identification of characteristics and estimated functions of genes and gene products across all organisms, was conducted to verify the biological functions in each group of genes. GO analysis is classified into cellular component (CC), which refers to the internal or external components of a cell, molecular component (MF), which indicates the role of gene products at the molecular level, and biological process (BP), which presents the functional activities of genes in organisms, tissues, and cells. According to the GO analysis, based on FDR<0.05, Group A had a total of 21 functional annotations, consisting of 9 CC, 8 BP, and 4 MF, and Group B had a total of 30 functional annotations, consisting of 11 CC, 18 BP, and 1 MF (
Figure 4). The CC and BP in Group A were identified to be common as the CC and BP in Group B, and MF exhibited distinct functionalities in each group. Moreover, Group A and Group B exhibited distinct patterns in GO. Group A had 4 MF, while Group B had 2 CC, 10 BP, and 1 MF. Notably, Group A annotations were predominantly associated with immune responses, whereas Group B annotations were mainly linked to the development and proliferation of cells and organs. In both Group A and Group B, GO: 0005882 of CC and GO: 0045109 of BP were ranked highly. According to the GO analysis results, it predominantly participates in the composition of filament and cellular cytoskeleton, and intermediate filament (GO: 0005882) constitutes cellular cytoskeleton proteins, and has a direct correlation with body weight [
44]. A single GO term was confirmed to be commonly associated with carcass traits in both groups. When comparing the FDR of the common GO function identified in Groups A and B, Group B exhibited lower FDR values. However, the difference in p-value and FDR value for the common GO between groups are attributed to the variations in the analysis, resulting from the number of input gene sets in each analysis group during the calculation of the gene list and GO term gene set. Therefore, it can be determined that utilizing the results of GO analysis conducted with a small gene set list is challenging for intergroup comparisons.
In the KEGG analysis, pathways such as ‘Staphylococcus aureus infection’ and ‘Estrogen signaling pathway’ were identified in Group A, and ‘Pyrimidine metabolism’ and ‘Nucleotide metabolism’ were additionally identified in Group B, confirming more gene functions and pathways elucidated in Group B compared to Group A. The KEGG analysis revealed the involvement of sufficient genes in pathways related to disease and immune function, such as the Staphylococcus aureus infection pathway (bta05150) and Estrogen signaling pathway (bta04915), and are predominantly associated with the Keratin genes family, which has been reported to be primarily associated with hair in cattle [
45]. Therefore, the pathway analysis groups genes participating in the same BP, but direct pathways specifically associated with the carcass traits in Hanwoo were not identified.
PPI analysis was performed for visualization to examine the functions of genes extracted from Go and KEGG (
Figure 5). In PPI analysis, interactions refer to the physical interactions between proteins, where proteins either bind to one another or interact to carry out biological functions. These experimentally observed interactions vary based on the proteins’ structure and function and are depicted in a PPI network, where each node symbolizes a protein and each edge represents an interaction between proteins. In this study, 106 nodes and 817 edges were confirmed, and the candidate genes of Group A and Group B were identified to exclude genes that are not involved in the networks from visualization. According to PPI analysis results, Group A was revealed to have more nodes and edges compared to Group B, indicating that the candidate genes of Group A have diverse set of interactions with other genes. Group A can form more complex networks due to having more genes and interactions compared to group B. This highlights the functional diversity of group A. PPI allows to confirm the unidentified interactions among proteins and genes, and can discover the expression mechanisms of proteins [
21].
GWAS is a comprehensive analysis technique that allows for the exploration of all genetic variations of SNPs associated with a specific trait, enabling the identification of genetic biomarkers [
46]. Family-based GWAS introduces a novel strategy to correct for population structure [
14,
15]. In this study, we aim to compare analysis results by using genetic information from parents to explore the genetic relationship matrix in families, including the father, mother, and their offspring. According to the comparison analysis between groups based on the utilization of both paternal and maternal genetic information, Group A was found to have a significantly higher number of SNP markers compared to Group B, and the p-values of SNP markers commonly verified were found to be more significant in Group A. The gene function associated with carcass traits was reported more in previous studies for the candidate genes identified in Group A. Although Group B revealed a higher number of functions and pathways in GO and KEGG analysis, only one function related to CWT was commonly identified between the groups in GO. When conducting GWAS analysis to investigate genes and functionalities related to carcass traits, incorporating genetic data from both parents and offspring results in more significant SNP markers. Thus, it is more beneficial for identifying causal genes related to carcass traits. Further analysis with an increased number of typical full-sib family groups is necessary for additional insights.