Go to Top Go to Bottom
Anim Biosci > Volume 38(5); 2025 > Article
Huang, Zhang, Luo, Zhang, Huang, Tian, Wu, and Li: Whole-genome resequencing reveals genetic differentiation and selection signatures among wild, local and commercial duck populations

Abstract

Objective

The purpose of this study was to systematically analyze the population genetic structure and genetic diversity among wild, local and commercial populations using whole-genome sequencing data from 416 individuals of 22 duck breeds in China and to further explore genetic pathways and candidate genes associated with importantly economic traits.

Methods

We performed principal component analysis, an unrooted neighbor-joining phylogenetic tree and ADMIXTURE to analyze the population structure. We compared the genetic diversity among wild, local and commercial populations using the effective population size, inbreeding coefficient, expected heterozygosity, observed heterozygosity, nucleotide diversity and regions of homozygosity. To detect selection signatures, we calculated the locus-specific branch length for local and commercial populations and calculated genetic differentiation coefficient and genetic diversity between egg and dual-purpose breeds.

Results

Wild, local and commercial duck populations formed three distinct genetic groups. The commercial population presented the lowest genetic diversity, the highest levels of inbreeding and the smallest effective population size. ADMIXTURE analysis also demonstrated that ducks were clearly divided into these three populations at K = 3. Selection signals in the commercial population were associated with growth and muscle development pathways, such as the mTOR signaling pathway and ErbB signaling pathway, and two key genomic regions (Chr1: 70.25 to 74.00 Mb and Chr2: 97.10 to 99.76 Mb) containing important genes, such as LRP6, BORCS5, and EDN1, were identified. In contrast, selection signals in the local population were associated with immune-related pathways involving NCAM2 and MPHOSPH6. Furthermore, PTGS2 and PLA2G4A genes were positively selected in egg breeds, whereas KCNK16, KCNK5, and KCNK17 genes were in dual-purpose breeds.

Conclusion

Because of artificial selection, wild, local and commercial populations presented obvious genetic differences. The selection signal analysis revealed that LRP6, BORCS5, and EDN1 are important for growth and muscle development; NCAM2 and MPHOSPH6 are for immune traits; and PTGS2 and PLA2G4A are for egg-related traits.

INTRODUCTION

Poultry, as an indispensable part of the human food chain, holds an important position in global agricultural production [1]. Ducks, as important poultry, are valuable not only for their meat and egg products but also for their diversity in different cultures and ecosystems [2]. China is one of the largest producers of poultry worldwide and is a rich resource of duck breeds [3]. These duck resources are genetically diverse and exhibit unique adaptability and production performance characteristics.
With the rapid development of molecular technology, whole-genome resequencing has become a powerful tool for studying animal genetic diversity, population structure, selection pressure, and demographic history [4]. Song et al [5] compared Xia’nan cattle with 178 samples from 5 breeds in a public database, which provided a basis for further breeding and improvement of cattle. Gao et al [6] analyzed genomic differences between Chinese indigenous pig breeds and commercial pigs, revealing distinct growth patterns between local and commercial breeds. Jiang et al [7] analyzed the whole-genome sequences of 101 domestic ducks to reveal the complex population history and significant gene flow between duck populations. Li et al [8] conducted a comparative analysis of eight Muscovy duck populations and reported significant differences in fatty acid metabolism and immune-related genes among different breeds. Jiang et al [7] collected genomic data from 78 ducks in China, Southeast Asia and South Asia, revealing the demographic history, gene flow, and domestication patterns of Southeast/South Asian and Chinese duck populations. Zhu et al [9] assembled three chromosome-level genomes and demonstrated the fat-suppressing function of NR2F2, which explains the variation in fat deposition between meat-type breeds. However, these studies are limited by the number of samples and breeds. To date, no study has carried out broader genetic comparative analyses among wild, local, and commercial populations including as many Chinese duck breeds as possible.
In this study, we collected 416 high-quality whole-genome resequencing datasets from 22 breeds, including wild, local, and commercial duck breeds, to represent a wide spectrum of genetic diversity in ducks in China. The objective of this study was to analyze the whole-genome data of these duck breeds comprehensively to reveal their genetic diversity, population genetic structure, selection signals, and candidate genes related to economically important traits. This study provides a scientific basis for Chinese duck protection and genetic improvement.

MATERIALS AND METHODS

Whole-genome resequencing data analysis

In this study, we downloaded publicly available whole-genome resequencing data of 484 Chinese duck individuals from the National Center for Biotechnology Information Sequence Read Archive database prior to November 26, 2023. We adopted a stringent and uniform pipeline to analyze raw genome data and filter single nucleotide polymorphism (SNP)s. Briefly, we first trimmed adaptors and removed low-quality reads via Trimmomatic (v0.39) with the following parameters: LEADING: 3 TRAILING: 3 SLIDINGWINDOW: 4:15 MINLEN: 36. We then mapped the clean reads to the Pekin duck (Anas platyrhynchos) reference genome (ZJU1.0) via the Burrows-Wheeler Aligner (v0.7.17) [10] with the parameters mem -M–R. We used SAMtools (1.13) [11] to sort bam files and the MarkDuplicates module in GATK (4.2.3.0) [12] to remove duplicates. We used mosdepth (0.3.3) [13] to calculate the sequencing depth of each sample and selected samples with sequencing depths greater than 3×. Finally, we called variants through the modules HaplotypeCaller, CombineGVCF, and GenotypeGVCF in GATK (4.2.3.0) and filtered SNPs via the module SelectVariants with the following parameters: QD<2.0 || MQ<40.0 || FS>60.0 || SOR>3.0 || MQRankSum<−12.5 || ReadPosRankSum<−8.0. We utilized BCFtools (v1.9) [11] with the parameters --min-af 0.01:minor -e ‘F_MISSING>0.9’ -m 2 -M 2 -v snps to obtain biallelic SNPs, and these SNPs were then more rigorously filtered via PLINK (v1.9) [14] with the parameters minor allele frequency<0.05 and Hardy-Weinberg equilibrium <0.000001.
After removing 68 samples with sequencing depths below 3×, we filtered 416 high-quality individual ducks with an average depth of 7× coverage for the following analysis. These samples were derived from Bioproject IDs: PRJNA599025, PRJNA658213, PRJNA888455, PRJNA409830, PRJNA419832, PRJNA450892, and PRJNA549423. We categorized these 416 ducks into wild populations (72), local populations (159), and commercial populations (185) (Table 1). The wild breed included 72 Mallard ducks, whereas the commercial populations included Pekin ducks, Mapleleaf ducks and Cherry Valley ducks. The local populations included 18 breeds, i.e., JinDing duck, Liancheng White duck, MaWang duck, Putian Black duck, SanSui duck, ShanMa duck, ShaoXing duck, Taiwan duck, Yulin-Wu duck, DongLan duck, YouXian duck, GaoYou duck, Ji’an Red duck, Longsheng-Cui duck, WenQiao duck, Rongshui-Xiang duck, Xilin-Ma duck, and Yulin-Ma duck.

Annotation of genetic variants

After quality control, we obtained 5,140,908 high-quality SNPs, and these SNPs were annotated via the ANNOVAR package [15] on the basis of the gene annotations of the Pekin duck reference genome (ZJU1.0). We divided these SNPs into five classes: exon regions, intron regions, splicing sites, upstream and downstream regions and intergenic regions.

Population genetics analysis

We performed principal component analysis (PCA) for 416 ducks by using PLINK (v1.9), with 5,140,908 SNPs shared among the three populations. We constructed an unrooted neighbor-joining (NJ) phylogenetic tree via iTOL [16] on the basis of the identity-by-state genetic distance matrix among 416 individuals calculated via PLINK (v1.9). Population genetic structure was inferred via ADMIXTURE software (v-1.3.0) [17], which employs a maximum likelihood-based method to estimate individual ancestries from multilocus SNP genotype datasets. We then performed visualization by using TBtools-II [18] while setting a predefined range of genetic clusters (K) from 2 to 12 to encompass the maximum number of lineages.
We estimated the population genetic diversity of the wild, local and commercial populations. The genetic diversity included the effective population size (Ne), inbreeding coefficient (F), expected heterozygosity (He), observed heterozygosity (Ho), nucleotide diversity (Pi), and regions of homozygosity (ROH). The effective population size (Ne) of different populations was estimated via SNeP (v1.1) [19] with independent SNPs calculated via PLINK (v1.9) with the parameters --indep-pairwise 50 5 0.2, and 72 samples from each population were used to avoid the effect of the number of samples. We selected 72 individuals from commercial and local populations on the basis of the corresponding proportion of each breed included in each population. We calculated F (--het), He (--hardy), Ho (--hardy) and ROH (--homozyg --homozyg-window-snp 50 --homozyg-window-het 1 --homozyg-snp 50 --homozyg-window-missing 5 --homozyg-window-threshold 0.05 --homozyg-density 500 --homozyg-gap 1000) for each duck of the wild population, local population, and commercial population, respectively. We calculated F, He, Ho, and ROH for each duck from the wild population, local population, and commercial population, respectively. We used VCFtools (v1.17) [12] to compute Pi values by scanning all genomic regions within 50-kb sliding windows with a step size of 25 kb. We compared the patterns of linkage disequilibrium (LD) among wild ducks, local ducks, and commercial ducks through pairwise LD estimates, which were measured as r2 values calculated from PopLDdecay (v3.42) [20]. The samples used for the LD estimates were the same as those used for the Ne calculation.

Selective signatures and functional enrichment

First, we calculated genetic differentiation coefficient (Fst) values for the pairwise populations among the wild population, local population and commercial population using VCFtools (v1.17) with 50-kb sliding windows and a step size of 25 kb. To identify selective genomic signals in commercial or local populations compared with those in the other two populations, we then derived the locus-specific branch length (LSBL) [21] from the pairwise Fst distances calculated as follows: LSBLCM–(LC+WT) = (FstCM–LC + FstCM–WT − FstLC–WT)/2 and LSBLLC–(CM+WT) = (FstCM–LC + FstLC–WT − FstCM–WT)/2, where FstCM–LC refers to the Fst value between the commercial population and the local population, FstCM–WT refers to the Fst value between the commercial population and the wild population, and FstLC–WT refers to the Fst value between the local population and the wild population. We selected the top 1% of LSBL values of the genomic windows as candidate regions. Moreover, for the local population, we also analyzed selective signatures between egg and dual-purpose breeds by calculating Fst and Pi values from VCFtools (v1.17) with 50-kb sliding windows and a step size of 25 kb. We annotated the genes related to these important genomic regions, and we utilized the clusterProfiler R package [22] for the Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway analyses of these candidate genes. The GO and KEGG pathways were selected from the top 10 enriched pathways in terms of Benjamini-adjusted p values. For some importantly selected genomic regions, we employed LSBL and Pi values with a 10-kb sliding window and a 5-kb step size across wild, local, and commercial populations to conduct comparisons and to identify candidate genes within these selected genomic regions.

RESULTS

Whole-genome sequencing and single nucleotide polymorphism calling

We obtained 416 ducks with a wide and representative geographic distribution across various regions in China, yielding 5,140,908 high-quality SNPs distributed across 29 chromosomes (Figures 1A, 1B). As shown in Figure 1C, among these SNPs, we identified 4,321,515 SNPs in the wild population, 4,491,942 in the local population, and 3,746,643 in the commercial population. There were 2,832,649 SNPs shared among these three populations, accounting for approximately 55.1% of the total SNPs. The commercial population had 195,701 unique SNPs, the local population had 222,579 unique SNPs, and the wild population had 135,613 unique SNPs.
In the functional annotation of SNPs, the percentages of SNPs located in introns, intergenic regions, introns and exons of noncoding RNAs, untranslated regions3 regions, downstream, and exonic regions were very similar among the wild, local, and commercial populations. For example, the majority of SNPs are within introns, accounting for approximately 54.2%, 54.06%, and 54% of the total number of SNPs in the wild, local, and commercial populations, respectively. In contrast, the proportions of SNPs located in exons were relatively low, i.e., approximately 1.4% for these three populations (Figure 1D).

Population genomic analysis

PCA clearly revealed that individuals from the wild, local, and commercial populations were clustered, with PC1 and PC2 accounting for 30.61% and 11.57% of the total variation, respectively (Figure 2A). The NJ tree also confirmed that the ducks were divided into three groups: wild, local, and commercial populations (Figure 2B). Moreover, ADMIXTURE analysis indicated that the wild, local, and commercial populations were clearly distinguishable at K = 3, with a log-likelihood value of −1,637,271,699.91 and a cross-validation error of 0.49702 (Figure 2C).
As shown in Figure 3A, in terms of Ho, He, and Pi, commercial duck breeds presented the lowest values (He = 0.22±0.15, Ho = 0.19±0.13, and Pi = 0.0011±0.0009), suggesting a relatively high degree of inbreeding. This was supported by the inbreeding coefficient, where commercial breeds also had the highest value (F = 0.26±0.15). The commercial duck breeds also presented the longest average ROH length (1905.33±1371.00 kb) among the three populations. The commercial population had the smallest Ne at generation 999, except that the Ne of the wild population was lower than that of the commercial population before generation 150, while the local population consistently had the largest Ne across all generations (Figure 3B). Compared with the other two populations, the commercial population presented a greater extent of LD (r2) (Figure 3C). These results were consistent with our general understanding of the wild, local, and commercial populations.

Genomic signatures

As shown in Figure 4A, we identified 152 genes by using the top 1% LSBLCM–(LC+WT) genome regions of the commercial duck breeds. We subsequently conducted KEGG and GO enrichment analyses for these genes (Figures 4D, 4E). For the KEGG pathways, the mTOR signaling pathway and ErbB signaling pathway were significantly enriched (p<0.05). The GO enrichment analysis revealed that the enriched pathways were highly related to muscle development, such as cartilage development, connective tissue development, myofibrils and contractile fibers. Furthermore, we analyzed two important genomic regions by calculating LSBL and Pi with a 10-kb sliding window and 5-kb step and identified the candidate genes LRP6, BORCS5, MANSCI, BCL2L14, WNT7B, and PRR5 in the region Chr1: 70.25 to 74.00 Mb (Figure 4B) and BRD9 and EDN1 in the region Chr2: 97.10 to 99.76 Mb (Figure 4C).
Similarly, we detected 277 genes involved by using the top 1% of LSBLLC–(CM+WT) values (Figure 5A). Through KEGG enrichment analysis, other types of O-glycan biosynthesis and mucin type O-glycan biosynthesis were significantly enriched (p<0.05), which may be linked to the immune system and disease resistance of the local population (Figure 5E). The significantly enriched GO pathways were regulation of axonogenesis, regulation of neuron projection development, protein glycosylation, and macromolecule glycosylation (Figure 5D). These pathways have implications for neurodevelopment and protein glycosylation processes. By using LSBL and Pi values with a 10-kb sliding window and 5-kb step for two important genomic regions, we identified the candidate gene NCAM2 in the region Chr1: 103.60 to 107.58 Mb (Figure 5B) and MPHOSPH6 in the region Chr12: 3.53 to 6.95 Mb (Figure 5C).
We identified 415 genes in the overlap of the top 5% Fst and Pi ratios between egg-laying and dual-purpose ducks in the local population. For these genes, KEGG pathway analysis revealed that the ovarian steroidogenesis pathway and protein digestion and absorption pathway were significantly enriched (p<0.05) (Figure 6C). GO analysis revealed that the regulation of monoatomic anion transport were the most significantly enriched terms (Figure 6B). We identified two candidate genes, i.e., PTGS2 and PLA2G4A, which are involved in the ovarian steroidogenesis pathway, that are positively selected in the egg laying population, whereas KCNK16, KCNK5 and KCNK17 are positively selected in dual-purpose ducks and are involved in the fundamental processes of protein digestion and absorption and monoatomic anion transport.

DISCUSSION

In this study, we conducted a preliminary analysis of the most comprehensive duck dataset of whole-genome resequencing data from 416 wild, local, and commercial individuals in China. The three populations presented significant differences in genetic diversity and in the distribution of SNPs. The local population has the highest numbers of total and unique SNPs, indicating greater genetic diversity. The wild population has slightly fewer SNPs but still maintains high diversity, whereas the commercial population has the fewest SNPs and a greater proportion of unique SNPs, suggesting that it has been subjected to stronger artificial selection pressure. The trends of Pi, F, He, and Ho across wild, local and commercial populations indicated that the wild population presented relatively high genomic diversity, which was confirmed in previous whole-genome resequencing analyses [23]. The analysis of ROH and LD decay also revealed the rich genetic diversity and the presence of inbreeding in the wild population. The Ne of the local population was the highest among the three populations, largely because it included 18 breeds. Commercial populations presented lower genetic diversity and higher inbreeding coefficients, which are likely associated with strong artificial selection and breeding strategies [24]. Local populations exhibit relatively high genetic diversity, possibly due to their long-term adaptation to specific environmental conditions and limited human intervention [25]. These differences reflect the distinct evolutionary trajectories of each population in terms of breeding and environmental adaptation.
In the commercial population, we detected selection signals for genes associated with cell growth, proliferation, and differentiation, such as those in the mTOR and ErbB signaling pathways. These signals confirmed that commercial populations have been artificially selected for growth rate and meat quality [26,27]. It has been reported that LRP6 is involved in the bone construction process [28] and that BRD9 has a positive effect on the development of muscles and bones [29]. ARHGAP8 is reported for the first time in this study. Additionally, EDN1, WNT7B, PARVB, and PRR5 are related to the development of muscle and adipose tissue. EDN1 has a significant effect on the development of meat marbling [30]. WNT7B promotes bone formation and muscle development [31]. PARVB establishes and maintains cell polarity and the actin cytoskeleton, affecting muscle development [32]. PRR5 is associated with fat deposition in pigs [33].
In the local population, we observed selection signals related to the immune system and disease resistance, such as the O-glycosylation pathway [34]. This may reflect the adaptation of local duck breeds to specific environmental pressures, such as challenges from pathogens and parasites. Selection signals related to neural development and glycosylation processes may influence the environmental adaptability and reproductive capacity of local duck breeds [35,36]. NCAM2 promotes neural remodeling and synaptic plasticity, supporting adaptation to rapid environmental changes and stress [37]. Polymorphisms in the MPHOSPH6 gene are associated with an increased risk of several diseases, suggesting that it may play a key role in immune regulation and inflammatory responses [3840].
To identify selection signals between egg-laying and dual-purpose ducks in the local population, we identified signaling pathways associated with reproduction and egg production, such as ovarian steroid production. Interestingly, we found that PTGS2 and PLA2G4A, which are highly expressed in egg-laying ducks, are involved in ovarian steroid production and may play important roles in egg production. PTGS2 on Chr08: 17.77 to 17.95 Mb plays a key role in prostaglandin synthesis and exerts an important impact on ovarian function and ovulation. PLA2G4A indirectly regulates prostaglandin synthesis, thereby affecting egg production and egg production performance. These two genes were reported to influence egg production performance in chickens [41]. We also detected KCNK16, KCNK5, and KCNK17 at Chr03: 89.45 to 90.18 Mb, which are positively selected in dual-purpose ducks. According to the results of the GO enrichment analysis, these genes are involved in monoatomic anion transport and protein digestion and are critical for maintaining acid-base balance, respiratory function, and the stability of the intracellular environment.

CONCLUSION

To date, we have conducted the most comprehensive preliminary analysis of whole-genome resequencing data from 416 wild, local, and commercial ducks in China. Owing to strong artificial selection, the commercial population has the lowest genetic diversity and the highest inbreeding level. The selection signals in the commercial population are related to growth and muscle development, with LRP6, BORCS5, MANSCI, BCL2L14, WNT7B, PRR5, BRD9, and EDN1 likely playing key roles. In contrast, selection signals in the local population were associated with immune-related pathways involving NCAM2 and MPHOSPH6. Furthermore, the PTGS2 and PLA2G4A genes were positively selected in egg breeds, whereas the KCNK16, KCNK5, and KCNK17 genes were positively selected in dual-purpose breeds. This study provides valuable insights into the genetic mechanisms of local and commercial populations, supporting the development of strategies for conserving genetic diversity and optimizing breeding management.

Notes

CONFLICT OF INTEREST

No potential conflict of interest relevant to this article was reported.

AUTHORS’ CONTRIBUTION

Conceptualization: Huang Z, Li X.

Data curation: Huang Z, Zhang L, Luo M.

Formal analysis: Huang Z, Zhang L.

Writing - original draft: Huang Z, Li X.

Writing - review & editing: Huang Z, Zhang L, Luo M, Zhang X, Huang Y, Tian Y, Wu Z, Li X.

FUNDING

This work was supported by Pearl River S&T Nova Program of Guangzhou (201906010040), and Modern Agriculture Waterfowl Industry Technology System Innovation Team of Guangdong (2020KJ137), China.

ACKNOWLEDGMENTS

Not applicable.

DATA AVAILABILITY

Upon reasonable request, the datasets of this study can be available from the corresponding author.

ETHICS APPROVAL

Not applicable.

DECLARATION OF GENERATIVE AI

No AI tools were used in this article.

SUPPLEMENTARY MATERIAL

Not applicable.

Figure 1
Geographic distributions and genetic variants of three duck populations. (A) Geographical distribution of 22 duck breeds. (B) The distribution of SNPs across 29 chromosomes. (C) The shared and unique SNPs among the wild, local, and commercial duck populations. (D) Functional annotation of SNPs for wild, local, and commercial duck populations. SNP, single nucleotide polymorphism.
ab-24-0643f1.jpg
Figure 2
Population structure of 22 duck breeds based on 5,140,908 common single nucleotide polymorphisms. (A) Principal component analysis (PCA) clustering of 416 samples. (B) Identity-by-state (IBS) neighbor joining (NJ) tree constructed with 416 samples. The wild population encompasses mallards (MALs). The local population is represented by various breeds, including JinDing duck (JRD), Liancheng White duck (LCW), MaWang duck (MWD), Putian Black duck (PBD), SanSui duck (SSD), ShanMa duck (SMD), ShaoXing duck (SXD), Taiwan duck (TWD), Yulin-Wu duck (YWD), DongLan duck (DLD), YouXian duck (YXD), GaoYou duck (GYD), Ji’an red duck (JRD), Longsheng-Cui duck (LCD), WenQiao duck (WQD), Rongshui-Xiang duck (RSD), Xilin-Ma duck (XMD), and Yulin-Ma duck (YMD). Commercial populations include Pekin ducks (PKD), Mapleleaf ducks (MLD) and Cherry Valley ducks (CVDs). (C) Model-based clustering of 416 samples calculated via ADMIXTURE with K values ranging from 2 to 12.
ab-24-0643f2.jpg
Figure 3
Genetic diversity of the three duck populations. (A) Expected heterozygosity (He), observed heterozygosity (Ho), nucleotide diversity (Pi), length of regions of homozygosity (bp), and inbreeding coefficient (F) values of three duck populations. (B) Effective population size for three populations in different generations. (C) Estimation of the genome-wide linkage disequilibrium (LD) decay for three duck populations.
ab-24-0643f3.jpg
Figure 4
Genome-wide selective signals of commercial populations compared with those of wild and local populations. (A) Manhattan plots of LSBLCM–(LC+WT) values between commercial and the other two populations. (B) LSBLCM–(LC+WT) and Pi with a 10-kb sliding window and 5-kb step and candidate genes in the Chr1: 70.25–74.00 Mb region. (C) LSBL and Pi with the 10-kb sliding window and 5-kb step and candidate genes in the region in the region Chr2: 97.10–99.76 Mb. (D) Gene Ontology enrichment analysis of genes located in the top 1% of the LSBLCM–(LC+WT). (E) Kyoto Encyclopedia of Genes and Genomes enrichment analysis of genes located in the top 1% of the LSBLCM–(LC+WT). LSBL, locus-specific branch length; Pi, nucleotide diversity.
ab-24-0643f4.jpg
Figure 5
Genome-wide selective signals of local populations compared with those of wild and commercial populations. (A) Manhattan plots of LSBL values between the local population and the other two populations. (B) LSBLLC–(CM+WT) and Pi with a 10-kb sliding window and 5-kb step and candidate genes in the region Chr1: 103.6–107.57 Mb. (C) LSBLLC–(CM+WT) and Pi with the 10-kb sliding window and 5-kb step and candidate genes in the region Chr12: 3.52–6.95 Mb. (D) Gene Ontology enrichment analysis of genes located in the top 1% of the LSBLLC–(CM+WT). (E) Kyoto Encyclopedia of Genes and Genomes enrichment analysis of genes located in the top 1% of LSBLLC–(CM+WT). LSBL, locus-specific branch length; Pi, nucleotide diversity.
ab-24-0643f5.jpg
Figure 6
Genome-wide selective signals of the egg-laying population compared with those of dual-purpose populations. (A) Manhattan plots of Fst and Pi ratio values between egg-laying and dual-purpose populations. (B) Gene Ontology enrichment analysis of genes located in the top 5% of the Fst genes. (C) Kyoto Encyclopedia of Genes and Genomes enrichment analysis of genes located in the top 5% of the Fst genes. Fst, genetic differentiation coefficient; Pi, nucleotide diversity.
ab-24-0643f6.jpg
Table 1
Sample information for 416 ducks used in this study
Population Breed Count Type Location Sequencing depth (×)
Wild Mallard 72 Wild Zhejiang/Ningxia 7.54
Local JinDing duck 11 Egg Longhai, Fujian 6.11
Liancheng White duck 3 Egg Liancheng, Fujian 9.74
Mawang duck 3 Egg Mawang, Chongqing 9.05
Putian black duck 3 Egg Putian, Fujian 9.78
Sansui duck 3 Egg Sansui, Guizhou 8.98
Shanma duck 21 Egg Longyan, Fujian 5.47
Shaoxing duck 24 Egg Shaoxing, Zhejiang 4.75
Taiwan duck 3 Egg Taiwan 9.25
Youxian duck 3 Egg You, Hunan 10.19
Yulin-Wu duck 9 Dual-purpose Yulin, Guangxi 8.64
Donglan duck 9 Dual-purpose Donglan, Guangxi 8.52
Gaoyou duck 11 Dual-purpose Gaoyou, Jiangsu 6.24
Ji’an red duck 3 Dual-purpose Ji’an, Jiangxi 9.90
Longsheng-Cui duck 13 Dual-purpose Longsheng, Guangxi 8.86
Wenqiao duck 10 Dual-purpose Wenqiao, Guangxi 9.27
Rongshui-Xiang duck 10 Dual-purpose Rongshui, Guangxi 9.66
Xilin-Ma duck 10 Dual-purpose Xilin, Guangxi 8.87
Yulin-Ma duck 10 Dual-purpose Yulin, Guangxi 8.85
Commercial Pekin duck 160 Meat Beijing 6.58
Mapleleaf duck 7 Meat Beijing 4.93
CherryValley duck 18 Meat Beijing/Zhejiang 4.57

REFERENCES

1. Mottet A, Tempio G. Global poultry production: current state and future outlook and challenges. Worlds Poult Sci J 2017;73:245–56. https://doi.org/10.1017/s0043933917000071
crossref
2. Péron G, Koons DN. Integrated modeling of communities: parasitism, competition, and demographic synchrony in sympatric ducks. Ecology 2012;93:2456–64. https://doi.org/10.1890/11-1881.1
crossref pmid
3. Wang Y, Zhang R, Li J, et al. Comprehensive resistome analysis reveals the prevalence of NDM and MCR-1 in Chinese poultry production. Nat Microbiol 2017;2:16260. https://doi.org/10.1038/nmicrobiol.2016.260
crossref pmid
4. Fuentes-Pardo AP, Ruzzante DE. Whole-genome sequencing approaches for conservation biology: advantages, limitations and practical recommendations. Mol Ecol 2017;26:5369–406. https://doi.org/10.1111/mec.14264
crossref pmid
5. Song X, Yao Z, Zhang Z, et al. Whole-genome sequencing reveals genomic diversity and selection signatures in Xia’nan cattle. BMC Genomics 2024;25:559. https://doi.org/10.1186/s12864-024-10463-3
crossref pmid pmc
6. Gao J, Sun L, Pan H, et al. Genomic dissection through whole-genome resequencing of five local pig breeds from Shanghai, China. Animals 2023;13:3727. https://doi.org/10.3390/ani13233727
crossref pmid pmc
7. Jiang F, Lin R, Xiao C, et al. Analysis of whole-genome re-sequencing data of ducks reveals a diverse demographic history and extensive gene flow between Southeast/South Asian and Chinese populations. Genet Sel Evol 2021;53:35. https://doi.org/10.1186/s12711-021-00627-0
crossref pmid pmc
8. Li T, Wang Y, Zhang Z, Ji C, Zheng N, Huang Y. A comparative analysis reveals the genomic diversity among 8 Muscovy duck populations. G3 2024;14:jkae112. https://doi.org/10.1093/g3journal/jkae112
crossref pmid pmc
9. Zhu F, Yin ZT, Wang Z, et al. Three chromosome-level duck genome assemblies provide insights into genomic variation during domestication. Nat Commun 2021;12:5932. https://doi.org/10.1038/s41467-021-26272-1
crossref pmid pmc
10. Li H, Durbin R. Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics 2009;25:1754–60. https://doi.org/10.1093/bioinformatics/btp324
crossref pmid pmc
11. Danecek P, Bonfield JK, Liddle J, et al. Twelve years of SAMtools and BCFtools. Gigascience 2021;10:giab008. https://doi.org/10.1093/gigascience/giab008
crossref pmid pmc
12. Danecek P, Auton A, Abecasis G, et al. The variant call format and VCFtools. Bioinformatics 2011;27:2156–8. https://doi.org/10.1093/bioinformatics/btr330
crossref pmid pmc
13. Pedersen BS, Quinlan AR. Mosdepth: quick coverage calculation for genomes and exomes. Bioinformatics 2018;34:867–8. https://doi.org/10.1093/bioinformatics/btx699
crossref pmid pmc
14. Purcell S, Neale B, Todd-Brown K, et al. PLINK: a tool set for whole-genome association and population-based linkage analyses. Am J Hum Genet 2007;81:559–75. https://doi.org/10.1086/519795
crossref pmid pmc
15. Wang K, Li M, Hakonarson H. ANNOVAR: functional annotation of genetic variants from high-throughput sequencing data. Nucleic Acids Res 2010;38:e164. https://doi.org/10.1093/nar/gkq603
crossref pmid pmc
16. Letunic I, Bork P. Interactive tree of life (iTOL) v3: an online tool for the display and annotation of phylogenetic and other trees. Nucleic Acids Res 2016;44:W242–5. https://doi.org/10.1093/nar/gkw290
crossref pmid pmc
17. Alexander DH, Lange K. Enhancements to the ADMIXTURE algorithm for individual ancestry estimation. BMC Bioinformatics 2011;12:246. https://doi.org/10.1186/1471-2105-12-246
crossref pmid pmc
18. Chen C, Wu Y, Li J, et al. TBtools-II: A “one for all, all for one” bioinformatics platform for biological big-data mining. Mol Plant 2023;16:1733–42. https://doi.org/10.1016/j.molp.2023.09.010
crossref pmid
19. Barbato M, Orozco-terWengel P, Tapio M, Bruford MW. SNeP: a tool to estimate trends in recent effective population size trajectories using genome-wide SNP data. Front Genet 2015;6:109. https://doi.org/10.3389/fgene.2015.00109
crossref pmid pmc
20. Zhang C, Dong SS, Xu JY, He WM, Yang TL. PopLDdecay: a fast and effective tool for linkage disequilibrium decay analysis based on variant call format files. Bioinformatics 2019;35:1786–8. https://doi.org/10.1093/bioinformatics/bty875
crossref pmid
21. Shriver MD, Kennedy GC, Parra EJ, et al. The genomic distribution of population substructure in four populations using 8,525 autosomal SNPs. Hum Genomics 2004;1:274. https://doi.org/10.1186/1479-7364-1-4-274
crossref pmid pmc
22. Wu T, Hu E, Xu S, et al. clusterProfiler 4.0: a universal enrichment tool for interpreting omics data. Innovation 2021;2:100141. https://doi.org/10.1016/j.xinn.2021.100141
crossref pmid pmc
23. Zhang Z, Jia Y, Almeida P, et al. Whole-genome resequencing reveals signatures of selection and timing of duck domestication. Gigascience 2018;7:giy027. https://doi.org/10.1093/gigascience/giy027
crossref pmid pmc
24. Hulsegge I, Calus M, Hoving-Bolink R, Lopes M, Megens HJ, Oldenbroek K. Impact of merging commercial breeding lines on the genetic diversity of Landrace pigs. Genet Sel Evol 2019;51:60. https://doi.org/10.1186/s12711-019-0502-6
crossref pmid pmc
25. Abebe AS, Mikko S, Johansson AM. Genetic diversity of five local Swedish chicken breeds detected by microsatellite markers. PLOS ONE 2015;10:e0120580. https://doi.org/10.1371/journal.pone.0120580
crossref pmid pmc
26. Zhang P, Liang X, Shan T, et al. mTOR is necessary for proper satellite cell activity and skeletal muscle regeneration. Biochem Biophys Res Commun 2015;463:102–8. https://doi.org/10.1016/j.bbrc.2015.05.032
crossref pmid pmc
27. Liu Y, Wei Y, Dou Y, et al. Effect of miR-149-5p on intramuscular fat deposition in pigs based on metabolomics and transcriptomics. BMC Genomics 2023;24:293. https://doi.org/10.1186/s12864-023-09382-6
crossref pmid pmc
28. Cavani L, Poindexter MB, Nelson CD, Santos JEP, Peñagaricano F. Gene mapping, gene-set analysis, and genomic prediction of postpartum blood calcium in Holstein cows. J Dairy Sci 2022;105:525–34. https://doi.org/10.3168/jds.2021-20872
crossref pmid
29. Lukic B, Curik I, Drzaic I, et al. Genomic signatures of selection, local adaptation and production type characterisation of East Adriatic sheep breeds. J Anim Sci Biotechnol 2023;14:142. https://doi.org/10.1186/s40104-023-00936-y
crossref pmid pmc
30. de Las Heras-Saldana S, Chung KY, Kim H, Lim D, Gondro C, van der Werf JHJ. Differential gene expression in longissimus dorsi muscle of hanwoo steers—new insight in genes involved in marbling development at younger ages. Genes 2020;11:1381. https://doi.org/10.3390/genes11111381
crossref pmid pmc
31. Chen H, Ji X, Lee WC, et al. Increased glycolysis mediates Wnt7b-induced bone formation. FASEB J 2019;33:7810–21. https://doi.org/10.1096/fj.201900201rr
crossref pmid pmc
32. Matsuda C, Kameyama K, Suzuki A, et al. Affixin activates Rac1 via betaPIX in C2C12 myoblast. FEBS Lett 2008;582:1189–96. https://doi.org/10.1016/j.febslet.2008.01.064
crossref pmid
33. Caetano-Anollés K, Rhodes JS, Garland T, et al. Cerebellum transcriptome of mice bred for high voluntary activity offers insights into locomotor control and reward-dependent behaviors. PLOS ONE 2016;11:e0167095. https://doi.org/10.1371/journal.pone.0167095
crossref pmid pmc
34. Chang YH, Weng CL, Lin KI. O-GlcNAcylation and its role in the immune system. J Biomed Sci 2020;27:57. https://doi.org/10.1186/s12929-020-00648-9
crossref pmid pmc
35. Price EO. Behavioral development in animals undergoing domestication. Appl Anim Behav Sci 1999;65:245–71. https://doi.org/10.1016/s0168-1591(99)00087-8
crossref
36. Hiyama G, Kansaku N, Kinoshita M, et al. Changes in post-translational modifications of prolactin during development and reproductive cycles in the chicken. Gen Comp Endocrinol 2009;161:238–45. https://doi.org/10.1016/j.ygcen.2009.01.007
crossref pmid
37. Sytnyk V, Leshchyns’ka I, Schachner M. Neural cell adhesion molecules of the immunoglobulin superfamily regulate synapse formation, maintenance, and function. Trends Neurosci 2017;40:295–308. https://doi.org/10.1016/j.tins.2017.03.003
crossref pmid
38. Zhang Y, Wang S, Wen X, Zhang S, Yang Y. Association of ACYP2 and MPHOSPH6 genetic polymorphisms with the risk of hepatocellular carcinoma in chronic hepatitis B virus carriers. Oncotarget 2017;8:86011–9. https://doi.org/10.18632/oncotarget.20846
crossref pmid pmc
39. Song Y, Yan M, Li J, Li J, Jin T, Chen C. Association between TNIP1, MPHOSPH6 and ZNF208 genetic polymorphisms and the coronary artery disease risk in Chinese Han population. Oncotarget 2017;8:77233–40.
crossref pmid pmc
40. Yang X, Zhang Y, Li W, et al. Association between MPHOSPH6 gene polymorphisms and IgA nephropathy risk in a Chinese Han population. Oncotarget 2017;8:72375–80.
crossref pmid pmc
41. Kim J, Macharia JK, Kim M, et al. Runs of homozygosity analysis for selection signatures in the Yellow Korean native chicken. Anim Biosci 2024;37:1683–91. https://doi.org/10.5713/ab.24.0092
crossref


Editorial Office
Asian-Australasian Association of Animal Production Societies(AAAP)
Room 708 Sammo Sporex, 23, Sillim-ro 59-gil, Gwanak-gu, Seoul 08776, Korea   
TEL : +82-2-888-6558    FAX : +82-2-888-6559   
E-mail : editor@animbiosci.org               

Copyright © 2025 by Asian-Australasian Association of Animal Production Societies.

Developed in M2PI

Close layer
prev next