Results of genome sequencing of pooled DNA from each of 10 chickens per line showed 5.1×, 6.5×, and 8.2× genome coverage for HMC, LMC, and RAN lines, respectively. The total number of SNPs was 5.3, 6.0, and 4.4 million (~0.5% of template genome) in HMC, LMC, and RAN genome, respectively (data not shown). The large number of SNPs per examined chicken line was based on data of at least 2 read coverage depths (number of read counts per nucleotide location). To identify genetic biomarkers that are responsible for regulating muscle color, high quality (Q call >40) SNPs uniquely found in HMC and LMC were selected by removing overlapping SNPs between HMC and LMC simultaneously. Then, possibly fixed mutations showing >75% SNP rates were chosen as reliable marker SNPs. As a result, a total of 1,134,655 unique SNPs were identified throughout the HMC and LMC chicken genomes. The number of SNPs in each chromosome is shown in
Figure 1. The HMC line (
Figure 1A) showed 576,886 SNPs including 229,415 parental and 347,471 non-parental segregations, while the LMC line (
Figure 1B) showed 557,769 SNPs including 253,055 parental- and 304,714 non-parental segregations (Information for all SNPs for HMC and LMC will be provided upon request). When SNPs were grouped by feature types of chromosome regions, ~50% of SNPs were in the intergenic (heterochromatic) regions and 26,800 SNPs were found in CDS (protein coding regions). Around 70% of SNPs in CDS regions were synonymous mutations that do not induce amino acid changes. A total of 2,884 SNPs (1,307 for HMC and 1,577 for LMC) could induce non-synonymous-, frameshift-, nonsense-, and no-start mutations in proteins, suggesting that the 2,884 SNPs may be inducible mutations that are part of the genetic components regulating muscle color in chickens. Of those 2,884 candidate SNPs associated with amino acid changes, 679 and 924 SNPs were parental (RAN line) segregations for HMC and LMC, respectively (data not shown; information for all SNPs for HMC and LMC will be provided upon request). To focus on the novel mutations different from parental genotypes, SNPs of non-parental segregation showing non-synonymous changes in CDS region were considered as causal mutations in this study. Potentially causal SNPs showing over 10 read depths (considered to be more reliable candidate genetic markers) were chosen for further bioinformatic analysis as described in Materials and Methods. In addition, re-scanning of each SNP position for reliable, mutational, and causal protein coding SNPs was conducted to reduce false positives due to possible errors (e.g. SNP detection by reading a position in a chicken line, which was not covered the region of the SNP in the other chicken line) in the SNP calling process, using Seqman-Pro viewer program. This process yielded 15 more reliable SNPs including 1 for HMC and 14 for LMC (
Table 1) and gene names containing SNPs that are listed in
Table 2. Genes included are: trafficking protein particle complex 8 (
TRAPPC8) for HMC and apolipoprotein B antigen (
APOB), ataxia telangiectasia mutated (
ATM), coiled-coil domain containing 88A (
CCDC88A; transcript variant 1), complement factor H (
CFH), collagen, type XXVIII, alpha 1 (
COL28A1), glycoprotein IX (
GP9 [platelet]), ligase IV (
LIG4, DNA, ATP-dependent), melanoma inhibitory activity family, member 3 (
MIA3), microtubule associated monooxygenase, calponin and LIM domain containing 2 (
MICAL2), RE1-silencing transcription factor (
REST), small nuclear RNA activating complex, polypeptide 1 (
SNAPC1, 43 kDa; transcript variant 1), TNF receptor-associated factor 1 (
TRAF1), UTP18 small subunit processome component homolog (
UTP18 [yeast]), and
LOC428119 (uncharacterized protein) for LMC.