- Research article
- Open Access
Analysis of genetic diversity in Brown Swiss, Jersey and Holstein populations using genome-wide single nucleotide polymorphism markers
BMC Research Notes volume 5, Article number: 161 (2012)
Studies of genetic diversity are essential in understanding the extent of differentiation between breeds, and in designing successful diversity conservation strategies. The objective of this study was to evaluate the level of genetic diversity within and between North American Brown Swiss (BS, n = 900), Jersey (JE, n = 2,922) and Holstein (HO, n = 3,535) cattle, using genotyped bulls. GENEPOP and FSTAT software were used to evaluate the level of genetic diversity within each breed and between each pair of the three breeds based on genome-wide SNP markers (n = 50,972).
Hardy-Weinberg equilibrium (HWE) exact test within breeds showed a significant deviation from equilibrium within each population (P < 0.01), which could be a result of selection, genetic drift and inbreeding within each breed. Hardy-Weinberg test also confirmed significant heterozygote deficit in each breed over several loci. Moreover, results from population differentiation tests showed that the majority of loci have alleles or genotypes drawn from different distributions in each breed. Average gene diversity, expressed in terms of observed heterozygosity, over all loci in BS, JE and HO was 0.27, 0.26 and 0.31, respectively. The proportion of genetic diversity due to allele frequency differences among breeds (Fst) indicated that the combination of BS and HO in an ideally amalgamated population had higher genetic diversity than the other pairs of breeds.
Results suggest that the three bull populations have substantially different gene pools. BS and HO show the largest gene differentiation and jointly the highest total expected gene diversity compared to when JE is considered. If the loss of genetic diversity within breeds worsens in the future, the use of crossbreeding might be an option to recover genetic diversity, especially for the breeds with small population size.
The importance of genetic diversity in livestock is directly related to the need for genetic improvement of economically important traits as well as to facilitate rapid adaptation to potential changes in breeding goals . Estimates of effective population size in commercial dairy populations, including Brown Swiss, Holstein and Jersey are decreasing at alarming rates to be of serious concern to the livestock industry . Recently pedigree-based studies revealed increasing rates of inbreeding and coancestry in Canadian Jersey and Holstein populations . Studies of genetic diversity are useful to the understanding of evolution of breeds, gene pool development and the level of differentiation among breeds [1, 4, 5]. Such studies are quite important for prioritizing conservation of breeds with critically low levels of diversity.
Hardy-Weinberg Equilibrium (HWE) states that in a large random mating population with no selection, mutation, or migration, the allele frequencies and the genotype frequencies are constant from generation to generation, and, hence, a simple relationship between the allele frequencies and the genotype frequencies exists . The theory of HWE has played an important role in the development of population genetics, and has frequently been used as a basis for genetic inferences .
Tests for departures from Hardy-Weinberg proportions are often used to check on random mating in populations, and the deviations from the expected frequency of homozygotes are used to estimate inbreeding coefficients . The same approach was used for estimating the inbreeding coefficient of a population by calculating the excess of homozygotes with respect to Hardy-Weinberg equilibrium expectations . The role that the variance due to differences in gene frequencies among subpopulations play in the total genotypic frequencies from amalgamating subpopulations has been demonstrated by several studies [10, 11]. Fixation indices (Fis, Fit and Fst) are the most widely used parameters for studying the genetic differentiation of populations. These indices have been originally defined in terms of the correlations of two uniting gametes [12–14]. Accordingly, Fit is the correlation between uniting gametes that generate an individual relative to the gametes of the total population. Fis is the average over all subpopulations of the correlation between uniting gametes that generate an individual relative to the gametes of their own subpopulation. Fst is the correlation between random gametes within subpopulations, relative to gametes of the total population. For example, in this study, an ideally amalgamated population of Brown Swiss and Jersey bulls would have each breed as a subpopulation. Furthermore, the relationship between fixation indices and measures of identity by decent have been illustrated in previous studies [15, 16].
Fixation indices can also be formulated entirely in terms of the allelic and genotypic frequencies in the population [11, 17, 18]. In this case the fixation indices can be expressed in terms of ratios of heterozygosities. The Fst is equal to 0 when the same allele is fixed in all populations . Allelic and genotypic frequencies may fluctuate because of finite subpopulation sizes or random variation in evolutionary forces . In view of different factors affecting probabilities of gene identity in subdivided populations, the fixation indices were redefined in terms of the observed and expected heterozygosity based on allelic and genotypic frequencies in a population . In addition, measures of inter-population gene differences and coefficients of gene differentiation (Dst and Gst, respectively) have been extensively used to describe the level of genetic diversity [11, 17].
The objective of this study was to assess the status of genetic diversity within and between BS, JE and HO breeds, using bulls genotyped with a dense SNP marker map through detailed analyses carried out via GENEPOP and FSTAT software.
Genome-wide SNP data for the three breeds were received from the Animal Improvement Programs Laboratory, USDA (Beltsville, MD, USA) in November 2009. The data consisted of 900, 2,922 and 3,535 Brown Swiss (BS), Jersey (JE) and Holstein (HO) bulls, respectively, all genotyped with the Illumina BovineSNP50K BeadChip (Illumina Inc., San Diego, CA) as part of the North American collaboration in genomic prediction in dairy cattle . Genotypes for a total of 50,972 SNPs were available for the analyses, which included all the SNPs with useable calls, without any exclusion due to minor allele frequency or correlation between SNPs. The bulls included in the analyses represented a sample of most BS and JE proven/sampled bulls in North America and a large sample of proven HO bulls in North America.
Genetic diversity analysis
Estimates of genetic diversity and statistical analyses were performed using the software GENEPOP, version 4.0 . The exact tests for deviations from HWE  were also performed using the GENEPOP package. GENEPOP uses a Markov Chain (MC) algorithm (dememorization = 10,000, batches = 100, and iterations per batch = 5,000) to estimate the P-value of the exact HWE tests . Significance levels were calculated per locus, per breed, and over all loci and pairs of breeds combined. Genetic diversity within breeds was also measured as the frequency of private alleles (PA, breed-specific alleles), the observed heterozygosity (Ho), and the expected heterozygosity (He) under HWE. The significance of breed differences was tested using the exact test of population differentiation in GENEPOP software based on allele frequencies.
Genetic differentiation between breeds was also estimated using the Fst coefficient proposed by Wright  and computed by GENEPOP.
The software FSTAT  was used to compute F-statistic , and to test them using randomisation methods. The Fst was estimated by a “weighted” analysis of variance . The most common computational formula for Fst is:
Where: δp2 the sample variance of allele frequencies over populations . Fst can therefore be described as the amount of allele frequency variance in a sample relative to the maximum possible variance. Fst can also be defined as follows :
Where: Fit is the correlation between uniting gametes that generate an individual, relative to the gametes of the total population; Fis is the average, over all subpopulations, of the correlation between uniting gametes that generate an individual relative to those of their own subpopulation.
The amount of heterozygosis (Yt) in the total population was also defined regardless of structure of the population, in terms of total population gene frequency (qt) :
Indirect estimates of gene flow were implemented in FSTAT  according to the method demonstrated by . The effective number of migrants (Nm) was estimated, assuming the n-island model of population structure, on the basis of the relationship:
Furthermore, FSTAT was used to calculate inter-population gene differences and coefficients of gene differentiation that are either dependent (Dst and Gst) or independent (Dst’ and Gst’) of the number of subpopulations . Dst is the average gene diversity between subpopulations. The gene diversity in the total population is equivalent to the sum of gene diversities within each subpopulation. Coefficient of gene differentiation (Gst) was computed as the ratio of Dst to the total population diversity.
The exact test for Hardy-Weinberg Equilibrium (HWE) within breeds showed a significant deviation in each breed (P < 0.01). Moreover, results of the exact test for HWE showed lower observed heterozygosity (Ho) than expected heterozygosity (He) in each breed (Figure 1). The Holstein bull population showed the highest average marker diversity between individuals within breeds in terms of He compared to BS and JE breeds (0.31, 0.27, and 0.26, respectively). Jersey showed higher percentage of loci with fixed alleles followed by BS and HO (Table 1). The HWE test has also confirmed significant heterozygote deficit (≥90%) in each population over several loci.
Average gene diversity over all loci, per chromosome, in BS, JE and HO, expressed in terms of Ho, are shown in Figure 2. Holsteins showed consistently higher Ho than JE and BS across all chromosomes. BS and JE had similar overall Ho, however, depending on the chromosome, one or another of the two breeds had higher Ho. Higher Ho for HO than BS and JE and similar overall Ho for BS and JE is consistent with the effective population sizes of this three breeds, which is higher for HO and lower and similar for BS and JE  Moreover, average heterozygosity of Holsteins showed a declining trend over the last four generations considering the generation interval of 5 years (Figure 3). Accordingly, Ho in HO has reduced from 0.361, when 4 generations were traced back in the pedigree, down to 0.3534, when one generation was traced back.
Population genetic differentiation of BS, JE and HO, as measured by Fst (Figure 4) showed that the breeds are genetically differentiated at each chromosome. For example, the average measure of Fst in an ideally amalgamated population of BS, JE and HO on Chromosome 18 showed that the breeds are differentiated with an average value of Fst equal to 0.16. Higher value of Fst indicates the presence of higher genetic differentiation between subpopulations, which implies that pairs of genes between individuals within subpopulations are more related than those of individuals between subpopulations. The Fst values between each pair of populations indicated that BS vs. HO population has higher genetic differentiation than the other pairs (Table 2). However, the Fst values among the last four generations in the HO were below 0.1, suggesting that there was no considerable genetic differentiation in the HO bull population in the last four generations (data not shown). This may indicate the fact that there has not been new outbred genetic material introduced to the bull population over the last four generations, except the use of commonly used sires of good genetic merit over generations. The relatedness between individuals within breed in BS vs. HO populations relative to the total population was higher (0.28) than that in BS vs. JE (0.23) and JE vs. HO (0.22), which also implies that BS and HO gene pools are more differentiated compared to the other pairs of breeds.
A summary of allelic richness, average fixation indices, frequency of private alleles per breed pair are presented in Table 2. Higher frequency of private alleles (alleles that are present in one of the breeds, but not in another) was observed in BS vs. HO followed by JE vs. HO and BS vs. JE populations. This result is also in agreement with population differentiation results, as measured by Fst values. In addition, indirect estimates of gene flow indicated the presence of higher effective number of migrants (Nm) between populations of JE and HO followed by BS and JE, while BS and HO populations showed the least Nm. This might be one explanation for the higher values of population differentiation measures, in particular higher Fst between BS and HO.
The exact-test for population differentiation of each breed pair across all loci showed highly significant differences among breeds regarding the distributions from which the alleles and genotypes were drawn from. Accordingly, the majority of loci have alleles or genotypes drawn from different distributions in the three breeds. However, there are some loci with alleles or genotypes drawn from the same distribution in all the breeds. For example, loci with alleles drawn from the same distribution in BS, JE and HO are shown for Chromosome 14 and 18 (Figure 5). This implies that alleles of those loci may not have been differentiated by selection, drift and inbreeding in the three bull populations. Moreover, the comparison of each pair of the three breeds with respect to the origin of their alleles is also presented. Accordingly, on average, alleles of 7.8, 5.5 and 3.1% of loci could be drawn from the same distribution in JE vs. HO, BS vs. JE and BS vs. HO populations, respectively (Table 3). Similar results were obtained for the percentage of loci with genotypes drawn from the same distribution (Table 4).
The amalgamated BS vs. HO population showed the highest average inter-population gene differentiation both dependent (Dst = 0.03) and independent (Dst’ = 0.07) on the number of subpopulations, and also the highest expected total heterozygosity (Yt = 0.33) compared with the ideally amalgamated populations of BS vs. JE, or JE vs. HO. Similarly, the highest Gst and Gst’ (12.5 and 19.7%, respectively) were also observed in an ideally amalgamated population of BS vs. HO. Therefore, the measure of inter-population gene differences also revealed that BS vs. HO population had the highest genetic differences compared to the other two pairs of breeds (BS vs. JE and JE vs. HO). Overall, the results indicated higher genetic diversity in an ideally amalgamated population of BS vs. HO.
The recent decline in diversity is sufficiently rapid that loss of diversity should be of concern to animal breeders . Several authors e.g.  demonstrated different models to describe deviations from Hardy-Weinberg proportions. The exact test for Hardy-Weinberg disequilibrium [9, 25] within breeds showed a significant deviation in each breed in this study (P < 0. 01). The populations also showed several loci with a significant heterozygote deficit (P < 0.01) but no loci with significant heterozygote excess, which implies the application of genetic selection and inevitably the role of random genetic drift and inbreeding in each breed. Generally, the results have showed that there are some loci (from 3.1 to 7.8%) with alleles drawn from the same distribution in all the populations. This may suggest the fact that, over time and through forces like selection and random genetic drift, the allele frequencies have been largely changed in the breeds, where very little of the original genomes are preserved.
Each breed showed considerable difference between the observed and expected number of heterozygous individuals across loci. However, in the ideally amalgamated pairs of the populations, the difference between the observed and expected number of heterozygous individuals appears to be smaller suggesting that crossbreeding could be carefully considered for increasing diversity in the future if needed. In livestock species, heterozygote deficiencies can be interpreted as the consequence of many factors, such as selection, population subdivision, or inbreeding .
Populations are said to be undifferentiated if Fst. In this study each pair of breeds showed higher values, which implies that the populations have different gene pools. However, Fst among the last four generations in HO was below 0.1, suggesting that there has been no significant introduction of more outbreed gene pool into the HO population over the last four generations. These results imply measures of population differentiation based on Fst have been described as reliable. For example, pair-wise Fst values were significantly correlated between bi-allelic loci and microsatellite datasets in Atlantic salmon, and similar result was found with regard to the overall heterozygosity .
The highest proportion of total genetic variation attributed to between breed differentiation was observed in BS vs. HO. The proportion of between breed genetic variation observed in this study was comparable to the average between breed variation (7.03%) reported in nine populations of Argentinean Creole cattle populations . Studies in the past demonstrated that Wright’s Fst results were reliable and most consistent with Reynold’s distances, Nei’s minimum distance measures and eight other genetic distance measures for ordering populations, which are widely used and well-established measures of genetic differentiation . In this study, the mean Fst indicated that BS vs. HO population has higher genetic diversity than BS vs. JE and JE vs. HO ideally amalgamated populations. The average estimates, based on microsatellites, of Fst in 20 Northern European cattle breeds was 0.11 ± 0.01 , which is comparable with the findings in this study.
To summarize, Brown Swiss, Jersey and Holstein bull populations have substantially different gene pools. An interesting result was the heterozygote deficit observed in each of the populations in this study. In livestock species, heterozygote deficiencies can be explained by several factors, such as selection, population subdivision, drift and inbreeding. Each breed showed a considerable difference between the observed and expected number of heterozygous individuals across loci. However, in the ideally amalgamated pairs of the populations, the difference between the observed and expected number of heterozygote individuals across loci appears to be smaller, suggesting that crossbreeding could be carefully considered for increasing diversity if needed in the future. At the present level of genetic diversity, crossbreeding is not a necessity, however if loss of genetic diversity within each breed worsens in the future, crossing can be considered as an option to increase total genetic diversity within breeds.
The results suggested that the within population genetic diversity accounts for a higher proportion of the total genetic diversity in ideally amalgamated populations than the diversity between populations. The results of private alleles frequencies in this study indicated that each breed might contain unique genes or gene combinations that are absent in another breed. The study demonstrates that even with a much smaller population size, BS showed similar gene diversity to the Jersey breed, while Holstein showed higher gene diversity than both breeds in agreement with their reported effective population sizes. BS and HO seem to have higher population differentiation (Fst) compared to the other pairs (BS vs. JE and JE vs. HO). If BS and HO were to be amalgamated, higher total expected gene diversity would be obtained as compared to the other pairs of breeds (BS vs. JE and JE vs. HO). If the loss of genetic diversity within breeds worsens in the future, the use of crossbreeding might be an option to recover genetic diversity, especially for the breeds with small population size.
Fis is the average over all subpopulations of the correlation between uniting gametes relative to those of their own subpopulation. Fst is the correlation between random gametes within subpopulations relative to gametes of the total population, which is a measure of subpopulations differentiation. Fit is the correlation between uniting gametes that generated the individual relative to gametes of the total population; subscripts is, st and it stand for individual relative to subpopulation, subpopulation relative to total population, and individual relative to total population, respectively.
Baker CA, Manuel C: Chemical classification of Cattle. 1 Breed groups. Anim Blood Groups Biochem Genet. 1980, 11: 127-150.
Weigel KA: Controlling inbreeding in modern breeding programs. J. Dairy Sci. 2001, 84 (E. Suppl): E177-E184.
Stachowicz K, Sargolzaei M, Miglior F, Schenkel FS: Rates of Inbreeding and Genetic Diversity in Canadian Holstein and Jersey Cattle. J. Dairy Sci. 2011, 94: 5160-5175. 10.3168/jds.2010-3308.
Blott SC, Williams JL, Haley CS: Genetic relationships among European cattle breeds. Animal Genet. 1998, 29: 273-282. 10.1046/j.1365-2052.1998.00327.x.
Bennewitz J, Meuwissen THE: A novel method for the estimation of the relative importance of breeds in order to conserve total genetic variance. Genet Sel Evol. 2005, 37: 315-337. 10.1186/1297-9686-37-4-315.
Falconer DS, Mackay TFC: Introduction to Quantitative Genetics. 1996, Prentice Hall, Harlow, UK
Guo SW, Thompson EA: Performing the Exact Test of Hardy-Weinberg Proportion for Multiple Alleles. Biometrics. 1992, 48: 361-372. 10.2307/2532296.
Robertson A, Hill W: Deviations from Hardy-Weinberg proportions: Sampling variances and use in estimation of inbreeding coefficients. Genetics. 1984, 107: 703-718.
Haldane BS: An exact test for randomness of mating. J Genet. 1984, 52: 631-635.
Weir BS, Cockerham CC: Estimating F-statistics for the analysis of population structure. Evolution. 1984, 38: 1358-1370. 10.2307/2408641.
Nei M, Chakravarti A: Drift variances of Fstand Gststatistics obtained from a finite number of isolated populations. Theoret. Popul. Biol. 1977, 11: 307-325. 10.1016/0040-5809(77)90014-4.
Wright S: Isolation by distance. Genetics. 1943, 28: 114-138.
Wright S: The genetical structure of populations. Ann Eugenics. 1951, 15: 323-354.
Wright S: The interpretation of population structure by f-statistics with special regard to systems of mating. Evolution. 1965, 19 (3): 395-420. 10.2307/2406450.
Cockerham CC: Variance of gene frequencies. Evolution. 1969, 23: 72-84. 10.2307/2406485.
Cockerham CC: Analysis of gene frequencies. Genetics. 1973, 74: 679-700.
Nei M: Analysis of gene diversity in subdivided populations. Proceeding of National Academy Science USA. 1973, 70: 3321-3323. 10.1073/pnas.70.12.3321.
Nei M: Definition and estimation of fixation indices. Evolution. 1986, 40 (3): 643-645. 10.2307/2408586.
VanRaden P, Wiggans G, Van Tassell C, Sonstegard T, Schenkel F: Benefits from cooperation in genomics. Interbull Bull. 2009, 39: 67-72.
Rousset F: GENEPOP’007: a complete re-implementation of the GENEPOP software for Windows and Linux. Mol Ecol. 2007, 8: 103-106.
Goudet J: FSTAT 18.104.22.168, a program to estimate and test gene diversities and fixation indices. Available: http://www.unil.ch/izea/softwares/fstat.html, Accessed 20 October 2008.
Slatkin M, Barton NH: A comparison of three indirect methods for estimating levels of gene flow. Evolution. 1989, 43: 1349-1368. 10.2307/2409452.
The Bovine HAPMAP Consortium: Genome-Wide Survey of SNP Variation Uncovers the Genetic Structure of Cattle Breeds. Science. 2009, 324: 528-531.
Robertson A, Hill W: Deviations from Hardy–Weinberg proportions: Sampling variances and use in estimation of inbreeding coefficients. Genetics. 1984, 107: 703-718.
Weir BS, Cockerham CC: Estimating F-statistics for the analysis of population structure. Evolution. 1984, 38: 1358-1370. 10.2307/2408641.
Maudet C, Luikart G, Taberlet P: Genetic diversity and assignment tests among seven French cattle breeds based on microsatellite DNA analysis. J Anim Sci. 2002, 80: 942-950.
Ryynanen HJ, Tonteri A, Vasemagi A, Primmer CR: A Comparison of Biallelic Markers and Microsatellites for the Estimation of Population and Conservation Genetic Parameters in Atlantic Salmon (Salmo salar). J. Heredity. 2007, 98: 692-704. 10.1093/jhered/esm093.
Giovambattista G, Ripoli MV, Peral-Garcia P, Bouzat JL: Indigenous domestic breeds as reservoirs of genetic diversity the Argentinean Creole cattle. Anim Genet. 2001, 32: 240-247. 10.1046/j.1365-2052.2001.00774.x.
Libiger O, Nievergelt CM, Schork NJ: Comparison of Genetic Distance Measures Using Human SNP Genotype Data. Hum Biol. 2009, 81: 389-406. 10.3378/027.081.0401.
Kantanene J, Olsaker I, Holm LE, Lien S, Vilkki J, Brusgaard K, Ethorsdottir E, Danell B, Adalstainsson S: Genetic diversity and population structures of 20 North European cattle breeds. The American Genetic Association. 2000, 91: 446-457.
The authors would like to thank the Animal Improvement Programs Laboratory, USDA (Beltsville, MD, USA) for providing the genotypes for this study
MGM conducted the analyses and wrote the manuscript. FSS supervised the study and critically reviewed the manuscript. All authors read and approved the final manuscript.
The authors declare that they have no competing interests.
About this article
Cite this article
Melka, M.G., Schenkel, F.S. Analysis of genetic diversity in Brown Swiss, Jersey and Holstein populations using genome-wide single nucleotide polymorphism markers. BMC Res Notes 5, 161 (2012). https://doi.org/10.1186/1756-0500-5-161
- Genetic diversity