- Short Report
- Open Access
Cross-species amplification of 41 microsatellites in European cyprinids: A tool for evolutionary, population genetics and hybridization studies
BMC Research Notes volume 3, Article number: 135 (2010)
Cyprinids display the most abundant and widespread species among the European freshwater Teleostei and are known to hybridize quite commonly. Nevertheless, a limited number of markers for conducting comparative differentiation, evolutionary and hybridization dynamics studies are available to date.
Five multiplex PCR sets were optimized in order to assay 41 cyprinid-specific polymorphic microsatellite loci (including 10 novel loci isolated from Chondrostoma nasus nasus, Chondrostoma toxostoma toxostoma and Leuciscus leuciscus) for 503 individuals (440 purebred specimens and 63 hybrids) from 15 European cyprinid species. The level of genetic diversity was assessed in Alburnus alburnus, Alburnoides bipunctatus, C. genei, C. n. nasus, C. soetta, C. t. toxostoma, L. idus, L. leuciscus, Pachychilon pictum, Rutilus rutilus, Squalius cephalus and Telestes souffia. The applicability of the markers was also tested on Abramis brama, Blicca bjoerkna and Scardinius erythrophtalmus specimens. Overall, between 24 and 37 of these markers revealed polymorphic for the investigated species and 23 markers amplified for all the 15 European cyprinid species.
The developed set of markers demonstrated its performance in discriminating European cyprinid species. Furthermore, it allowed detecting and characterizing hybrid individuals. These microsatellites will therefore be useful to perform comparative evolutionary and population genetics studies dealing with European cyprinids, what is of particular interest in conservation issues and constitutes a tool of choice to conduct hybridization studies.
The Cyprinidae family is of special interest for conducting comparative differentiation, evolutionary and hybridization dynamics studies: (i) Cyprinidae is the most abundant and widespread freshwater fish family across the world ; and (ii) the Cyprinidae family is characterized by high level of inter-species hybridization (reviewed in ). An indirect way to develop microsatellite markers in species with non-sequenced genomes holds in the cross-species amplification of loci previously developed in related species (e.g. ). Here, we examined cross-species amplification success of 41 cyprinid-specific polymorphic microsatellite markers, including 10 novel loci.
This was done for 15 European cyprinid species and hybrids (Table 1). They represent 11 geographically widespread European cyprinid species (Alburnus alburnus, Alburnoides bipunctatus, C. n. nasus, L. idus, L. leuciscus, Rutilus rutilus, Squalius cephalus, Telestes souffia, Abramis brama, Blicca bjoerkna and Scardinius erythrophtalmus) and four endemic species (C. genei, C. soetta, C. t. toxostoma and Pachychilon pictum). The species sampling represents 12 of the 24 European Cyprinidae genera. We also studied two sets of hybrid specimens. The first one (Table 1; Additional file 1) consists in 48 Chondrostoma hybrids specimens (i.e. hybrids between C. t. toxostoma and C. n. nasus) from the Durance River previously characterized using the mitochondrial cytochrome b gene and four nuclear intron sequences . These hybrids (Hy) range from 10% of C. n. nasus alleles in their genome (Hy1) to 80% of C. n. nasus alleles in their genome (Hy8) (sensu). The second set of hybrids consists of 15 individuals (Table 1; Additional file 2) which exhibited an intermediate morphology between two cyprinid species or for which the species identification was not coherent among the different markers (meristic, mitochondrial or microsatellites). All specimens were beforehand identified at the species level using a morphological analysis (we used identification key based on meristic characters ) and by sequencing the 5' part of the cytochrome b gene (as described in ). The results from mitochondrial sequences of all the 440 purebred specimens were congruent with their morphological identification and species assignation could be done without any ambiguity (data not shown).
Ten novel loci were isolated from Chondrostoma toxostoma toxostoma, Chondrostoma nasus nasus and Leuciscus leuciscus following a protocol detailed elsewhere [6, 7]. The program MICROFAMILY was used to discard redundancies by detecting flanking region similarities among different loci. Sixty-nine primers pairs were designed and a total of 10 novel primer pairs were retained (Additional file 3). They were associated with clear amplification pattern, with unambiguous genotype profiles, and were polymorphic for at least one of the 15 cyprinid species. Thirty-one primers pairs from previously described microsatellite loci [6, 7, 9–19] were then integrated into the protocol. These loci were combined into multiplex PCR kits, along with the 10 novel loci. Overall, a total of 41 loci were combined into five multiplex PCR kits (Additional file 3). Amplifications and genotyping were conducted using reagents and protocols described previously .
The size range of the applicable markers with a common fluorescent dye did not overlap (Additional file 3), except for CtoG-075 and Lid8 in A. alburnus and BL1-T2 and CypG24 in L. idus. Nevertheless, the differences in genotype profiles allowed to unambiguously discriminating alleles among loci. The mean rate of amplification success per species is 95.1% (i.e. 39/41 loci) and the mean rate of polymorphic loci per species (excluding A. brama, B bjoerkna and S. erythrophtalmus because of too small sample size) is 76.6% (i.e. about 31/41 loci) (Table 2). Moreover, 23 loci (namely BL1-2b, BL1-30, BL1-84, BL1-98, BL1-153, Ca1, CtoA-247, CtoF-172, CtoG-216, CypG24, IV04, LceC1, LleA-071, LleC-090, Lsou05, Lsou08, Lsou19, Lsou34, N7K4, Ppro132, Rru4, Rser10 and Z21908) displayed amplification success in all the species tested here. The vast majority of cross-species amplifications from the European cyprinid species resulted in PCR fragments with proper microsatellite chromatograms and expected allele sizes, indicating homologous loci. The 41 retained sequences containing microsatellites were BLASTed against the complete genome of Danio rerio (e-value = 1E-20) as a proxy for testing loci homology. This procedure makes the assumption that chromosome topology is well conserved across cyprinid species (e.g. ). Eighteen of the 41 sequences produced a significant hit and were mapped onto the D. rerio genome without ambiguity (Table 3).
For populations with more than 20 samples, GENEPOP 4.0  was used to: (i) test for the Hardy-Weinberg (HW) equilibrium, (ii) estimate the heterozygosity for all loci and populations, and (iii) test linkage disequilibrium (LD) among loci within populations. MICRO-CHECKER ver. 2.2.3  was used to analyze the causes of departures from HW equilibrium: real HW disequilibrium, null alleles or scoring errors (often resulting from stuttering). Among the 41 loci, we noticed five cases where homozygotes with null alleles were detected (Additional file 1): Lsou29 in A. bipunctatus and A. alburnus; CtoE-249 in A. alburnus; BL1-T2 in L. leuciscus; and CnaD-112 and LCO5 in B. bjoerkna. These cases were discarded from the analyses dealing with HW and LD analyses. Most of the 41 loci were at HW equilibrium after False Discovery Rate (FDR) correction (modified  and applied within each species). Considering all loci and species together, we found 1218 cases of polymorphic pattern among which only sixteen cases (<1%) displaying HW disequilibrium (see Additional file 4). Most loci with HW disequilibrium showed significant excess of homozygotes. It is noticeable that all A. bipunctatus individuals exhibited two alleles at locus LCO3. Regarding their size, these alleles are very close to the variation range observed in the other cyprinid species, suggesting that this locus has experimented duplication within the genome of A. bipunctatus. This locus was therefore discarded from the A. bipunctatus data for further statistical analyses. The MICRO-CHECKER analyses indicated that homozygote excess at loci with HW equilibrium departures was mainly caused by the presence of null alleles. A total of 6152 pairwise comparisons were submitted to LD analyses, and LD was tested with 515 pairwise comparisons per species in average. After FDR correction, only 36 pairs of loci exhibited LD (0.006% of the total pairwise comparisons). A noticeable higher number of pairs of loci at LD were found in L. idus (21/496 of pairwise comparisons) and C. n. nasus (Allier River) (7/666) (Additional file 4). Interestingly, LD was found between BL1-98 and Z21908 in two cases (in C. n. nasus and L. idus), in agreement with their relative vicinity (253 Kb) on the D. rerio's chromosome 24 (Table 3). Additionally, three other pairs of loci displayed LD in two or more species: LleA-150 and Lsou34 in A. bipunctatus and L. idus; BL1-T2 and Z21908 in C. n. nasus and L. idus; and BL1-2b and BL1-T2 in C. genei, C. n. nasus, C. t. toxostoma from Serre-Ponçon Lake and L. idus (Additional file 4).
A Bayesian-based approach was used to search for the occurrence of independent genetic groups (K) in the microsatellites dataset (STRUCTURE 2.2 ; http://pritch.bsd.uchicago.edu). The burn-in length was set to 100,000 followed by 1,000,000 iterations within a Markov Chain Monte Carlo (MCMC). The 'admixture model' and the 'I-model' (independent allele frequencies) were used, with no prior population information. Parameter K was chosen to vary from 1 to 20 and five repeats were run for each K. We selected the K value for which the posterior probability of the data, Ln P(D), was maximized. Each individual was assigned to the inferred clusters according to the results from the simulation procedures (parameter "Q" representing the estimated membership coefficients for each individual in each cluster). Because choosing K can be difficult a priori (although in our case the number of species is known), we combined two main approaches : i) choosing K that maximizes the posterior probability of the data Ln P(D); ii) using the formula [Ln P(D)k - Ln P(D)k-1], where Ln P(D) is the estimated posterior probability of the data conditional to K. A. brama, B. bjoerkna and S. erythrophtalmus individuals were discarded from these analyses due to their too limited sample size (see ). Moreover, the pattern of allelic frequency differentiation between species was explored through Factorial Correspondence Analyses (FCA) using GENETIX ver. 4.05.2 . In addition, for the C. n. nasus and C. t. toxostoma specimens and their hybrids, the Bayesian clustering method implemented in the program NEWHYBRID was used to assign individuals to different genotypic classes (parental, F1, F2 or backcrosses). The method computes, by MCMC method, the Bayesian posterior probability that an individual in a sample belongs to different hybrid classes (F1, F2, and backcrosses) while simultaneously estimating allelic frequencies for parental species. The program was run five times with varying lengths of burn-in period and numbers of sweeps, as recommended by the authors.
Based on either all the 41 or 23 microsatellites loci, K = 13 was obtained from the Ln P(D) analyses for K parameter determination (see Figure 1), which approximate well the number of analysed cyprinid species. Within species, correct assignment score ranged from 97% to 99% both with 23 loci and 41 loci. Additionally, FCA could separate the 15 cyprinid analysed species (axes 1 and 2; Additional files 5 and 6). It is worth noting that the graphical discrimination was increased when the two outliers (A. bipunctatus and P. pictum) were discarded. Moreover, as highlighted by the results from the different populations of A. alburnus, C. n. nasus, C. t. toxosmtoma or S. cephalus, the quality of species identification, both in FCA or STRUCTURE analyses, did not depend on the population sampled (Figure 1; Additional files 5 and 6).
The genotypic distribution of the Chondrostoma hybrids specimens compared to their parental species (C. t. toxostoma and C. n. nasus) is summarized in Figure 2. The two purebred C. t. toxostoma populations can hardly be differentiated on axes 1 and 2 of the FCA, whereas a differentiation between the C. n. nasus sampled in Allier River and those sampled in Rhone River is displayed on axis 2. A strong coherence was found between the genome dilution as defined by  and the distribution of the hybrids between the two parental species. Indeed, Hy1 genotypes were assigned toward the C. t. toxostoma species whereas Hy8 were assigned toward the C. n. nasus species. F1 hybrids, which correspond to Hy4 and Hy5 depending on their mitochondrial sequence, were mainly intermediate between the two parental species. However, these F1 hybrids are not strictly homogeneous and the assignment scores fluctuate (Additional file 1; see also Figure 2). Variations found at the level of assignment scores for Hy4 and Hy5 hybrids may be caused by the limited number of markers (n = 5) used by , although they were discriminative markers. As for the second hybrid group, thirteen different hybrid combinations have been identified. Most of the hybrids revealed being introgressed individuals (i.e. individuals assigned to one species based on mitochondrial cytochrome b gene or morphology and to another species based on microsatellites), although most of them exhibited an intermediate morphology (Additional file 2).
To our knowledge, the development of large number (n > 20) of polymorphic microsatellite markers applicable to European cyprinid species has never been achieved to date (but see for instance ). Moreover, through the species sampling designed in this study, we demonstrated their applicability within half of the European cyprinid genera, with 24 to 37 polymorphic loci per species. A common set of 23 markers enabled the comparison of 15 species and hence allows genetic variability and recombination to be compared directly for these species. Furthermore, normalization of the PCR conditions and multiplexing make faster and cost effective the genotyping of the 41 loci. The high number of loci and wide applicability to the European cyprinid species make the developed set of markers a powerful tool for: (i) studies dealing with genetic diversity and structure of the European cyprinid species (these markers will notably be useful to assess the impact of anthropogenic factors on the cyprinid genetic diversity, and will be useful for conservation and environmental monitoring purposes); (ii) comparative studies dealing with the evolutionary pattern and history of a large set of species; (iii) species identification; and (iv) developing knowledge in hybridization processes and dynamics in Cyprinids. More specifically, the analysis of a large number of genetic markers (see ) will significantly improve the understanding of the relative impact of inter-species interactions, response to environmental effects and ecological trade-offs in cyprinid hybrid zones (as initiated by ).
Nelson J: Fishes of the World. 1994, New-York: John Wiley & Sons, 3
Costedoat C, Pech N, Chappaz R, Gilles A: Novelties in hybrid zones: Crossroads between population genomic and ecological approaches. PLoS ONE. 2007, 2: e357-10.1371/journal.pone.0000357.
Holmen J, Vøllestad LA, Jakobsen KS, Primmer CR: Cross-species amplification of 36 cyprinid microsatellite loci in Phoxinus phoxinus (L.) and Scardinius erythrophthalmus (L.). BMC Res Notes. 2009, 2: 248-10.1186/1756-0500-2-248.
Keith P, Allardi J: Atlas des poissons d'eau douce de France. 2001, Paris: Muséum National d'Histoire Naturelle
Costedoat C, Pech N, Salducci MD, Chappaz R, Gilles A: Evolution of mosaic hybrid zone between invasive and endemic species of Cyprinidae through space and time. Biol J Linn Soc. 2005, 85: 135-155. 10.1111/j.1095-8312.2005.00478.x.
Dubut V, Martin JF, Costedoat C, Chappaz R, Gilles A: Isolation and characterization of polymorphic microsatellite loci in the freshwater fishes Telestes souffia and Telestes muticellus (Teleostei: Cyprinidae). Mol Ecol Resour. 2009, 9: 1001-1005. 10.1111/j.1755-0998.2009.02539.x.
Dubut V, Martin JF, Gilles A, Van Houdt J, Chappaz R, Costedoat C: Isolation and characterization of polymorphic microsatellite loci for the dace complex: Leuciscus leuciscus (Teleostei: Cyprinidae). Mol Ecol Resour. 2009, 9: 1179-1183. 10.1111/j.1755-0998.2009.02594.x.
Meglécz E: MICROFAMILY (version 1): A computer program for detecting flanking region similarities among different microsatellite loci. Mol Ecol Notes. 2007, 7: 18-20. 10.1111/j.1471-8286.2006.01537.x.
Shimoda N, Knapik EW, Ziniti J, Sim C, Yamada E, Kaplan S, Jackson D, de Sauvage F, Jacob H, Fishman MC: Zebrafish genetic map with 2000 microsatellite markers. Genomics. 1999, 58: 219-232. 10.1006/geno.1999.5824.
Dimsoski P, Toth GP, Bagley MJ: Microsatellite characterization in central stoneroller Campostoma anomalum (Pisces: Cyprinidae). Mol Ecol. 2000, 9: 2187-2189. 10.1046/j.1365-294X.2000.105318.x.
Bessert ML, Ortí G: Microsatellite loci for paternity analysis in the fathead minnow, Pimephales promelas (Teleostei: Cyprinidae). Mol Ecol Notes. 2003, 3: 532-534. 10.1046/j.1471-8286.2003.00501.x.
Dawson DA, Burland TM, Douglas A, Le Comber SC, Bradshaw M: Isolation of microsatellite loci in the freshwater fish, the bitterling Rhodeus sericeus (Teleostei: Cyprinidae). Mol Ecol Notes. 2003, 3: 199-202. 10.1046/j.1471-8286.2003.00395.x.
Mesquita N, Cunha C, Hänfling B, Carvalho GR, Zé-Zé L, Tenreiro R, Coelho MM: Isolation and characterization of polymorphic microsatellite loci in the endangered Portuguese freshwater Squalius aradensis (Cyprinidae). Mol Ecol Notes. 2003, 3: 572-574. 10.1046/j.1471-8286.2003.00515.x.
Salgueiro P, Carvalho G, Collares-Pereira MJ, Coelho MM: Microsatellite analysis of genetic population structure of the endangered cyprinid Anaecypris hispanica in Portugal: Implications for conservation. Biol Conserv. 2003, 109: 47-56. 10.1016/S0006-3207(02)00132-5.
Baerwald MR, May B: Characterization of microsatellite loci for five members of the minnow family Cyprinidae found in the Sacramento-San Joaquim Delta and its tributaries. Mol Ecol Notes. 2004, 4: 385-390. 10.1111/j.1471-8286.2004.00661.x.
Barinova A, Yadrenkina E, Nakajima M, Taniguchi N: Identification and characterization of microsatellite DNA markers developed in ide Leuciscus idus and Siberian roach Rutilus rutilus. Mol Ecol Notes. 2004, 4: 86-88. 10.1046/j.1471-8286.2003.00577.x.
Turner TF, Dowling TE, Broughton RE, Gold JR: Variable microsatellite markers amplify across divergent lineages of cyprinid fishes (subfamily Leuciscinae). Conserv Genet. 2004, 5: 279-281. 10.1023/B:COGE.0000029998.11426.ab.
Larno V, Launey S, Devaux A, Laroche J: Isolation and characterization of microsatellite loci from chub Leuciscus cephalus (Pisces: Cyprinidae). Mol Ecol Notes. 2005, 5: 752-754. 10.1111/j.1471-8286.2005.01052.x.
Muenzel FM, Sanetra M, Salzburger W, Meyer A: Microsatellites from the vairone Leuciscus souffia (Pisces: Cyprinidae) and their application to closely related species. Mol Ecol Notes. 2007, 7: 1048-1050. 10.1111/j.1471-8286.2007.01772.x.
Ráb P, Rábová M, Pereira CS, Collares-Pereira MJ, Pelikánová Š: Chromosome studies of European cyprinid fishes: Interspecific homology of leuciscine cytotaxonomic marker-the largest subtelocentric chromosome pair as revealed by cross-species painting. Chromosome Res. 2008, 16: 863-873. 10.1007/s10577-008-1245-3.
Rousset F: GENEPOP'007: A complete reimplementation of the Genepop software for Windows and Linux. Mol Ecol Resour. 2008, 8: 103-106. 10.1111/j.1471-8286.2007.01931.x.
van Oosterhout C, Hutchinson WF, Wills DPM, Shipley P: MICRO-CHECKER: Software for identifying and correcting genotyping errors in microsatellite data. Mol Ecol Notes. 2004, 4: 535-538. 10.1111/j.1471-8286.2004.00684.x.
Benjamini Y, Hochberg Y: Controlling the false discovery rate-a practical and powerful approach to multiple testing. J Roy Stat Soc B Met. 1995, 57: 289-300.
Pritchard JK, Stephens M, Donnelly P: Inference of population structure using multilocus genotype data. Genetics. 2000, 155: 945-959.
Garnier S, Alibert P, Audiot P, Prieur B, Rasplus JY: Isolation by distance and sharp discontinuities in gene frequencies: Implications for the phylogeography of an alpine insect species, Carabus solieri. Mol Ecol. 2004, 13: 1883-1897. 10.1111/j.1365-294X.2004.02212.x.
Fogelqvist J, Niittyvuopio A, Ågren J, Savolainen O, Lascoux M: Cryptic population genetic structure: The number of inferred clusters depends on sample size. Mol Ecol Resour. 2010, 10: 314-323. 10.1111/j.1755-0998.2009.02756.x.
Belkhir K, Borsa P, Chikhi L, Raufaste N, Bonhomme F: GENETIX 4.05, logiciel sous Windows TM pour la génétique des populations. 2004, Montpellier: Laboratoire Génome Populations Interactions (UMR 5000, CNRS, Université de Montpellier II)
Anderson EC, Thompson EA: A model-based method for identifying species hybrids using multilocus genetic data. Genetics. 2002, 160: 1217-1229.
Vähä JP, Primmer CR: Efficiency of model-based Bayesian methods for detecting hybrid individuals under different hybridization scenarios and with different numbers of loci. Mol Ecol. 2006, 15: 63-72. 10.1111/j.1365-294X.2005.02773.x.
Corse E, Costedoat C, Pech N, Chappaz R, Grey J, Gilles A: Trade-off between morphological convergence and opportunistic diet behavior in fish hybrid zone. Front Zool. 2009, 6: 26-10.1186/1742-9994-6-26.
Many thanks to Rémi Grenier, Chloé Bennati-Granier and Bastien Patin for their help with the laboratory work. We are grateful to Emmanuel Corse for some key samples from Serre-Ponçon Lake. Thanks to Tom Nielsen for his help with the English editing. Data used in this work were partly produced through molecular genetic analysis technical facilities of the IFR119 'Montpellier Environnement Biodiversité'.
The authors declare that they have no competing interests.
MS, VD, JFM and JF carried out the molecular genetics laboratory work. VD, MS, EM and CC performed the statistical analyses. VD, CC and MS drafted the manuscript. AG, VD and CC conceived the study. RC participated to the financial support. All authors read, contributed to and approved the final manuscript.
Vincent Dubut, Melthide Sinama contributed equally to this work.
Electronic supplementary material
Additional file 3:Microsatellite loci, multiplex PCR conditions, levels of variability and amplifiability of 41 microsatellite loci in 15 cyprinid species. Excel Table describing the microsatellite loci, multiplex PCR conditions, levels of variability and amplifiability of 41 microsatellite loci in 15 cyprinid species. (XLS 93 KB)
Additional file 5:Biplot representations of the FCA for 15 cyprinid species (A) or 13 cyprinid species (excluding A. bipunctatus and P. pictum ) (B) using 41 microsatellites. PDF file containing the biplot representations of the FCA for 15 cyprinid species (A) or 13 cyprinid species (excluding A. bipunctatus and P. pictum) (B) using 41 microsatellites. (PDF 345 KB)
Additional file 6:Biplot representations of the FCA for 15 cyprinid species (A) or 13 cyprinid species (excluding A. bipunctatus and P. pictum ) (B) using 23 microsatellites. PDF file containing the biplot representations of the FCA for 15 cyprinid species (A) or 13 cyprinid species (excluding A. bipunctatus and P. pictum) (B) using 23 microsatellites. (PDF 402 KB)
About this article
- Markov Chain Monte Carlo
- Parental Species
- Linkage Disequilibrium Analysis
- Factorial Correspondence Analysis
- Intermediate Morphology