Cross-species amplification of 41 microsatellites in European cyprinids: A tool for evolutionary, population genetics and hybridization studies

Background Cyprinids display the most abundant and widespread species among the European freshwater Teleostei and are known to hybridize quite commonly. Nevertheless, a limited number of markers for conducting comparative differentiation, evolutionary and hybridization dynamics studies are available to date. Findings Five multiplex PCR sets were optimized in order to assay 41 cyprinid-specific polymorphic microsatellite loci (including 10 novel loci isolated from Chondrostoma nasus nasus, Chondrostoma toxostoma toxostoma and Leuciscus leuciscus) for 503 individuals (440 purebred specimens and 63 hybrids) from 15 European cyprinid species. The level of genetic diversity was assessed in Alburnus alburnus, Alburnoides bipunctatus, C. genei, C. n. nasus, C. soetta, C. t. toxostoma, L. idus, L. leuciscus, Pachychilon pictum, Rutilus rutilus, Squalius cephalus and Telestes souffia. The applicability of the markers was also tested on Abramis brama, Blicca bjoerkna and Scardinius erythrophtalmus specimens. Overall, between 24 and 37 of these markers revealed polymorphic for the investigated species and 23 markers amplified for all the 15 European cyprinid species. Conclusions The developed set of markers demonstrated its performance in discriminating European cyprinid species. Furthermore, it allowed detecting and characterizing hybrid individuals. These microsatellites will therefore be useful to perform comparative evolutionary and population genetics studies dealing with European cyprinids, what is of particular interest in conservation issues and constitutes a tool of choice to conduct hybridization studies.


Findings
The Cyprinidae family is of special interest for conducting comparative differentiation, evolutionary and hybridization dynamics studies: (i) Cyprinidae is the most abundant and widespread freshwater fish family across the world [1]; and (ii) the Cyprinidae family is characterized by high level of inter-species hybridization (reviewed in [2]). An indirect way to develop microsatellite markers in species with non-sequenced genomes holds in the cross-species amplification of loci previously developed in related species (e.g. [3]). Here, we examined cross-spe-cies amplification success of 41 cyprinid-specific polymorphic microsatellite markers, including 10 novel loci.
This was done for 15 European cyprinid species and hybrids (Table 1). They represent 11 geographically widespread European cyprinid species (Alburnus alburnus, Alburnoides bipunctatus, C. n. nasus, L. idus, L. leuciscus, Rutilus rutilus, Squalius cephalus, Telestes souffia, Abramis brama, Blicca bjoerkna and Scardinius erythrophtalmus) and four endemic species (C. genei, C. soetta, C. t. toxostoma and Pachychilon pictum). The species sampling represents 12 of the 24 European Cyprinidae genera. We also studied two sets of hybrid specimens. The first one (Table 1; Additional file 1) consists in 48 Chondrostoma hybrids specimens (i.e. hybrids between C. t. toxostoma and C. n. nasus) from the Durance River previously characterized using the mitochondrial cytochrome b gene and four nuclear intron sequences [2]. These  [2]). The second set of hybrids consists of 15 individuals (Table 1; Additional file 2) which exhibited an intermediate morphology between two cyprinid species or for which the species identification was not coherent among the different markers (meristic, mitochondrial or microsatellites). All specimens were beforehand identified at the species level using a morphological analysis (we used identification key based on meristic characters [4]) and by sequencing the 5' part of the cytochrome b gene (as described in [5]). The results from mitochondrial sequences of all the 440 purebred specimens were congruent with their morphological identification and species assignation could be done without any ambiguity (data not shown). Ten novel loci were isolated from Chondrostoma toxostoma toxostoma, Chondrostoma nasus nasus and Leuciscus leuciscus following a protocol detailed elsewhere [6,7]. The program MICROFAMILY [8] was used to discard redundancies by detecting flanking region similarities among different loci. Sixty-nine primers pairs were designed and a total of 10 novel primer pairs were retained (Additional file 3). They were associated with clear amplification pattern, with unambiguous genotype profiles, and were polymorphic for at least one of the 15 cyprinid species. Thirty-one primers pairs from previously described microsatellite loci [6,7,[9][10][11][12][13][14][15][16][17][18][19] were then integrated into the protocol. These loci were combined into multiplex PCR kits, along with the 10 novel loci. Overall, a total of 41 loci were combined into five multiplex PCR kits (Additional file 3). Amplifications and genotyping were conducted using reagents and protocols described previously [6].
For populations with more than 20 samples, GENEPOP 4.0 [21] (Table 3). Additionally, three other pairs of loci displayed LD in two or more species: LleA-150 and Lsou34 in A. bipunctatus and L. idus; BL1-T2 and Z21908 in C. n. nasus and L. idus; and BL1-2b and BL1-T2 in C. genei, C. n. nasus, C. t. toxostoma from Serre-Ponçon Lake and L. idus (Additional file 4). A Bayesian-based approach was used to search for the occurrence of independent genetic groups (K) in the microsatellites dataset (STRUCTURE 2.2 [24]; http:// pritch.bsd.uchicago.edu). The burn-in length was set to 100,000 followed by 1,000,000 iterations within a Markov Chain Monte Carlo (MCMC). The 'admixture model' and the 'I-model' (independent allele frequencies) were used, with no prior population information. Parameter K was chosen to vary from 1 to 20 and five repeats were run for each K. We selected the K value for which the posterior probability of the data, Ln P(D), was maximized. Each individual was assigned to the inferred clusters according to the results from the simulation procedures (parameter "Q" representing the estimated membership coefficients for each individual in each cluster). Because choosing K can be difficult a priori (although in our case the number of species is known), we combined two main approaches [25]: i) choosing K that maximizes the posterior probability of the data Ln P(D); ii) using the formula [Ln P(D) k -Ln P(D) k-1 ], where Ln P(D) is the estimated posterior probability of the data conditional to K. A. brama, B. bjoerkna and S. erythrophtalmus individuals were discarded from these analyses due to their too limited sample size (see [26]). Moreover, the pattern of allelic frequency differentiation between species was explored through Factorial Correspondence Analyses (FCA) using GENETIX ver. 4.05.2 [27]. In addition, for the C. n. nasus and C. t. toxostoma specimens and their hybrids, the Bayesian clustering method implemented in the program NEWHYBRID [28] was used to assign individuals to different genotypic classes (parental, F1, F2 or backcrosses). The method computes, by MCMC method, the Bayesian posterior probability that an individual in a sample belongs to different hybrid classes (F1, F2, and backcrosses) while simultaneously estimating allelic frequencies for parental species. The program was run five times with varying lengths of burn-in period and numbers of sweeps, as recommended by the authors. Based on either all the 41 or 23 microsatellites loci, K = 13 was obtained from the Ln P(D) analyses for K parameter determination (see Figure 1), which approximate well the number of analysed cyprinid species. Within species, correct assignment score ranged from 97% to 99% both with 23 loci and 41 loci. Additionally, FCA could separate the 15 cyprinid analysed species (axes 1 and 2; Additional files 5 and 6). It is worth noting that the graphical discrimination was increased when the two outliers (A. bipunctatus and P. pictum) were discarded. Moreover, as highlighted by the results from the different populations of A. alburnus, C. n. nasus, C. t. toxosmtoma or S. cephalus, the quality of species identification, both in FCA or STRUCTURE analyses, did not depend on the population sampled ( Figure 1; Additional files 5 and 6).
The genotypic distribution of the Chondrostoma hybrids specimens compared to their parental species (C. t. toxostoma and C. n. nasus) is summarized in Figure 2.
The two purebred C. t. toxostoma populations can hardly be differentiated on axes 1 and 2 of the FCA, whereas a differentiation between the C. n. nasus sampled in Allier River and those sampled in Rhone River is displayed on axis 2. A strong coherence was found between the Numbers between parentheses refer to the sample location: 1, Durance River (southeastern France); 2, Serre-Ponçon Lake (southeastern France); 3, Po River (northern Italy); 4, Allier River (central France); 5, Rhone River (southeastern France); 6, Suran River (eastern France); 7, Rhine basin (Germany); 8, Ain River (eastern France); 9, Orbieu River (southern France); 10, Buech River (southeastern France). A B genome dilution as defined by [2] and the distribution of the hybrids between the two parental species. Indeed, Hy1 genotypes were assigned toward the C. t. toxostoma species whereas Hy8 were assigned toward the C. n. nasus species. F1 hybrids, which correspond to Hy4 and Hy5 depending on their mitochondrial sequence, were mainly intermediate between the two parental species. However, these F1 hybrids are not strictly homogeneous and the assignment scores fluctuate (Additional file 1; see also Figure 2). Variations found at the level of assignment scores for Hy4 and Hy5 hybrids may be caused by the limited number of markers (n = 5) used by [2], although they were discriminative markers. As for the second hybrid group, thirteen different hybrid combinations have been identified. Most of the hybrids revealed being introgressed individuals (i.e. individuals assigned to one species based on mitochondrial cytochrome b gene or morphology and to another species based on microsatellites), although most of them exhibited an intermediate morphology (Additional file 2). To our knowledge, the development of large number (n > 20) of polymorphic microsatellite markers applicable to European cyprinid species has never been achieved to date (but see for instance [3]). Moreover, through the spe-cies sampling designed in this study, we demonstrated their applicability within half of the European cyprinid genera, with 24 to 37 polymorphic loci per species. A common set of 23 markers enabled the comparison of 15 species and hence allows genetic variability and recombination to be compared directly for these species. Furthermore, normalization of the PCR conditions and multiplexing make faster and cost effective the genotyping of the 41 loci. The high number of loci and wide applicability to the European cyprinid species make the developed set of markers a powerful tool for: (i) studies dealing with genetic diversity and structure of the European cyprinid species (these markers will notably be useful to assess the impact of anthropogenic factors on the cyprinid genetic diversity, and will be useful for conservation and environmental monitoring purposes); (ii) comparative studies dealing with the evolutionary pattern and history of a large set of species; (iii) species identification; and (iv) developing knowledge in hybridization processes and dynamics in Cyprinids. More specifically, the analysis of a large number of genetic markers (see [29]) will significantly improve the understanding of the relative impact of inter-species interactions, response to environmental  Axis 1 (11.30 %) Axis 2 (2.13 %) effects and ecological trade-offs in cyprinid hybrid zones (as initiated by [30]).