- Research article
- Open Access
Intra-platform comparison of 25-mer and 60-mer oligonucleotide Nimblegen DNA microarrays
BMC Research Notes volume 6, Article number: 43 (2013)
We performed a Nimblegen intra-platform microarray comparison by assessing two categories of flax target probes (short 25-mers oligonucleotides and long 60-mers oligonucleotides) in identical conditions of target production, design, labelling, hybridization, image analyses, and data filtering. We compared technical parameters of array hybridizations, precision and accuracy as well as specific gene expression profiles.
Comparison of the hybridization quality, precision and accuracy of expression measurements, as well as an interpretation of differential gene expression in flax tissues were performed. Both array types yielded reproducible, accurate and comparable data that are coherent for expression measurements and identification of differentially expressed genes. 60-mers arrays gave higher hybridization efficiencies and therefore were more sensitive allowing the detection of a higher number of unigenes involved in the same biological process and/or belonging to the same multigene family.
The two flax arrays provide a good resolution of expressed functions; however the 60-mers arrays are more sensitive and provide a more in-depth coverage of candidate genes potentially involved in different biological processes.
Technologies for performing genome-wide expression analyses have rapidly multiplied in recent years and different cross-platform studies have focused on target type, target production and design, labelling or hybridization protocols [1–4] as well as mathematical approaches [5–8]. Despite the tremendous progress in Next Generation Sequencing (NGS) technology and the increasing use of RNAseq approaches, different microarray platforms continue to generate large amounts of high quality expression data for a wide range of animal and plant species and are extensively applied in medical decision-making research. In general, arrays can contain oligonucleotide probes of 25, 30, 40, 50, 60, 65, 70–80 bases in length . For example, in situ synthesized arrays for human, mouse, yeast, rat, Arabidopsis, Drosophila, C. elegans, zebrafish and other species, can use 25-mers probes (Affimetrix platform), 50-mers probes (Illumina platform), 60-mers probes (Agilent platform), and 50-75-mers probes (Nimblegen platform) .
Since 1999 Roche NimbleGen provides high-density arrays for advanced gene expression analysis, synthesized by digital light processing and rapid, high-yield photochemistry using Maskless Array Synthesis (MAS) technology. These arrays present the advantage of a custom design allowing specification of the regions of interest or the targeted probes for a tailored array solution in any organism (http://www.nimblegen.com). Nimblegen 25-mers arrays were successfully used in gene expression analyses in bacteria , yeast , and human . Nimblegen 36-mers arrays were used in rice , and 50- to 75-mers were also used in bacteria [15, 16], zebrafish , Mus musculus, human , alga , poplar , Arabidopsis , rice  and many other species.
We have recently developed a flax high-density oligo-microarray platform using Nimblegen technology . Flax (Linum usitatissimum L.) is one of mankind’s oldest cultivated plants and is grown for both its cellulose-rich fibers and for its seeds rich in alpha linolenic acid (ALA, C18:3) . Our current flax platform is based upon a uniplex 385K system consisting of 8 short (25-mers) oligonucleotides per unigene and a total of 48,021 unigenes per slide. This platform represents the first high-density flax microarray system and is currently providing extremely useful biological information [24, 26]. Nevertheless, we wanted to know whether adifferent design based upon long oligonucleotides (60-mers) would improve the performances of our gene expression analyses and consequently increase the yield of meaningful biological information. In order to do this we compared two categories of flax target probes: short (25-mers) oligonucleotides and long (60-mers) oligonucleotides in identical conditions of target production, design, labelling, hybridization, image analyses, and data filtering. This comparison was realized with two different flax samples and each RNA sample was used for the two categories of arrays. Experiments were realized in order to discriminate specific gene expression profiles of two different flax tissues, and results were cross-validated using an independent method (qRT-PCR). In this paper technical parameters of array hybridizations are compared and their relevance for the generation of biologically useful information are discussed.
Results and discussion
The Nimblegen array system is based upon the hybridization of a single labelled sample (derived from RNA), followed by one-channel detection. The intensity of the hybridization signal is then used to determine target concentration. We used two contrasted samples, one from flax inner stem tissue and the other from the outer stem tissue (Additional file 1). These two tissues are easily separated without cross-contamination as previously demonstrated .
Three independent hybridizations were performed for each sample using the two array types (25-mers and 60-mers). After verifying the hybridization quality for all experiments the results obtained using the 2 arrays were compared by evaluating the precision and accuracy of expression measurements; a sub-list of 9 genes was used for the comparison of microarray data and qRT-PCR data.
Our results (GSE37980) showed that all probes present on the two types of array were capable of hybridizing successfully (signal>background). The sensitivity of both array types was demonstrated by the wide signal dynamic range obtained (log2 values of 6 to 15 for 25-mers arrays and 4 to 15 for 60-mers arrays). Hybridization quality was verified using experimental metrics reports (NimbleScan v2.5) as recommended by Roche/Nimblegen. This program generates summary statistics (interquartile density, signal range, uniformity mean, uniformity CV (coefficient of variation), number of empty features on the array, mean empty, the number of random control features present on the array, mean random). All metrics were within the recommended value range indicating that hybridization quality was satisfactory for all experiments and any potential artifacts during hybridization were registered for both array types and for all samples. Raw expression data on all flax hybridization experiments were normalized through RMA (Robust Multi-array Average) algorithms included in the NimbleScan software and 46,589 common targets on both array types were taken into account for further comparisons.
Comparison of the precision of expression measurements
Precision (also called reproducibility or repeatability) represents the degree to which repeated measurements of the same sample hybridization will show the same or similar results . In order to compare the precision of the measurements derived from each array type we used the following criteria: 1) the distribution of inter-slide variation measures; 2) inter-slide correlation of expression profiles.
Comparison of the hybridization signal intensities for all experiments in the two array types (Figure 1A), and of expression measurements between each pair-wise combination of inner vs. outer tissues (Figures 1B and 1C), show that the data from both array types is highly reproducible. Nevertheless, the 60-mers arrays presented globally higher signal intensities and lower variation measures compared to 25-mers array type as shown by the coefficient of variation (CV) and standard deviation (SD) values for both array types (Figure 1C).
Variations in signal intensities of probes corresponding to different regions of the same mRNA target have previously been observed [29, 30], and highly sequence dependent [31–33]. The hybridization efficiency between a probe and its targets is determined by the balance between the binding strength of the probe-target duplex and the formation of probe-probe dimers and secondary structures in either probes or targets [34, 35]. The duplex melting temperature is generally considered as one of the most popular measures in the evaluation of microarray probes. It gives the temperature at which half of all probes form a duplex with their target while the other half are unbound, assuming a simple two state transition . General thermodynamic models of probe-target hybridization have also recently been used to compare 25- and 45 to 75-mers tiling Nimblegen human arrays in order to calculate the thermodynamic parameters and model choice . Differences in the probe sequence seems to explain the specific variations of microarray signal intensities as the melting temperature is different for each probe set. In our experiments the 25-mers arrays were hybridized at 38°C, and the 60-mers arrays at 42°C, conforming to Nimblegen recommendations.
Inter-array comparison of expression profiles (Figure 2) showed that a strong correlation exists between the two arrays. Only a few exceptions were detected for up-regulated profiles (log2 ratio >1) on 60-mers arrays and for down-regulated profiles (log2 ratio <−1) on 25-mers arrays.
Comparison of the accuracy of expression measurements
Accuracy is defined as the degree of conformity of the measured quantity to its actual (true) value [28, 37]. To evaluate this parameter we used: 1) the number of targets showing differences in expression values between each pair-wise combination of replica slides, and 2) the concordance between relative expression values obtained on arrays with those obtained by qRT-PCR for a subset of 9 genes. The number of targets showing significant differences in expression values between each pair-wise combination of inner vs. outer tissues (Figure 3) are given for three different thresholds: -1< log2 ratio >1 (Figures 3A and 3B), -2< log 2ratio >2 (Figures 3C and 3D), and −3< log2 ratio >3 (Figures 3E and 3F). Significantly expressed targets were detected by both arrays at all threshold values used.
The 60-mers arrays showed a much greater sensibility in significant target expression and the number of up- or down-regulated targets was between 4x and 39x more important than with 25-mers arrays depending upon the threshold value used (Additional file 2). This sensibility of detection could be related to the higher intensities of signals in 60-mers arrays (Figure 1A) and the hybridization efficiency that is sequence-dependent (see Comparison of the precision of expression measurements).
The 25-mers arrays seemed to produce accurate measurements as a high number of identified targets were confirmed by 60-mers arrays (between 56 and 100% of significant expression values obtained in 25-mers arrays were detected in 60-mers arrays at the same threshold). Targets that were specifically detected using the 25- or 60-mers arrays generally presented relatively similar log2 ratio values even though they did not necessarily occur within the same threshold range. Only three targets (out of the 46,589 unigenes targeted) showed discordant expression values being significantly up-/down-regulated in one array type as compared to the other (Figure 3G). In an attempt to understand this discordance we measured the expression levels of 2 of these genes (C29324: up-regulated on the 25-mers array and down-regulated on the 60-mers array, and C50701: down-regulated on the 25-mers array and up-regulated on the 60-mers array) by qRT-PCR. Our results (Figures 3G and 4) show that qRT-PCR measurements indicate that C50701 is under-expressed in stem inner tissues in agreement with the 25-mers value but in disagreement with the 60-mers value. In contrast the qRT-PCR results indicated that C29324 was up-regulated in inner stem tissues in agreement with the 60-mers results, but not the 25-mers results. One possible explanation for the observed differences could be the existence of alternative splicing variants. Comparison of splicing predictions between genomic (http://www.phytozome.org) and EST (http://urgi.versailles.inra.fr/Species/Flax/Download-sequences) databases suggest that splice variants could exist for C50701. Nevertheless alignment of 25-mer, 60-mer probes and qRT-PCR primers (Additional file 3) showed that none would be capable of distinguishing the different potential splice variants. Further investigation using microarrays specifically designed as tiling or splice junction arrays could provide further information. Comparison of the genomic sequence, probes and primers for the C29324 unigene (Additional file 3) also provided a possible explanation for the observed discordance between 25-mers and 60-mers values. The genomic sequence (Lus 10011816) and the C29324 unigene show good alignment in the central region of the unigene but are not aligned at both extremities suggesting that the C29324 contig is not correctly assembled. Both 60-mers probes and qRT-PCR primers target the central region (correct) of the gene/transcript whereas the 25-mers target the extremities that are problematic. In conclusion, only 3 out of the 46,589 unigenes (0.0064%) targeted by the 25-mers-arrays and the 60-mers arrays showed discordant significant expressions thereby confirming the overall conformity of probe design for the 2 array platforms. Verification by qRT-PCR and analyses of sequence data for 2 unigenes showing discordant expression values indicated possible explanations for the observed contradictions.
We have previously evaluated the accuracy of the 25-mers platform by qRT-PCR cross-validation using 9 genes . We therefore adopted the same approach to validate the 60-mer platform using a subset of 9 genes showing i) no significant (1<log2 ratio>−1) expression values on both arrays (C24118, C3323, C21991, C2533), ii) significant expression value (log2 ratio>1 or log2 ratio<−1) on one array, but not the other (C57711), iii) significant expression values on both arrays (C602, C822) and iv) significant but opposed expression values (C50701, C29324) (Figure 4). For 5 out of the 9 tested genes (C24118, C3323, C21991, C602, and C2533), qRT-PCR determined expression values were not significantly different from those determined by both flax arrays. For three other genes (C822, C57711, C29324), qRT-PCR expression values were significantly different from 25-mers data, but not from 60-mers data suggesting that the 60-mers array performs better than the 25-mers. The qRT-PCR expression value of only one gene (C50701) was significantly different from 60-mers data but not from 25-mers data. Although examination of genomic and transcript data suggested that different splice variants might exist for this gene as indicated above, neither the 25-mers, nor the 60-mers probes would distinguish the different forms and it is therefore difficult to explain why the 25-mers apparently give a more accurate measure of expression levels for this unigene. The design and use of different qRT-PCR primers and further sequence analyses would enable to clarify this point.
The correlation coefficient was calculated separately between the 60-mers results and the qRT-PCR results for 6 selected genes. We deliberately excluded the unigenes C50701 and C29324 that gave discordant results between the platforms and/or the qRT-PCR data, probably resulting from assembly problems as indicated above. We also decided to exclude the unigene C57711 because of the discordant 25-mers and 60-mers values, but also because we were unable to identify the corresponding genomic sequence. The obtained value (r = 0.9832) indicated a highly statistically significant correlation (Figure 5). A similar calculation for the 25-mers platform (Figure 5) also gave a highly significant (but lower) r value (0.9414). Taken together these data indicate a good accuracy for both array types and are in agreement with other similar studies demonstrating that experimental errors were not a significant source of unwanted variability in expression profiling obtained by Affymetrix U74Av2 arrays transcriptome experiments  or custom made microarrays .
Differential gene expression in flax tissues
Our results showed that the 60-mers array detected a higher number of unigenes differentially expressed between the two flax samples and was therefore more sensitive than the 25-mers array. However, we wondered whether the increased sensitivity also represented an augmentation in the biologically-relevant information. As a first step to answering this question we functionally classified genes showing significant differential expression on the two arrays using GO (Gene Ontology in biological process category) annotations based on blast results and GOA and TAIR gene cross-referenced files [24, 26]. Functional categories of up- and down-regulated genes in inner vs. outer tissues at −1<log2 ratio<1 in the two array types are represented in Figure 6 and Additional file 2. Even if the total number of significantly expressed genes is very different in the two array types (number up-regulated genes: 529 for 25-mers, 5,030 for 60-mers; number down-regulated genes: 1,346 for 25-mers; 4,253 for 60-mers), the percentages of annotated genes involved in different functional groups are very similar for the 2 arrays. For example, 7.96%, (9 genes) and 7.84%, (43 genes) of all genes significantly more expressed in stem inner tissues were assigned to the class ‘secondary metabolites’ in the 25-mers and 60-mers data sets, respectively. Similarly, 18.58%, (102 genes) and 18.8% (277 genes) were assigned to the class ‘response to stress’, and transport 15.93%, (18 genes) and 13.32%, 87 genes) were assigned to the class ‘transport’ in the 25-mers and 60-mers data sets, respectively. The high similarity between functional class percentage values was also observed for genes showing a significant higher expression in stem outer tissues. For example, 12.56% (86 genes) and 11.41% (170 genes) were assigned to the class ‘photosynthesis’, and 19.05% (132 genes) and 18.35% (275 genes) were assigned to the class ‘response to stress’ in the 25-mers and 60-mers data sets, respectively. Generally, these observations are in close agreement with the known physiological roles of these two different tissues [26, 38, 39] and confirm the biological consistency of data reported by both array types. Taken together, these observations suggest that both arrays are able to provide a biologically-coherent global view of the flax stem transcriptome.
In order to better assess whether the increase in the number of significantly expressed genes detected by the 60-mers arrays as compared to the 25-mers arrays represented biologically relevant information we decided to focus on genes encoding enzymes responsible for the biosynthesis of lignin monomers (monolignols) and/or their oxidation (laccases). Our results (Figure 7A) show that the 25-mers array detected 3 significantly expressed unigenes corresponding to 3 multigenes families (Phenylalanine Ammonia Lyase: PAL, 4-Coumarate Ligase: 4CL, Cinnamyl Alcohol Dehydrogenase: CAD) encoding enzymes involved in monolignol biosynthesis. When the 60-mers array was used additional significantly expressed unigenes corresponding to each of these 3 multigene families were detected (5 PAL unigenes, 2 4CL unigenes and 2 CAD unigenes). Nevertheless, the significantly expressed unigene detected by the 25-mers array also corresponded to the most significantly expressed unigene detected by the 60-mers array. In addition, the 60-mers array, but not the 25-mers array, also detected significantly expressed unigenes corresponding to 3 further multigene families encoding enzymes involved in monolignol biosynthesis (Cinnamate 4-Hydroxylase: C4H, Caffeic Acid O-methyltransferase: COMT, Cinnamoyl CoenzymeA Reducatse: CCR). Although the C4H unigene expression level determined by the 25-mer array was just below the cut-off value (0.98), the COMT unigene expression level was considerably inferior (0.37). Similarly all CCR unigene expression levels determined by the 25-mers array were well below the threshold value. Similar observations could be made for those unigenes encoding enzymes (laccases) potentially involved in the oxidative polymerisation step of the lignification process. Interestingly however, one laccase unigene (C37539) showed a significant expression with the 25-mers array but not the 60-mers array.
In order to understand the possible reason for the higher sensitivity of the 60-mers arrays, we focused on PAL unigenes (Figure 7B) and examined signal intensities, unigene lengths, probe Tm, as well as the probe position and coverage of the EST (Additional file 4). Signal intensities were consistently higher on 60-mers arrays, presumably since the probe Tm average was higher, resulting in lower background. No relation was found between unigene length and array sensitivity. Both arrays covered similar unigene region lengths, generally the 60-mers probes cover 240 bp (4 duplicate probes per unigene) and the 25-mers probes cover 200 pb (8 25-bp probes per unigene) (Additional file 4).
All these observations support the hypothesis that hybridization efficiency depends on probe thermodynamic parameters as previously suggested . Similar results were found with the Agilent platform  when 25- and 60-mers arrays were compared. Agilent 60-mers arrays tended to have higher sensitivity, with an average lower detection limit as compared to 25-mers. In contrast, reproducibility of log2 ratio values, system noise and accuracies of log2 ratio determination were comparable between these two microarray types. Similarly, the overall biological information obtained with these 2 arrays was similar in agreement with our observations in flax stem tissues.
Our study compared two different flax Nimblegen high density microarray platforms based on a short- and long-oligonucleotide design. Our results showed that both array types yielded reproducible, accurate and comparable data that are coherent for expression measurements and identification of differentially expressed genes. Nevertheless, we found that the 60-mers arrays gave higher hybridization efficiencies and therefore were more sensitive allowing the detection of a higher number of unigenes involved in the same biological process and/or belonging to the same multigene family. The two flax arrays provide a good resolution of expressed functions; however the 60-mers arrays are more sensitive and provide a more in-depth coverage of candidate genes potentially involved in different biological processes.
Linum usitatissimum L. (cv. Barbara) plants were grown in a growth chamber (light/night cycles 16h (22°C)/8h (19°C), 50% humidity and light intensity of 400 μE s-1 m-2) and harvested after nine weeks of grown. The outer fiber-bearing tissues were peeled off and inner tissues (xylem) from a 15 cm long stem section were cut into short fragments before both tissues were frozen in liquid nitrogen as previously described [24, 26].
Total RNA was isolated from pooled flax inner- and outer-stems using the NucleoSpin® RNA Plant kit (Macherey-Nagel) following manufacturer’s guidelines. To obtain sufficient amount of RNA for microarray analysis (10 μg), a minimum of three extractions with up to 150 mg of fresh tissue were necessary for each sample. To eliminate DNA contamination, on column treatment was done using the RNAse-free DNAse included in the kit. RNA integrity and concentration were evaluated with RNA StdSens Chips using the ExperionTM automated eletrophoresis system (Bio-Rad). For each sample, the three RNA extracts were pooled and final concentrations were adjusted to 1 μg/μL.
Microarray design and oligo synthesis
Two types (25-mers, 60-mers) of high-density flax microarrays based on the Nimblegen 385K design format (Nimblegen Systems, Inc., Madison, WI, USA) each containing a total of 384,168 oligonucleotides were designed. The 25-mers array utilized 8 distinct, 25 bp-long oligos for each of the 48,021 contigs and the 60-mers array utilized 4 duplicate, 60 bp-long oligonucleotides for 46,589 contigs. Microarray contigs were selected from a collection of 59,000 contigs obtained by assembling the GS FLX sequences . The 46,589 contigs targeted by the 60-mers array were also targeted by the 25-mers array allowing direct comparison between the two designs.
cDNA synthesis, labeling and hybridization
Double-stranded cDNA (ds-cDNA) was synthesized from 10 μg of total RNA using an Invitrogen SuperScript ds-cDNA synthesis kit in the presence of 250 ng random hexamer primers. ds-cDNA was cleaned and labeled in accordance with the Nimblegen Gene Expression Analysis protocol (Nimblegen Systems, Inc., Madison, WI, USA). Briefly, ds-cDNA was incubated with 4 μg RNase A (Promega) at 37°C for 10 min and cleaned using phenol:chloroform:isoamyl alcohol, followed by ice-cold absolute ethanol precipitation. For Cy3 labeling of cDNA, the Nimblegen One-Color DNA labeling kit was used according to the manufacturer’s guideline detailed in the Gene Expression Analysis protocol (Nimblegen Systems, Inc., Madison, WI, USA). One μg ds-cDNA was incubated for 10 min at 98°C with 2 OD of Cy3-9mer primer. Then, 100 pmol of deoxynucleoside triphosphates and 100U of the Klenow fragment (New England Biolabs, Ipswich, MA, USA) were added and the mix incubated at 37°C for 2h30. The reaction was stopped by adding 0.1 volume of 0.5 M EDTA, and the labeled ds-cDNA was purified by isopropanol/ethanol precipitation. Microarrays were hybridized at 38°C (25-mers arrays) and at 42°C (60-mers arrays) during 16 to 18h with 6μg of Cy3 labelled ds-cDNA in Nimblegen hybridization buffer/hybridization component A in a hybridization chamber (Hybridization System - Nimblegen Systems, Inc., Madison, WI, USA). Following hybridization, washing was performed using the Nimblegen Wash Buffer kit (Nimblegen Systems, Inc., Madison, WI, USA).
Slides were scanned at 5 μm/pixel resolution using an Axon GenePix 4000B scanner (Molecular Devices Corporation, Sunnyvale, CA, USA) piloted by GenePix Pro 6.0 software (Axon). Scanned images (TIFF format) were then imported into NimbleScan software (Nimblegen Systems, Inc., Madison, WI, USA) for grid alignment and expression data analyses. Expression data were normalized through quantile normalization  and the Robust Multichip Average (RMA) algorithm  included in the NimbleScan software. Identification of genes displaying a change in expression over repetitions was accomplished with a script utilizing library functions in R with a false discovery rate (FDR) of less than 5%. The SAM  was used to identify differentially expressed genes over different conditions. Analysis was completed with the Tree-view clustering program . Functional annotation of differentially-expressed genes was based on Gene Ontology (http://www.geneontology.org/). All the microarray data have been submitted to the Gene Expression Omnibus (GEO) database  with the accession number is GSE37980.
Quantitative reverse transcriptase-PCR (qRT-PCR) analysis
For qRT-PCR analyses, 1 μg of total RNA was reverse-transcribed to single stranded cDNA using the IScript cDNA synthesis kit (Bio-Rad) according to the manufacturer’s instructions. The qRT-PCRs were carried out in 96-wells plates with a MyIQ real time PCR detection system (Bio-Rad) using iQSYBR Green PCR Kit (Bio-Rad) in a reaction volume of 20 μL (5 μL diluted cDNAs, 10 μL of 2× SYBR Green mix and primer pairs at 0.4 μM). Aliquots from the same cDNA solutions were used with all primer sets in each experiment. All PCR reactions were performed under the following conditions: 95°C for 15 min, 40 cycles of 10 s at 95°C and 30 s at 60°C. For each primer pair, a melting curve was generated in order to confirm the specificity of the amplification. The primer sequences used for all target genes are presented in Additional file 5.
Each experiment was repeated on three biological replicates, each one represented by three technical repetitions. PCR reactions on samples lacking the cDNA template or the reverse transcriptase during the cDNA synthesis were also performed as negative controls for each primer pair. The efficiency (E) value of each reaction was between 0.85 and 1.17 with R2 values higher than 0.99.
Data were analysed using Bio-Rad iQ5 software. For each primer pair, a melting curve was generated in order to confirm the specificity of the amplification. The PCR efficiencies (E) for each reaction were between 0.85 and 1.17 with R2 values higher than 0.99. The expression of each gene was normalized by using 2 reference genes, ETIF1 and ETIF4F, shown to be expressed in a stable manner in flax stem tissues .
SF is a researcher in post doctorat position; MC is a PhD student; SG is bioinformatics ingeneer; RH is researcher; GN is assistant professor in charge of qRT-PCR platform; NR is the head of Biogemma’s transcriptomics group; BT is professor; SH is professor and the Plant FibreTeam leader; AL is assistant professor in charge of transcriptomic platform.
Next generation sequencing
Robust multi-array average
Coefficient of variation
quantitative Real Time Polymerase Chain Reaction
Gene ontology annotation
The arabidopsis information resource
Phenylalanine ammonia lyase
Cinnamyl alcohol dehydrogenase
Caffeic acid O-methyltransferase
Cinnamoyl coenzymeA reducatse
Expressed sequence tag
Gene expression omnibus.
Bakay M, Chen YW, Borup R, Zhao P, Nagaraju K, Hoffman EP: Sources of variability and effect of experimental approach on expression profiling data interpretation. BMC Bioinformatics. 2002, 3: 4-10.1186/1471-2105-3-4.
Ramdas L, Coombes KR, Baggerly K, Abruzzo L, Highsmith WE, Krogmann T, Hamilton SR, Zhang W: Sources of nonlinearity in cDNA microarray expression measurements. Genome Biol. 2001, 2 (11): 1-7.
Dobbin KK, Kawasaki ES, Petersen DW, Simon RM: Characterizing dye bias in microarray experiments. Bioinformatics. 2005, 17: 17-
Woo Y, Affourtit J, Daigle S, Viale A, Johnson K, Naggert J, Churchill G: A comparison of cDNA, oligonucleotide, and Affymetrix GeneChip gene expression microarray platforms. J Biomol Tech. 2004, 5: 276-284.
Culhane AC, Perriere G, Higgins DG: Cross-platform comparison and visualisation of gene expression data using co-inertia analysis. BMC Bioinformatics. 2003, 4: 59-10.1186/1471-2105-4-59.
Bloom G, Yang IV, Boulware D, Kwong KY, Coppola D, Eschrich S, Quackenbush J, Yeatman TJ: Multi-platform, multi-site, microarray- based human tumor classification. Am J Pathol. 2004, 164: 9-16. 10.1016/S0002-9440(10)63090-8.
Wright G, Tan B, Rosenwald A, Hurt EH, Wiestner A, Staudt LM: A gene expression-based method to diagnose clinically distinct subgroups of diffuse large B cell lymphoma. Proc Natl Acad Sci USA. 2003, 100: 9991-9996. 10.1073/pnas.1732008100.
Stevens JR, Doerge RW: Combining Affymetrix microarray results. BMC Bioinformatics. 2005, 6: 57-10.1186/1471-2105-6-57.
Barrett JC, Kawasaki ES: Microarrays: the use of oligonucleotides and cDNA for the analysis of gene expression. Drug Discov Today. 2003, 8: 134-141. 10.1016/S1359-6446(02)02578-3.
Ulijasz AT, Andes DR, Glasner JD, Weisblum B: Regulation of iron transport in Streptococcus pneumoniae by RitR, an orphan response regulator. J Bacteriol. 2004, 186: 8123-8136. 10.1128/JB.186.23.8123-8136.2004.
Morschhäuser J, Barker KS, Liu TT, BlaB-Warmuth J, Homayouni R, Rogers PD: The transcription factor Mrr1p controls expression of the MDR1 efflux pump and mediates multidrug resistance in Candida albicans. PLoS Pathog. 2007, 3 (11): e164-10.1371/journal.ppat.0030164.
Chiu S, Kim K, Haus KA, Espinal GM, Millon LV, Warden CH: Identification of positional candidate genes for body weight and adiposity in subcongenic mice. Physiol Genomics. 2007, 31 (1): 75-85. 10.1152/physiolgenomics.00267.2006.
Stolc V, Li L, Wang X, Li X, Su N, Tongprasit W, Han B, Xue Y, Li J, Snyder M, Gerstein M, Wang J, Deng XW: A pilot study of transcription unit analysis in rice using oligonucleotide tiling-path microarray. Plant Mol Biol. 2005, 59 (1): 137-149. 10.1007/s11103-005-6164-5.
Abomoelak B, Hoye EA, Chi J, Marcus SA, Laval F, Bannantine JP, Ward SK, Daffé M, Liu HD, Talaat AM: MosR, a novel transcriptional regulator of hypoxia and virulence in Mycobacterium tuberculosis. J Bacteriol. 2009, 191 (19): 5941-5952. 10.1128/JB.00778-09.
Subramoni S, Venturi V: PpoR is a conserved unpaired LuxR solo of Pseudomonas putida which binds N-acyl homoserine lactones. BMC Microbiol. 2009, 9: 125-10.1186/1471-2180-9-125.
Bai X, Kim J, Yang Z, Jurynec MJ, Akie TE, Lee J, LeBlanc J, Sessa A, Jiang H, DiBiase A, Zhou Y, Grunwald DJ, Lin S, Cantor AB, Orkin SH, Zon LI: TIF1gamma controls erythroid cell fate by regulating transcription elongation. Cell. 2010, 142 (1): 133-143. 10.1016/j.cell.2010.05.028.
Badders NM, Goel S, Clark RJ, Klos KS, Kim S, Bafico A, Lindvall C, Williams BO, Alexander CM: The Wnt receptor, Lrp5, is expressed by mouse mammary stem cells and is required to maintain the basal lineage. PLoS One. 2009, 4 (8): e6594-10.1371/journal.pone.0006594.
Borghese B, Mondon F, Noël J, Fayt I, Mignot T, Vaiman D, Chapron C: Gene expression profile for ectopic versus eutopic endometrium provides new insights into endometriosis oncogenic potential. Mol Endocrinol. 2008, 22 (11): 2557-2562. 10.1210/me.2008-0322.
Dittami S, Scornet D, Petit J, Ségurens B, da Silva C, Corre E, Dondrup M, Glatting K, König R, Sterck L, Rouzé P, van de Peer Y, Cock J, Boyen C, Tonon T: Global expression analysis of the brown alga Ectocarpus siliculosus (Phaeophyceae) reveals large-scale reprogramming of the transcriptome in response to abiotic stress. Genome Biol. 2009, 10 (6): R66-10.1186/gb-2009-10-6-r66.
Ramírez-Carvajal GA, Morse AM, Dervinis C, Davis JM: The cytokinin type-B response regulator PtRR13 is a negative regulator of adventitious root development in Populus. Plant Physiol. 2009, 150: 759-771. 10.1104/pp.109.137505.
Adamczyk BJ, Fernandez DE: MIKC* MADS domain heterodimers are required for pollen maturation and tube growth in Arabidopsis. Plant Physiol. 2009, 149 (4): 1713-1723. 10.1104/pp.109.135806.
Kim EH, Kim YS, Park S, Koo YJ, Choi YD, Chung Y, Lee I, Kim J: Methyl jasmonate reduces grain yield by mediating stress signals to alter spikelet development in rice. Plant Physiol. 2009, 149 (4): 1751-1760. 10.1104/pp.108.134684.
Fenart S, Ndong YPA, Duarte J, Rivière N, van Wuytswinkel O, Lucau A, Cariou E, Neutelings G, Gutierrez L, Chabbert B, Guillot X, Tavernier R, Hawkins S, Thomasset B: Development and validation of a flax (Linum usitatissimum L.) gene expression oligo microarray. BMC Genomics. 2010, 11: 592-10.1186/1471-2164-11-592.
Deyholos MK: Bast fiber of flax (Linum usitatissimum L.): Biological foundations of its ancient and modern uses. Israel J Plant Sci. 2007, 54: 273-280. 10.1560/IJPS_54_4_273.
Huis R, Morreel K, Fliniaux O, Lucau-Danila A, Fénart S, Grec S, Neutelings G, Chabbert B, Mesnard F, Boerjan W, Hawkins S: Natural hypolignification is associated with extensive oligolignol accumulation in flax stems. Plant Physiol. 2012, 158 (4): 1893-1915. 10.1104/pp.111.192328.
Day A, Ruel K, Neutelings G, Crônier D, David H, Hawkins S, Chabbert B: Lignification in the flax stem: evidence for an unusual lignin in bast fibers. Planta. 2005, 222 (2): 234-245. 10.1007/s00425-005-1537-1.
Draghici S, Khatri P, Eklund AC, Szallasi Z: Reliability and reproducibility issues in DNA microarray measurements. Trends Genet. 2006, 22 (2): 101-109. 10.1016/j.tig.2005.12.005.
Bruun GM, Wernersson R, Juncker AS, Willenbrock H, Nielsen HB: Improving comparability between microarray probe signals by thermodynamic intensity correction. Nucleic Acids Res. 2007, 35 (7): e48-10.1093/nar/gkl1098.
Held GA, Grinstein G, Tu Y: Relationship between gene expression and observed intensities in DNA microarrays-a modeling study. Nucleic Acids Res. 2006, 34 (9): e70-10.1093/nar/gkl122.
Zhang L, Miles MF, Aldape KD: A model of molecular interactions on short oligonucleotide microarrays. Nat Biotechnol. 2003, 21 (7): 818-821. 10.1038/nbt836.
Wu Z, Irizarry R, Gentleman R, Murillo FM, Spencer F: A model based background adjustement for oligonucleotide expression arrays. JASA. 2004, 99: 909-917. 10.1198/016214504000000683.
Royce TE, Rozowsky JS, Gerstein MB: Toward a universal microarray: prediction of gene expression through nearest-neighbor probe sequence identification. Nucleic Acids Res. 2007, 35 (15): e99-10.1093/nar/gkm549.
Matveeva OV, Shabalina SA, Nemtsov VA, Tsodikov AD, Gesteland RF, Atkins JF: Thermodynamic calculations and statistical correlations for oligo-probes design. Nucleic Acids Res. 2003, 31 (14): 4211-4217. 10.1093/nar/gkg476.
Khomyakova E, Livshits MA, Steinhauser MC, Dauphinot L, Cohen-Kaminsky S, Rossier J, Soussaline F, Potier MC: On-chip hybridization kinetics for optimization of gene expression experiments. Biotechniques. 2008, 44: 109-117. 10.2144/000112622.
Mueckstein U, Leparc GG, Posekany A, Hofacker I, Kreil DP: Hybridization thermodynamics of nimblegen microarrays. BMC Bioinformatics. 2010, 11: 35-10.1186/1471-2105-11-35.
Moreau Y, Aerts S, De Moor B, De Strooper B, Dabrowski M: Comparison and meta-analysis of microarray data: from the bench to the computer desk. Trends Genet. 2003, 19 (10): 570-577. 10.1016/j.tig.2003.08.006.
Roach MJ, Deyholos MK: Microarray analysis of flax (Linum usitatissimum L.) stems identifies transcripts enriched in fibre-bearing phloem tissues. Mol Genet Genomics. 2007, 278: 149-165. 10.1007/s00438-007-0241-1.
Roach MJ, Mokshina NY, Badhan A, Snegireva AV, Hobson N, Deyholos MK, Gorshkova TA: Development of cellulosic secondary walls in flax fibers requires β-galactosidase. Plant Physiol. 2011, 156 (3): 1351-1363. 10.1104/pp.111.172676.
Fulmer-Smentek SB: Performance comparison of Agilent’s 60-mer and 25-mer in situ synthesized oligonucleotide microarrays. Agilent Technologies. 2003, 5977-5988. EN. http://www.chem.agilent.com/temp/radEB25E/00042209.pdf
Bolstad BM, Irizarry RA, Astrand M, Speed TP: A comparison of normalization methods for high density oligonucleotide array data based on variance and bias. Bioinformatics. 2003, 19 (2): 185-193. 10.1093/bioinformatics/19.2.185.
Irizarry RA, Hobbs B, Collin F, Beazer-Barclay YD, Antonellis KJ, Scherf U, Speed TP: Exploration, normalization, and summaries of high density oligonucleotide array probe level data. Biostatistics. 2003, 4 (2): 249-264. 10.1093/biostatistics/4.2.249.
Tusher VG, Tibshirani R, Chu G: Significance analysis of microarrays applied to the ionizing radiation response. Proc Natl Acad Sci USA. 2001, 98 (9): 5116-5121. 10.1073/pnas.091062498.
Eisen MB, Spellman PT, Brown PO, Botstein D: Cluster analysis and display of genome-wide expression patterns. Proc Natl Acad Sci USA. 1998, 95: 14863-14868. 10.1073/pnas.95.25.14863.
Gene Expression Omnibus database.http://www.ncbi.nlm.nih.gov/geo,
Huis R, Hawkins S, Neutelings G: Selection of reference genes for quantitative gene expression normalization in flax (Linum usitatissimum L.). BMC Plant Biol. 2010, 10: 71-10.1186/1471-2229-10-71.
Authors gratefully acknowledge financial support of the French Nord-Pas de Calais Region (Plant Teq 4 project), RH gratefully acknowledges the support of the French government (PhD grant) and MC gratefully acknowledges the support of the ANR (Agence National de Recherche) (PhD grant KBBE FIBRAGEN project). This study was also supported by the ANR project GENOLIN.
The authors declare that they have no competing interests. No funding either now or in the future, no stocks or shares was or will be perceived, no interest or financing is known from an organisation for this manuscript. No patents relating to the content of the manuscript are known, and no non-financial competing interests are to declare in relation to this manuscript.
SF and RH realized the plant cultures, RNA extractions, and transcriptomic analyses. MC and GN performed the qRT-PCR analyses, SG executed the biocomputing analyses of the results, NR and BT contributed to the microarrays design and construction, SH participated in the writing and global interpretations of paper, AL organized the various strategies of analyses and interpretation and drafted the paper. All authors read and approved the final manuscript.
Electronic supplementary material
Authors’ original submitted files for images
Below are the links to the authors’ original submitted files for images.