Spike-in validation of an Illumina-specific variance-stabilizing transformation

Dunning, Mark J; Ritchie, Matthew E; Barbosa-Morais, Nuno L; Tavaré, Simon; Lynch, Andy G

doi:10.1186/1756-0500-1-18

Short Report
Open access
Published: 04 June 2008

Spike-in validation of an Illumina-specific variance-stabilizing transformation

Mark J Dunning¹,
Matthew E Ritchie¹,
Nuno L Barbosa-Morais¹,
Simon Tavaré¹ &
…
Andy G Lynch¹

BMC Research Notes volume 1, Article number: 18 (2008) Cite this article

5310 Accesses
12 Citations
Metrics details

Abstract

Background

Variance-stabilizing techniques have been used for some time in the analysis of gene expression microarray data. A new adaptation, the variance-stabilizing transformation (VST), has recently been developed to take advantage of the unique features of Illumina BeadArrays. VST has been shown to perform well in comparison with the widely-used approach of taking a log₂ transformation, but has not been validated on a spike-in experiment. We apply VST to the data from a recently published spike-in experiment and compare it both to a regular log₂ analysis and a recently recommended analysis that can be applied if all raw data are available.

Findings

VST provides more power to detect differentially expressed genes than a log₂ transformation. However, the gain in power is roughly the same as utilizing the raw data from an experiment and weighting observations accordingly. VST is still advantageous when large changes in expression are anticipated, while a weighted log₂ approach performs better for smaller changes.

Conclusion

VST can be recommended for summarized Illumina data regardless of which Illumina pre-processing options have been used. However, using the raw data is still encouraged whenever possible.

Background

Gene expression microarrays allow messenger RNA (mRNA) abundance to be quantified quickly and cost-effectively on a genome-wide scale. The production of mRNA is a key step in the process that leads from the information contained within DNA to the formation of the proteins that act within a cell. Quantifying the abundance of mRNA is therefore of interest because it provides much information regarding the state of the cell [1].

Microarrays for measuring mRNA expression make use of probes that hybridize to fluorescently-labelled sample material, where the measured level of fluorescence is used to infer the expression level of each interrogated gene. Traditionally they are constructed by attaching probes directly to a specific point on the array's surface. By contrast, the BeadArray expression platform developed by Illumina makes use of probes attached to beads that are subsequently randomly arranged on the array surface [2]. There are approximately 30 beads for each type of probe (a high degree of replication for a microarray), providing robustness against systematic spatial influences on the array.

For BeadArrays, the raw (bead-level) intensity information is stored in a proprietary format. Until recently, only summarized output (averaged values over the replicate beads on a given array) was available from Illumina's analysis software (BeadScan and BeadStudio). As such, most published studies make use of summarized Illumina data, a state that leaves the low-level (but vital) steps (e.g. image analysis, background correction, summarization) beyond the control of the data analyst.

The data from microarray experiments generally require transformation in order to facilitate simple analyses such as the confident fitting of basic linear models. Variance-stabilizing transformations are applied to microarray data in order to remove the mean-variance relationship in intensities. A log₂ transformation is the simplest variance-stabilizing transformation commonly applied to microarray data. Other, more sophisticated approaches have been developed, such as the variance-stabilizing normalisation (VSN) method of Huber et al. [3] and that of Durbin et al. [4].

The VST method [5] is an adaptation of the VSN methodology for Illumina data, exploiting the replicate beads on the array and is defined for intensity x as

f (x) = \frac{a r c s i n h (\frac{c_{2}}{\sqrt{c_{3}}} + \frac{c_{1} x}{\sqrt{c_{3}}})}{c_{1}}

where c₃ is defined as the variance of bead types that estimate background noise and c₂ and c₁ respectively represent additive and multiplicative levels of error in the intensity.

Using previously published data [6], VST is found to outperform the approach of log₂ transformation, based on the results of a mixture experiment where each sample was a pool of blood and placenta at various ratios. However, the authors commented on the then lack of a publicly available spike-in experiment, a data set that would have provided an ideal test for their method.

Coinciding with the publication of VST, Dunning et al. [7] published an independent account of such a spike-in experiment using customized Mouse WG-6 BeadArrays. In addition to the approximately 48,000 probes (bead types) included as standard, the content of these chips was modified to include 33 probes targeting bacterial and viral genes absent from the mouse genome. These "spikes" were added at specific concentrations on each array, and hence the relative change in expression level of a particular spike between arrays is known a priori. The expression levels of the remaining probes ("non-spikes") should not change between arrays. Twelve different concentrations of spike were used (1000 pM, 300 pM, 100 pM, 30 pM, 10 pM, 3 pM, 1 pM, 0.3 pM, 0.1 pM 0.03 pM, 0.01 pM and 0 pM) and each was replicated four times. Control experiments such as this have proven useful for comparing low-level analysis methods in other microarray platforms, such as Affymetrix [8].

Access to the raw bead-level data allows some of the low-level analysis steps to be explored in greater detail [7]. In particular, it was shown that the local background correction and summarization steps carried out by BeadScan and BeadStudio reduce bias and produce robust summary measurements.

The "Background normalisation" method (BGN) available in BeadStudio adjusts the intensities on each array by subtracting the average expression level of the negative controls (probes that have no targets in the genome being studied) in order that arrays might have comparable baselines. In the analysis of the spike-in experiment, it was shown that BGN resulted in many negative values, and also in increased variability of intensity at low expression levels when combined with the standard log₂ transformation. Concordant with previously published observations [6], it was concluded that BGN is not desirable.

It has also been shown that, by using the variances of each bead type as inverse weights, the performance of linear models intended to detect differentially expressed (DE) genes can be improved [7]. This approach is generally only possible if bead-level data are available and a log₂ transformation applied prior to calculating bead type averages and variances. We shall refer to this approach as a weighted log₂ analysis. Other advantages of having access to data at the bead-level were also shown. Naturally such data allow for detailed quality control and also for greater flexibility in the choice of statistical model.

In this paper, we apply VST to data from the spike-in experiment. This offers further validation of the VST method, not only because the estimation of differential expression can be objectively assessed, but also because the microarray used is different: the mixture data used to validate VST was from a HumanRef-8 BeadArray with some 22,000 probes, rather than the 48,000 MouseWG-6 BeadArray used in the spike experiment. By design, BeadArrays with 48,000 probes tend to have many more probes at low intensity [7] than the HumanRef-8 BeadArray that only contains probes taken from a curated database. Since 48,000 probe BeadArrays are more widely used, it is important to confirm that VST can be applied to these higher density arrays with no impairment due to the different distribution of intensities. Additionally, we will investigate whether VST can reduce some of the problems encountered when applying a standard log₂ transformation after BGN.

Methods

The bead-level data for the spike-in experiment were read by beadarray [9] (version 1.7.11) using the default background subtraction method. These bead intensities were then filtered using a 3 median absolute deviation cut-off to remove outliers. The data were summarized and transformed (VST or log₂) as appropriate, and the arrays were then quantile normalized. The bead-level data were reprocessed using both background subtraction and background normalization, and the 'lumi' software package [5] (version 1.5.17) used to apply either a VST (with the default settings) or a log₂ transformation (with an offset added if necessary to avoid negative values).

The linear model and subsequent analyses used to find DE genes between arrays with different spike concentrations have been previously described [7]. We obtained log-odds scores quantifying the evidence for differential expression for both the spike and non-spike probes. The 12 spike concentrations allow for construction of 6 independent contrasts. We considered two sets: one where neighbouring concentrations are compared to provide the greatest challenge for differentiation (1^st concentration vs 2^nd, 3^rd concentration vs 4^th etc.) and one where a range of effect sizes would be observed by contrasting pairs symmetric about the middle concentrations (1^st concentration vs 12^th, 2^nd concentration vs 11^th etc.). Finally, a series of smaller models were fitted, where only the 8 (of the 48) arrays featuring in the contrast of interest (4 arrays for each concentration) were considered.

ROC curves were also plotted, but not found to be informative as the spikes were consistently selected as DE, with very few false positives (data not shown). All data and scripts used in the analysis are available as supplementary material [10].

Results

Figure 1 shows MA-plots comparing arrays with spikes at 3 pM and 1 pM. When BGN is not used, VST reduces the range of observed log-ratios for the probes we expect not to change. In the absence of BGN both the log₂ transformation and VST separate the spikes well from the non-spikes, but the log-fold changes achieved from the log₂ transformation exhibit less bias.

Applying the transformations after BGN, we see that the MA-plot for VST is little changed. By contrast, the combination of BGN and log₂ transformation is to be avoided, with much-reduced ability to separate out the spikes from the non-spikes by considering the log₂-ratio, as we have previously noted [7].

We fitted three linear models to the entire spike-in experiment: one using VST, one using a log₂ transformation, and the weighted log₂ analysis. For two of the linear models at a time, Figure 2 displays the differences in log-odds calculated for six contrasts. VST is seen to lead to a more powerful test than a standard log₂ transformation, producing higher log-odds values for the spikes (Figure 2a/2c). At the same time, values for the non-spikes were not appreciably altered (data not shown). The difference between VST and log₂ is seen to decrease as the spike concentrations get closer together (Figure 2c).

When comparing VST to a weighted log₂ analysis (Figure 2b/2d), VST is seen to be more powerful for detecting differential expression for large differences, but the weighted log₂ analysis outperforms VST for finer comparisons (such as 100 pM vs 30 pM and 3 pM vs 1 pM).

When the models are fitted to only the arrays involved in the contrast of interest (Figure 3), the same broad results are seen. The weighted log₂ analysis, however, begins to show more sensitivity than VST even at quite extreme comparisons (e.g. 100 pM vs 0.03 pM).

Discussion

In agreement with the original investigation into VST, we find that VST offers improvements over a standard log₂ analysis. Thus, users with only the summarized output from BeadStudio will find this method beneficial. In particular, VST can cope with data that have been background normalized (BGN is implemented as the "subtract background" option in recent versions of BeadStudio). This should not be confused with local background subtraction that has already been applied to bead-level data prior to summarization.

Using a published spike-in experiment we are also able to show that VST offers greater ability to detect DE genes compared to a log₂ transformation. This improvement was seen to diminish as the spike concentrations being compared become closer. At the same time, a weighted log₂ analysis had more power than VST for finer concentration differences.

In our initial analysis of the spike-in experiment, we used all 48 arrays in the linear model. The size of such an experiment may not be typical for some researchers and therefore we repeated the analysis using fewer arrays. In this smaller experiment, VST was seen to have marginally improved log-odds over a regular log₂ analysis. Under these conditions the weighted log₂ analysis was seen to improve the detection of DE genes in most cases, especially when comparing arrays with similar spike concentrations. We note that a weighted log₂ analysis is compromised without access to bead-level data. It would be beneficial if Illumina's software had the option to work with data on the log₂ scale when creating summarized data.

In summary, we have shown that the VST method does indeed perform well, and can be applied to the popular 48,000 probe BeadArrays. However, there are still benefits to having access to the raw data.

References

Parmigiani G, Garrett ES, Irizarry RA, Zeger SL: The analysis of gene expression data: an overview of methods and software. The Analysis of Gene Expression Data: methods and software. Edited by: Parmigiani G, Garrett ES, Irizarry RA, Zeger SL. 2003, New York: Springer-Verlag
Chapter Google Scholar
Kuhn K, Baker SC, Chudin E, Lieu MH, Oeser S, Bennett H, Rigault P, Barker D, McDaniel TK, Chee MS: A novel, high-performance random array platform for quantitative gene expression profiling. Genome Research. 2004, 14 (11): 2347-2356. 10.1101/gr.2739104.
Article PubMed Central CAS PubMed Google Scholar
Huber W, Von Heydebreck A, Sültmann H, Poustka A, Vingron M: Variance stabilization applied to microarray data calibration and to the quantification of differential expression. Bioinformatics. 2002, 18 (suppl 1): S96-S104.
Article PubMed Google Scholar
Durbin BP, Hardin JS, Hawkins DM, Rocke DM: A variance-stabilizing transformation for gene-expression microarray data. Bioinformatics. 2002, 18 (Suppl 1): S105-10.
Article PubMed Google Scholar
Lin S, Du P, Huber W, Kibbe WA: Model-based variance-stabilizing transformation for Illumina data. Nucleic Acids Research. 2008, 36 (2): e11-10.1093/nar/gkm1075.
Article PubMed Central PubMed Google Scholar
Barnes M, Freudenberg J, Thompson S, Aronow B, Pavlidis P: Experimental comparison and cross-validation of the Affymetrix and Illumina gene expression analysis platforms. Nucleic Acids Res. 2005, 33 (18): 5914-5923. 10.1093/nar/gki890.
Article PubMed Central CAS PubMed Google Scholar
Dunning MJ, Barbosa-Morais NL, Lynch AG, Tavaré S, Ritchie ME: Statistical issues in the analysis of Illumina data. BMC Bioinformatics. 2008, 9: 85-10.1186/1471-2105-9-85.
Article PubMed Central PubMed Google Scholar
Cope LM, Irizarray RA, Jaffee HA, Wu Z, Speed TP: A benchmark for Affymetrix GeneChip expression measures. Bioinformatics. 2004, 20 (3): 323-331. 10.1093/bioinformatics/btg410.
Article CAS PubMed Google Scholar
Dunning MJ, Smith ML, Ritchie ME, Tavaré S: beadarray: R classes and methods for Illumina bead-based data. Bioinformatics. 2007, 23 (16): 2183-4. 10.1093/bioinformatics/btm311.
Article CAS PubMed Google Scholar
University of Cambridge, Computational Biology Group Resources. [http://www.compbio.group.cam.ac.uk/resources.html]

Download references

Acknowledgements

We thank Gordon Smyth and Wei Shi for useful comments and discussions and Matthew Eldridge for feedback on the manuscript. The authors were supported in part by grants from the MRC (MJD), CRUK grant number

C14303/A8646 (MJD, NLBM, ST and AGL) and the Isaac Newton Trust (MER). We would like to acknowledge the support of The University of Cambridge, Cancer Research UK and Hutchison Whampoa Limited.

Author information

Authors and Affiliations

Department of Oncology, University of Cambridge, CRUK Cambridge Research Institute, Li Ka Shing Centre, Robinson Way, Cambridge, CB2 0RE, UK
Mark J Dunning, Matthew E Ritchie, Nuno L Barbosa-Morais, Simon Tavaré & Andy G Lynch

Authors

Mark J Dunning
View author publications
You can also search for this author in PubMed Google Scholar
Matthew E Ritchie
View author publications
You can also search for this author in PubMed Google Scholar
Nuno L Barbosa-Morais
View author publications
You can also search for this author in PubMed Google Scholar
Simon Tavaré
View author publications
You can also search for this author in PubMed Google Scholar
Andy G Lynch
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Andy G Lynch.

Additional information

Authors' contributions

MJD performed the analysis under the supervision of MER and AGL. NLBM provided expertise on the comparison of arrays. The study was conceived out of a discussion between all of the authors. MJD, MER and AGL drafted the manuscript. All authors read, revised and approved the final manuscript.

Authors’ original submitted files for images

Below are the links to the authors’ original submitted files for images.

Authors’ original file for figure 1

Authors’ original file for figure 2

Authors’ original file for figure 3

Rights and permissions

Open Access This article is published under license to BioMed Central Ltd. This is an Open Access article is distributed under the terms of the Creative Commons Attribution License ( https://creativecommons.org/licenses/by/2.0 ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and permissions

About this article

Cite this article

Dunning, M.J., Ritchie, M.E., Barbosa-Morais, N.L. et al. Spike-in validation of an Illumina-specific variance-stabilizing transformation. BMC Res Notes 1, 18 (2008). https://doi.org/10.1186/1756-0500-1-18

Download citation

Received: 29 February 2008
Accepted: 04 June 2008
Published: 04 June 2008
DOI: https://doi.org/10.1186/1756-0500-1-18

Spike-in validation of an Illumina-specific variance-stabilizing transformation