Universal disease biomarker: can a fixed set of blood microRNAs diagnose multiple diseases?

Taguchi, Y-h; Murakami, Yoshiki

doi:10.1186/1756-0500-7-581

Research article
Open access
Published: 30 August 2014

Universal disease biomarker: can a fixed set of blood microRNAs diagnose multiple diseases?

Y-h Taguchi¹ &
Yoshiki Murakami²

BMC Research Notes volume 7, Article number: 581 (2014) Cite this article

2282 Accesses
26 Citations
5 Altmetric
Metrics details

Abstract

Background

The selection of disease biomarkers is often difficult because of their unstable identification, i.e., the selection of biomarkers is heavily dependent upon the set of samples analyzed and the use of independent sets of samples often results in a completely different set of biomarkers being identified. However, if a fixed set of disease biomarkers could be identified for the diagnosis of multiple diseases, the difficulties of biomarker selection could be reduced.

Results

In this study, the previously identified universal disease biomarker (UDB) consisting of blood miRNAs that could discriminate between patients with multiple diseases and healthy controls was extended to the recently reported independent measurements of blood microRNAs (miRNAs). The performance achieved by UDB in an independent set of samples was competitive with performances achieved with biomarkers selected using lasso, a standard, heavily sample-dependent procedure. Furthermore, the development of stable feature extraction was suggested to be a key factor in constructing more efficient and stable (i.e., sample- and disease-independent) UDBs.

Conclusions

The previously proposed UDB was successfully extended to an additional seven diseases and is expected to be useful for the diagnosis of other diseases.

Background

Identification of biomarkers is important for the diagnosis of disease. By using biomarkers with high specificity for certain diseases, patients can be identified without diagnosis by doctors. After diagnosis using biomarkers, it is hoped that fewer patients will require diagnosis by a doctor. This enables doctors to diagnose a limited number of screened patients in more detail. Blood is a useful source of biomarkers. Numerous compounds/proteins in blood have been identified as effective biomarkers that allow the early diagnosis of several diseases (e.g., [1–3]). One disadvantage of this system is that distinct compounds/proteins are required to diagnose individual diseases, because diagnoses are usually based on the observation of unexpected values of compounds/proteins. When following this strategy, new compounds/proteins that increase or decrease in specific diseases should be identified. This system of biomarker identification incurs high costs because of the measurements of each biomarker. Thus, it is difficult to test for many diseases simultaneously because the number of diseases tested is proportional to the cost. The identification of a universal disease biomarker (UDB) that can diagnose multiple diseases simultaneously would be useful and economically beneficial. However, identifying a UDB using the traditional strategy of one compound/protein for one disease is unlikely.

Despite this difficulty, several studies have attempted to identify UDBs. For example, interleukin-8 (IL-8) was thought to be a UDB [4] as it was reported to be a useful biomarker for multiple diseases including urinary bladder cancer, prostatitis, acute pyelonephritis, vesicoureteral reflux, pulmonary infections, osteomyelitis, inflammatory bowel disease, chorioamnionitis, nosocomial bacterial infections, and non-Hodgkin’s lymphoma. Despite the apparent usefulness of IL-8 as a UDB, it has a strong tendency to increase non-specifically in individuals because most inflammatory conditions induce its production, therefore it might be considered together with other biomarkers. Another UDB is pHLIP and acidity, which although limited to cancer diagnosis was proposed to be a UDB for cancers [5]. Fendos and Engelman successfully and noninvasively labeled tumor tissues using a pH-sensitive biosensor. pHLIP also labeled tumors independent of the type of cancer. Another example of a UDB is FibroTest [6], which was used to diagnose several liver diseases including alcoholic liver disease, Hepatitis B virus, Hepatitis C virus, and Nonalcoholic fatty liver disease. FibroTest consists of a six-parameter blood test, α 2-macroglobulin, Haptoglobin, Apolipoprotein A1, γ-glutamyl transpeptidase, Total bilirubin, and Alanine transaminase, combined with the age and gender of the patient. However, these biomarkers lacked either specificity (IL-8 is used in combination with other biomarkers for accurate diagnoses) or universality (pHLIP is used only for cancer diagnosis while FibroTest is only used to diagnose liver diseases). An ideal disease UDB should have the ability to diagnose multiple diseases compared with normal healthy controls. One method to achieve this is by the combination of multiple biomarkers, as used for the FibroTest. Although FibroTest has fixed coefficients to construct a UDB, if varying coupling constants allows the diagnosis of distinct multiple diseases, biomarkers that consist of multiple individual biomarkers have the potential to be UDBs.

Recently, blood microRNAs (miRNA) have been identified as promising disease biomarkers [7]; combinations of mir-498 clusters are potential biomarkers for pregnancy, although pregnancy is not a disease. Blood miRNAs were also identified as anti-doping biomarkers [8], biomarkers of peripheral arterial disease [9], acute myocardial infarction and underlying coronary artery stenosis [10], and acute graft-versus-host disease [11]. They are also stable biomarkers [12]. Furthermore, although combinatorial circulating biomarkers are considered potential effective biomarkers for various diseases [13–20], combinations for the diagnosis of individual diseases often fluctuate between studies. For example, two recent distinct studies that tried to construct combinatorial blood miRNA biomarkers for the diagnosis of Alzheimer’s disease had no common miRNAs [21, 22]. Even for the diagnosis of an individual disease, there is often no unique combination of blood miRNAs. This suggests that a UDB is unlikely to be constructed from multiple blood miRNAs.

In contrast to these studies, we recently identified a potential UDB consisting of blood miRNAs [23]. Ten to 12 common blood miRNAs could be used to diagnose 13 various diseases from normal controls. Although this demonstrated the potential of blood miRNAs to be used as a UDB, the study used samples taken from only one study with shared normal controls. Thus, further studies are required to provide convincing data. In the current study, we cross-validated the previously proposed UDB [23] of 12 fixed miRNAs by investigating whether miRNAs could diagnose an additional seven distinct diseases using blood miRNAs that were recently reported and were not available when the previous study [23] was performed. The discriminatory ability of a UDB composed of 12 fixed blood miRNAs was competitive compared with that using a conventional method and miRNAs selected by a recently proposed principal component analysis (PCA)-based unsupervised feature selection method [23].

Results and discussion

Universality of UDB

To determine whether previously identified UDBs consisting of blood miRNAs [23] were universal, we evaluated their performance using seven independent data sets targeting seven diseases (see Methods). Although 10 miRNAs were selected for each disease from a total of 13 diseases in the previous study, 12 combined blood miRNAs (hsa-miR-425, hsa-miR-15b, hsa-miR-185, hsa-miR-92a, hsa-miR-140-3p, hsa-miR-320a, hsa-miR-486-5p, hsa-miR-16, hsa-miR-191, hsa-miR-106b, hsa-miR-19b, and hsa-miR-30d) were used to form the UDB in this study. Missing miRNAs in the data sets were excluded from the discrimination.

In Figure 1, the accuracy achieved by PCA-based liner discriminant analysis (LDA, red crosses) and support vector machine (SVM, red x-marks) using UDB is shown (also red boxes in Figure 1(b)). Mean accuracies were 0.791 and 0.815, respectively, and they were coincident with the mean accuracy (0.784) estimated using PCA-based LDA with UDB in a previous study [23] (see Table 1). Values of accuracy together with sensitivity and specificity values are also listed in Table 1. It was observed that performances were independent of the methods and samples, demonstrating the usefulness of the UDB. More detailed performances and their evaluations, i.e., true and false positives and negatives in a 2×2 tables together with P-values computed by Fisher’s exact test, odds ratio and area under the receiver operating characteristic (ROC) area under the curve (AUC), are shown in Additional file 1: Table S2.

Table 1 Performance of UDB with PCA-based LDA and SVM

Full size table

Comparison of performances between UDB and lasso

Although Table 1 shows the usefulness of a UDB consisting of blood miRNAs, it is important to determine how effective the UDB is when compared with conventional methods (i.e., non-universal, sample-dependent sets). We performed lasso-based discrimination (see Methods) between healthy controls and patients of each disease. Lasso-based discrimination was used so that performances of feature extraction (FE) between unsupervised FE and lasso could be compared. In addition, there are generally limited numbers of individual miRNAs that exhibit significant differences between normal controls and patients (see below), thus selection based on significant differences between patients and healthy controls as usual was difficult. The results are shown in Table 2 and Figure 1 (blue diamonds and a blue box in Boxplot). More detailed performances and their evaluations, i.e., true and false positives and negatives in a 2×2 tables of lassobased discrimination together with P-values computed by Fisher’s exact test, odds ratios and AUC, are shown in Additional file 1: Table S3. Although performances achieved by lasso-based discrimination were better than by PCA-based LDA with UDB (Table 1), those achieved by SVM with UDB were not significantly lower than the lasso-based discrimination (although three tests were performed, t-test, Wilcoxon rank sum test and Kolmogorov-Smirnov test, no P-values lower than 0.05 were detected). Since the lack of significance was because of large fluctuations in performances achieved by SVM with UDB, this suggested UDB might not be as effective as lasso-based discrimination. However, the possibility that UDB is as effective as standard discrimination using sample-dependent (not universal) features is indicated.

Table 2 Performance of lasso-based discrimination

Full size table

Stability of FE: the condition to get UDB

To understand why we could successfully identified a UDB in the previous study that could never be indeitified by anyone, the stabilities of FE were compared between lasso and PCA-based unsupervised FE. PCA-based unsupervised FE was used for the previous UDB discovery [23]. The importance of stability was previously demonstrated by Wehrens et al.[24], who showed that a stable FE improved the performance.

Figure 2 shows the stabilities S (see Methods) of lasso-based discrimination (blue diamonds). Generally, the stabilities were very low and each miRNA was selected as a biomarker at most for half the trials. Thus, lasso does not have the ability to provide UDBs, because it could not select stable (sample-independent) biomarkers for each disease. One may suppose that the stabilities will improve if miRNAs that exhibit significant differences between healthy controls and patients are identified and selected. However, this is not currently a realistic strategy, since there are insufficient numbers of miRNAs (often<10) that exhibit significant differences between healthy controls and patients (Table 3). For coronary artery disease (CAD) and hepatocellular carcinoma (HCC), no miRNAs have been identified that exhibit significant differences between normal controls and patients in the present data sets.

Table 3 The number of miRNAs that exhibit significant differences between normal controls and patients for each disease

Full size table

However, PCA-based unsupervised FE (black circles in Figure 2) showed significantly larger S values than lasso. In addition, performances were comparative with those achieved by lasso (Table 4, black circles and triangles in Figure 1(a) and black box in Figure 1(b)). More detailed performances and their evaluations, i.e., true and false positives and negatives in a 2×2 tables together with P-values by Fisher’ exact test, odds ratio and AUC, are shown in Additional file 1: Table S4.

Table 4 Performance of miRNAs selected by PCA-based FE with PCA-based LDA and SVM

Full size table

Why selected biomarkers are frequently varied between samples was attributed to the difference of data normalization. However, the results shown here indicate this might be caused by using incorrect and unstable FE methods. To obtain UDB, stable FE methods should be used [23].

The study by Wehrens et al.[24] used PCA-based LDA to maximize the stability of FE, whereas the current study did not require better stability, as this is automatically obtained when using PCA-based unsupervised FE. Thus, stability achieved by PCA-based unsupervised FE is expected to be more robust than feature selections by stability maximization using PCA-based LDA. Moreover, to rank features based on stability, Wehrens et al.[24] performed time-consuming iterative cross-validations that were not required by the PCA-based unsupervised FE. Thus, PCA-based unsupervised FE methodology is less computationally challenging than feature selections by stability maximization using PCA-based LDA.

The successful identification of UDBs [23] was possibly because of stable FE methods, which we suggest are important for developing UDBs, although the stability of FE is often overlooked. To determine more efficient UDBs, searching with efficient and stable FEs is required.

The number of features selected by FE

Previously [23], the number of features selected by PCA-based unsupervised FE was fixed at 10, because data sets analyzed previously were taken from a single study. Previous studies used the same microarray to measure miRNA expression in multiple diseases. In contrast, data sets used in the current study were heterogeneous. They were collected from multiple studies performed by independent research groups. Measurements were not performed by a single microarray but by various methods including qPCR. The sources of samples were also heterogeneous, ranging from whole blood to serum or plasma. Thus, we varied the number of features selected by PCA-based unsupervised FE between diseases (Additional file 2: Figure S1 for two-dimensional embeddings of miRNAs used for FE).

Interestingly, the optimal number of selected features was common between lasso and PCA-based unsupervised FE (Additional file 2: Figure S2). This suggests that the number of miRNAs required to discriminate healthy controls from patients is not dependent on the methods used but on the samples. This is not surprising because many sets of miRNAs discriminate between patients and normal controls if miRNAs are not independent of each other. In addition, the stability of FE is important, otherwise selected features will vary between trials.

This study did not identify a UDB from a data set we used, but rather validated the usefulness of UDBs identified in a previous study. To identify UDBs, sample preparation and measurements must be standardized to minimize the variance between samples. This should be possible because the target is uniquely independent of blood in disease.

Toward a mechanism-based biomarker

The UDB in this study was clearly decided by meta-analysis, and thus was not mechanism-based. However, if it also functions as a mechanism-based biomarker, this would be more plausible. To determine the possibility of using a UDB as a mechanism-based biomarker, we employed DIANA-mirpath [25]. Table 5 lists the 27 significant KEGG pathways reported by DIANA-mirpath (see Methods). Among 27 KEGG pathways, nine were cancer pathways (bold font). There were also five pathways (bold ilatic) that were disease pathways other than cancers. In addition, three pathways (italicized) were cancer-related pathways and four pathways (asterisked) were parts of “Pathways in cancer” (Figure 3). Thus, there were only five pathways that were not directly related to diseases. Therefore, miRNAs included in the UDB in this study were not only extensively included disease pathways, but also contributed to various disease pathways. Further experimental investigations of the expression of miRNA target genes will be required to demonstrate how UDB is involved in disease mechanisms.

Table 5 KEGG pathway analysis of 12 miRNAs included in the UDB using DIANA-mirpath [[25]]

Full size table

Heterogeneity of blood sources

In contrast to previous research [23] where only serum samples were used, the blood sources in this study were heterogenous, ranging from whole blood [21] to serum [26] or plasma [27] (full list of sources is shown in Additional file 1: Table S1). One may wonder why UDB works well despite this heterogeneity of sources. However, in a previous study [23], we tried to select 12 miRNAs included in UDB, not based on inference accuracy but rather by stability. That study only checked sample independency, but it is likely that sample independency is also related to source independency, since it is often as large as source dependency. miRNA expression is dependent upon both the source and patients’ age, gender, and body mass index. In addition, UDB was independent of measurement methods, i.e., NGS, microarray or qPCR (a full list of measurement methods is shown in Additional file 1: Table S1). If UDB is independent of patient properties and measurement methods, it is not surprising that UDB is also independent of sources, since all sources were taken from blood. Source independency of UDB should be investigated in more detail in the future.

Usefulness of UDB as practical clinical tools

One may wonder if the expected accuracy (0.8) of UDB is useful or not. However, UDB can diagnose multiple diseases simultaneously. Therefore, by measuring 12 miRNAs in blood, over 20 diseases (14 diseases in the previous study [23] and seven diseases in this study) can be diagnosed. Thus, UDB can be used for pre-screening. For example, patients are diagnosed by UDB for the 20 diseases. Then, if patients are positive for one disease, further diagnosis using more precise biomarkers can confirm the diagnosis. This will be more effective and non-invasive than performing 20 independent diagnoses using disease-specific biomarkers.

Conclusion

In this study, we demonstrated that a predefined UDB [23] could discriminate seven diseases from healthy controls. Since the diseases and samples were not included in our previous study [23] that defined UDBs, this study suggests the robustness of UDB for disease diagnosis. The performance achieved by UDB was comparative with that of lasso, the standard sample-dependent FE. Because PCA-based unsupervised FE, used for UDB identification in a previous study, outperformed lasso in terms of stability, the use of stable FE will be a key factor for discovering UDBs.

Methods

Blood miRNA expression profiles

Seven blood miRNA expressions used in this study were from the Gene Expression Omnibus (GEO): Alzheimer’s disease (AD) (GSE46579) [21], carcinoma (GSE37472) [26], CAD (GSE49823), nasopharyngeal carcinoma (NPC) (GSE43329), HCC (GSE50013) [27], breast cancer (BC) (GSE41922) [28] and acute myeloid leukemia (AML) (GSE49665) [29]. Detailed information is shown in Additional file 1: Table S1.

Principal component analysis-based unsupervised feature extraction

To select blood miRNAs for the diagnosis of seven diseases, blood miRNAs were selected using the recently proposed PCA-based unsupervised FE as previously described [23, 30]. Briefly, suppose X is the matrix such that x_ij represents the amount of the i th miRNA expression in the j th sample. PCA is regarded as the eigenvalue problem

\frac{1}{N} X^{T} X u_{k} = λ_{k} u_{k}, (k = 1, \dots, M)

where N and M are the total number of miRNAs and samples, respectively. Here M is assumed to be less than N as is usual. λ_i and u_i represent the eigenvalue and vector, respectively.

x_{ik} \equiv \sum_{j} u_{kj} x_{ij}

gives the principal component score (PCS) of i th miRNA. Using the obtained x_ik,k=1,…,D(<M), miRNAs were determined to be embedded into low D dimensional space.

Multiplying X on both sides, the following is obtained:

\frac{1}{N} (X X^{T}) (X u_{k}) = λ_{k} (X u_{k}), (k = 1, \dots, M)

where v_k=X u_k can be regarded as an eigenvector. Then,

x_{kj} \equiv \sum_{i} v_{ki} x_{ij}

gives the PCS of the j th sample. Using the obtained x_kj,k=1,…,D(<M), samples were regarded to be embedded into low D dimensional space.

PCA-based unsupervised FE selects outlier miRNAs in low K(<M) dimensional embedding space,

r_{Ki} > Δ

where

r_{Ki}^{2} \equiv \sum_{k = 1}^{K} x_{ik}^{2}

Typically K is taken to be two. Since these outliers could have a major contribution to u_k’s by definition, if there are a limited number of well-defined outliers, the exclusion of miRNAs other than outliers does not alter u_k’s. Since v_k is a linear transformation of u_k as shown above, the exclusion of miRNAs other than outliers does not alter v_k. Thus, retaining only outlier miRNAs may also preserve lower dimensional embeddings of samples that are important for disease diagnosis, e.g., discrimination between patients and healthy controls. Although this is only hypothetical, it explains why PCA-based unsupervised FE is expected to function well. Currently, there are no well-defined criteria for the selection of Δ. Although Δ was decided to include sufficient numbers (majority) of outliers, these were selected by the visual inspection of two-dimensional embedding of miRNAs. Singular decomposition-based interpretation is also available as Additional file 3: Text S1.

Discriminatory analyses between patients and healthy controls with cross-validations

Three discriminant analyses were performed in this study as follows. The first, a PCA-based LDA, a discriminant counterpart of the partial least square (PLS), is defined as discrimination using the first k PCSs (i.e., from the first to the k th PCSs). First, PCA was applied to all samples. Then, PCA-based LDA was performed using only PCSs in the training set. Since the learning process includes unlabeled information of the test set, it is semi-supervised learning. Samples in the test set were predicted using trained PCA-based LDA. LDA was performed using lda functions in R [31] and the prediction of samples in the test set was performed by predict.lda functions in R. Optimal k was determined using cross-validations. The second analysis used an SVM trained with training set samples using svm function included in the e1071 R package with default settings (e.g., with the usage of Gaussian kernel), other than class.weight argument that was set to attribute equal weights to sets of normal controls and patients when the number of samples in normal controls differed from that of patients. Then, samples in the test set were predicted using predict.svm function in R. Third, lasso was used for a discrimination study. Lasso was performed using the lars function included in lars R package, attributing 1 and 2 to healthy controls and patients, respectively, and using the setting type=‘lasso’. Then, samples in the test set were predicted using predict.lars function in R for s=n/100,n=0,…,100 with mode=‘fraction’. Samples with predicted values larger (less) than 1.5 were regarded to be patients (healthy controls). Optimal s was selected by cross-validation. For all cases, leave one out cross-validation (LOOCV) was employed.

Data normalization

Since this study is a meta-analysis using data sets collected from various independent studies employing distinct measuring methods, we normalized data sets individually by distinct methods (Table 6). Data from multiple studies were treated identically and compared. In addition, some miRNAs with abnormally large values were excluded from the analysis. Excluded miRNAs were hsa-miR-486-5p (AD), hsa-miR-223 and hsa-miR-338 (CAD), and hsa-miR-451 (NPC).

Table 6 Details of data normalization

Full size table

Stability test

On LOOCV FE, selected features (miRNAs) are listed. For lasso, miRNAs with non-zero β s were listed by setting type=‘coefficients’ for predict.lars function with estimated optimal s. Because of LOOCV, FE was performed by M(=the number of samples) times. Then stability was defined as

S \equiv \frac{1}{\hat{N}} \sum_{i \in {i | F_{i} \neq 0}} \frac{F_{i}}{M}

where F_i is the number of times that i th miRNA was selected within M times FE. Summation was performed for miRNAs that were non-zero F_i (i.e., selected at least once in FEs) and $\hat{N}$ is the number of miRNAs included in the summation. Larger $S, (\frac{1}{M} \leq S \leq 1)$ indicates more stable FEs.

P-values computation for significant difference between healthy controls and patients

P-values computed for significant differences between healthy controls and patients of each disease were determined using t-test for each miRNA. Computed P-values were adjusted by BH-criterion [32] and miRNAs with adjusted P-values less than 0.05 were regarded to have significantly different expression between normal controls and patients.

KEGG pathway analysis of UDB using DIANA-mirpath

DIANA-mirpath [25] was employed to investigate KEGG pathways enriched by miRNA target genes. Twelve genes were uploaded to DIANA-mirpath with the following settings: “Species” was “Human”, “FDR” correction was “yes”, “P-value threshold” was 0.05, and “Select the way to merge results” was “pathway union” (direct link to DIANA-mirpath and full list of KEGG pathways are shown in Additional file 3: Text S2 and Additional file 1: Table S5).

References

Hanash SM, Baik CS, Kallioniemi O:Emerging molecular biomarkers–blood-based strategies to detect and monitor cancer. Nat Rev Clin Oncol. 2011, 8 (3): 142-150. 10.1038/nrclinonc.2010.220.
Article PubMed Google Scholar
Jellinger KA, Janetzky B, Attems J, Kienzl E:Biomarkers for early diagnosis of Alzheimer disease ’ALZheimer ASsociated gene’–a new blood biomarker?. J Cell Mol Med. 2008, 12 (4): 1094-1117. 10.1111/j.1582-4934.2008.00313.x.
Article PubMed CAS PubMed Central Google Scholar
Petricoin EF, Belluco C, Araujo RP, Liotta LA:The blood peptidome: a higher dimension of information content for cancer biomarker discovery. Nat Rev Cancer. 2006, 6 (12): 961-967. 10.1038/nrc2011.
Article PubMed CAS Google Scholar
Shahzad A, Knapp M, Lang I, Kohler G:Interleukin 8 (IL-8) - a universal biomarker?. Int Arch Med. 2010, 3: 11-10.1186/1755-7682-3-11.
Article PubMed PubMed Central Google Scholar
Fendos J, Engelman D:pHLIP and acidity as a universal biomarker for cancer. Yale J Biol Med. 2012, 85 (1): 29-35.
PubMed CAS PubMed Central Google Scholar
Morra R, Munteanu M, Imbert-Bismut F, Messous D, Ratziu V, Poynard T:FibroMAX: towards a new universal biomarker of liver disease?. Expert Rev Mol Diag. 2007, 7 (5): 481-490. 10.1586/14737159.7.5.481.
Article Google Scholar
Williams Z, Ben-Dov IZ, Elias R, Mihailovic A, Brown M, Rosenwaks Z, Tuschl T:Comprehensive profiling of circulating microRNA via small RNA sequencing of cDNA libraries reveals biomarker potential and limitations. Proc Natl Acad Sci USA. 2013, 110 (11): 4255-4260. 10.1073/pnas.1214046110.
Article PubMed CAS PubMed Central Google Scholar
Leuenberger N, Robinson N, Saugy M:Circulating miRNAs: a new generation of anti-doping biomarkers. Anal Bioanal Chem. 2013, 405 (309): 617-623.
Google Scholar
Sluijter JP, Doevendans PA:Circulating microRNA profiles for detection of peripheral arterial disease: small new biomarkers for cardiovascular disease. Circ Cardiovasc Genet. 2013, 6 (5): 441-443. 10.1161/CIRCGENETICS.113.000344.
Article PubMed Google Scholar
Wang F, Long G, Zhao C, Li H, Chaugai S, Wang Y, Chen C, Wang D. W:Plasma microRNA-133a is a new marker for both acute myocardial infarction and underlying coronary artery stenosis. J Transl Med. 2013, 11 (1): 222-10.1186/1479-5876-11-222.
Article PubMed PubMed Central Google Scholar
Xiao B, Wang Y, Li W, Baker M, Guo J, Corbet K, Tsalik EL, Li QJ, Palmer SM, Woods CW, Li Z, Chao NJ, He YW:Plasma microRNA signature as a non-invasive biomarker for acute graft-versus-host disease. Blood. 2013, 122 (19): 3365-3375. 10.1182/blood-2013-06-510586.
Article PubMed CAS PubMed Central Google Scholar
Koberle V, Pleli T, Schmithals C, Augusto Alonso E, Haupenthal J, Bonig H, Peveling-Oberhag J, Biondi RM, Zeuzem S, Kronenberger B, Waidmann O, Piiper A:Differential stability of cell-free circulating microRNAs implications for their utilization as biomarkers. PLoS ONE. 2013, 8 (9): 75184-10.1371/journal.pone.0075184.
Article Google Scholar
Sheinerman KS, Umansky SR:Circulating cell-free microRNA as biomarkers for screening, diagnosis and monitoring of neurodegenerative diseases and other neurologic pathologies. Front Cell Neurosci. 2013, 7: 150-
Article PubMed PubMed Central Google Scholar
Recchioni R, Marcheselli F, Olivieri F, Ricci S, Procopio AD, Antonicelli R:Conventional and novel diagnostic biomarkers of acute myocardial infarction a promising role for circulating microRNAs. Biomarkers. 2013, 18 (7): 547-558. 10.3109/1354750X.2013.833294.
Article PubMed CAS Google Scholar
Dorval V, Nelson PT, Hebert SS:Circulating microRNAs in Alzheimer’s disease: the search for novel biomarkers. Front Mol Neurosci. 2013, 6: 24-
PubMed PubMed Central Google Scholar
Deddens JC, Colijn JM, Oerlemans MI, Pasterkamp G, Chamuleau SA, Doevendans PA, Sluijter JP:Circulating microRNAs as novel biomarkers for the early diagnosis of acute coronary syndrome. J Cardiovasc Transl Res. 2013, 6 (6): 884-898. 10.1007/s12265-013-9493-9.
Article PubMed CAS Google Scholar
Ramshankar V, Krishnamurthy A:Lung cancer detection by screening - presenting circulating miRNAs as a promising next generation biomarker breakthrough. Asian Pac J Cancer Prev. 2013, 14 (4): 2167-2172. 10.7314/APJCP.2013.14.4.2167.
Article PubMed Google Scholar
Dart DA, Waxman J, Bevan CL, Sita-Lumsden A:Circulating microRNAs as potential new biomarkers for prostate cancer. Br J Cancer. 2013, 108 (10): 1925-1930. 10.1038/bjc.2013.192.
Article PubMed PubMed Central Google Scholar
Grasedieck S, Sorrentino A, Langer C, Buske C, Dohner H, Mertens D, Kuchenbauer F:Circulating microRNAs in hematological diseases: principles, challenges, and perspectives. Blood. 2013, 121 (25): 4977-4984. 10.1182/blood-2013-01-480079.
Article PubMed CAS Google Scholar
Redova M, Sana J, Slaby O:Circulating miRNAs as new blood-based biomarkers for solid cancers. Future Oncol. 2013, 9 (3): 387-402. 10.2217/fon.12.192.
Article PubMed CAS Google Scholar
Leidinger P, Backes C, Deutscher S, Schmitt K, Mueller SC, Frese K, Haas J, Ruprecht K, Paul F, Stahler C, Lang CJ, Meder B, Bartfai T, Meese E, Keller A:A blood based 12-miRNA signature of Alzheimer disease patients. Genome Biol. 2013, 14 (7): 78-10.1186/gb-2013-14-7-r78.
Article Google Scholar
Kumar P, Dezso Z, MacKenzie C, Oestreicher J, Agoulnik S, Byrne M, Bernier F, Yanagimachi M, Aoshima K, Oda Y:Circulating miRNA biomarkers for Alzheimer’s disease. PLoS ONE. 2013, 8 (7): 69807-10.1371/journal.pone.0069807.
Article Google Scholar
Taguchi YH, Murakami Y:Principal component analysis based feature extraction approach to identify circulating microRNA biomarkers. PLoS ONE. 2013, 8 (6): 66714-10.1371/journal.pone.0066714.
Article Google Scholar
Wehrens R, Franceschi P, Vrhovsek U, Mattivi F:Stability-based biomarker selection. Anal Chim Acta. 2011, 705 (1-2): 15-23. 10.1016/j.aca.2011.01.039.
Article PubMed CAS Google Scholar
Vlachos IS, Kostoulas N, Vergoulis T, Georgakilas G, Reczko M, Maragkakis M, Paraskevopoulou MD, Prionidis K, Dalamagas T, Hatzigeorgiou AG:DIANA miRPath v.2.0: investigating the combinatorial effect of microRNAs in pathways. Nucleic Acids Res. 2012, 40 (Web Server issue): 498-504.
Article Google Scholar
Maclellan SA, Lawson J, Baik J, Guillaud M, Poh CF, Garnis C:Differential expression of miRNAs in the serum of patients with high-risk oral lesions. Cancer Med. 2012, 1 (2): 268-274. 10.1002/cam4.17.
Article PubMed CAS PubMed Central Google Scholar
Shen J, Wang A, Wang Q, Gurvich I, Siegel AB, Remotti H, Santella RM:Exploration of genome-wide circulating microRNA in hepatocellular carcinoma (HCC): MiR-483-5p as a potential biomarker. Cancer Epidemiol Biomarkers Prev. 2013, 22 (12): 2364-2373. 10.1158/1055-9965.EPI-13-0237.
Article PubMed CAS PubMed Central Google Scholar
Chan M, Liaw CS, Ji SM, Tan HH, Wong CY, Thike AA, Tan PH, Ho GH, Lee AS:Identification of circulating microRNA signatures for breast cancer detection. Clin Cancer Res. 2013, 19 (16): 4477-4487. 10.1158/1078-0432.CCR-12-3401.
Article PubMed CAS Google Scholar
Rommer A, Steinleitner K, Hackl H, Schneckenleithner C, Engelmann M, Scheideler M, Vlatkovic I, Kralovics R, Cerny-Reiterer S, Valent P, Sill H, Wieser R:Overexpression of primary microRNA 221/222 in acute myeloid leukemia. BMC Cancer. 2013, 13: 364-10.1186/1471-2407-13-364.
Article PubMed CAS PubMed Central Google Scholar
Murakami Y, Toyoda H, Tanahashi T, Tanaka J, Kumada T, Yoshioka Y, Kosaka N, Ochiya T, Taguchi YH:Comprehensive miRNA expression analysis in peripheral blood can diagnose liver disease. PLoS ONE. 2012, 7 (10): 48366-10.1371/journal.pone.0048366.
Article Google Scholar
R Core Team: R: A Language and Environment for Statistical Computing. 2013, Vienna, Austria: R Foundation for Statistical Computing,http://www.R-project.org/,
Google Scholar
Benjamini Y, Hochberg Y:Controlling the false discovery rate: a practical and powerful approach to multiple testing. J R Stat Soc Series B (Methodological). 1995, 57 (1): 289-300.
Google Scholar

Download references

Acknowledgments

This research was supported by KAKENHI, 23300357 and 26120528.

Author information

Authors and Affiliations

Department of Physics, Chuo University, 1-13-27 Kasuga, Bunkyo-ku, 112-8551, Tokyo, Japan
Y-h Taguchi
Department of Hepatology, Osaka City University, Graduate School of Medicine, 1-4-3 Asahimachi, Abeno-ku, 545-8585, Osaka, Japan
Yoshiki Murakami

Authors

Y-h Taguchi
View author publications
You can also search for this author in PubMed Google Scholar
Yoshiki Murakami
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Y-h Taguchi.

Additional information

Competing interests

The authors declare that they have no competing interests.

Authors’ contributions

YHT and YM planned all the projects. YHT performed analyses and wrote the paper. All authors read and approved the final manuscript.

Electronic supplementary material

Additional file 1:Supporting Tables.(XLSX 16 KB)

Additional file 2:Supporting Figures.(PDF 198 KB)

Additional file 3:Supporting Texts.(ZIP 17 KB)

Authors’ original submitted files for images

Below are the links to the authors’ original submitted files for images.

Authors’ original file for figure 1

Authors’ original file for figure 2

Authors’ original file for figure 3

Rights and permissions

This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.

Reprints and permissions

About this article

Cite this article

Taguchi, Yh., Murakami, Y. Universal disease biomarker: can a fixed set of blood microRNAs diagnose multiple diseases?. BMC Res Notes 7, 581 (2014). https://doi.org/10.1186/1756-0500-7-581

Download citation

Received: 05 December 2013
Accepted: 14 August 2014
Published: 30 August 2014
DOI: https://doi.org/10.1186/1756-0500-7-581

Universal disease biomarker: can a fixed set of blood microRNAs diagnose multiple diseases?

Abstract

Background

Results

Conclusions

Background

Results and discussion

Universality of UDB

Comparison of performances between UDB and lasso

Stability of FE: the condition to get UDB

The number of features selected by FE

Toward a mechanism-based biomarker

Heterogeneity of blood sources

Usefulness of UDB as practical clinical tools

Conclusion

Methods

Blood miRNA expression profiles

Principal component analysis-based unsupervised feature extraction

Discriminatory analyses between patients and healthy controls with cross-validations

Data normalization

Stability test

P-values computation for significant difference between healthy controls and patients

KEGG pathway analysis of UDB using DIANA-mirpath

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Additional information

Competing interests

Authors’ contributions

Electronic supplementary material

Additional file 1:Supporting Tables.(XLSX 16 KB)

Additional file 2:Supporting Figures.(PDF 198 KB)

Additional file 3:Supporting Texts.(ZIP 17 KB)

Authors’ original submitted files for images

Authors’ original file for figure 1

Authors’ original file for figure 2

Authors’ original file for figure 3

Rights and permissions

About this article

Cite this article

Share this article

Keywords

BMC Research Notes

Contact us