- Data Note
RCDB: Renal Cancer Gene Database
BMC Research Notesvolume 5, Article number: 246 (2012)
Renal cell carcinoma or RCC is one of the common and most lethal urological cancers, with 40% of the patients succumbing to death because of metastatic progression of the disease. Treatment of metastatic RCC remains highly challenging because of its resistance to chemotherapy as well as radiotherapy, besides surgical resection. Whereas RCC comprises tumors with differing histological types, clear cell RCC remains the most common. A major problem in the clinical management of patients presenting with localized ccRCC is the inability to determine tumor aggressiveness and accurately predict the risk of metastasis following surgery. As a measure to improve the diagnosis and prognosis of RCC, researchers have identified several molecular markers through a number of techniques. However the wealth of information available is scattered in literature and not easily amenable to data-mining. To reduce this gap, this work describes a comprehensive repository called Renal Cancer Gene Database, as an integrated gateway to study renal cancer related data.
Renal Cancer Gene Database is a manually curated compendium of 240 protein-coding and 269 miRNA genes contributing to the etiology and pathogenesis of various forms of renal cell carcinomas. The protein coding genes have been classified according to the kind of gene alteration observed in RCC. RCDB also includes the miRNAsdysregulated in RCC, along with the corresponding information regarding the type of RCC and/or metastatic or prognostic significance. While some of the miRNA genes showed an association with other types of cancers few were unique to RCC. Users can query the database using keywords, category and chromosomal location of the genes. The knowledgebase can be freely accessed via a user-friendly web interface at http://www.juit.ac.in/attachments/jsr/rcdb/homenew.html.
It is hoped that this database would serve as a useful complement to the existing public resources and as a good starting point for researchers and physicians interested in RCC genetics.
Renal cell carcinoma (RCC) represents a heterogeneous group of tumors differing in genetic background, responses to surgical and medical therapy and prognoses. It accounts for 3% of adult malignancy and results in over 100000 deaths worldwide annually . It is the one of the leading causes of cancer deaths in Western countries with steadily escalating incidence over the last few decades . RCC is classified based on morphological and genetic differences. This classification distinguishes metanephric adenoma oncocytoma and papillary adenoma as benign tumors from the clear cell (ccRCC), papillary/chromophilic, chromophobic (chRCC) and collecting duct RCC. This classification is important because of its prognostic implications. ccRCC is the most common and accounts for 70% of RCCs. RCC is diagnosed through imaging studies including CT and ultrasound, but kidney biopsy is an invasive technique that might result in complications and would not provide accurate diagnosis in certain situations. For early presentations, surgical extirpation through nephrectomy provides an effective treatment, but patients usually present at advanced stages, leading to poor outcomes. Even for patients without metastatic spread who undergo nephrectomy, metastatic recurrence is frequent. Apart from surgery, RCC is resistant to chemotherapy and radiotherapy. Cytokine therapy, which is reserved for patients with advanced disease, can produce partial responses in 10%–15% and durable remissions in 5% of the patients. The therapeutic approach to RCC is determined by the probability of cure, which is related directly to the stage or degree of tumor dissemination. An accurate assessment of the individual risk of disease progression and mortality after treatment is crucial to counsel patients and plan individualized surveillance protocols. Multiple studies have investigated the deregulation of genes in renal carcinogenesis at the genomic, transcriptomic as well as proteomic levels using a suite of molecular profiling techniques . These include cytogenetic studies , gene expression analyses through tissue microarrays [5, 6], serum proteomics , genomic resequencing , and microRNA profiling  and have yielded useful insights into RCC biology and clinical presentation, and have led to a rich understanding of the heterogeneity of this disease which greatly influences prognostic decisions. Despite the voluminous data available on RCC, the information is rather sporadic and scattered in literature.
In the last few years, a number of databases have emerged with a central focus on a particular cancer type as exemplified by Lung Cancer Database , Oral Cancer Database , Breast Cancer Gene Database , Cervical Cancer Database  etc., however there is no report of any such database for RCC. This work describes the development of the Renal Cancer Gene Database (RCDB) that catalogs the protein-coding and miRNA genes known to be involved in renal carcinogenesis as evidenced by biomedical literature. Due to its specific focus on RCC, unlike dbDEMC  and miR2Disease , it provides a far broader coverage of the miRNAsdysregulated in RCC. It incorporates information regarding the relevance of miRNAs to molecular classification of renal tumors (neoplasms) based on tumor type, metastatic status or prognosis group. This provides an additional advantage over other databases like miR2disease. Many of the protein-coding and miRNA genes in RCDB are useful prognostic and diagnostic markers and are therefore clinically relevant. These may also serve as therapeutic targets.
Construction and content
RCDB contains information on RCC-implicated genes compiled from research articles indexed in PubMed. The PubMed database was queried with different keywords like renal cell carcinoma, renal cancer or tumor etc. and the articles retrieved were manually scrutinized to winnow the genes affecting the etiology of RCC. The final lists of 240 protein-coding and 269 miRNA genes were identified in this way and used to populate the database. The former were grouped into following six categories (Table 1) based on the kind of gene alteration observed in RCC: 1) (silencing/downregulation through) Methylation, 2) Overexpression, 3) Downregulation, 4) Mutation and 5) Translocation and 6) Unclassified. The latter (miRNAs) were categorized according to their differential expression in the different types of RCC. This classification scheme (Table 2) includes differential expression in: 1) chromocytoma vs oncocytoma 2) ccRCC vs papillary RCC 3) Poor vs Good prognosis 4) ccRCC vs normal kidney 5) chRCC vs normal kidney 6) ccRCC vs chRCC 7) Metastatic vs Non-metastatic RCC 8) Primary vs Late metastasis. There exist few overlaps of miRNAs within these categories. RCDB is implemented as a MySQL database and the web-interface built in PHP.
Utility and discussion
The web interface query form allows users to query the protein coding genes from database using keyword, the class and the chromosome number (Figure 1a). This retrieves a list of genes (Figure 1b) where each gene entry is further linked (Figure 1c) to its specific details comprising its gene, nucleotide and protein accession numbers, its chromosomal location as well as its involvement in RCC and the PubMed records corroborating the same. The miRNAs can also be browsed in a similar way. The miRNA entries are linked to miRBase  wherever available. The ViroBLAST  tool searches a user-defined query sequence against the sequences available in the database. This offers the additional advantage of parsing the results according to E-value or score chosen by the user.
RCDB provides a comprehensive compilation of information obtained from published RCC research, complemented with the information from public databases like Swissprot, Refseq etc. It would allow the users in performing comparative studies, e.g. to deduce the genes that are shared with other cancers as well as the ones which are unique to RCC. This analysis was performed for miRNA genes in RCDB by surveying the literature for the involvement of these miRNAs in different cancers (Additional file 1). While most of the miRNAs were found to be reported in other types of malignancies, few were unique to RCC. The latter included miR455, miR219, miR509, miR627, miR648, miR510, miR379, miR136, miR376b, miR154, miR551b, miR514, miR383, miR453, miR582, miR450, miR425, miR365-1 etc.
RCDB has been developed as an integrated information source to assist the research efforts of scientists and clinicians working on renal carcinoma. Besides providing a panoramic overview of RCC, it facilitates thorough exposition of each gene by providing hyperlinks to relevant PubMed records. In future, RCDB would be updated and additional data incorporated. It is anticipated that RCDB would serve as a valuable resource to the scientific community.
Availability and requirements
Project home page: http://www.juit.ac.in/attachments/jsr/rcdb/homenew.html.
Danilin S, Sourbier C, Thomas L, Lindner Vr, Rothhut S, Dormoy Vr, Helwig J-J, Jacqmin D, Lang H, Massfelder T: Role of the RNA-binding protein HuR in human renal cell carcinoma. Carcinogenesis. 2010, 31: 1018-1026. 10.1093/carcin/bgq052.
Tun HW, Marlow LA, von Roemeling CA, Cooper SJ, Kreinest P, Wu K, Luxon BA, Sinha M, Anastasiadis PZ, Copland JA: Pathway Signature and Cellular Differentiation in Clear Cell Renal Cell Carcinoma. PLoS One. 2010, 5: e10696-10.1371/journal.pone.0010696.
Arsanious A, Bjarnason G, Yousef G: From bench to bedside: current and future applications of molecular profiling in renal cell carcinoma. Mol Cancer. 2009, 8: 20-10.1186/1476-4598-8-20.
Zhang Z, Wondergem B, Dykema K: A Comprehensive Study of Progressive Cytogenetic Alterations in Clear Cell Renal Cell Carcinoma and a New Model for ccRCC Tumorigenesis and Progression. Advances in Bioinformatics. 2010
Rogers CG, Tan M-H, Teh BT: Gene expression profiling of renal cell carcinoma and clinical implications. Urology. 2005, 65: 231-237. 10.1016/j.urology.2004.08.038.
Tavares TS, Nanus D, Yang XJ, Gudas LJ: Gene microarray analysis of human renal cell carcinoma: The effects of HDAC inhibition and retinoid treatment. Cancer Biol Ther. 2008, 7: 1607-1618. 10.4161/cbt.7.10.6584.
Xu G, Xiang CQ, Lu Y, Kang XN, Liao P, Ding Q, Zhang YF: Application of SELDI-TOF-MS to identify serum biomarkers for renal cell carcinoma. Cancer Lett. 2009, 282: 205-213. 10.1016/j.canlet.2009.03.017.
Dalgliesh GL, Furge K, Greenman C, Chen L, Bignell G, Butler A, Davies H, Edkins S, Hardy C, Latimer C, et al: Systematic sequencing of renal carcinoma reveals inactivation of histone modifying genes. Nature. 2010, 463: 360-363. 10.1038/nature08672.
Weng L, Wu X, Gao H, Mu B, Li X, Wang J-H, Guo C, Jin JM, Chen Z, Covarrubias M, et al: MicroRNA profiling of clear cell renal cell carcinoma by whole-genome small RNA deep sequencing of paired frozen and formalin-fixed, paraffin-embedded tissue specimens. J Pathol. 2010, 222: 41-51.
Wang L, Xiong Y, Sun Y, Fang Z, Li L, Ji H, Shi T, HLung DB: an integrated database of human lung cancer research. Nucleic Acids Res. 2009, 38 (suppl 1): D665-D669.
Levine AE, Steffen DL: OrCGDB: a database of genes involved in oral cancer. Nucleic Acids Res. 2001, 29: 300-302. 10.1093/nar/29.1.300.
Baasiri R, Glasser S, Steffen D, Wheeler D: The Breast Cancer Gene Database: a collaborative information resource. Oncogene. 1999, 18: 7958-7965.
Agarwal SM, Raghav D, Singh H, Raghava GPS: CCDB: a curated database of genes involved in cervix cancer. Nucleic Acids Res. 2010, 39 ((suppl 1)): D975-D979.
Yang Z, Ren F, Liu C, He S, Sun G, Gao Q, Yao L, Zhang Y, Miao R, Cao Y, et al: dbDEMC: a database of differentially expressed miRNAs in human cancers. BMC Genomics. 2010, 11: S5-
Jiang Q, Wang Y, Hao Y, Juan L, Teng M, Zhang X, Li M, Wang G, Liu Y: miR2Disease: a manually curated database for microRNA deregulation in human disease. Nucleic Acids Res. 2009, 37: D98-D104. 10.1093/nar/gkn714.
Griffiths-Jones S, Grocock RJ, van Dongen S, Bateman A, Enright AJ: miRBase: microRNA sequences, targets and gene nomenclature. Nucleic Acids Res. 2005, 34: D140-D144.
Deng W, Nickle DC, Learn GH, Maust B, Mullins JI: ViroBLAST: a stand-alone BLAST web server for flexible queries of multiple databases and user's datasets. Bioinformatics. 2007, 23: 2334-2336. 10.1093/bioinformatics/btm331.
The author declares that she has no competing interests.
JR conceptualized the study, developed the database and wrote the manuscript.