- Research note
- Open Access
Transcriptional network analysis on brains reveals a potential regulatory role of PPP1R3F in autism spectrum disorders
BMC Research Notesvolume 11, Article number: 489 (2018)
This study aims at identifying master regulators of transcriptional networks in autism spectrum disorders (ASDs).
With two sets of independent RNA-Seq data generated on cerebellum from patients with ASDs and control subjects (N = 39 and 45 for set 1, N = 24 and 38 for set 2, respectively), we carried out a network deconvolution of transcriptomic data, followed by virtual protein activity analysis. We identified PPP1R3F (Protein Phosphatase 1 Regulatory Subunit 3F) as a candidate master regulator affecting a large body of downstream genes that are associated with the disease phenotype. Pathway analysis on the identified targets of PPP1R3F in both datasets indicated alteration of endocytosis pathway. Despite a limited sample size, our study represents one of the first applications of network deconvolution approach to brain transcriptomic data to generate hypotheses that may be further validated by large-scale studies.
Autism spectrum disorders (ASD) comprise a set of highly inheritable neurodevelopmental conditions characterized by impairments in social communication, repetitive behaviors and restricted interests [1, 2]. ASDs are estimated to affect 1 in 68 children in the United States, and boys are 4.5 times more likely than girls to develop ASDs . Several studies showed that the heritability of autistic phenotypes is estimated to be around 90% [4, 5]. The number of genes potentially implicated in ASDs is rapidly growing, mainly from large-scale genetic studies such as next generation sequencing (NGS) [6,7,8,9,10,11,12] and genome-wide association studies (GWAS) [13,14,15,16]. Although these studies have substantially advanced our understanding of the etiology of ASDs, the underlying molecular mechanisms remain elusive . Transcriptome analysis is gaining momentum as a complementary approach to genetic association studies , enabling us to understand the molecular pathophysiology of ASDs.
A number of studies have evaluated whole-genome gene expression that may contribute to the onset of ASD. In a large-scale RNA-Seq effort, matched brain regions from subjects affected with ASDs and controls were utilized to identify neuronal genes strongly dysregulated in cortical regions . Utilizing microarray technology, Voineagu et al.  demonstrated consistent differences in transcriptome organization between autistic/normal human brain tissues using gene co-expression network analysis. However, the potential molecular drivers of co-expressed modules have not been identified . Despite applications of co-expression network approaches in the inference of regulatory machinery in ASD , state-of-the-art approaches such as network deconvolution methods are barely adopted in this area. Network deconvolution methods have been successfully used to study prostate differentiation  and cancers . They can overcome limitations of the existing methods such as connecting genes with indirect interactions leaving their mutual causal effects aside as well as suffering from the exponentially increasing computational cost, etc. . These methods can illuminate the underlying transcription circuitry of diseases and illustrate potential regulation drivers. For example, with transcriptional network deconvolution approach, we have recently provided novel insights on post-traumatic stress disorder (PTSD)  by identifying several genes as drivers of innate immune function. In the current study, we used ARACNe (algorithm for reconstruction of accurate cellular networks)  to deconvolve cellular networks. In this approach, gene–gene co-regulatory patterns are first identified using mutual information (MI), and the constructed networks are further pruned by removing indirect connections where two genes are co-regulated through one or more intermediaries. Using two of the largest transcriptomic datasets of postmortem brain tissues from ASD individuals and control subjects by Parikshak et al.  and Gupta et al. , we reconstructed the transcriptional networks followed by virtual protein activity analysis, to identify “master regulators” (MRs) that may differentially regulate the expression levels of multiple downstream genes in the cerebellum region of ASD individuals and controls.
Network construction and analysis tools are explained in the Additional file 1. Upon constructing the transcriptional networks, we used an algorithm called VIPER (virtual inference of protein-activity by enriched regulon analysis ). VIPER aims at inferring the protein activity of a MR by a systematic analysis of the expression patterns of its targets (regulons). VIPER directly integrates target mode of regulation indicating whether targets are repressed or activated given the statistical confidence in regulator–target interactions and target overlap between different regulators in order to obtain the enrichment of a protein regulon in differentially expressed genes . Compared to the existing approaches such as T-profiler , gene set enrichment analysis (GSEA) , and Fisher’s exact test , VIPER supports seamless integration of genes with different likelihoods of representing activated, repressed or undetermined targets.
Both datasets contain multiple regions including cerebellum, which is relevant for ASDs since specific cerebellar zones can affect neocortical substrates for social interaction and cognitive functions such as language and executive functions [28,29,30]. Abnormalities of the cerebellum, which is believed to be involved in cognitive functions, can in part underlie autistic symptoms . Several other brain regions, such as gyral surface of the anterior cingulate cortex and ventromedial prefrontal cortex , posterior superior temporal sulcus (pSTS) , amygdala, orbital frontal cortex, and fusiform gyrus  are also known to be ASD-relevant. We reasoned that in the same brain region, there should be highly active proteins whose expression regulates a large set of target genes and such patterns should be replicated in an independent dataset. Our preliminary finding indicates PPP1R3F (Protein Phosphatase 1 Regulatory Subunit 3F) as a potential master regulator (MR). The framework of the in silico experiments is illustrated in Fig. 1. Influence of dysregulation of this gene on ASD pathogenesis was then examined.
Results and discussion
We first used the data from Parikshak et al.  to construct the regulatory networks. This data is part of a large RNA-Seq repository on post-mortem human brain tissue (39 cases vs. 45 controls) from cerebellum, frontal cortex, temporal cortex, prefrontal cortex, and visual cortex. During the process of network deconvolution (see Methods in Additional file 1), pairwise MI between all of the available transcripts were obtained. Next, the constructed network was trimmed to remove genetic intermediaries, resulting in potential direct connections between MRs and their targets (we used the recommended P value threshold of 10−8, as a measure of confidence of regulatory relationships between two genes ). This analysis yielded a repertoire of 672,973 interactions, 23,935 regulators, and 24,847 targets in the constructed network using the dataset from Parikshak et al. . We similarly analyzed the second dataset from Gupta et al. , a RNA-Seq data of post-mortem brain tissues with more samples of cerebellum region than other brain regions. Using the same network construction settings on this dataset  containing 24 cases and 38 controls, we deconvolved a network of 297,870 interactions containing 12,040 regulators and 12,529 targets. Both constructed networks are provided in Additional files 2 and 3.
After applying VIPER, we compared the list of significant MRs at FDR ≤ 0.05. We identified PPP1R3F as the only MR shared between the two datasets. Given the small sample size of the data, it is possible that our analysis was underpowered and may have missed other relevant MRs in ASDs. Figure 2 illustrated how downregulation of this MR influences the expression of its regulons in the constructed networks of both data sets. PPP1R3F was significantly downregulated in Parikshak et al. data (FDR from one-sided t test: 0.029) as well as Gupta et al. data (FDR = 3.58 × 10−4).
PPP1R3F is one of the type-1 protein phosphatase (PP1) regulatory subunits. Protein phosphorylation is a key mechanism by which cells regulate signaling transduction pathways, and PPP1 family enzymes are associated with dephosphorylation of several proteins such as TGF-ß cascade . PPP1R3F has been found to be important to neuronal activities . A systematic resequencing of X-chromosome synaptic genes in a group of individuals with ASD (122 males and 20 females) has identified a rare non-synonymous variant in PPP1R3F that can predispose to developing ASDs . This potentially damaging variant, c.733T > C, was observed in a boy with a diagnosis of asperger syndrome and was transmitted from a mother who suffered from learning disabilities and seizures .
Further, we examined the overlaps between PPP1R3F regulons and known candidate genes implicated in ASD and its related disorders (Table 1). The most significant overlap was found with SFARI gene list  (P = 8 × 10−4), followed by overlap with an intellectual disability database gene list (P = 0.072) . The overlaps with other ASDs candidate gene lists also showed trends towards to being significant. These results suggest the potential relevance of the predicted PPP1R3F network to ASDs.
Since PPP1R3F is a sex-linked gene, we accounted for differences between its expression in male and female samples with ASDs. In the Parikshak et al. data set (from Ref. ) there were 32 males and 7 females with ASDs while there were 39 male controls compared to 6 female controls. The gender information is not available on the Gupta et al. dataset . We found no difference of PPP1R3F expression between male and female samples with ASDs in the Parikshak et al. dataset  (FDR = 0.644; two-sided t test), although this may be due to the small sample size. Nevertheless, to account for possible sex effects on the structure of the constructed network, we re-constructed the regulatory network using only male samples in the Parikshak et al. dataset  (i.e., 32 cases and 39 controls). Following the virtual protein activity analysis, we observed that PPP1R3F remained as a significant MR (VIPER enrichment P value = 0.0186). We note that constructing a network by using only female samples is significantly underpowered and leads to an unreliable network with a large number of false positive connections. These suggest that PPP1R3F likely acts independently from potential sex-based gene expression differences, and our observation of PPP1R3F as a MR was unlikely to be a sex-related artifact. Additionally, we conducted the same analyses on the gene expression data from prefrontal cortex, and did not find PPP1R3F as a significant MR (activity FDR = 0.1364). We should note that the number of samples from other brain regions were too small to be used for network analysis. Our finding thus suggests a potential role of PPP1R3F in developing ASDs by modulating a large body of genes in the cerebellum region.
We next conducted pathway enrichment analysis on the PPP1R3F regulons from both networks. We found that the gene targets are enriched for endocytosis pathway in both the Parikshak et al. dataset  (FDR = 5 × 10−3, fold enrichment = 8.26) and the Gupta et al. dataset  (FDR = 8 × 10−4, fold enrichment = 8.42). “Endocytosis” is the only significantly enriched pathway on both data sets. Combining both sets of gene targets (n = 177) (Supplementary Fig. 1 in Additional files 4 and 5) yielded a more significant enrichment of the endocytosis pathway (FDR = 4.85 × 10−4, fold enrichment = 8.97).
Since ASDs are commonly recognized as brain disorders, we further examined whether the identified MR is mainly expressed in the brain. We looked up PPP1R3F in GTEx consortium portal , and found that compared to other tissues, PPP1R3F is predominantly expressed in various brain regions such as frontal cortex and cerebellum (Supplementary Fig. 2 in Additional file 4). We also checked BrainSpan Atlas of the Developing Human Brain (http://brainspan.org) where we found that PPP1R3F is not expressed until 37 weeks post-conception. While remaining unexpressed in some brain regions, it is modestly expressed in 4 month postnatal stage in some brain regions including cerebellum. We further probed the expression of each of the 177 targets of PPP1R3F in GTEx and identified the tissues in which they are highly expressed. We found that 89 out of the 177 target genes of PPP1R3F are highly expressed in various brain regions compared to other tissues (P= 5.51 × 10−5, Fisher’s exact test; number of protein coding genes in GTEx = 20,900, number of protein coding genes highly expressed in the brain in GTEx = 7528). The enrichment of the expressed PPP1R3F target genes for those highly expressed in the brain supports the pathophysiological relevance of PPP1R3F to ASDs.
In this study, we performed exploratory analysis on two small-scale RNA-Seq data sets, and used a network deconvolution algorithm to construct regulatory networks. Applying virtual protein activity analysis on both networks, we identified PPP1R3F as a MR of 177 targets genes. Gene set enrichment analysis on the PPP1R3F regulons suggested that PPP1R3F may exert its functional effects through regulating endocytosis, a pathway that has been previously implicated in neuropsychiatric disorders .
We acknowledge that our study is limited by the small sample size (due to the scarcity of brain tissues), and the results thus need further replications. Nonetheless, our study generates a testable hypothesis that may be validated by large-scale studies in the future. Additionally, further experimental validation of the regulatory effects of PPP1R3F on its downstream targets as predicted by our network analysis may provide novel insights on possible pathophysiological role of PPP1R3F as a MR of ASD gene network.
autism spectrum disorders
next generation sequencing
genome-wide association studies
post-traumatic stress disorder
algorithm for reconstruction of accurate cellular networks
virtual inference of protein-activity by enhanced regulon analysis
gene set enrichment analysis
de la Torre-Ubieta L, Won HJ, Stein JL, Geschwind DH. Advancing the understanding of autism disease mechanisms through genetics. Nat Med. 2016;22:345–61.
Li J, Wang L, Guo H, Shi L, Zhang K, Tang M, Hu S, Dong S, Liu Y, Wang T, et al. Targeted sequencing and functional analysis reveal brain-size-related genes and their networks in autism spectrum disorders. Mol Psychiatry. 2017;22:1282.
Christensen DL, Bilder DA, Zahorodny W, Pettygrove S, Durkin MS, Fitzgerald RT, Rice C, Kurzius-Spencer M, Baio J, Yeargin-Allsopp M. Prevalence and characteristics of autism spectrum disorder among 4-year-old children in the autism and developmental disabilities monitoring network. J Dev Behav Pediatry. 2016;37:1–8.
Freitag CM. The genetics of autistic disorders and its clinical relevance: a review of the literature. Mol Psychiatry. 2007;12:2–22.
Sandin S, Lichtenstein P, Kuja-Halkola R, Larsson H, Hultman CM, Reichenberg A. The familial risk of autism. JAMA. 2014;311:1770–7.
Iossifov I, Ronemus M, Levy D, Wang Z, Hakker I, Rosenbaum J, Yamrom B, Lee YH, Narzisi G, Leotta A, et al. De novo gene disruptions in children on the autistic spectrum. Neuron. 2012;74:285–99.
O’Roak BJ, Vives L, Girirajan S, Karakoc E, Krumm N, Coe BP, Levy R, Ko A, Lee C, Smith JD, et al. Sporadic autism exomes reveal a highly interconnected protein network of de novo mutations. Nature. 2012;485:246–50.
Yu TW, Chahrour MH, Coulter ME, Jiralerspong S, Okamura-Ikeda K, Ataman B, Schmitz-Abe K, Harmin DA, Adli M, Malik AN, et al. Using whole-exome sequencing to identify inherited causes of autism. Neuron. 2013;77:259–73.
De Rubeis S, He X, Goldberg AP, Poultney CS, Samocha K, Cicek AE, Kou Y, Liu L, Fromer M, Walker S, et al. Synaptic, transcriptional and chromatin genes disrupted in autism. Nature. 2014;515:209–15.
Iossifov I, O’Roak BJ, Sanders SJ, Ronemus M, Krumm N, Levy D, Stessman HA, Witherspoon KT, Vives L, Patterson KE, et al. The contribution of de novo coding mutations to autism spectrum disorder. Nature. 2014;515:216–21.
Yean RKC, Merico D, Bookman M, Howe JL, Thiruvahindrapuram B, Patel RV, Whitney J, Deflaux N, Bingham J, Wang Z, et al. Whole genome sequencing resource identifies 18 new candidate genes for autism spectrum disorder. Nat Neurosci. 2017;20:602–11.
Yuen RK, Thiruvahindrapuram B, Merico D, Walker S, Tammimies K, Hoang N, Chrysler C, Nalpathamkalam T, Pellecchia G, Liu Y, et al. Whole-genome sequencing of quartet families with autism spectrum disorder. Nat Med. 2015;21:185–91.
Weiss LA, Arking DE, Gene Discovery Project of Johns H, the Autism C, Daly MJ, Chakravarti A. A genome-wide linkage and association scan reveals novel loci for autism. Nature. 2009;461:802–8.
Wang K, Zhang H, Ma D, Bucan M, Glessner JT, Abrahams BS, Salyakina D, Imielinski M, Bradfield JP, Sleiman PM, et al. Common genetic variants on 5p14.1 associate with autism spectrum disorders. Nature. 2009;459:528–33.
Autism Spectrum Disorders Working Group of The Psychiatric. Genomics C: meta-analysis of GWAS of over 16,000 individuals with autism spectrum disorder highlights a novel locus at 10q24.32 and a significant overlap with schizophrenia. Mol Autism. 2017;8:21.
Anney R, Klei L, Pinto D, Regan R, Conroy J, Magalhaes TR, Correia C, Abrahams BS, Sykes N, Pagnamenta AT, et al. A genome-wide scan for common alleles affecting risk for autism. Hum Mol Genet. 2010;19:4072–82.
Gupta S, Ellis SE, Ashar FN, Moes A, Bader JS, Zhan J, West AB, Arking DE. Transcriptome analysis reveals dysregulation of innate immune response genes and neuronal activity-dependent genes in autism. Nat Commun. 2014;5:5748.
Voineagu I, Wang X, Johnston P, Lowe JK, Tian Y, Horvath S, Mill J, Cantor RM, Blencowe BJ, Geschwind DH. Transcriptomic analysis of autistic brain reveals convergent molecular pathology. Nature. 2011;474:380–4.
Parikshak NN, Swarup V, Belgard TG, Irimia M, Ramaswami G, Gandal MJ, Hartl C, Leppa V, Ubieta LT, Huang J, et al. Genome-wide changes in lncRNA, splicing, and regional gene expression patterns in autism. Nature. 2016;540:423–7.
Dutta A, Le Magnen C, Mitrofanova A, Ouyang X, Califano A, Abate-Shen C. Identification of an NKX3.1-G9a-UTY transcriptional regulatory network that controls prostate differentiation. Science. 2016;352:1576–80.
Alvarez MJ, Shen Y, Giorgi FM, Lachmann A, Ding BB, Ye BH, Califano A. Functional characterization of somatic mutations in cancer using network-based inference of protein activity. Nat Genet. 2016;48:838–47.
Doostparast Torshizi A, Armoskus C, Forrest MP, Zhang S, Zhang H, Souaiaia T, Evgrafov OV, Knowles JA, Duan J, Wang K. Deconvolution of transcriptional networks identified TCF4 as a master regulator in schizophrenia. BioRxiv. 2018. https://doi.org/10.1101/133363.
Doostparast Torshizi A, Wang K. Deconvolution of transcriptional networks in post-traumatic stress disorder uncovers master regulators driving innate immune system function. Sci Rep. 2017;7:14486.
Margolin AA, Wang K, Lim WK, Kustagi M, Nemenman I, Califano A. Reverse engineering cellular networks. Nat Protoc. 2006;1:662–71.
Boorsma A, Foat BC, Vis D, Klis F, Bussemaker HJ. T-profiler: scoring the activity of predefined groups of genes using gene expression data. Nucleic Acids Res. 2005;33:W592–5.
Subramanian A, Tamayo P, Mootha VK, Mukherjee S, Ebert BL, Gillette MA, Paulovich A, Pomeroy SL, Golub TR, Lander ES, Mesirov JP. Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles. Proc Natl Acad Sci USA. 2005;102:15545–50.
Abatangelo L, Maglietta R, Distaso A, D’Addabbo A, Creanza TM, Mukherjee S, Ancona N. Comparative study of gene set enrichment methods. BMC Bioinf. 2009;10:275.
Wang SS, Kloth AD, Badura A. The cerebellum, sensitive periods, and autism. Neuron. 2014;83:518–32.
Becker EB, Stoodley CJ. Autism spectrum disorder and the cerebellum. Int Rev Neurobiol. 2013;113:1–34.
Hampson DR, Blatt GJ. Autism spectrum disorders and neuropathology of the cerebellum. Front Neurosci. 2015;9:420.
Rogers TD, McKimm E, Dickson PE, Goldowitz D, Blaha CD, Mittleman G. Is autism a disease of the cerebellum? An integration of clinical and pre-clinical research. Front Syst Neurosci. 2013;7:15.
Balsters JH, Apps MA, Bolis D, Lehner R, Gallagher L, Wenderoth N. Disrupted prediction errors index social deficits in autism spectrum disorder. Brain. 2017;140:235–46.
Alaerts K, Woolley DG, Steyaert J, Di Martino A, Swinnen SP, Wenderoth N. Underconnectivity of the superior temporal sulcus predicts emotion recognition deficits in autism. Soc Cogn Affect Neurosci. 2014;9:1589–600.
Kaiser MD, Pelphrey KA. Disrupted action perception in autism: behavioral evidence, neuroendophenotypes, and diagnostic utility. Dev Cogn Neurosci. 2012;2:25–35.
Korrodi-Gregorio L, Silva JV, Santos-Sousa L, Freitas MJ, Felgueiras J, Fardilha M. TGF-beta cascade regulation by PPP1 and its interactors -impact on prostate cancer development and therapy. J Cell Mol Med. 2014;18:555–67.
Piton A, Gauthier J, Hamdan FF, Lafreniere RG, Yang Y, Henrion E, Laurent S, Noreau A, Thibodeau P, Karemera L, et al. Systematic resequencing of X-chromosome synaptic genes in autism spectrum disorder and schizophrenia. Mol Psychiatry. 2011;16:867–80.
Abrahams BS, Arking DE, Campbell DB, Mefford HC, Morrow EM, Weiss LA, Menashe I, Wadkins T, Banerjee-Basu S, Packer A. SFARI gene 2.0: a community-driven knowledgebase for the autism spectrum disorders (ASDs). Mol Autism. 2013;4:36.
http://dnatesting.uchicago.edu/tests/intellectual-disability-exome-panel. Accessed 5 Jan 2018.
Consortium GT. The genotype-tissue expression (GTEx) project. Nat Genet. 2013;45:580–5.
Loebrich S. The role of F-actin in modulating Clathrin-mediated endocytosis: lessons from neurons in health and neuropsychiatric disorder. Commun Integr Biol. 2014;7:e28740.
http://gfuncpathdb.ucdenver.edu/iddrc/iddrc/home.php. Accessed 4 Jan 2018.
Chiurazzi P, Pirozzi F. Advances in understanding - genetic basis of intellectual disability. F1000Res. 2016;5(F1000 Faculty Rev):599.
Turner TN, Yi Q, Krumm N, Huddleston J, Hoekzema K, Stessman HAF, Doebley AL, Bernier RA, Nickerson DA, Eichler EE, Hoekzema K. denovo-db: a compendium of human de novo variants. Nucleic Acids Res. 2017;45:D804–11.
ADT designed the experiments, conducted the analysis and computations, and wrote the manuscript. JD advised on data analysis and edited the manuscript. KW designed the experiments, supervised data analysis and edited the manuscript. All authors read and approved the final manuscript.
The authors would like to thank the PsychENCODE consortium for providing the data. Data were generated as part of the PsychENCODE Consortium, supported by: U01MH103339, U01MH103365, U01MH103392, U01MH103340, U01MH103346, R01MH105472, R01MH094714, R01MH105898, R21MH102791, R21MH105881, R21MH103877, and P50MH106934 awarded to: Schahram Akbarian (Icahn School of Medicine at Mount Sinai), Gregory Crawford (Duke), Stella Dracheva (Icahn School of Medicine at Mount Sinai), Peggy Farnham (USC), Mark Gerstein (Yale), Daniel Geschwind (UCLA), Thomas M. Hyde (LIBD), Andrew Jaffe (LIBD), James A. Knowles (USC), Chunyu Liu (UIC), Dalila Pinto (Icahn School of Medicine at Mount Sinai), Nenad Sestan (Yale), Pamela Sklar (Icahn School of Medicine at Mount Sinai), Matthew State (UCSF), Patrick Sullivan (UNC), Flora Vaccarino (Yale), Sherman Weissman (Yale), Kevin White (UChicago) and Peter Zandi (JHU). We would also like to thank the providers of the Gupta et al. dataset (The Arking Lab at the McKusick-Nathans Institute of Genetic Medicine of Johns Hopkins University) to generate the brain gene expression data and make the data freely available.
The authors declare that they have no competing interests.
Availability of data and materials
Consent for publication
Ethics approval and consent to participate
This study is supported by NIH grant MH108728 to K.W.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.