Physical activity and blood gene expression profiles: the Norwegian Women and Cancer (NOWAC) Post-genome cohort
BMC Research Notes volume 13, Article number: 283 (2020)
The influence of physical activity (PA) on the immune system has emerged as a new field of research. Regular PA may promote an anti-inflammatory state in the body, thus contributing to the down-regulation of pro-inflammatory processes related to the onset and progression of multiple diseases. We aimed to assess whether overall PA levels were associated with differences in blood gene expression profiles, in a cohort of middle-aged Norwegian women. We used information from 977 women included in the Norwegian Women and Cancer (NOWAC) Post-genome cohort. Information on PA and covariates was extracted from the NOWAC database. Blood samples were collected using the PAXgene Blood RNA collection system, and gene expression profiles were measured using Illumina microarrays. The R-package limma was used for the single-gene level analysis. For a target gene set analysis, we used the global test R-package with 48 gene sets, manually curated from the literature and relevant molecular databases.
We found no associations between overall PA levels and gene expression profiles at the single-gene level. Similarly, no gene sets reached statistical significance at adjusted p < 0.05. In our analysis of healthy, middle-aged Norwegian women, self-reported overall PA was not associated with differences in blood gene expression profiles.
Physical activity (PA) is one of the major modifiable risk factors for several diseases, along with other lifestyle factors such as smoking, alcohol consumption, and diet. PA is a complex phenomenon, which includes concepts such as exercise and training, as well as occupational, leisure time, household and transportation activities, all at different intensity, duration and frequency . Studies have shown that PA is associated with reduced risk of both communicable  and non-communicable diseases like cardiovascular diseases, diabetes, overweight/obesity, and cancers of the breast, endometrium and colon [3,4,5,6,7]. From a public health perspective, the health effects of the total PA level of the general population is of particular interest.
The physiological and molecular mechanisms of the association between PA and health are not fully understood. At the physiological level, PA influences energy expenditure, metabolism, cardiorespiratory and muscular fitness, and body composition, with subsequent consequences for disease risk . The main hormonal systems at play in the link between PA and disease include sex steroids [8, 9], adipokines , as well as insulin and insulin-like growth factors . In recent years, the influence of PA on the immune system has emerged as a new field of research. Regular PA may promote an anti-inflammatory state in the body, thus contributing to the down-regulation of pro-inflammatory processes related to the onset and progression of multiple diseases . Exercise in acute bouts in clinical trials in humans, on the other hand, may lead to muscle tissue damage and localized inflammation, with systemic release of cytokines that may act either pro- or anti-inflammatory [12, 13]. Hence, the immunological response to PA differs according to the intensity and duration of PA, but several other factors also modify the response. These factors include age, diet, level of PA, and baseline level of inflammation in the body [12, 14].
New epidemiological evidence is demonstrating the importance of increased everyday PA and reduced sedentary behavior to minimize the risk of disease, and new insights on immunological mechanisms of exercise are emerging. However, understanding how PA levels in a general population influences immunological mechanisms remains a challenge. Here, we have assessed whether PA levels are associated with differences in the gene expression profiles of immune cells in whole-blood samples collected in the NOWAC Post-genome cohort, a nationally representative, population-based cohort of middle-aged women.
The NOWAC study  is a nationally representative, prospective cohort study which includes more than 170,000 middle-aged women. The participants answered one or more 4- or 8-page questionnaires on lifestyle, dietary factors and health. In the years 2003–2006, approx. 50,000 of the NOWAC women donated a blood sample eligible for gene expression analysis and answered concurrently a 2-page questionnaire, collectively forming the NOWAC Post-genome cohort (further details in ). For the present cross-sectional study, 977 cancer-free women were randomly drawn from the NOWAC Post-genome cohort.
Information on overall PA and age at inclusion was extracted from questionnaires answered no more than 1 year prior to blood sampling. All other variables were extracted from the 2-page questionnaire accompanying the blood sample. PA level was self-reported on a 10-increment scale from 1 to 10. The question on PA was stated as follows (see Additional file 1: Table S1): “By physical activity we mean both work in and outside the home, as well as training/exercise and other physical activity, such as walking, etc. Mark the number that best describes your level of physical activity.” For the present analyses, we defined five PA categories by combining levels 1 + 2 (very low), 3 + 4 (low), 5 + 6 (moderate), 7 + 8 (high), and 9 + 10 (very high) from the 10-increment scale. Smoking (during the last week yes/no) and use of medication (during the last week yes/no) were defined as smoking or taking medication during the past week before giving the blood sample. Medication was used as a proxy to gain a comprehensive impression of the participants’ health status, and was grouped according to the Anatomical Therapeutic Chemical (ATC) classification, and the following classes were assessed: N (nervous system, including analgesics), C (cardiovascular system), M (musculo-skeletal system, including non-steroidal anti-inflammatory drugs), and B (blood and blood forming organs). We excluded the lowest PA category from analyses due to low n (36), as well as high body mass index (BMI) (27.8) and high frequency of medication use (78%) in that group. We also excluded women with missing information on either PA, BMI, smoking, or medication use (n = 70).
Blood samples were collected using the PAXgene Blood RNA collection system, Preanalytix/Qiagen, Hilden, Germany), they were kept at − 80 °C until shipment to the Genomics Core Facility at the Norwegian University of Science and Technology for analysis. Total RNA was isolated in accordance with the manufacturer’s protocol (PAXgene Blood miRNA isolation Kit). RNA purity was assessed by NanoDrop ND 8000 spectrophotometer (ThermoFisher Scientific, Wilmington, DE, USA), and RNA integrity by Bioanalyzer capillary electrophoresis (Agilent Technologies, Palo Alto, CA, USA). The mRNA was amplified and labeled using the Illumina TotalPrepT-96 RNA Amplification Kit (Ambion Inc., Austin, TX, USA), and hybridized to Illumina HumanHT-12 Expression BeadChip microarrays (Illumina, Inc. San Diego, CA, USA). The raw microarray images were processed in Illumina GenomeStudio. Preprocessing of the microarray dataset is described in . The main steps of the preprocessing included (1) removal of outliers, (2) background correction (using the R package limma, function nec), and (3) probe filtering based on Illumina quality control measures, detection in < 1% of samples, or probes that were not mapped to an Illumina ID. The dataset was quantile normalized and log2 transformed using the R package lumi, functions lumiN and lumiT. The R packages lumi: nuID2RefSeqID and illuminaHumanv4.db were used for annotation of Illumina IDs to gene symbols. The final gene expression dataset included 7741 probes and 977 individuals. After applying the exclusion criteria based on PA levels and missing information on covariates, our analytical sample size consisted of 871 women.
We used independent sample t-test and chi-square statistics to compare different levels of physical activity according to age, smoking, BMI, and use of medication at the time of the blood sample. We looked for differential expression first at the single gene level (R package limma ), and secondly at the gene set level (R package: global test ). In the search for differentially expressed single genes, we compared PA levels low versus high, and low versus very high, using false discovery rate-adjusted p < 0.05 as a significance threshold. Secondly, we checked for linear trends in gene expression, using the four highest PA categories as a continuous variable. All single gene level analyses were adjusted for variables that were significantly different (p < 0.05) between one or more comparison groups: BMI, smoking, and use of ATC class N medications. In addition to single-gene level analyses, we carried out a targeted gene set level analysis using global test . As input, we curated gene sets from the literature and the Array Express database of gene expression studies (n = 21, Additional file 2: Table S2). Also, gene sets representing general processes related to PA was extracted from Molecular Signatures Database (MSigDB , n = 27). The general processes included inflammatory pathways, oxidation, oxidative stress, and cell cycle pathways, as well as specific molecular pathways like PTEN signaling and Toll like receptor signaling.
The mean age of women in our sample was 54.3, mean BMI was 25.3, and 25% were current smokers (Table 1). The women in the high and very high PA level groups were less likely to be smokers, had a lower BMI and were more likely to have used ATC N type medications compared to women in the low PA category. Age differed statistically significant between low and very high PA groups. However, the actual difference in number of years was small (1.4 years), thus we considered it not to be biologically relevant, and consequently we did not adjust for age (Table 2).
At the single-gene level, we found no associations between PA levels and gene expression profiles, neither in categorical analysis of low versus high, nor in low versus very high PA levels. Table 3 shows the top ten genes sorted by fold change. Similarly, analyzing PA levels as a continuous variable did not produce results that met our criteria for statistical significance (adjusted p < 0.05). For gene set analysis, we first employed the overall test for significance in the global test R-package. In the comparison of low versus high PA levels and low versus very high PA, the p-values were 0.82 and 0.63, respectively. As recommended by Goeman et al. when overall statistically significant level is low , we proceeded with a targeted approach for gene set testing. The 48 gene sets we chose were tested using the global test, but no gene sets reached statistical significance at adjusted p < 0.05.
In the present cohort study of Norwegian, middle-aged women, we explored associations between everyday PA levels and blood gene expression profiles. Using statistical methods that are sensitive to low-magnitude associations, we found no statistically significant associations when comparing low versus high PA, low versus very high, or when using PA as a continuous variable.
Most published studies on the molecular effects of PA has focused on exercise as the exposure, in contrast to the total, everyday level of PA. Available studies often use an experimental approach by subjecting participants to exercise interventions, and include young participants, frequently males. In comparison, the present study includes participants representing the general, middle-aged female population, and provide insight into the magnitude of the effects of differing levels of total, everyday PA. Our findings are more relevant to the general population, and may serve to moderate results from controlled trials of exercise.
In addition to the single gene level analysis limma, we chose the global test when analyzing gene sets. The global test method used herein is found to be among the most sensitive methods, and not to be prone to false positive findings [21,22,23].
A main strength of this study is the large study population, as compared to the available literature on molecular mechanisms of PA. Smaller studies are prone to selection bias and reduced generalizability. Our large sample size gives the opportunity to discuss associations in the general population, with increased generalizability of the findings as compared to smaller studies. Further, the full-genome expression analysis method gives a global view of the actively transcribed genes in the entire blood cell pool, as opposed to studying single genes and single cell types. Finally, a validation study using objective measures showed that the global PA scale is able to rank study participants from very low to high PA levels, although the dose is not possible to determine .
In our analysis of healthy, middle-aged Norwegian women, self-reported PA was not associated with differences in blood gene expression profiles. To our knowledge, this is the first study assessing the association between blood gene expression profiles and everyday PA levels in a general, female population. When comparing studies of exercise to our study of overall everyday PA levels, the lack of knowledge on potential threshold levels of PA for immunological effects becomes evident. Thus, more research is needed to identify the level of PA needed for positive immunological effects, and whether these levels differ between population strata. To identify this, PA needs to be assessed using objective measures.
A limitation of our work is that in the observational study design, we cannot exclude the possibility of residual confounding by diet or other lifestyle factors, which may influence our results. Furthermore, we did not have available data on blood cell subpopulation distributions, which may be considered a potential confounding factor. Our sample size was relatively large compared to other published studies, however, it is possible that some of the analyses were underpowered, as we confined our analytical sample only to women that were within tested PA categories. Even though the PA scale used was shown to be reliable in correctly separating women based on their PA level, this does not exclude the possibility that the PA levels were overestimated across the PA categories used in the analyses. However, we do not expect that this overestimation differs along the scale. Further, the PA scale used captures total PA levels, including occupational PA. It was previously shown that occupational PA was associated with a lower level of perceived health . This indicates that occupational PA might have unfavorable effect on anti-inflammatory gene expression, in other words, it might counteract the favorable effects of non-occupational PA. Finally, there was some time gap between questionnaires and blood sampling, so our results will only reflect potential long-term effects of overall, habitual PA levels. Taken together, our exposure measurement may not be very precise, driving our results toward the null due to potential misclassification.
Availability of data and materials
The dataset used will be made available upon request. Please contact the corresponding author.
Anatomical Therapeutic Chemical Classification
Body mass index
False discovery rate
Molecular Signatures Database
Norwegian Women and Cancer
PetteeGabriel KK, Morrow JR Jr, Woolsey AL. Framework for physical activity as a complex and multidimensional behavior. J Phys Act Health. 2012;9(1):S11–8.
Pape K, et al. Leisure-time physical activity and the risk of suspected bacterial infections. Med Sci Sports Exerc. 2016;48(9):1737–44.
Morris JN, et al. Coronary heart-disease and physical activity of work. Lancet. 1953;265(6796):1111–20.
WCRF and AICR. Continuous update project Expert Report 2018. Physical activity and the risk of cancer. 2018. dietandcancerreport.org. Accessed Dec 2018.
Moore SC, et al. Association of leisure-time physical activity with risk of 26 types of cancer in 1.44 million adults. JAMA Intern Med. 2016;176(6):816–25.
Lee IM, et al. Effect of physical inactivity on major non-communicable diseases worldwide: an analysis of burden of disease and life expectancy. Lancet. 2012;380(9838):219–29.
Ding D, et al. The economic burden of physical inactivity: a global analysis of major non-communicable diseases. Lancet. 2016;388(10051):1311–24.
Matthews CE, et al. Effects of exercise and cardiorespiratory fitness on estrogen metabolism in postmenopausal women. Cancer Epidemiol Biomark Prev. 2018;27(12):1480.
Ennour-Idrissi K, Maunsell E, Diorio C. Effect of physical activity on sex hormones in women: a systematic review and meta-analysis of randomized controlled trials. Breast Cancer Res. 2015;17(1):139.
Friedenreich CM, Neilson HK, Lynch BM. State of the epidemiological evidence on physical activity and cancer prevention. Eur J Cancer. 2010;46(14):2593–604.
Friedenreich CM, et al. Changes in insulin resistance indicators, IGFs, and adipokines in a year-long trial of aerobic exercise in postmenopausal women. Endocr Relat Cancer. 2011;18(3):357–69.
Gjevestad GO, Holven KB, Ulven SM. Effects of exercise on gene expression of inflammatory markers in human peripheral blood cells: a systematic review. Curr Cardiovasc Risk Rep. 2015;9(7):34.
Campbell JP, Turner JE. Debunking the myth of exercise-induced immune suppression: redefining the impact of exercise on immunological health across the lifespan. Front Immunol. 2018;9:648.
Sellami M, et al. Effects of acute and chronic exercise on immunological parameters in the elderly aged: can physical activity counteract the effects of aging? Front Immunol. 2018;9:2187.
Lund E, et al. Cohort profile: the Norwegian Women and Cancer Study—NOWAC—Kvinner og kreft. Int J Epidemiol. 2008;37:36–41.
Dumeaux V, et al. Gene expression analyses in breast cancer epidemiology: the Norwegian Women and Cancer postgenome cohort study. Breast Cancer Res. 2008;10(1):R13.
Gunter C-C, Holden M, Holden L. Preprocessing of gene-expression data related to breast cancer diagnosis. Oslo: Norwegian Computing Center; 2014.
Smyth GK. Linear models and empirical bayes methods for assessing differential expression in microarray experiments. Stat Appl Genet Mol Biol. 2004;3:Article 3.
Goeman JJ, et al. A global test for groups of genes: testing association with a clinical outcome. Bioinformatics. 2004;20(1):93–9.
Subramanian A, et al. Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles. Proc Natl Acad Sci USA. 2005;102(43):15545–50.
Tarca AL, Bhatti G, Romero R. A comparison of gene set analysis methods in terms of sensitivity, prioritization and specificity. PLoS ONE. 2013;8(11):e79217.
Fridley BL, Jenkins GD, Biernacka JM. Self-contained gene-set analysis of expression data: an evaluation of existing and novel methods. PLoS ONE. 2010;5(9):e12693.
Nguyen T-M, et al. Identifying significantly impacted pathways: a comprehensive review and assessment. Genome Biol. 2019;20(1):203.
Borch KB, et al. Criterion validity of a 10-category scale for ranking physical activity in Norwegian women. Int J Behav Nutr Phys Act. 2012;9:2.
Bogaert I, De Martelaer K, Deforche B, Clarys P, Zinzen E. Associations between different types of physical activity and teachers’ perceived mental, physical, and work-related health. BMC Public Health. 2014;30(14):534. https://doi.org/10.1186/1471-2458-14-534.
We gratefully acknowledge the participants of the NOWAC Post-genome study, the founder of NOWAC prof. Eiliv Lund, and the technical and administrative staff of NOWAC.
All laboratory work was carried out by the Genomics Core Facility (GCF), Norwegian University of Science and Technology (NTNU). GCF is funded by the Faculty of Medicine and Health Sciences at NTNU and Central Norway Regional Health Authority. This study was funded by UiT The Arctic University of Norway, and the European Research Council (ERC-AdG 232997-Tice). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Ethics approval and consent to participate
All women gave written, informed consent. Handling of personal information in the NOWAC study was approved by the Norwegian Data Protection Authority (ref. 07/00030). Collection and storage of biological material in NOWAC was approved by the Regional Ethical Committee of Northern Norway (REK-Nord) in accordance with the Norwegian Biobank Act (ref. REK-Nord 2014/1605).
Consent for publication
The authors have no competing interests to declare.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
About this article
Cite this article
Olsen, K.S., Lukic, M. & Borch, K.B. Physical activity and blood gene expression profiles: the Norwegian Women and Cancer (NOWAC) Post-genome cohort. BMC Res Notes 13, 283 (2020). https://doi.org/10.1186/s13104-020-05121-2