Skip to main content

The UK BiLEVE and Mendelian randomisation: using multivariable instrumental variables to address “damned if you, damned if you don’t” adjustment problems



To explore the use of multivariable instrumental variables to resolve the “damned if you do, damned if you don’t” adjustment problem created for Mendelian randomisation (MR) analysis using the smoking or lung function related phenotypes in the UK Biobank (UKB).


“damned if you do, damned if you don’t” adjustment problems occur when both adjusting and not-adjusting for a variable will induce bias in an analysis. One instance of this occurs because the genotyping chip of UKB participants differed based on lung function/smoking status. In simulations, we show that multivariable instrumental variables analyses can attenuate potential collider bias introduced by adjusting for a proposed covariate, such as the UKB genotyping chip. We then explore the effect of adjusting for genotyping chip in a multivariable MR model exploring the effect of smoking on seven medical outcomes (lung cancer, emphysema, hypertension, stroke, heart diseases, depression, and disabilities). We additionally compare our results to a traditional univariate MR analysis using genome-wide analyses summary statistics which had and had not adjusted for genotyping chip. This analysis implies that the difference in genotyping chip has introduced only a small amount of bias.

Peer Review reports


Mendelian randomisation (MR) is an increasingly popular method for inferring the causal effect of modifiable exposures on epidemiological outcomes [1]. In an MR study, genetic variants which are robustly associated with the exposure of interest are used as instruments in an instrumental variables analysis.

One of the most popular resources for conducting MR analyses is the UK Biobank (UKB) [2]. The UKB is a large (approximately half a million participants) population cohort study of Britons. The UKB has been used in the MR literature to explore the causal effects of smoking-related phenotypes [3]. However, the UKB genotyping was rolled out over several steps.

Participants enrolled in the UK BiLEVE study were genotyped using different instruments (‘genotyping chip’) than other participants [4]. Because UK BiLEVE was not randomly sampled, there is a worry that differences in genotyping between participants could cause (confounding) bias if the sampling probability is associated with risk factors for the outcome phenotype of interest. Because of this, there is general advice to adjust UKB GWASs by genotyping chip [5]. However, the UK BiLEVE study selected participants who were in a tail or centre of the distribution for lung function and smoking. This poses a problem for genetic designs, like MR, in the UKB exploring smoking and lung function-related phenotypes. If genotyping chip is determined by UK BiLEVE enrolment, which is in turn determined by lung function and smoking status, then adjusting for genotyping chip could introduce collider bias.

Situations where no adjustment will result in bias, but where the required covariate is a collider (see Fig. 1) have been described as “damned if you do, damned if you don’t” adjustment problems in a recent review of types of covariate controls [6]. This review concluded that there were no satisfactory methods for addressing this bias, and they suggested authors should implement sensitivity analyses when encountering this type of problem. Existing guidelines for conducting genetic analysis in the UKB had made similar suggestions of running analyses with and without adjustment for genotyping chip when a phenotype might relate to lung function or smoking [5].

Fig. 1
figure 1

An example “damned if you do, damned if you don’t” adjustment problem. Adjusting for M will result in collider bias, but not adjusting for M will introduce confounding

In this article, we propose a complementary sensitivity analysis to address this type of adjustment paradox, with specific emphasis on addressing bias due to UK BiLEVE in MR studies. Multivariable IV (MVIV) is an extension of traditional IV analysis to include more than one exposure [7]. A traditional IV analysis, like MR, assumes that the instrument is robustly associated with the exposure, can affect the outcome only via the exposure, and that there is no ‘back door’ path from the instrument to the outcome. MVIV modifies these assumptions so that: 1) the instrument is robustly associated with the exposure(s) conditional on the other covariate(s), can affect the outcome only via one of the exposures, and that, conditional on all covariates, there is no ‘back door’ path from the instrument to the outcome. While the effect estimates of a standard IV analysis are the total effect of the exposure on the outcome, MVIV effect estimates should be interpreted as the direct effect of the exposure conditional on the covariates. Because of this, MR applications of MVIV have shown that it can be used to address bias, like collider bias [8, 9], by ensuring that the effect estimate of interest is conditionally independent of a known biasing phenotype. Intuitively then, adding the genotyping chip as a second exposure to an MR model using chip adjusted genome-wide summary statistics should remove any collider bias introduced by adjusting for genotyping chip. The “damned if you do, damned if you don’t” paradoxes explored here differs from existing applications of MVIV which have focused on settings where the variable being adjusted for is only one of a confounder [10], pleiotropic [7], or a collider [8, 9], whereas here we consider a setting where it is both a collider and either pleiotropic or a confounder.

Fig. 2
figure 2

Directed Acyclic Graph of data generative model used in simulation. Here, G1 is the genetic liability to the exposure, and G2 the genetic liability to the potential covariate. C is the potential covariate, and U1 and U2 are confounders. X and Y are the exposure and outcome respectively. M is additionally pleotropic and a collider

Main text



We ran a simple simulation to provide a proof of concept that MVIV can be used to address “damned if you do, damned if you don’t” adjustment problems. We report our simulations using the ADEMP (aims, data-generating mechanisms, estimands, methods, and performance measures) approach [11].

Data-generating mechanisms

We simulated a setting in which there is an exclusion restriction violation (i.e. where the instrument causes the outcome via a path not mediated by the exposure), but where adjusting for this violation would introduce M bias (Fig. 2). More formally, we simulated 100 single nucleotide polymorphisms (SNPs, which are common genetic variants) as independent and identically distributed binomial variables with the following parameters.

$$SNP \sim B\left( {2,N\left( {0.5,{{0.1}^2}} \right)} \right)$$

The distribution of allele frequencies was chosen to be loosely based on the one observed for the genome wide significant SNPs in the UKB GWAS of lifetime smoking by Wooton and colleagues [3].

We then simulated two confounders as independent and identically distributed normal variables with the following parameters:

$$C \sim N\left( {0,{1^2}} \right)$$

We then defined the exposure as.

$$E = {C_1} + \sum\nolimits_1^{50} {\left[ {N\left( {0.1,{{0.05}^2}} \right) * SNP} \right]} + {\varepsilon _1}$$

where ε is an error term such that ε1 ~ N(0, 12).

We defined the potential covariate for blocking the exclusion restriction violation as.

$$P = {C_2} + \sum\nolimits_{51}^{100} {\left[ {N\left( {0.1,{{0.05}^2}} \right) * SNP} \right]} + {\varepsilon _2}$$

where ε2 ~ N(0, 12).

Finally we defined the outcome as.

$$Y = E + {C_1} + {C_2} + P + {\varepsilon _3}$$

where ε3 ~ N(0, 12).

The phenotypic beta values chosen in this simulation were chosen purely arbitrarily. However, biases are generally more visible with larger effect estimates. By choosing betas values of 1 we therefore hoped to clearly illustrate any possible effect of using MVIV. The exact conclusions of our simulation are therefore not expected to generalise to any specific applied setting.


The causal effect of the exposure on the outcome.

Methods: We compare three methods for estimating the causal effect of the exposure on the outcome:

  1. 1)

    We ran an inverse variance weighted (intercept free) regression of By ~ Bx + 0, where By is the SNP-outcome association and Bx is the SNP-exposure association, and where the Bx and By were estimated in linear models which additionally adjusted for P.

  2. 2)

    An inverse-variance weighted (intercept free) regression of By ~ Bx + 0, where the Bx and By were not estimated in linear models which did not additionally adjust for P.

  3. 3)

    An inverse-variance weighted (intercept free) regression of By ~ Bx + Bp + 0, where Bp is the SNP-covariate association, and where the Bx and By were estimated in linear models which additionally adjusted for P.

For readers less familiar with the MR literature, it is worth noting that the intercept-free weighted regression is equivalent to an inverse variance weighted meta-analysis of the IV Wald ratios for each SNP. In addition, [1] and [2] only used the first 50 simulated SNPs (i.e. those which associated with the exposure), while [3] used all 100 simulated SNPs (i.e. SNPs which associated with either exposure). By, Bx, and Bp were all estimated in non-overlapping samples of 250,000 participants.

Performance measure

The mean bias in the causal effect of the exposure on the outcome over 1000 iterations.

Results of the simulation

The simulation found that both adjusting and not adjusting the linear model for the covariate resulted in bias (mean bias = -0.445 [MC SE = 0.002] and 0.972 [MC SE = 0.001] for the adjusted and not adjusted analysis respectively). On the other hand, the MVIV model attenuated most of the bias (mean bias = -0.064 [MC SE = 0.002]).

Applied example with the UKB genotyping chip

We used a two-sample MR analysis of the effect of smoking on seven outcomes (lung cancer, emphysema, depression, hypertension, stroke, heart diseases, and diabetes) in the UKB as an applied example. The outcomes were chosen because there is existing literature implying a causal association between smoking and these outcomes [12,13,14,15,16,17]. We ran three versions of this analysis: (1) using univariable MR to estimate the effect of smoking on the outcomes where the UKB smoking, lung cancer and emphysema GWASs had adjusted for genotyping chip, (2) using univariable MR to estimate the effect of smoking on the outcomes where the UKB smoking, lung cancer and emphysema GWASs had not adjusted for genotyping chip, and (3) using multivariable MR estimate the effect smoking on the outcomes adjusted for genotyping chip where the UKB smoking, lung cancer and emphysema GWASs had adjusted for genotyping chip.

We used the Wootton et al. UKB lifetime smoking GWAS as a source of SNP-exposure associations [3], which we standardised by dividing the effect estimate and standard error by 0.6940093. To estimate genotype-chip association we ran a GWAS (adjusted for age, sex, and the first 10 principal components of ancestry) using BOLT-LMM in the MRC-IEU UKB GWAS pipeline. Full methods for both GWASs described elsewhere [3, 18]. In both the univariate and multivariate setting, we selected genome-wide significant SNPs associated with the exposure(s) of interest as genetic instruments, and then clumped this list using an r2 of 0.001 and kb of 10,000. We additionally implemented the FIQT WCC on the exposure GWASs to correct for any effect of Winner’s curse [19].

We used the Elsworth’s UKB GWASs in the MRC-IEU OpenGWAS platform as a source of SNP-emphysema and -hypertension associations [20]; as well as Nikpay et al’s GWAS of CAD, Malik et al’s GWAS of stroke, Wang et al’s GWAS of lung cancer, Howard et al’s GWAS of depression and the FinnGen round 5 GWAS of disabilities [21,22,23,24,25]. Details on genotyping, quality control, and phenotyping can be found in the original publications and on the UKB website ( All outcome GWASs were on the odds ratio scale. We harmonised the exposure and outcome samples, and removed palindromic SNPs whose effect allele could not be inferred using based on minor allele frequency. We used four MR estimators: IVW, weighted mode, weighted median, and MR Egger. We additionally estimated the heterogeneity in the MR Wald ratios using the Cochrane Q statistic as a control for exclusion restriction violations. The univariate MR analysis was implemented using the TwoSampleMR R package [26, 27]. Multivariable MR analyse additionally used the MVMR, MendelianRandomization, and MVMRMode R packages [28,29,30].

Table 1 presents the results of this analysis. These broadly show highly consistent findings across the three methods, with most of the changes in estimates smaller than the standard error of each point estimate. Overall, this would therefore broadly imply that there is minimal collider or confounding bias introduced by adjusting or not adjusting for genotyping chip.


Here we have shown that MVIV can, in theory, be used to attenuate bias when “damned if you do, damned if you don’t” adjustment problems occur in an IV analysis. We then apply this to the UKB and show that, despite differences in genotyping depending on lung function and smoking status of participants, the UK BiLEVE study appears to have introduced only a small amount of bias into our estimates of the causal effect of smoking on lung cancer.

There are three complications to the application of our findings to address differences in genotyping chip in the UKB, which we believe mean that our proposal should be used to supplement, rather than replace the existing guidance of performing both a chip-adjusted, and no-chip-adjusted, analysis as a sensitivity analysis. Firstly, MVIV has additional parameters than univariable IV analyses, and will therefore be even less precise. Secondly, the collinearity of exposures (such as smoking and genotype chip) can also introduce conditionally weak instruments into analyses which would have strong instruments in a university setting. Although not an issue in our applied analysis, this could become a major issue in analysis using weaker instruments, such as parental smoking status. Hence, authors should come to a judgment about which method will have a lower mean squared error and then use the alternatives as sensitivity analyses. Thirdly, because enrolment into the UK BiLEVE study was determined by smoking and lung function, it could be argued that it is, in effect, a proxy of these variables. If this is the case, then adjusting for genotyping chip in a model would potentially do something equivalent to adjusting for a mediator in a traditional regression analysis, and therefore introduce bias. This underpins the importance of not using the MVIV analysis to replace the existing guidelines.

A final, but related, limitation when applying our proposal to other settings is that there has to be a way to validly instrument the proposed covariate. Since there are many settings, especially when using summary data IV analysis like two-sample MR, when study-specific variables, such as UKB genotyping chip, this may be more common than the authors would hope.

Table 1 Results of applied example. UVMR = univariable Mendelian randomisation, MVMR = multivariable Mendelian randomisation

Data Availability

The code and GWAS summary statistics used in this study is available from The GWAS summary statistics will also be made available from the MRC-IEU OpenGWAS project.

This project was conducted using UK Biobank application no. 15,825. UK Biobank was established by the Wellcome Trust medical charity, Medical Research Council, Department of Health, Scottish Government and the Northwest Regional Development Agency. It has also had funding from the Welsh Government, British Heart Foundation, Cancer Research UK and Diabetes UK. UK Biobank is supported by the National Health Service (NHS). UK Biobank is open to bona fide researchers anywhere in the world.


  1. Davies NM, Holmes MV, Smith GD. Reading mendelian randomisation studies: a guide, glossary, and checklist for clinicians. BMJ. 2018;362:k601.

    Article  PubMed  PubMed Central  Google Scholar 

  2. Woolf B, Di Cara N, Moreno-Stokoe C, Skrivankova V, Drax K, Higgins JPT et al. Investigating the transparency of reporting in two-sample summary data Mendelian randomization studies using the MR-Base platform. International Journal of Epidemiology. 2022;dyac074.

  3. Wootton RE, Richmond RC, Stuijfzand BG, Lawn RB, Sallis HM, Taylor GMJ, et al. Evidence for causal effects of lifetime smoking on risk for depression and schizophrenia: a mendelian randomisation study. Psychol Med. 2020;50(14):2435–43.

    Article  PubMed  Google Scholar 

  4. Wain LV, Shrine N, Miller S, Jackson VE, Ntalla I, Soler Artigas M, et al. Novel insights into the genetics of smoking behaviour, lung function, and chronic obstructive pulmonary disease (UK BiLEVE): a genetic association study in UK Biobank. Lancet Respir Med. 2015;3(10):769–81.

    Article  PubMed  PubMed Central  Google Scholar 

  5. Mitchell R, Hemani G, Dudding T, Corbin L, Harrison S, Paternoster L. UK Biobank Genetic Data: MRC-IEU Quality Control, version 2 [Internet]. [cited 2022 Jun 22]. Available from:

  6. Cinelli C, Forney A, Pearl J. A Crash Course in Good and Bad Controls [Internet]., Rochester NY. ; 2020 [cited 2022 Jun 22]. Available from:

  7. Burgess S, Thompson SG. Multivariable mendelian randomization: the use of pleiotropic genetic variants to estimate causal effects. Am J Epidemiol. 2015;181(4):251–60.

    Article  PubMed  PubMed Central  Google Scholar 

  8. Schooling CM, Lopez PM, Yang Z, Zhao JV, Au Yeung SL, Huang JV. Use of Multivariable Mendelian Randomization to Address Biases Due to Competing Risk Before Recruitment. Frontiers in Genetics [Internet]. 2021 [cited 2022 Apr 6];11. Available from:

  9. Gilbody J, Borges MC, Smith GD, Sanderson E, Multivariable. MR can mitigate bias in two-sample MR using covariable-adjusted summary associations [Internet]. medRxiv; 2022 [cited 2022 Nov 3]. p. 2022.07.19.22277803. Available from:

  10. Woolf B, Rajasundaram S, Gill D, Sallis HM, Munafò MR. Assessing the Causal Effects of Environmental Tobacco Smoke Exposure: A meta-analytic Mendelian randomisation study [Internet]. medRxiv; 2023 [cited 2023 May 24]. p. 2023.03.30.23287949. Available from:

  11. Morris TP, White IR, Crowther MJ. Using simulation studies to evaluate statistical methods. Stat Med. 2019;38(11):2074–102.

    Article  PubMed  PubMed Central  Google Scholar 

  12. Khuder SA, Milz S, Jordan T, Price J, Silvestri K, Butler P. The impact of a smoking ban on hospital admissions for coronary heart disease. Prev Med. 2007;45(1):3–8.

    Article  PubMed  Google Scholar 

  13. Doll R, Hill AB. Lung Cancer and other causes of death in relation to smoking. Br Med J. 1956;2(5001):1071–81.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  14. Narkiewicz K, Kjeldsen SE, Hedner T. Is smoking a causative factor of hypertension? Blood Press. 2005;14(2):69–71.

    Article  PubMed  Google Scholar 

  15. FRANKLIN W CLOWELLF, MICHELSON A L, W. SCHILLER I. CHRONIC OBSTRUCTIVE PULMONARY EMPHYSEMA:. A DISEASE OF SMOKERS*. Annals of Internal Medicine [Internet]. 2008 Dec 1 [cited 2022 Jun 22]; Available from:

  16. Shah RS, Cole JW. Smoking and stroke: the more you smoke the more you stroke. Expert Rev Cardiovasc Ther. 2010;8(7):917–32.

    Article  PubMed  PubMed Central  Google Scholar 

  17. Chaiton MO, Cohen JE, O’Loughlin J, Rehm J. A systematic review of longitudinal studies on the association between depression and smoking in adolescents. BMC Public Health. 2009;9(1):356.

    Article  PubMed  PubMed Central  Google Scholar 

  18. Ruth Mitchell E, MRC IEU UK Biobank. GWAS pipeline version 2 [Internet]. data.bris. 2019 [cited 2022 Mar 12]. Available from:

  19. Bigdeli TB, Lee D, Riley BP, Vladimirov V, Fanous AH, Kendler KS et al. FIQT: a simple, powerful method to accurately estimate effect sizes in genome scans [Internet]. bioRxiv; 2015 [cited 2022 Mar 12]. p. 019299. Available from:

  20. Elsworth B, Lyon M, Alexander T, Liu Y, Matthews P, Hallett J et al. The MRC IEU OpenGWAS data infrastructure [Internet]. bioRxiv; 2020 [cited 2022 Mar 30]. p. 2020.08.10.244293. Available from:

  21. Kurki MI, Karjalainen J, Palta P, Sipilä TP, Kristiansson K, Donner K et al. FinnGen: Unique genetic insights from combining isolated population and national health register data [Internet]. medRxiv; 2022 [cited 2022 Apr 14]. p. 2022.03.03.22271360. Available from:

  22. Nikpay M, Goel A, Won HH, Hall LM, Willenborg C, Kanoni S, et al. A comprehensive 1,000 genomes-based genome-wide association meta-analysis of coronary artery disease. Nat Genet. 2015;47(10):1121–30.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  23. Malik R, Chauhan G, Traylor M, Sargurupremraj M, Okada Y, Mishra A, et al. Multiancestry genome-wide association study of 520,000 subjects identifies 32 loci associated with stroke and stroke subtypes. Nat Genet. 2018;50(4):524–37.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  24. Wang Y, McKay JD, Rafnar T, Wang Z, Timofeeva MN, Broderick P, et al. Rare variants of large effect in BRCA2 and CHEK2 affect risk of lung cancer. Nat Genet. 2014;46(7):736–41.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  25. Howard DM, Adams MJ, Clarke TK, Hafferty JD, Gibson J, Shirali M et al. Genome-wide meta-analysis of depression identifies 102 independent variants and highlights the importance of the prefrontal brain regions.: Supplementary Information [Internet]. Genetics; 2018 Oct [cited 2019 Apr 17]. Available from:

  26. R Core Team. R: A language and environment for statistical computing. R Foundation for Statistical Computin [Internet]. 2021. Available from:

  27. Hemani G, Zheng J, Elsworth B, Wade KH, Haberland V, Baird D, et al. The MR-Base platform supports systematic causal inference across the human phenome. eLife. 2018;7:e34408.

    Article  PubMed  PubMed Central  Google Scholar 

  28. Sanderson E, Spiller W, Bowden J. Testing and correcting for weak and pleiotropic instruments in two-sample multivariable mendelian randomization. Stat Med. 2021;40(25):5434–52.

    Article  PubMed  PubMed Central  Google Scholar 

  29. Yavorska OO, Burgess S. MendelianRandomization: an R package for performing mendelian randomization analyses using summarized data. Int J Epidemiol. 2017;01(6):1734–9.

    Article  Google Scholar 

  30. Woolf B, Gill D, Grant AJ, Burgess S, MVMRmode. Introducing an R package for plurality valid estimators for multivariable Mendelian randomisation [Internet]. medRxiv; 2023 [cited 2023 Jan 20]. p. 2023.01.09.23284345. Available from:

Download references


This work was carried out using the computational facilities of the Advanced Computing Research Centre, University of Bristol -

We are very grateful to Kate Tilling for providing feedback on the simulation methods and interpretation of the study.


Benjamin Woolf is funded by an Economic and Social Research Council (ESRC) South West Doctoral Training Partnership (SWDTP) 1 + 3 PhD Studentship Award (ES/P000630/1). BW, HMS and MRM are all members of the Medical Research Council (MRC) Integrative Epidemiology Unit at the University of Bristol (MC_UU_00011/7).

Author information

Authors and Affiliations



BW conceived and implemented the analyses. DG, HS, and MM supervised. All authors reviewed the manuscript.

Corresponding author

Correspondence to Benjamin Woolf.

Ethics declarations

Ethics approval and consent to participate

UKB received ethical approval from the North West Multi-Centre Research Ethics Committee (REC reference 11/NW/0382). All participants provided written informed consent to participate in the study. Data from the UKB are fully anonymised. All methods were performed in accordance with the relevant guidelines and regulations.

Consent for publication


Competing interests

DG is employed part-time by Novo Nordisk. The other authors declare no conflicts of interest.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Electronic supplementary material

Below is the link to the electronic supplementary material.

Supplementary Material 1

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit The Creative Commons Public Domain Dedication waiver ( applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Woolf, B., Gill, D., Sallis, H. et al. The UK BiLEVE and Mendelian randomisation: using multivariable instrumental variables to address “damned if you, damned if you don’t” adjustment problems. BMC Res Notes 16, 157 (2023).

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: