- Research note
- Open Access
Investigating psychometric properties of the Thai version of the Zarit Burden Interview using rasch model and confirmatory factor analysis
BMC Research Notes volume 13, Article number: 120 (2020)
- The Correction to this article has been published in BMC Research Notes 2020 13:227
The Zarit Burden Interview (ZBI) has been widely used to assess caregiver burden. Few research papers have investigated the Thai version of the ZBI. The study aimed to examine the psychometric properties of the Thai version of both the full length (ZBI-22) and short versions (ZBI-12) using Rasch analysis and confirmatory factor analysis among a sample of Alzheimer’s disease caregivers.
The ZBI-22 fitted the Rasch measurement model regarding unidimensionality but not for ZBI-12. Five items from ZBI-22, and 2 items from ZBI-12 were shown to be misfitting items. Half of ZBI items were shown to be disordered category or threshold, and were locally dependent. CFA revealed three-factor and four-factor fitted the data the best for ZBI-22 and ZBI-12, respectively. Reliability was good for both forms of the ZBI (α = 0.86–0.92). Significant correlations were found with caregiver’s perceived stress, anxiety/depression, pain and mobility but not with self-care and usual activity (p > 0.05), indicating convergent and discriminant validity. To conclude, the Thai version ZBI-22, but not ZBI-12, supported the reliability and unidimensional scale among Alzheimer’s disease caregivers. Some misfitting items of the ZBI undermined the unidimensionality of the scale, and need revision.
Caregiver burden is a state where the physical and psychological well-being, family relations or financial status of the caregiver could be threatened by providing the necessary care to another . Caregiving, especially for elderly with dementia, usually causes burden among caregivers [2,3,4,5]. Studies showed that burden can ultimately lead to depression , and could lead to the poor treatment outcomes of the patients .
Thailand is becoming an aging society which would increase the number of dependent individuals and tendency of a household’s need for caregivers , so the burden of caregivers should be concerned. Studies have shown that over 40 to 70% of caregivers perceiving a burden [9, 10].
One of the oldest and most common measurements to assess dementia caregiver burden is the Zarit Burden Interview (ZBI) [11, 12]. It measures multidimensional aspects including physical, emotional, financial and social burden and the relation with the care receiver. It originated as a 29-item questionnaire but currently has been translated to many languages and revised to a 22-item form (ZBI-22) . Shorter forms of ZBI have been developed by researchers over the past decades, ranging from 1 to 14 items . However, the most widely used is ZBI-12, introduced by Be´dard et al. . ZBI-12 has shown good psychometric properties in various languages and cultures [16,17,18,19,20,21,22,23,24,25]. Regarding its factor structure, the dimensions of ZBI range from 2 to 5 [15, 20]. Due to its multidimensional nature, it may not be accurately captured by a global score [26,27,28]. The ZBI has demonstrated high correlations with other psychological tools [17, 23,24,25, 29]. For ZBI-12, factor analysis revealed a two-factor rather than a unidimensional model despite being shorter . Correlations between the ZBI-12 and ZBI-22 received a value of 0.96 in the initial study. To capture the global score of burden, the unidimensional ZBI was developed using item response theory (IRT), yielding a different set of items for the short scale as compared with the former 12-item ZBI .
To our knowledge, only one study examined the Thai version of ZBI-12 using factor analysis . The Thai ZBI has never been tested for psychometric properties among Thai dementia caregivers and using IRT or Rasch measurement model. Therefore, the present study aimed to examine the ZBI construct by means of convergent, discriminant and concurrent validity, using both Rasch analysis and confirmatory factor analysis (CFA).
One hundred and two caregivers of patients with Alzheimer’s, who were diagnosed and treated by neurologists at Maharaj Nakorn Chiang Mai Hospital, participated in the study. Primary caregivers aged 18 years old or more, who had been providing care for at least 1 month were recruited. Exclusion criteria was inability to communicate due to either language barrier or severe mental health problem.
Data were collected at an outpatient clinic through structured interviews by one physician (MP) who had no role in patient care planning. All gave written informed consent before completing the questionnaires. The questionnaires included sociodemographic data, records related to caregiving and specific measures, which were ZBI, Perceived stress scale (PSS), Patient Health Questionnaire (PHQ-9), and EQ-5D.
The ZBI is a caregiver-reported questionnaire measuring the burden the respondent feels in providing care to the patient. Currently, it has two widely used forms, ZBI-22 and ZBI-12, with a Likert scoring scale between 0 (never) and 4 (nearly always) [15, 32]. Studies showed high correlation in both ZBI-22 and ZBI-12 with the Caregiver Activity Survey, and with other tools [25, 33].
The Thai version (translated version) of the ZBI used in this study was allowed by Professor Zarit and Mapi Research Trust . The study sample showed a Cronbach’s alpha of 0.921 for the ZBI-22 and 0.865 for ZBI-12.
The PSS is a self-reporting, 10-item questionnaire measuring the extent to which individuals perceived stress . The 4-response Likert scale, ranges from 0 (not at all) to 4 (the most). The Thai version PSS showed a Cronbach’s alpha of 0.85. It correlated with other measures including the State Trait Anxiety Inventory, but negatively correlated with the Rosenberg Self-Esteem Scale . The study sample showed a Cronbach’s alpha of 0.850.
The PHQ-9 is a self-reporting, 9-item questionnaire measuring the extent to which an individual feels bothered due to depressive symptoms over the past 2 weeks . The 4-response Likert scale ranges from 0 (not at all) to 3 (nearly every day). The Thai version PHQ-9 showed a Cronbach’s alpha of 0.79 and a positive association between the PHQ-9 and the HAM-D . The study sample showed a Cronbach’s alpha of 0.849.
The EQ-5D is a self-reporting questionnaire measuring health-related quality of life . It comprises 5 items assessing 5 domains of health state: mobility, self-care, usual activities, pain and anxiety/depression, with a 5-response scale ranging from 1 (no problem) to 5 (severe problem). All 5 aspects were calculated to an index score with the maximum of 1.000 . An intraclass correlation coefficient of 0.987 for the EQ-5D index score, and a significant correlation with WHOQOL-BREF were noted . The study sample showed that Cronbach’s alpha was 0.723.
Sociodemographic data were analyzed using descriptive statistics. Pearson’s or Spearman’s rank was used for correlational analysis. The same items were presented in both tests, leading to an overestimate of the “true” correlation, so a corrected correlation was made between both forms of ZBI .
Based on measurement theory, a scale should demonstrate that all items contribute to the same construct, and has monotonically increasing steps. All these properties can be illustrated by the Rasch model. The following approach was conducted for analysis.
We tested the ZBI against the EQ5D subscale, hypothesizing that ZBI should relate more to anxiety/depression than mobility. We expected to find a low—moderate correlation between ZBI and PSS and PHQ-9 to demonstrate concurrent validity.
The Rasch model belongs to the item-response latent trait models, a probabilistic logistic model that predicts that the response to a particular item is influenced by the quality of both person and item. More details can be found elsewhere . The partial credit Rasch model was used , with the following criteria. First, unidimensionality and local independence, which were evaluated by (a) the first principal component of the residuals (or first contrast) should have an eigen value less than 2, (b) disattenuated correlation > 0.7 and (c) item fit statistics (INFIT and OUTFIT mean-square) indicating the consistency of each item to the other items, should be 0.70 and 1.30 . To evaluate local independency, a standardized residual correlation should be less than 0.3 . Second, response category functioning and ordered categories and thresholds are expected for measurement . Third, a reliability coefficient of 0.80 or higher and of 0.90 or higher are considered acceptable for person reliability and item reliability, respectively.
To test how data were well modeled with the unidimensional construct, CFA was performed for both ZBI-22 and ZBI-12. The Weighted Least Square Mean and Variance corrected method of estimation was used for the nonnormality and ordinal types of items. Assessment model fit used Chi square (p > 0.05), comparative fit index and Tucker Lewis Index, where values 0.95 or higher are preferable . Root mean square error of approximation value < 0.08 was indicative of an acceptable model fit .
For CFA, Mplus, Version 8.4 was used (Muthén and Muthén 2015). Rasch analysis used Winsteps, Version 4.4.8 (Beaverton, Oregon: Winsteps.com). All other analyses were performed using IBM SPSS, Version 22 (SPSS Inc., Chicago, IL, USA).
The average age of the caregiver sample was 55 years (SD = 12.9); most were women (77.5%). According to ZBI level, the sample was reported to have low burden. The quality of life index score was quite high on average, while perceived stress and depressive symptoms were low (Table 1).
For the distribution of the ZBI-items, some had unacceptable kurtosis (> ±3), which contributed to the high frequency of zero categories on these respective items (Additional file 1: Table S1).
Correlation analysis showed that ZBI-22 had a coefficient of 0.855 (p < 0.01) with ZBI-12 for the uncorrected correlation, and 0.784 (p < 0.01). Both ZBI-22 and ZBI-21 significantly related to PHQ-9, PSS, the EQ-5D index score, subscale mobility, pain and anxiety/depression, but not to self-care and usual activity indicating convergent and discriminant validity (Table 2).
Rasch analysis results showed that the unexplained variance in the first contrast yielded eigen values of 2.52 and 3.03, implying a possible second dimension. However, based on disattenuated correlation between person measure > 0.7, the second dimension could noise for ZBI-22. Five items of ZBI-22, and two items of ZBI-12 were shown to be misfitted. Five pairs of items from ZBI-22 and three pairs of items from ZBI-12 had standardized residual correlations above 0.2, indicating item dependency of both forms of ZBI. For category function, 33 to 50% of items were found to be disordered category or threshold (Table 3). For this reason, the four original rating categories were combined in different ways until the criteria were best met. This was obtained by rescaling as follows: 0 = 0; 1 = 1; 2 = 2; 3 = 3 and 4 = 3 for ZBI-22, and 0 = 0; 1 = 1; 2 = 2; 3 = 2 and 4 = 3 for ZBI-12. After rescaling, the data fit better with Rasch model as the misfitting items reduced while reliability increased. All reliability values were shown to be in an acceptable range.
The CFA showed that the unidimensional model did not fit with the data for both versions of ZBI. Three-factor model provided the best-fitted statistics for ZBI-22, while the four-factor model with the correlated error terms of items 11 and item 12, provided the best-fitted statistics for ZBI-12 (Additional file 2: Table S2).
The present study aimed to evaluate the psychometric properties of the Thai version of the ZBI among caregivers of patients with Alzheimer’s disease. Consistent with related studies, both ZBI-22 and ZBI-12 did not demonstrate a unidimensional scale [49, 50], even though the ZBI-22 seemed to be favored over ZBI-12. Three-factor and four-factor fitted the data the best for ZBI-22 and ZBI-12, respectively. However, the disattenuated correlation (> 0.70) in ZBI-22 suggested that it could be sufficiently unidimensional, but not for ZBI-12.
Pairs of error variances to be correlated suggested by CFA corresponded to local dependence by Rasch analysis. This was consistent with Ballesteros et al.’s study  in that both items, “should do more” and “could do a better job caring” were excluded from the new 12-item ZBI. In addition to these two items, more pairs were shown to be locally dependent. Violations of local independence in a unidimensional scale can lead to inflated estimates of reliability, providing a false impression of the accuracy and precision of estimates .
Disordered categories and thresholds indicated that respondents had difficulty discriminating between response categories given their level of caregiver burden. In ZBI-22, the response categories were collapsed from five to four categories and by that category 3 (quite frequently) and 4 (nearly always) were collapsed together. Oddly, for ZBI-12, collapsing category 2 (sometimes) and 3 (quite frequently) yielded better results. It remains unclear why the participants responded differently to the same items of different scales.
Suggestions from our findings are twofold if interpretation of mean scores, or changes in total scores is to be meaningful, First, is to revise or remove the locally dependent and misfitting items 11 and 12 of ZBI-12 to make it better unidimensional scale. Second, is to look for the fitted items with ordered category and threshold from ZBI-22 to form a new short ZBI.
Taken together, the Thai version of ZBI-12 may not be regarded as unidimensional, an interval rating scale of burden among caregivers to patients with Alzheimer’s disease. The ZBI-22 showed sufficient unidimensionality. Some items were suggested to be removed if ZBI-12 is to be used.
Limitations and future study
Clinicians should interpret the results in light of the limitation in sample size. Replication in a larger sample size should be encouraged. Test–retest reliability, sensitivity to change and equivalence test in different populations and cultures should be warranted.
Availability of data and materials
The datasets used and/or analyzed during the current study are available from the corresponding author on reasonable request.
Zarit Burden Interview
Confirmatory factor analysis
Principal component analysis
Root mean square error of approximation
Comparative fit index
Tucker lewis index
Weighted Least Square Mean and Variance Corrected
Sherman CW, Burgio LD, Kowalkowski JD. Chapter 10—Assessment of Dementia Family Caregivers. In: Lichtenberg PA, editor. Handbook of Assessment in Clinical Gerontology (Second Edition). San Diego: Academic Press; 2008. p. 243–71.
Alfakhri AS, Alshudukhi AW, Alqahtani AA, Alhumaid AM, Alhathlol OA, Almojali AI, Alotaibi MA, Alaqeel MK. Depression among caregivers of patients with Dementia. Inquiry. 2018;55:46958017750432.
Schulz R, McGinnis KA, Zhang S, Martire LM, Hebert RS, Beach SR, Zdaniuk B, Czaja SJ, Belle SH. Dementia patient suffering and caregiver depression. Alzheimer Dis Assoc Disord. 2008;22:170–6.
Covinsky KE, Newcomer R, Fox P, Wood J, Sands L, Dane K, Yaffe K. Patient and caregiver characteristics associated with depression in caregivers of patients with dementia. J Gen Intern Med. 2003;18:1006–14.
Ostojic D, Vidovic D, Bacekovic A, Brecic P, Jukic V. Prevalence of anxiety and depression in caregivers of Alzeheimer’s dementia patients. Acta Clin Croat. 2014;53:17–21.
del-Pino-Casado R, Rodríguez Cardosa M, López-Martínez C, Orgeta V. The association between subjective caregiver burden and depressive symptoms in carers of older relatives: a systematic review and meta-analysis. PLoS ONE. 2019;14:e0217648.
Kuzuya M, Enoki H, Hasegawa J, Izawa S, Hirakawa Y, Shimokata H, Akihisa I. Impact of caregiver burden on adverse health outcomes in community-dwelling dependent older care recipients. Am J Geriatr Psychiatry. 2011;19:382–91.
Phetsitong R, Vapattanawong P, Sunpuwan M, Volker M. State of household need for caregivers and determinants of psychological burden among caregivers of older people in Thailand: an analysis from national surveys on older persons. PLoS ONE. 2019;14:e0226330.
Chaobankrang C, Anothaisintawee T, Kittichai K, Boongird C. Predictors of depression among thai family caregivers of Dementia patients in primary care. Int J Gerontol Geriatr Res. 2019;3:007–13.
Sittironnarit G. Quality of life and subjective burden of primary dementia caregivers in Bangkok, Thailand. Asian J Psychiatry. 2020;48:101913.
Zarit SH, Todd PA, Zarit JM. Subjective burden of husbands and wives as caregivers: a longitudinal study. Gerontologist. 1986;26:260–6.
Zarit SH. Diagnosis and management of caregiver burden in dementia. Handb Clin Neurol. 2008;89:101–6.
Zarit Burden Interview (ZBI) http://mapi-trust.org/.
Liu ZW, Zhou W, Chen XC, Zhang XY, Hu M, Xiao SY. Assessment of burden among family caregivers of schizophrenia: psychometric testing for short-form zarit burden interviews. Front Psychol. 2018;9:2539.
Bédard M, Molloy DW, Squire L, Dubois S, Lever JA, O’Donnell M. The Zarit Burden Interview: a new short version and screening version. Gerontologist. 2001;41:652–7.
Gratão ACM, Brigola AG, Ottaviani AC, Luchesi BM, Souza É, Rossetti ES, de Oliveira NA, Terassi M, Pavarini SCI. Brief version of Zarit Burden Interview (ZBI) for burden assessment in older caregivers. Dement Neuropsychol. 2019;13:122–9.
Chattat R, Cortesi V, Izzicupo F, Del Re ML, Sgarbi C, Fabbo A, Bergonzini E. The Italian version of the Zarit Burden Interview: a validation study. Int Psychogeriatr. 2011;23:797–805.
Goncalves-Pereira M, Zarit SH. The Zarit Burden Interview in Portugal: validity and recommendations in dementia and palliative care. Acta Med Port. 2014;27:163–5.
Iecovich E. Psychometric properties of the Hebrew version of the Zarit Caregiver Burden scale short version. Aging Ment Health. 2012;16:254–63.
Lu L, Wang L, Yang X, Feng Q. Zarit Caregiver Burden interview: development, reliability and validity of the Chinese version. Psychiatry Clin Neurosci. 2009;63:730–4.
Ojifinni OO, Uchendu OC. Validation and reliability of the 12-item Zarit Burden Interview among informal caregivers of elderly persons in Nigeria. Arch Basic Appl Med. 2018;6:45–9.
Ozer N, Yurttaş A, Akyil R. Psychometric evaluation of the Turkish version of the Zarit Burden Interview in family caregivers of inpatients in medical and surgical clinics. J Transcult Nurs. 2012;23:65–71.
Seng BK, Luo N, Ng WY, Lim J, Chionh HL, Goh J, Yap P. Validity and reliability of the Zarit Burden Interview in assessing caregiving burden. Ann Acad Med Singapore. 2010;39:758–63.
Tang JY, Ho AH, Luo H, Wong GH, Lau BH, Lum TY, Cheung KS. Validating a Cantonese short version of the Zarit Burden Interview (CZBI-Short) for dementia caregivers. Aging Ment Health. 2016;20:996–1001.
Wang G, Cheng Q, Wang Y, Deng YL, Ren RJ, Xu W, Zeng J, Bai L, Chen SD. The metric properties of Zarit caregiver burden scale: validation study of a Chinese version. Alzheimer Dis Assoc Disord. 2008;22:321–6.
Springate BA, Tremont G. Dimensions of caregiver burden in dementia: impact of demographic, mood, and care recipient variables. Am J Geriatr Psychiatry. 2014;22:294–300.
Branger C, O’Connell ME, Morgan DG. Factor analysis of the 12-item zarit burden interview in caregivers of persons diagnosed with Dementia. J Appl Gerontol. 2016;35:489–507.
Cheng S-T. Dementia caregiver burden: a research update and critical analysis. Curr Psychiatry Rep. 2017;19:64.
Lin C-Y, Wang J-D, Pai M-C, Ku L-JE. Measuring burden in dementia caregivers: confirmatory factor analysis for short forms of the Zarit Burden Interview. Arch Gerontol Geriatr. 2017;68:8–13.
Ballesteros J, Santos B, González-Fraile E, Muñoz-Hermoso P, Domínguez-Panchón AI, Martín-Carrasco M. Unidimensional 12-Item Zarit Caregiver Burden Interview for the assessment of dementia caregivers’ burden obtained by item response theory. Value Health. 2012;15:1141–7.
Silpakit O, Silpakit C, Chomchuen R. Psychometric study of the Thai version of Zarit Burden Interview in psychiatric caregivers. J Ment Health Thai. 2015;23:12–24.
Zarit SH, Reever KE, Bach-Peterson J. Relatives of the impaired elderly: correlates of feelings of burden. Gerontologist. 1980;20:649–55.
Ko K-T, Yip P-K, Liu S-I, Huang C-R. Chinese version of the Zarit Caregiver Burden Interview: a validation study. Am J Geriatr Psychiatry. 2008;16:513–8.
Cohen S, Kamarck T, Mermelstein R. A global measure of perceived stress. J Health Soc Behav. 1983;24:385–96.
Wongpakaran N, Wongpakaran T. The Thai version of the PSS-10: an investigation of its psychometric properties. Biopsychosoc Med. 2010;4:6.
Kroenke K, Spitzer RL, Williams JB. The PHQ-9: validity of a brief depression severity measure. J Gen Intern Med. 2001;16:606–13.
Lotrakul M, Sumrithe S, Saipanish R. Reliability and validity of the Thai version of the PHQ-9. BMC Psychiatry. 2008;8:46.
The EuroQol Group. EuroQol–a new facility for the measurement of health-related quality of life. Health Policy. 1990;16:199–208.
Tongsiri S, Cairns J. Estimating population-based values for EQ-5D health states in Thailand. Value Health. 2011;14:1142–5.
Li L, Liu C, Cai X, Yu H, Zeng X, Sui M, Zheng E, Li Y, Xu J, Zhou J, Huang W. Validity and reliability of the EQ-5D-5 L in family caregivers of leukemia patients. BMC Cancer. 2019;19:522.
Levy P. The correction for spurious correlation in the evaluation of short-form tests. J Clin Psychol. 1967;23:84–6.
Tennant A, Conaghan PG. The Rasch measurement model in rheumatology: what is it and why use it? When should it be applied, and what should one look for in a Rasch paper? Arthritis Care Res. 2007;57:1358–62.
Masters GN. A Rasch model for partial credit scoring. Psychometrika. 1982;47:149–74.
Wright BD, Linacre JM. Reasonable mean-square fit values. Rasch Meas Transact. 1994;8:370–1.
Christensen KB, Makransky G, Horton M. Critical values for Yen’s Q(3): identification of local dependence in the rasch model using residual correlations. Appl Psychol Meas. 2017;41:178–94.
Andrich D. An expanded derivation of the threshold structure of the polytomous rasch model that dispels any “threshold disorder controversy”. Educ Psychol Measur. 2012;73:78–124.
Hu LT, Bentler PM. Cutoff criteria for fit indexes in covariance structure analysis: conventional criteria versus new alternatives. Struct Equ Model. 1999;6:1–55.
Browne MW, Cudeck R. Alternative ways of assessing model fit. Sociological Methods Res. 1992;21:230–58.
Landfeldt E, Mayhew A, Straub V, Bushby K, Lochmüller H, Lindgren P. Psychometric properties of the Zarit Caregiver Burden Interview administered to caregivers to patients with Duchenne muscular dystrophy: a Rasch analysis. Disabil Rehabil. 2019;41:966–73.
Siegert RJ, Jackson DM, Tennant A, Turner-Stokes L. Factor analysis and Rasch analysis of the Zarit Burden Interview for acquired brain injury carer research. J Rehabil Med. 2010;42:302–9.
Marais I, Andrich D. Effects of varying magnitude and patterns of response dependence in the unidimensional Rasch model. J Appl Meas. 2008;9:105–24.
The authors wish to thank all the patients and caregivers for their participation and encouragement for us to do this research.
This research was supported by the Research Medical Fund, Grant Number 066-2561, Faculty of Medicine, Chiang Mai University.
Ethics approval and consent to participate
This study was approved by the research ethics committee of the Faculty of Medicine, Chiang Mai University (PSY-2560-05110). All patients provided written informed consent to the study.
Consent for publication
Consent for publication is not applicable.
The authors declare that they have no competing interests.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
About this article
Cite this article
Pinyopornpanish, K., Pinyopornpanish, M., Wongpakaran, N. et al. Investigating psychometric properties of the Thai version of the Zarit Burden Interview using rasch model and confirmatory factor analysis. BMC Res Notes 13, 120 (2020). https://doi.org/10.1186/s13104-020-04967-w
- Zarit Burden Interview (ZBI)
- Alzheimer’s disease
- Rasch analysis
- Perceived stress
- Psychometric property