Axillary and rectal thermometry in the newborn: do they agree?
© Charafeddine et al.; licensee BioMed Central Ltd. 2014
Received: 22 August 2014
Accepted: 29 August 2014
Published: 31 August 2014
Accurate measurement of body temperature is critical for the assessment of a newborn’s general well-being. In nursery settings, the gold standard rectal thermometry has been replaced by the axillary method. However, evidence pertaining to the agreement between axillary and rectal thermometry in the newborn is controversial. In this cross-sectional study, the agreement between axillary and rectal temperature in newborns, as well as the effects of neonatal, maternal and environmental factors on this agreement were investigated.
The mean difference between axillary and rectal temperatures was compared in stable term and preterm newborns using paired t-test for the means of differences, Pearson correlation coefficient (r), and the Bland-Altman plot. Stepwise multivariate regression assessed predictors of this difference in the overall group and by gestational age categories.
The study included 118 newborns with gestational ages ranging from 29 to 41 weeks, median birth weight of 2980 grams (IQR: 2321.3-3363.8). Axillary and rectal temperatures correlated significantly (r = 0.5, p = 0.000) and had similar overall means but differed in 34–36 weeks gestation newborns (p = 0.01). Correlation between both methods increased with advancing gestational age being highest in term newborns (r = 0.6, p = 0.000). Bland-Altman plots revealed good agreement in gestational ages above 29 weeks. The difference between measurements increased with Cesarean delivery (ß = 0.2; 95% CI: 0.02, 0.38), but decreased with advancing chronological age (ß = -0.01; 95% CI: -0.02,-0.01), and with gestational age (ß = -0.05; 95% CI: -0.08,-0.01).
In clinically stable term and preterm infants, axillary thermometry is as reliable as rectal measurement. Predictors of agreement between the two methods include gestational age, chronological age and mode of delivery. Further studies are needed to confirm this agreement in sick newborns and in extremely premature infants.
KeywordsAxillary temperature Diagnostic accuracy Newborn Rectal temperature Thermometry methods
Body temperature is an essential vital sign that reflects the wellbeing of a newborn. Temperature variation can be an indication of maladaptation to the external environment, as well as a sign of serious illness. Hence, accurate measurement of a newborn’s body temperature is critical for early detection of serious conditions, and for appropriate and timely intervention or treatment. Rectal thermometry, the gold standard method of temperature measurement is more invasive than skin or tympanic thermometry , and has therefore been replaced by the less invasive axillary method in nursery settings, including neonatal intensive care units (NICU) [1–3]. However, evidence pertaining to the agreement between axillary and rectal temperature measurements in the newborn is controversial, with conflicting results regarding the accuracy and precision of axillary temperature.
A systematic search of the literature for studies comparing axillary and rectal thermometry in the newborn reveals one systematic review  that identified two studies in neonates with opposite results and significant heterogeneity. Subsequently, seven original studies were published revealing controversial results [3, 5–10]. In four studies [5–7, 9] there was poor agreement between rectal and axillary measurements using the Bland-Altman method , whereas two studies reported good correlation between axillary and rectal temperature measurements [3, 10]. In the seventh study , skin temperature measured from the back correlated with rectal measurement better than skin temperature obtained from the abdomen. The level of agreement between the two methods was reported only by Friedrichs, et al. .
In view of the existing controversy, we conducted this study that aims at assessing the agreement between axillary and rectal thermometry in term and preterm neonates of different gestational ages, as well as identifying the neonatal, maternal or environmental factors that may affect this agreement.
This was an observational, cross-sectional study conducted in the Normal Nursery and Neonatal Intensive Care Unit (NICU) of the American University of Beirut Medical Center (AUBMC), Lebanon. Between December 2012 and July 2013, all newborns who were admitted to the Normal Nursery or NICU were screened for inclusion in the study. Neonates whose age was less than six hours were excluded, as well as those who suffered from any of the following conditions: critical clinical status, necrotizing enterocolitis, disseminated intravascular coagulation, bleeding disorders or thrombocytopenia, immunodeficiency, intraventricular hemorrhage, congenital anomalies, therapeutic-induced hypothermia, neurologic disorders, and rectal pathology such as rectal injury, imperforate anus, or rectal surgery.
Neonates satisfying the inclusion criteria were subjected to axillary and rectal temperature measurements after obtaining parental written informed consent. For each neonate, one paired temperature recording was performed in the same sequence by the same investigator (MN): one axillary temperature reading (less invasive method) followed immediately by one rectal temperature reading (more invasive method), using the same digital thermometer Welch Allyn® Sure Temp® Plus Model 690 (Welch Allyn, Inc., San Diego, California), according to the manufacturer’s instructions for proper device use. Rectal measurements were obtained by gentle insertion of the rectal probe two centimeters into the rectum.
For each neonate the following data were collected: gestational age, chronological age, gender, birth weight, birth length, head circumference, mode of delivery, mode of maternal anesthesia, resuscitation at delivery and type of resuscitation; admission status and type of placement (crib, normal humidity isolette, high humidity isolette, warmer). For newborns who were admitted to NICU, we also recorded the Newborn’s Clinical Risk Index for Babies (CRIB) Score  along with the initial and current diagnosis.
Neonates were divided into four categories according to their gestational age: term (≥37 weeks), late preterm (340/7 to 366/7 weeks), early preterm (>28 to < 34 weeks), and very small preterm (≤28 weeks). Our primary outcome was the mean difference between axillary and rectal measurements. Sample size calculation was carried for the entire cohort taking into consideration the minimum number of subjects to be recruited from the subgroup of very small preterm newborns while maintaining at least 80% power, since the number of preterm infants born at or below 28 weeks of gestation is small compared to the other gestational age categories. Considering a desirable mean maximum difference between axillary and rectal temperature of <0.3°C, and a mean difference in standard deviation (SD) of <0.5°C (the quoted accuracy of most mercury-in-glass thermometers) [7, 9], the sample size needed to detect a difference of 0.3°C, with SD of 0.5°C, α of 0.05, and power of at least 80% was 24 newborns in each gestational age category.
We used paired t-test to compare the means (SD) of differences between axillary and rectal measurements, and Pearson correlation coefficient (r) to investigate the correlation between the two methods. Analysis was done separately for term, late and early preterm newborns. No such analysis was conducted for the very small preterm neonates below 29 weeks of gestation since none met our inclusion criteria during the study conduct. To assess predictors of the difference between the axillary and rectal temperatures, we carried out stepwise multivariate regression analyses, with the outcome being the difference in temperature between the two methods, and the independent variables being those that showed significance at the bivariate association, as well as variables of clinical importance (age, gender, gestational age, birth weight, birth length, mode of delivery, maternal anesthesia, and delivery room resuscitation). To build the model, the entry level of significance was set at 0.1 and the level of retaining variables in the model was set at 0.2.
The degree of agreement between axillary and rectal measurements was assessed using the Bland-Altman plot, which is a scatterplot of the difference of the two measurements against the mean of the two measurements . The plot generates three horizontal reference lines that are superimposed on the scatterplot: one line represents the average difference between the measurements, along with 2 lines that mark the standard deviation of the differences (±2SD). If the two temperature measurement methods are comparable, then differences should be small, with the mean of the differences close to 0, and with no systematic variation with the mean of the two measurements. The Statistical Package for Social Sciences (SPSS, version 21) was used for data management and analyses. A p-value of <0.05 was considered statistically significant.
≥ 37 wks
(N = 25)
(N = 30)
(N = 63)
(N = 118)
Male gender, N (%)
Cesarean delivery, N (%)
Weight-for-date, N (%)
Nursery setting, N (%)
Maternal anesthesia, N (%)
Placement, N (%)
Resuscitation at birth, N (%)
Age (Days), Median
Birth weight (Grams), Median
Birth length, Median
Birth head circumference, Median
Crib score, Range
Axillary temperature (°C), Mean ± SD
Rectal temperature (°C), Mean ± SD
Comparison of axillary and rectal temperatures
r (p value)
Linear regression model * for predicting the difference between axillary and rectal temperature
95% CI for B
In this study of clinically stable term and preterm newborns, axillary temperature was in good agreement with rectal temperature measurements. Moreover, there was significant correlation between the two methods, but this correlation was best observed in term newborns. The difference between the axillary and rectal measurements increased with Cesarean delivery but decreased with advancing gestational age and with increasing chronological age.
The main strength of our study is its inclusion of sufficient number of neonates in each of the gestational age categories to allow separate group analysis while still maintaining 80-90% power. Moreover, it is the first study to show that with advancing neonatal maturity and chronological age, there is a higher degree of agreement between axillary and rectal methods, and that Cesarean delivery may reduce this agreement thus decreasing the accuracy of the axillary method. However, since our subjects were clinically stable term and preterm neonates, our findings may not be generalizable to all newborns. Inference to sick neonates or preterm infants born at less than 29 weeks gestation is limited in view of lack of similar infants in our cohort.
Our findings agree with those of Falzon et al.  who reported a significant correlation between axillary and rectal temperature (r = 0.73, p < 0.0001) but differ with respect to the degree of agreement between the two methods. Whereas we found good agreement, Falzon et al. had poor agreement between the two types of measurements with 95% of axillary measurements falling within 2.5-3°C range around respective paired rectal measurements, using the Bland-Altman method. Additionally, axillary temperatures were consistently lower than rectal ones, with a mean (SD) difference of 0.38(0.76)°C and wide variability. To note, this study included children from birth to 4 years of age but did not provide specific information relating to the subgroup of neonates.
In a larger study that included 282 NICU infants born between 24 and 43 weeks gestation, Helder et al.  investigated the correlation between digital rectal and probe skin temperature, measured over the back and the abdomen. Skin temperature measured over the back had a stronger correlation (r = 0.77) and better agreement with digital rectal thermometry than abdominal skin temperature (r = 0.56). In contrast to our study, Helder et al. found that the correlation between skin and rectal measurements were best for infants with the lowest birth weight (<1000grams; r = 0.9; p < 0.001 for back skin temperature) in the first days of life , findings that are in an opposite direction to ours. This difference in results may be due to the fact that both studies measured skin temperature using different methods (probe versus digital) and at different sites (back/abdomen versus axilla). In the study of Friedrichs et al. , temperature obtained from the left axilla had higher correlation with rectal measurements as compared to that of the right axilla. Our results also contradict those of Hissink et al. , Hutton et al. , and Lee et al. . Comparing the agreement between axillary and rectal thermometry using the Bland-Altman method, all above studies reported significant differences in healthy and sick term and preterm neonates (range: 25–42 weeks gestation), including those in NICU settings. Moreover, Hissink et al. found that axillary temperature was lower than rectal ones, and that increasing postnatal age increased the difference between the two measurements .
This study provides reassuring evidence regarding the accuracy of axillary thermometry in nursery settings. In newborns at or above 29 weeks gestation that are clinically stable, axillary thermometry is a reliable method for assessing the general well-being of the newborn, therefore guiding decision-making. However, further studies are needed to confirm its accuracy in sick newborns who are clinically unstable and in very small preterm infants less than 29 weeks of gestation.
This study was approved by the Institutional Review Board of the American University of Beirut.
We are grateful for Miss Soumayya Ayyash for her help in data collection and entry.
This study was supported by a grant from the American University of Beirut Research Board.
- National Association of Neonatal Nurses (NANN): Neonatal thermoregulation guidelines for practice. 1997, Glenview, IL: NANNGoogle Scholar
- Haddock BJ, Merrow DL, Swanson MS: The falling grace of axillary temperatures. Pediatr Nurs. 1996, 22: 121-125.PubMedGoogle Scholar
- Uslu S, Ozdemir H, Bulbul A, Comert S, Bolat F, Can E, Nuhoglu A: A comparison of different methods of temperature measurements in sick newborns. J Trop Pediatr. 2011, 57 (6): 418-423. 10.1093/tropej/fmq120.PubMedView ArticleGoogle Scholar
- Craig JV, Lancaster GA, Williamson PR, Smyth RL: Temperature measured at the axilla compared with rectum in children and young people: systematic review. BMJ. 2000, 320: 1174-1178. 10.1136/bmj.320.7243.1174.PubMedPubMed CentralView ArticleGoogle Scholar
- Falzon A, Grech V, Caruana B, Margo A, Attard-Montalto S: How reliable is axillary temperature measurement?. Acta Paediatr. 2003, 92: 309-313. 10.1111/j.1651-2227.2003.tb00551.x.PubMedView ArticleGoogle Scholar
- Hissink Muller PC, van Berkel LH, de Beaufort AJ: Axillary and rectal temperature measurements poorly agree in newborn infants. Neonatology. 2008, 94: 31-34. 10.1159/000112840.PubMedView ArticleGoogle Scholar
- Hutton S, Probst E, Kenyon C, Morse D, Friedman B, Arnold K, Helsley L: Accuracy of different temperature devices in the postpartum population. JOGNN. 2009, 38: 42-49. 10.1111/j.1552-6909.2008.00302.x.PubMedView ArticleGoogle Scholar
- Helder OK, Twisk JWR, van Goudoever JB, van Elburg RM: Skin and rectal temperature in newborns. Acta Paediatr. 2011, 101: e240-e242.View ArticleGoogle Scholar
- Lee G, Flannery-Bergey D, Randall-Rollins K, Curry D, Rowe S, Teague M, Tuininga C, Schroeder S: Accuracy of temporal artery thermometry in neonatal intensive care infants. Adv Neonatal Care. 2011, 11: 62-70. 10.1097/ANC.0b013e3182087d2b.PubMedView ArticleGoogle Scholar
- Friedrichs J, Staffileno BA, Fogg L, Jegier B, Hunter R, Portugal D, Saunders JK, Penner JL, Peashey JM: Axillary temperatures in full-term newborn infants. Using evidence to guide safe and effective practice. Adv Neonatal Care. 2013, 13: 361-368. 10.1097/ANC.0b013e3182a14f5a.PubMedView ArticleGoogle Scholar
- Bland JM, Altman DG: Statistical method for assessing agreement between two methods of clinical measurement. Lancet. 1986, 1 (8476): 307-310.PubMedView ArticleGoogle Scholar
- Cockburn F, Cooke RWI: The crib (clinical risk index for babies) score: a tool for assessing initial neonatal risk and comparing performance of neonatal intensive care units. Lancet. 1993, 342 (8865): 193-198. 10.1016/0140-6736(93)92296-6.View ArticleGoogle Scholar
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.