Factors associated with genital human papillomavirus infection among adult females in the United States, NHANES 2007–2010
BMC Research Notes volume 7, Article number: 544 (2014)
Patients with human papillomavirus (HPV) infection are at risk of developing cancer later in their life. Current research estimates the prevalence of genital HPV infection and explores the factors that are associated with the infection.
The National Health and Nutrition Examination Survey 2007–2010 was used in this research study. The study population included females in the United States aged 18–59 years. The weighted prevalence of HPV infection was 41.9%. An estimated 59.4% of non-Hispanic black females had HPV infection. In a multivariate analysis, number of sexual partners, race, age, education level, marital status, income, smoking, and insurance status were associated with HPV infection. HPV infection was 5.77 times more likely for women with >11 sexual partners compared to women with 0–1 partners. Non-Hispanic black females were 1.87 times more likely to have HPV infection compared to non-Hispanic white females. Participants with only a high school degree had a 58% increased prevalence compared to college-educated women. Uninsured women had a 39% increased prevalence compared to those with insurance.
This study found that 41.9% of U.S. females aged 18–59 years tested positive for genital HPV infection. We determined that individuals with more sexual partners, with a lower education level, with non-Hispanic black race, and with no insurance were the populations at greatest risk. It is necessary to continue monitoring the prevalence of this infection in the general population to provide a basis for effective treatment and prevention in the target populations.
Genital human papillomavirus (HPV) is the most common sexually transmitted infection in the United States. HPV includes a group of more than 150 related viruses. Many of these viruses can be easily spread through direct skin contact during sexual intercourse . More than 50% of individuals engaging in sexual activities are infected with at least one type of HPV in their lifetime. An estimated 42.5% of U.S. females aged 14–59 years had a genital HPV infection in 2003–2006 . Some patients with HPV infection can be restored back to health, while infection progresses to cancer in others .
More than 40 HPV types can infect the genital areas of men and women . These types can be classified as high-risk, probable high-risk, low-risk, and undetermined risk for the development of cervical cancer. Low-risk types (HPV 6, 11) are mostly associated with genital warts. High-risk types (HPV 16, 18) can contribute to precancerous lesions, low-grade cervical intraepithelial lesions and high-grade cervical intraepithelial lesions, as well as anogenital cancers . HPV infection is a major cause of cervical cancer , and 11,818 women in the U.S. were diagnosed with cervical cancer in 2010 .
The primary goals of our current research are to identify the potential factors associated with HPV infection and to estimate the prevalence of infection for U.S. females from 2007 to 2010. Eventually, we aim to help create programs targeting high-prevalence populations to prevent HPV infection and lower their risk of getting cervical cancer.
Methods and materials
Data was obtained from the National Health and Nutrition Examination Survey (NHANES) 2007–2010 . The survey data includes information about demographic and socioeconomic status, mental health, and dental health, as well as physiological and laboratory measurements. The survey interviews about 12,000 people biannually.
Survey design and population
The NHANES used a multistage probability sample design to select the participants. Consenting participants completed a household interview followed by a physical examination and interviews at a Mobile Examination Center (MEC). Non-Hispanic (NH) black and low income groups were oversampled in the NHANES to allow for an accurate statistical estimation in these population groups . The protocol was approved by the National Center for Health Statistics (NCHS) institutional review board. Further information regarding study design and methods for oversampling is available on the NHANES website .
From 2007 to 2010, 10,010 females of all ages were interviewed. The combined unweighted household interview response rate for that period was 78.9%; the examination response rate was 76.3%. All females aged 18–59 years (n = 4242) who visited the MEC were asked to self-collect a cervicovaginal swab sample. Out of all samples collected, 3738 (88.1%) were reported as positive or negative and were used in the final analysis. Combining the data for years 2007 through 2010 was justified. There was no significant difference in HPV prevalence between 2007–2008 and 2009–2010 for all but 5 of 37 HPV types, as detected by the Linear Array assay (data not shown).
Demographic and behavioral data
Demographic information, including gender, age, race, education, marital status, and country of birth, was obtained from all participants during the household interviews. The poverty index was calculated according to the U.S. Census definition. This method divides total family income by the poverty threshold after adjusting for family size at the time of the interview.
Sexual history information, including if the participant had ever had sex and the age of first sexual experience, was self-reported by participants using an audio computer-assisted self-interview. Respondents who reported having sex (described as vaginal, oral, or anal) were asked additional questions about their lifetime sexual history and about any sexual encounters in the prior 12 months. These additional questions addressed number of sexual partners, sexual orientation, and condom usage.
Specimen collection and processing, laboratory methods
As described by Dunne et al. , self-collected cervicovaginal swab samples were obtained from female participants aged 18–59 who had an examination in the MEC. Swabs were given to the NHANES personnel, stored at room temperature, and mailed within 1 week to the Centers for Disease Control and Prevention (CDC) laboratory. There, they were kept at 4°C and extracted within 1 month of collection.
Multiplex polymerase chain reaction (PCR) was used for detection of 37 HPV types within the Alphapapillomavirus genera. Samples were reported as HPV positive if any of the 37 HPV deoxyribonucleic acid (DNA) types were detected, including high-risk (16, 18, 26, 31, 33, 35, 39, 45, 51, 52, 53, 56, 58, 59, 66, 68, 73, 82) and low-risk (6, 11, 40, 42, 54, 55, 61, 62, 64, 67, 69, 70, 71, 72, 81, 82 subtype IS39, 83, 84, 89) types. If the strips were positive for any of the types, the sample was coded as positive. If the strips were negative for all of the types and beta-globin was detected, the sample was coded as negative. If there was no beta-globin present in the sample and no HPV type was detected, the sample was coded as inadequate [8, 11–14].
We estimated the overall prevalence of infection for any HPV type with respect to sociodemographic and sexual behavioral characteristics. Due to the complexity of the design, all estimates were measured using 4-year MEC sample weights provided by NCHS to account for the unequal probabilities of selection and adjustment for nonresponse. The weighting methodology has been described previously . Taylor series linearization was used to estimate variance in a complex cluster survey design . Confidence intervals (CIs) were calculated using a logit transformation with the standard error of the logit prevalence based on the delta method and applying SUDAAN estimated standard errors .
The Wald χ2 statistic was used to assess bivariate association between HPV and sociodemographic or behavioral characteristics. No adjustments were made to the p- values for multiple comparisons. An unconditional logistic regression model was used to determine associations between genital HPV infection and these factors. Variables for adjustment in multivariate logistic regression models were selected based on bivariate associations with p-values <0.2. Goodness of fit for the final step of the model was assessed using the Hosmer-Lemeshow Satterthwaite adjusted F test.
SAS software version 9.4 for Windows (SAS Institute Inc. Gary, North Carolina) and SAS callable SUDAAN 11.0.1  were used for the statistical analyses. Two-sided p-values less than 0.05 were considered statistically significant.
HPV types and species by oncogenic risk category are presented in Figure 1. In the NHANES 2007–2010, there were 3738 females aged 18 to 59 years with an HPV evaluation result in the final analysis. The 37 HPV types were grouped as high-risk (Figure 1 Left Panel) or low-risk (Figure 1 Right Panel). The prevalence of high-risk HPV types 33 and 58 was less than 2%; HPV strains 16 and 18 had a prevalence of 4.92% and 1.77%, respectively. HPV strain 53 had the highest prevalence of 6.26% in the high-risk HPV category, and HPV strain 62 had the highest prevalence of 5.93% in the low-risk category.
In a univariate analysis (Table 1), the overall weighted prevalence of HPV infection was 41.9%. An estimated 59.4% NH black females and 38.7% of NH white females, the lowest prevalence, were positive for HPV. There was a statistically significant difference between subjects of different races. The Prevalence Ratio for NH black females was 2.3 times that of NH white females.
There was a lower trend of HPV infection with increased age, higher education level, and decreased family poverty level. There was a bimodal pattern in the prevalence of HPV for age, with a high HPV prevalence of 56.1% in the 18–24 age group and with a second peak in the 40–44 age group. HPV prevalence was 48% for women with less than a high school education, whereas it was 31.4% for those with a college education. Participants living in a family with an income-to-poverty ratio (PIR) of <130% showed a 55.3% prevalence of HPV infection. In terms of marital status, prevalence was lowest (29.4%) for married women and highest (>47%) for women classified as single, divorced, or never married.
The association of HPV infection with behavioral factors is presented in Table 2. Females who smoked <100 cigarettes in their lifetime (Prevalence Ratio [PR] = 0.52) were less likely to have HPV infection. Females who did not have health insurance (PR = 1.9) or a routine place to receive health care (PR =1.39) were more likely to have HPV infection.Females who reported never having sex were 71% less likely to have HPV infection compared to females who reported having sex. The prevalence of HPV infection increased 89% for females having sex before 16 years of age compared to after 16. There was a trend of increasing HPV infection in women with a higher number of sexual partners in their lifetime (or yearly). The prevalence of infection was 8.9 times higher for women with >11 sexual partners in their lifetime compared to women with 0–1 partners. This trend was noticed among all races. Overall, NH black females showed the highest prevalence of HPV infection regardless of their number of partners (Figure 2). Compared to women who used a condom during sex, those who did not use a condom had twice the prevalence of HPV infection. Lastly, women who identified themselves as being ‘lesbian’ or ‘bisexual’ had a 72% increased prevalence of HPV infection compared to those who identified as ‘straight.’
In a multivariate logistic regression analysis, adjusting for other factors, the number of lifetime sexual partners had a significant association with HPV infection. HPV infection was 5.77 times more likely for women with >11 partners compared to women with 0–1 partners. NH black females were 1.87 times more likely to have an HPV infection compared to NH white females. Age was negatively associated with the prevalence of HPV, except for an increased peak in the 44–49 age group. Having a college degree and being married were also associated with lower HPV prevalence. Family income-to-poverty ratio, a habit of smoking cigarettes, and insurance status remained significant factors associated with HPV infection.
When the results were adjusted for other factors, country of birth, recreational drug usage, ever having sex, age at first sexual encounter, condom usage during sex, and sexual orientation were no longer associated with HPV infection.
Discussion and conclusion
From the NHANES 2007–2010 data set, we found that the overall prevalence of genital HPV infection was 41.9% (95% CI: 39.6-44.2%) in females aged 18–59 in the United States. We compared this result with two previous NHANES analyses observing data for females in the United States. Our estimate was similar to the result of the first study, which found a 42.5% (95% CI: 40.3-44.7%) prevalence for females aged 14–59 in the 2003–2006 data set . The confidence intervals overlapped, indicating no statistical difference between these two estimations. Our estimate was higher than the result of the second study, which found a prevalence of 26.8%  among females, aged 14–59 in the 2003–2004 data set. This discrepancy could be due to a change in the laboratory methods to detect HPV DNA resulting in increased sensitivity of the test [2, 18, 19].
In our study, there was a bimodal prevalence of genital HPV for age, consistent with the same two previous NHANES studies [2, 10]. This may indicate increased sexual activity in the 18–24 age group [20, 21] causing the transmission of HPV by sexual contact. The reason for the increased prevalence in the 45–49 age group is unclear. We speculate that it may be due to an increased incidence , a difference in sexual behaviors across birth cohorts , or a change in marital status.
HPV infection was associated with certain racial and ethnic groups. NH black females had an increased prevalence even when controlling for factors such as number of sexual partners in their lifetime (Figure 2) or other factors (Table 3). Interestingly, even in the group with 0–1 sexual partners, NH black females had twice the prevalence of HPV infection compared to other racial and ethnic groups. This indicates that number of sexual partners is not the only factor that impacts the prevalence of HPV infection.
Some studies have shown an inconsistent association between education level and prevalence [24, 25]. However, in others, education level was negatively associated with HPV infection [2, 10]. In our study, with increased education, the prevalence of HPV infection decreased. We speculate that this trend could be connected to factors such as increased awareness of HPV or adoption of safe sexual practices. Safe practices might include, but are not limited to, regular usage of condoms and limiting the number of sexual partners. Therefore, health education interventions could be introduced to reduce the HPV prevalence by increasing awareness, encouraging safe sex , teaching about the routes of HPV transmission, and promoting the use of the HPV vaccine .
The number of sexual partners played a statistically significant role in HPV infection. This finding mirrors those seen in other studies [2, 10]. We also found an association of health insurance status with HPV infection. A meta-analysis showed that cost of vaccination and lack of insurance coverage are barriers that prevent women from obtaining the vaccine . Individuals with private health insurance are more likely to hear about the HPV vaccine and three times more likely to get the vaccine compared to uninsured patients or those with public insurance plans . In addition, poverty and having smoked <100 cigarettes in their lifetime were associated with HPV infection.
In conclusion, 41.9% of U.S. females aged 18–59 years tested positive for genital HPV infection. This study found that an increased number of sexual partners, a lower level of education, non-Hispanic black race, and a lack of insurance were factors of concern with HPV infection. Continuing to monitor the prevalence of HPV in the general population can establish a basis for possible interventions focusing on at-risk groups.
Availability of supporting data
All data used in this research can be downloaded from the following website: http://www.cdc.gov/nchs/nhanes/nhanes_questionnaires.htm.
Use of data from the NHANES 2007–2010 is approved by the National Center for Health Statistics (NCHS) Research Ethics Review Board (ERB) Approval for NHANES 2009–2010 (Continuation of Protocol #2005-06), NHANES 2007–2008 (Continuation of Protocol #2005-06) and NHANES 2005–2006 (Protocol #2005-06).
Centers for Disease Control and Prevention
National Health and Nutrition Examination Survey
Mobile Examination Center
National Center for Health Statistics
Polymerase chain reaction
Statistical analysis system
Survey data analysis.
Human Papillomavirus (HPV) Vaccines. [http://www.cancer.gov/cancertopics/factsheet/prevention/HPV-vaccine]
Hariri S, Unger ER, Sternberg M, Dunne EF, Swan D, Patel S, Markowitz LE: Prevalence of genital human papillomavirus among females in the United States, the National Health And Nutrition Examination Survey, 2003–2006. J Infect Dis. 2011, 204 (4): 566-573.
CDC: Human Papillomavirus–Associated Cancers — United States, 2004–2008. MMWR Morb Mortal Wkly Rep. 2012, 61 (15): 630-632.
Basic Information about HPV-Associated Cancers. [http://www.cdc.gov/cancer/hpv/basic_info/]
Baseman JG, Koutsky LA: The epidemiology of human papillomavirus infections. J Clin Virol. 2005, 32 (Suppl 1): S16-24.
Gillison ML, Chaturvedi AK, Lowy DR: HPV prophylactic vaccines and the potential prevention of noncervical cancers in both men and women. Cancer. 2008, 113 (10 Suppl): 3036-3046.
Cervical Cancer Statistics.http://www.cdc.gov/cancer/cervical/statistics/,
Questionnaires, Datasets, and Related Documentation.http://www.cdc.gov/nchs/nhanes/nhanes_questionnaires.htm,
Johnson CL, Paulose-Ram R, Ogden CL e: National Health and Nutrition Examination Survey: Analytic guidelines, 1999–2010. National Center for Health Statistics. Vital Health Stat 2. 2013, 2 (161): 1-15.
Dunne EF, Unger ER, Sternberg M, McQuillan G, Swan DC, Patel SS, Markowitz LE: Prevalence of HPV infection among females in the United States. JAMA. 2007, 297 (8): 813-819.
Steinau M, Swan DC, Unger ER: Type-specific reproducibility of the Roche linear array HPV genotyping test. J Clin Virol. 2008, 42 (4): 412-414.
Onyekwuluje JM, Steinau M, Swan DC, Unger ER: A real-time PCR assay for HPV52 detection and viral load quantification. Clin Lab. 2012, 58 (1–2): 61-66.
Gravitt PE, Peyton CL, Alessi TQ, Wheeler CM, Coutlee F, Hildesheim A, Schiffman MH, Scott DR, Apple RJ: Improved amplification of genital human papillomaviruses. J Clin Microbiol. 2000, 38 (1): 357-361.
Sanders AE, Slade GD, Patton LL: National prevalence of oral HPV infection and related risk factors in the U.S. adult population. Oral Dis. 2012, 18 (5): 430-441.
Korn E,B,G: Analysis of Health Surveys. 1999, New York, NY: Wiley
Casella G, Berger R: Statistical Inference. 1990, Duxbury: Pacific Grove, CA
RTI: SUDAAN Language Manual, Volumes 1 and 2, Release 11. 2012, Research Triangle Park, NC: Research Triangle Institute, First Edition
Unger ER, Steinau M, Lin JMS, Patel SS, Swan DC: Impact of HPV Assay on Observed Population Prevalence. Diagn Mol Pathol. 2011, 20 (2): 101-104.
Castle PE, Gravitt PE, Solomon D, Wheeler CM, Schiffman M: Comparison of linear array and line blot assay for detection of human papillomavirus and diagnosis of cervical precancer and cancer in the atypical squamous cell of undetermined significance and low-grade squamous intraepithelial lesion triage study. J Clin Microbiol. 2008, 46 (1): 109-117.
Bosch FX, Burchell AN, Schiffman M, Giuliano AR, de Sanjose S, Bruni L, Tortolero-Luna G, Kjaer SK, Munoz N: Epidemiology and Natural History of Human Papillomavirus Infections and Type-Specific Implications in Cervical Neoplasia. Vaccine. 2008, 26: K1-K16.
de Sanjose S, Diaz M, Castellsague X, Clifford G, Bruni L, Munoz N, Bosch FX: Worldwide prevalence and genotype distribution of cervical human papillomavirus DNA in women with normal cytology: a meta-analysis. Lancet Infect Dis. 2007, 7 (7): 453-459.
Garcia-Pineres AJ, Hildesheim A, Herrero R, Trivett M, Williams M, Atmetlla I, Ramirez M, Villegas M, Schiffman M, Rodriguez AC, Burk RD, Hildesheim M, Freer E, Bonilla J, Bratti C, Berzofsky JA, Pinto LA: Persistent human papillomavirus infection is associated with a generalized decrease in immune responsiveness in older women. Cancer Res. 2006, 66 (22): 11070-11076.
Finer LB: Trends in premarital sex in the United States, 1954–2003. Public Health Rep. 2007, 122 (1): 73-78.
Arbyn M, de Sanjose S, Saraiya M, Sideri M, Palefsky J, Lacey C, Gillison M, Bruni L, Ronco G, Wentzensen N, Brotherton J, Qiao YL, Denny L, Bornstein J, Abramowitz L, Giuliano A, Tommasino M, Monsonego J: EUROGIN 2011 roadmap on prevention and treatment of HPV-related disease. Int J Cancer. 2012, 131 (9): 1969-1982.
Applebaum KM, Furniss CS, Zeka A, Posner MR, Smith JF, Bryan J, Eisen EA, Peters ES, McClean MD, Kelsey KT: Lack of association of alcohol and tobacco with HPV16-associated head and neck cancer. J Natl Cancer Inst. 2007, 99 (23): 1801-1810.
Marek E, Dergez T, Rebek-Nagy G, Szilard I, Kiss I, Ember I, Gocze P, D’Cruz G: Effect of an educational intervention on Hungarian adolescents’ awareness, beliefs and attitudes on the prevention of cervical cancer. Vaccine. 2012, 30 (48): 6824-6832.
Markowitz LE, Hariri S, Lin C, Dunne EF, Steinau M, McQuillan G, Unger ER: Reduction in human papillomavirus (HPV) prevalence among young women following HPV vaccine introduction in the United States, National Health and Nutrition Examination Surveys, 2003–2010. J Infect Dis. 2013, 208 (3): 385-393.
Ratanasiripong NT: A Review of Human Papillomavirus (HPV) Infection and HPV Vaccine–Related Attitudes and Sexual Behaviors among College-Aged Women in the United States. J Am Coll Health. 2012, 60 (6): 461-470.
Pourat N, Jones JM: Role of insurance, income, and affordability in human papillomavirus vaccination. Am J Manag Care. 2012, 18 (6): 320-330.
The authors wish to acknowledge the NHANES 2007–2010 participants and investigators as well as the National Center for Health Statistics and its Research Data Center for making the public data available. However, the authors take full and sole responsibility for the integrity of the data analysis and the contents of this article. We also wish to thank Talicia A. Tarver and Shenika McCary for their assistance in preparing this manuscript and the reviewers for their critical comments and suggestions.
The authors have no financial or non-financial competing interests to disclose.
RS carried out design of the study, data management, statistical analysis, and writing of the manuscript. HT and SD helped to write and finalize the manuscript. LL conceived the study, participated in its design, and helped to write the manuscript. GM helped in the study’s design and in writing the manuscript. All authors read and approved the final manuscript.
About this article
Cite this article
Shi, R., Devarakonda, S., Liu, L. et al. Factors associated with genital human papillomavirus infection among adult females in the United States, NHANES 2007–2010. BMC Res Notes 7, 544 (2014). https://doi.org/10.1186/1756-0500-7-544