Adjusting HIV prevalence estimates for non-participation: an application to demographic surveillance
McGovern, M. E., Marra, G., Radice, R. ORCID: 0000-0002-6316-3961 , Canning, D., Newell, M-L. & Bärnighausen, T. (2015). Adjusting HIV prevalence estimates for non-participation: an application to demographic surveillance. Journal of the International AIDS Society, 18(1), article number 19954. doi: 10.7448/ias.18.1.19954
Abstract
INTRODUCTION: HIV testing is a cornerstone of efforts to combat the HIV epidemic, and testing conducted as part of surveillance provides invaluable data on the spread of infection and the effectiveness of campaigns to reduce the transmission of HIV. However, participation in HIV testing can be low, and if respondents systematically select not to be tested because they know or suspect they are HIV positive (and fear disclosure), standard approaches to deal with missing data will fail to remove selection bias. We implemented Heckman-type selection models, which can be used to adjust for missing data that are not missing at random, and established the extent of selection bias in a population-based HIV survey in an HIV hyperendemic community in rural South Africa.
METHODS: We used data from a population-based HIV survey carried out in 2009 in rural KwaZulu-Natal, South Africa. In this survey, 5565 women (35%) and 2567 men (27%) provided blood for an HIV test. We accounted for missing data using interviewer identity as a selection variable which predicted consent to HIV testing but was unlikely to be independently associated with HIV status. Our approach involved using this selection variable to examine the HIV status of residents who would ordinarily refuse to test, except that they were allocated a persuasive interviewer. Our copula model allows for flexibility when modelling the dependence structure between HIV survey participation and HIV status.
RESULTS: For women, our selection model generated an HIV prevalence estimate of 33% (95% CI 27-40) for all people eligible to consent to HIV testing in the survey. This estimate is higher than the estimate of 24% generated when only information from respondents who participated in testing is used in the analysis, and the estimate of 27% when imputation analysis is used to predict missing data on HIV status. For men, we found an HIV prevalence of 25% (95% CI 15-35) using the selection model, compared to 16% among those who participated in testing, and 18% estimated with imputation. We provide new confidence intervals that correct for the fact that the relationship between testing and HIV status is unknown and requires estimation.
CONCLUSIONS: We confirm the feasibility and value of adopting selection models to account for missing data in population-based HIV surveys and surveillance systems. Elements of survey design, such as interviewer identity, present the opportunity to adopt this approach in routine applications. Where non-participation is high, true confidence intervals are much wider than those generated by standard approaches to dealing with missing data suggest.
Publication Type: | Article |
---|---|
Additional Information: | © 2015 McGovern ME et al; licensee International AIDS Society. This is an Open Access article distributed under the terms of the Creative Commons Attribution 3.0 Unported (CC BY 3.0) License (http://creativecommons.org/licenses/by/3.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited |
Publisher Keywords: | HIV prevalence; non-participation; missing data; selection bias; Heckman-type selection models; demographic surveillance. |
Subjects: | D History General and Old World > DT Africa H Social Sciences > HA Statistics R Medicine > RA Public aspects of medicine R Medicine > RA Public aspects of medicine > RA0421 Public health. Hygiene. Preventive Medicine |
Departments: | Bayes Business School > Actuarial Science & Insurance |
SWORD Depositor: |
Available under License Creative Commons: Attribution 3.0.
Download (407kB) | Preview
Export
Downloads
Downloads per month over past year