Original Research

Multiple imputation using chained equations for missing data in survival models applied to multidrug-resistant tuberculosis and HIV data

Sizwe Vincent Mbona, Principal Ndlovu, Henry Mwambi, Shaun Ramroop
Journal of Public Health in Africa | Vol 14, No 8 | a91 | DOI: https://doi.org/10.4081/jphia.2023.2388 | © 2024 Sizwe Vincent Mbona, Principal Ndlovu, Henry Mwambi, Shaun Ramroop | This work is licensed under CC Attribution 4.0
Submitted: 03 April 2024 | Published: 30 August 2023

About the author(s)

Sizwe Vincent Mbona, Department of Statistics, Durban University of Technology, Durban, South Africa
Principal Ndlovu, Department of Statistics, University of South Africa, Pretoria, South Africa
Henry Mwambi, School of Mathematics, Statistics and Computer Science, University of KwaZulu-Natal, Pietermaritzburg, South Africa
Shaun Ramroop, School of Mathematics, Statistics and Computer Science, University of KwaZulu-Natal, Pietermaritzburg, South Africa

Abstract

Background. Missing data are a prevalent problem in almost all types of data analyses, such as survival data analysis. Objective. To evaluate the performance of multivariable imputation via chained equations in determining the factors that affect the survival of multidrug-resistant-tuberculosis (MDR-TB) and HIV-coinfected patients in KwaZulu-Natal. Materials and Methods. Secondary data from 1542 multidrug-resistant tuberculosis patients were used in this study. First, data from patients with some missing observations were deleted from the original data set to obtain the complete case (CC) data set. Second, missing observations in the original data set were imputed 15 times to obtain complete data sets using a multivariable imputation case (MIC). The Cox regression model was fitted to both the CC and MIC data, and the results were compared using the model goodness of fit criteria [likelihood ratio tests, Akaike information criterion (AIC), and Bayesian Information Criterion (BIC)]. Results. The Cox regression model fitted the MIC data set better (likelihood ratio test statistic =76.88 on 10 df with P<0.01, AIC =1040.90, and BIC =1099.65) than the CC data set (likelihood ratio test statistic =42.68 on 10 df with P<0.01, AIC =1186.05 and BIC =1228.47). Variables that were insignificant when the model was fitted to the CC data set became significant when the model was fitted to the MIC data set. Conclusion. Correcting missing data using multiple imputation techniques for the MDR-TB problem is recommended. This approach led to better estimates and more power in the model.</p


Keywords

missing data; multiple imputation; multidrug-resistance tuberculosis

Metrics

Total abstract views: 2671
Total article views: 1807

 

Crossref Citations

1. Machine learning algorithms to predict the risk of admission to intensive care units in HIV-infected individuals: a single-centre study
Jialu Li, Yi Ding, Yiwei Hao, Chengyu Gao, Jinjing Xiao, Ying Liu, Yining Zhao, Qinlan Li, Lulu Xing, Hongyuan Liang, Liang Ni, Fang Wang, Sa Wang, Di Yang, Guiju Gao, Jiang Xiao, Hongxin Zhao
Virology Journal  vol: 22  issue: 1  year: 2025  
doi: 10.1186/s12985-025-02900-w

2. Triglyceride-glucose index in predicting the risk of new-onset diabetes in the general population aged 45 years and older: a national prospective cohort study
Yingqi Shan, Qingyang Liu, Tianshu Gao
BMC Endocrine Disorders  vol: 25  issue: 1  year: 2025  
doi: 10.1186/s12902-025-01848-w

3. Investigating the Factors Influencing Traditional Male Circumcision and Its Contribution to HIV Transmission Amongst Men in Lesotho: A Multilevel Binary Logistic Regression Approach
Sizwe Vincent Mbona, Anisha Ananth, Retius Chifurira
International Journal of Environmental Research and Public Health  vol: 22  issue: 7  first page: 993  year: 2025  
doi: 10.3390/ijerph22070993

4. Assessing of case–cohort design: a case study for breast cancer patients in Xinjiang, China
Mengjuan Wu, Tao Zhang, Chunjie Gao, Ting Zhao, Lei Wang, Gang Sun
Frontiers in Oncology  vol: 14  year: 2024  
doi: 10.3389/fonc.2024.1306255

5. Comparative effectiveness of laparoscopic versus open colectomy in colon cancer patients: a study protocol for emulating a target trial using cancer registry data
Semaw Ferede Abera, Gabriele Robers, Anika Kästner, Ulrike Stentzel, Kerstin Weitmann, Wolfgang Hoffmann
Journal of Cancer Research and Clinical Oncology  vol: 151  issue: 1  year: 2025  
doi: 10.1007/s00432-024-06057-x