 Research article
 Open Access
 Published:
Comparison of three classes of Marginal Risk Set Model in predicting infant mortality among newborn babies at Kigali University Teaching Hospital, Rwanda, 2016
BMC Pediatrics volume 20, Article number: 62 (2020)
Abstract
Background
The Infant Mortality Rate (IMR) in SubSaharan Africa (SSA) remains the highest relatively to the rest of the world. In the past decade, the policy on reducing infant mortality in SSA was reinforced and both infant mortality and parental death decreased critically for some countries of SSA. The analysis of risk to death or attracting chronic disease may be done for helping medical practitioners and decision makers and for better preventing the infant mortality.
Methods
This study uses popular statistical methods of resampling and one selected model of multiple events analysis for measuring the survival outcomes for the infants born in 2016 at Kigali University Teaching Hospital (KUTH) in Rwanda, a country of SSA, amidst maternal and child’s socioeconomic and clinical covariates. Dataset comprises the newborns with correct information on the covariates of interest. The Bootstrap Marginal Risk Set Model (BMRSM) and Jackknife Marginal Risk Set Model (JMRSM) for the available maternal and child’s socioeconomic and clinical covariates were conducted and then compared to the outcome with Marginal Risk Set Model (MRSM). That was for measuring stability of the MRSM.
Results
The 2117 newborns had the correct information on all the covariates, 82 babies died along the study time, 69 stillborn babies were observed while 1966 were censored. Both BMRSM JMRSM and MRSM displayed the close results for significant covariates. The BMRSM displayed in some instance, relatively higher standard errors for nonsignificant covariates and this emphasized their insignificance in MRSM. The models revealed that female babies survive better than male babies. The risk is higher for babies whose parents are under 20 years old parents as compared to other parents’ age groups, the risk decreases as the APGAR increases, is lower for underweight babies than babies with normal weight and overweight and is lower for babies with normal circumference of head as compared to those with relatively small head.
Conclusion
The results of JMRSM were closer to MRSM than that of BMRSM. Newborns of mothers aged less than 20 years were at relatively higher risk of dying than those who their mothers were aged 20 years and above. Being abnormal in weight and head increased the risk of infant mortality. Avoidance of teenage pregnancy and provision of clinical care including an adequate dietary intake during pregnancy would reduce the IMR in Kigali.
Background
The discrepancy in IMR and low life expectancy of the SSA versus the other parts of the world attracts several researchers. The report of the World Bank in 2011 pointed that the IMR was 75/1000 in SSA versus 11/1000 in developed countries [1]. The same report pointed that half of the ten million children who die every year is in SSA. The World Bank dataset from 1960 to 2005 suggests that low life expectancy at birth in SSA is relatively higher in Middle Africa as compared to other subregional disparities of SSA [2]. The World Bank records of 2017 indicated that the IMR was 51.50/1000 in SSA [3]. Central African Republic had the highest IMR of 87.60/1000, the lowest IMR were found in Mauritius (11.60/1000), the IMR in Rwanda was 28.90/1000. Several studies on factors that could lower the infant mortality have been done and recommendations were suggested but the IMR remains a problem in SSA.
The multiple events model for infant mortality at the Kigali University Teaching Hospital analysed in [4] leaves a question on whether the adopted model is stable. The main causes of instability may be the correlation of the covariates or relatively small sample size [5]. One of the ways of assessing instability in survival regression models is a use of resampling techniques [6]. The analysis in [4] is a none resampled model that used the primary dataset of the year 2016. Two observable events per subject are death and the occurrence of at least one of the common conditions that may also cause the longterm death to infants. It was found that the Marginal Risk Set Model (MRSM) also known as the Wei, Lin and Weissfeld Model (WLWM) fit the data well. The WLWM is among the multiplicative methods for analysing ordered events found in [7]. Other multiplicative models include the AndersenGill Model (AGM) and the Prentice, Williams and Peterson Model (PWPM) [8].
The present study uses two popular nonparametric methods of resampling namely bootstrap which is based on the random samples with replacement [9], and jackknife method that is based on sampling by leaving out one observation at time [9]. The size of the sample in [4] is 2117 and the record is effective in the year 2016. The longterm results could be assumed according to the stability potentially observed after resampling. Several manuscripts on resampling in survival analysis are limited on the resampled Cox proportional hazards model and on estimating standard errors of the survival and hazard functions such as in [6, 10,11,12,13] where bootstrap is involved [13,14,15,16]; in which the jackknife is implicated or [17,18,19,20,21,22] where hazard and survival functions with their respective standard errors are of interest. The present study analyses the bootstrapbased MRSM with 1000 replicates and the jackknifebased MRSM. The results are then compared to that of the MRSM.
Methods
Dataset
The time to event data of 2117 newborns at the KUTH is recorded from the 1st January to the 31st December 2016. At KUTH, all newborns are recorded in registries with all details of parents and clinical outcomes of each newborn. The information in registry provides references on card indexes that provide information on clinical behavior of babies after leaving the hospital. KUTH as a site of interest in this study is a central Hospital where most of complicated childbirths countrywide are transferred. In 2016, KUTH recorded relatively high incidence of stillborn cases (69 stillborn babies or 3.259%) and relatively high infant mortality rate (3.873%). Table 1 summarises the information on newborns at KUTH along the study time.
The study is interested on subjects with a correct information on the covariates of interests. The two events per subject are observed namely the death and the incidence of at least one chronic disease or complication such as severe oliguria, severe prematurity, very low birth weight, macrosomia, severe respiratory distress, gastroparesis, hemolytic, trisomy, asphyxia and laparoschisis. Apart from the event status and the time to event, 11 covariates are recorded and subdivided in demographic covariates which include the age and the place of residence for parents; clinical covariates for female parents that include obstetric antecedents, type of childbirth and previous abortion. Clinical covariates for babies include APGAR; gender, number of births at a time, weight, circumference of the head, and height. Table 2 gives a description of the variables of interest.
Statistical methods
Marginal risk set model
Assume that h(tx_{i}) is the hazard function of the survival time T given the p fixed covariates x_{i} = (x_{i1}, x_{i2},. .., x_{ip}). Let h_{0}(t) be the hazard function when x_{i} = (0, 0,. .., 0) for all i, then
where β = (β_{1}, β_{2},. .., β_{p})^{’} is a pdimensional vector of model parameters [23]. Define an indicator function as.
δ_{ij}(t) = 1 if individual i is at risk of the j^{th} event and δ_{ij}(t) = 0 otherwise.
The marginal risk set model (MRSM) or the Wei Lin and Weisfeld Model (WLWM) assumes that events are unordered where each event has its own stratum and each data point appears in all strata [4, 24]. In other words, the k^{th} time interval per subject is in the k^{th} stratum, k = 1, 2,. .., n.
The hazard function for the j^{th} event for the individual i is given by
Maximum likelihood and parameter estimation
Let]0, τ_{i} [be the interval of time in which the individual i is observed with n_{i} the number of events of the individual i along]0, τ_{i} [and Assume that two events cannot occur simultaneously in continuous time. The probability density function for the outcome n_{i} along]0, τ_{i} [is given by.
L(Φ) \( =\prod \limits_{i=1}^n{L}_i\left(\varphi \right) \)
where
In (3), individual i has n_{i} events with n_{i} ≥ 0 at times t_{i1} ≤ t_{i2} ≤ · · · ≤ t_{ini} .
The appropriate partial likelihood functions for tied time to event data is well described in [24] and in [25] and include Breslow’s, Efron’s and Cox’s techniques. The maximum likelihood estimates are given by a system
where α is known as the baseline parameter vector while β is a vector of model parameters. The NewtonRaphson method is one of numerical methods used for solving system (4). The adequacy checking of the likelihood estimates is done by finding the elements ℑ_{αα}, ℑ_{αβ}, ℑ_{βα} and ℑ_{ββ} of the information matrix ℑ and assume that as \( n\to \infty, \hat{\varPhi}\varPhi \mapsto N\left(0,{\Im}^{1}\left(\hat{\varPhi}\right)\right) \) [4, 26].
In MRSM, n is assumed to be the maximum number of events per subject while τ_{k}, k = 1, 2, ...n are times to events per subject along the study time with range [0, T]. The study time is partitioned into n + 1 intervals of the form
STATA 15 provides results of the MRSM by applying the Cox Proportional Hazards Model (CPHM) to the dataset in the setup (5). The test of proportional hazards assumption is done by checking patterns of survival functions per groups of each covariate. Figure 1 presents the patterns of survival functions per groups of each covariate using KaplanMeier estimation. The patterns are approximately parallel for the covariates of interest. This allows a construction of the MRSM for all the covariates.
Resampled MRSM
The Bootstrap Marginal Risk set Model (BMRSM) is the inference of model (2) based on bootstrap samples (see Appendix). The BMRSM consists of applying model (2) to each of the B bootstrap samples x_{i}^{*k}, ∀k ∈ [1, B] of covariates x_{i}, ∀i ∈ [1, p]. Bootstrap model parameter estimation in presence of tied events uses either Breslow, Efron or Cox approach. The bootstrap standard error is obtained by using Eq. (6) of the Appendix.
As for the BMRSM, the Jackknife Marginal Risk Model (JMRSM) consists of applying model (2) to each of the n jackknife samples x_{i}^{*k} of covariates x_{i}, i ∈ [1, p] with a use of Breslow, Efron or Cox approach for estimating the jackknife model parameters. The Jackknife standard error is given by Eq. (7) found in the Appendix.
Results
Using Breslow estimation [27], Table 3 presents unadjusted MRSM, BMRSM, JMRSM and corresponding adjusted models. Unadjusted and adjusted MRSM, BMRSM and JMRSM are also presented in Tables 4 and 5 for Efron [28] and Cox estimation [29].
The results of the unadjusted JMRSM are relatively close to that of the unadjusted MRSM (Table 3). The standard errors in JMRSM and MRSM are close for all covariates. The standard errors in BMRSM and MRSM are also close for covariates except for all levels of covariates childbirth where the standard error in BMRSM is about 4 times that of MRSM and the upper levels of covariates weight, head and height where the standard error in BMRSM is about 20 times that of MRSM. Significance difference in levels of covariates is found at the same covariates for both MRSM, BMRSM and JMRSM except at the upper level of the covariate abortion where significance is suggested by the MRSM. Following the recommendations of Parzen and Lipsitz [30], the χ^{2} test statistics suggest a higher performance of the JCPHM as compared to the CPHM and BCPHM since the χ^{2} is relatively everywhere lower for the JCPHM..
Discussion
The overall results of MRSM, BMRSM and JMRSM by different approaches of ties handling (Tables 3, 4 and 5) are not critically different as expected. The STATA default method (Breslow) is then of interest in the analysis. The JMRSM is adopted for checking stability since the results are closer to that of MRSM than that of BMRSM. The similarity between MRSM and JMRSM suggests that the MRSM may be stable. The global analysis upholds the significance difference of all levels of covariates age, gender, number and APGAR and intermediate levels of covariates weight and head.
The resampled adjusted models by Breslow technique of handling tied events suggest that the risk of death or attracting a chronic disease of babies whose parents’ age range from 20 to 34 years old is lower than that of babies whose parents are under 20 years old and that of babies whose parents are 35 years and above. Basinga et al. [31] argue that the unintended pregnancy induces abortion in Rwanda, their study suggests a relatively higher rate of teenage unintended pregnancies as compared to the other age ranges, this contributes on the first hand, to the increase of infant mortality rate. On the second hand, the study by Olausson et al. [32] confirms a relatively higher risk for teenage pregnancies due to biological immaturity. As for the advanced maternal age, Lampinen et al. [33] point that it is associated with relatively poorer outcomes to pregnancies due to the observed higher incidence of chronic medical conditions among older women.
The results show that the risk for male babies is higher than that of female babies. This complies with the usual better survival outcome of the females as reports several manuscripts such as [34] or [35]. Multiple babies survive better than singleton babies; this is however against the results from studies conducted in SubSaharan Africa by Monden and Smits [36] and Pongou et al. [37]. This may be due to the small number of multiple newborns recorded at KUTH along the year 2016. The survival outcomes of babies whose APGAR is below 4/10 are poorer than that of babies with higher APGAR score. Babies whose weight range from 2500 g to 4500 g survive better than those whose weight is below 2500 g and those whose weight is above 4500 g while babies whose circumference of head range from 32 cm to 36 cm survive better than those whose circumference of head is below 32 cm. The results of APGAR, weight and circumference of the head comply with the recommendations of the clinical medicine and related manuscripts such as [38] for example.
The study shows that the BMRSM is close to JMRSM and MRSM for all significant covariate but the BMRSM shows relatively higher standard errors for some nonsignificant covariates. The discrepancy between standard errors after resampling for covariates such as childbirth, weight, head and height suggests the instability of the MRSM at these specific covariates and this emphasizes their nonsignificance in the MRSM.
The present analysis is limited on eleven covariates. Unavailable covariates concerning parents that could improve models are, for example, demographic covariates such as the parent’s education level, employment and income; behavioral covariates namely smoking habit, alcohol consumption and dietary and physiotherapeutic variables such as sports activity level. These variables are not recorded in registry at KUTH.
Conclusion
Marginal Risk Set Model (MRSM) and related resampling using Bootstrap (BMRSM) and Jackknife (JMRSM) are described and compared with a use of the dataset on infant mortality. The JMRSM and MRSM displayed relatively close results. The risk is higher for babies whose parents are under 20 years old parents as compared to older parents. Babies born with APGAR greater or equal to 7/10 were found to have a better survival outcome than those born with APGAR less than 4/10 and those whose APGAR range between 4/10 and 6/10. The risk is lower for underweight babies as compared to babies with normal weight and overweight. The survival outcomes for babies with normal circumference of head were found to be better than those with relatively small head. The study suggests that pregnancy of under 20 years old parents should be avoided, also appropriate clinical ways of keeping pregnancy against any cause of infant abnormality could help in lowering infant mortality.
Availability of data and materials
The dataset used is confidential. Some information on it is available from authors on reasonable request.
Abbreviations
 AGM:

AndersenGill Model
 APGAR:

Appearance, Pulse, Grimace, Activity and Respiration
 BCPHM:

Bootstrap Cox Proportional Hazards Model
 BMRSM:

Bootstrap Marginal Risk Set Model
 CPHM:

Cox Proportional Hazards Model
 CPHM:

Jackknife Cox Proportional Hazards Model
 IMR:

Infant Mortality Rate
 JMRSM:

Jackknife Marginal Risk Set Model
 KUTH:

Kigali University Teaching Hospital
 MRSM:

Marginal Risk Set Model
 PWPM:

Prentice, Williams and Peterson Model
 SSA:

SubSaharan Africa
 WLWM:

Wei, Lin and Weissfeld Model
References
Ester PV, Torres A, Freire JM, Hernández V, Gil A. Factors associated to infant mortality in SubSaharan Africa. J Public Health Afr. 2011;2(e27):112–6.
Adetunji J, Bos ER. Disease and Mortality in SubSaharan Africa. The International Bank for Reconstruction and Development/ the World Bank. 2nd ed; 2006. URL http://www.ncbi.nlm.nih.gov/books/NBK2292/
Gatabazi P, Melesse SF, Ramroop S. Infant mortality at the Kigali University Teaching Hospital: Application of Aalen additive hazards model and comparison with other classical survival models. Afr Popul Stud. 2019;33(2):4834–51.
Gatabazi P, Melesse SF, Ramroop S. Multiple events model for the infant at Kigali University teaching hospital. Open Public Health J. 2018;11:464–73.
Hamada C. Bootstrap Cox regression using SAS procedures. SAS Institute Japan Ltd. 1995;211:1258–62.
Utzet F, Sánchez A. Some applications of the bootstrap to survival analysis. Anuario de Psicologia. 1992;55:155–67.
Cook RJ, Lawless JF. The statistical analysis of recurrent events. New York: Springer; 2007.
Wei LJ, Glidden DV. An overview of statistical methods for multiple failure time data in clinical trials. Stat Med. 1997;16:833–9.
B Efron and R J Tibshirani. An introduction to the bootstrap. Chapman & Hall/CRC, 1994.
Bělašková SB, Fišerová E, Krupičková S. Study of bootstrap estimates in Cox regression model with delayed entry. Mathematica. 2013;52(2):21–30.
Xu G, Sen B, Ying Z. Bootstrapping a changepoint Cox model for survival data. Electron J Stat. 2014;8(1):1345–79.
Xiao L, YaoHua W, DongSheng T. Jackknifed random weighting for Cox proportional hazards model. Sci China Math. 2012;55(4):775–86.
Gatabazi P, Melesse SF, Ramroop S. Resampled Cox proportional hazard model for the infant mortality at the Kigali University teaching hospital teaching hospital. Open Public Health J. 2019;12:136–44.
Wang Z, Wu Y, Zhao LC. Approximation by randomly weighting method in censored regression model. Sci China Ser A. 2009;52:567–76.
Zheng ZG. Random weighting method. Acta Math Appl Sin. 1987;10:247–53.
Zheng ZG, Tu D. Random weighting method in regression models. Sci Sinica Series A. 1988;31:1442–59.
James LF. A study of a class of weighted bootstraps for censored data. Ann Stat. 1997;25:1595–621.
Quan H, Tsai WY. Jackknife for the proportional hazards model. J Stat Comput Simul. 1992;43:163–76.
Sauerbrei W, Schumacher M. A bootstrap resampling procedure for model building: application to the Cox regression model. Stat Med. 1992;11:2093–109.
Efron B. Censored data and the bootstrap. J Am Stat Assoc. 1981;76:312–9.
Hjort N. Bootstrapping Cox’s regression model. Technical Report, Stanford University, Dept. of Statistics, vol. 241; 1985.
Kim J. Conditional bootstrap methods for censored data. PhD dissertation, Florida State University, Dept, of Statistics; 1990.
Collet D. Modeling survival data in medical research. 2nd ed. London: Chapman & Hall; 2003.
Wei LJ, Lin DY, Weissfeld L. Regression analysis of multivariate incomplete failure time data by modelling marginal distributions. J Am Stat Assoc. 1989;84(408):1065–73.
Gatabazi P, Kabera G. Survival analysis and its stochastic process approach with application to diabetes data; 2015.
Sankaran PG, Anisha P. Shared frailty model for recurrent event data with multiple causes. J Appl Stat. 2011;38(12):2859–68.
Breslow NE. Covariance analysis of censored survival data. Biometrics. 1974;30:89–99.
Efron B. The efficiency of Cox’s likelihood function for censored data. J Am Stat Assoc. 1977;72:557–65.
Cox DR. Regression models and lifetables (with discussion). J R Stat Soc. 1972;34(2):187–220 Series B.
Parzen M, Lipsitz SR. A global goodnessoffit statistic for Cox regression models. Biometrics. 1999;55:580–4.
Basinga P, Moore AM, Singh S, Remez L, Birungi F, Nyirazinyoye L. Unintended pregnancy and induced abortion in Rwanda: causes and consequences. New York: Guttmacher Institute; 2012.
Olausson PO, Cnattinius S, Haglund B. Teenage pregnancies and risk of late fetal death and infant mortality. Br J Obstet Gynaecol. 1999;106:116–21.
Lampinen R, Julkunen KV, Kankkunen P. A review of pregnancy in women over 35 years of age. Open Nurs J. 2009;3:33–8.
Pongou R. Why is infant mortality higher in boys than in girls? A new hypothesis based on preconception environment and evidence from a large sample of twins. Demography. 2013;50:421–44.
Zarulli V, Jones JAB, Oksuzyanc A, LindahlJacobsen R, Christensen K, Vaupel JW. Women live longer than men even during severe famines and epidemics. Proc Natl Acad Sci U S A. 2018;115(4):E832–40.
Monden CWS, Smits J. Mortality among twins and singletons in SubSaharan Africa between 1995 and 2014: a pooled analysis of data from 90 demographic and health surveys in 30 countries. Proc Natl Acad Sci U S A. 2017;5(7):E673–9.
Pongou R, Shapiro D, Tenikue M. Mortality convergence of twins and singletons in subSaharan Africa. Demogr Res. 2019;41(36):1047–58.
Janssen PA, Thiessen P, Klein MC, Whitfield MF, Ying C, CullisKuhl SC. Standards for the measurement of birth weight, length and head circumference at term in neonates of European, Chinese and South Asian ancestry. Open Med. 2007;1(2):74–88.
Acknowledgements
We thank DELTAS Africa Initiative and the University of KwaZuluNatal for jointly supporting this work.
Funding
Writing the manuscript is founded by the University of KwaZuluNatal. The study design, data collection, data analysis and interpretation are funded by the Wellcome Trust, grant 107754/Z/15/ZDELTAS Africa via SubSaharan Africa Consortium for Advanced Biostatistics (SSACAB).
Author information
Authors and Affiliations
Contributions
SFM participated in the study design and data acquisition and reviewed the study, SR participated in the study conceptualization and reviewed the study, PG collected the dataset, analysed the dataset and reported the text. All authors reviewed and approved the final manuscript.
Corresponding author
Ethics declarations
Ethics approval and consent to participate
The present study was approved by the Kigali University Teaching Hospital where dataset was taken from the hospital database, with consent that the names of both parents and children cannot be published.
Consent for publication
Not applicable.
Competing interests
The authors declare that they have no competing interests.
Additional information
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Appendix
Appendix
Bootstrap and Jackknife resampling methods
Bootstrap
Consider the p fixed covariates x_{i} = (x_{i1}, x_{i2},. .., x_{in}) in Eq. (2) where x_{i j,i∈[1,p]} are independent and identically distributed possibly with distribution F_{θ} where θ is the statistical parameter of interest. Consider the distribution function F_{Rn} of a random variable R_{n}(x, F_{θ}). A bootstrap method as described in [9], consists of generating samples.
x_{i}^{*} = x_{i}^{*1}, x_{i}^{*2}, …, x_{i}^{*B},
where x_{i}^{*k}, k ∈ [1, B] are random samples of size n drawn with replacement from the sample x_{i}.
The variables of x_{i}^{*k} are independent and identically distributed with distribution \( {\hat{F}}_{\theta, n} \), given x; \( {\hat{F}}_{\theta, n} \) is an estimator of F_{θ} from x_{i}; B is a number of bootstrap samples also known as replications.
The estimated standard error of the bootstrap statistic of interest is given in Efron and Tibshirani [9] as
where \( {\hat{\theta}}^{\ast }(b) \) is an estimate of the statistic of interest from the b^{th} bootstrap sample,
b = 1, 2,. .. .B
Jackknife
Consider the p fixed covariates x_{i} = (x_{i1}, x_{i2},. .., x_{in}) in Eq. (2).
Let θ be a statistic of interest. The jackknife samples consist of leaving out one observation at a time, that is n samples.
x_{i}^{*} = (x_{i1}, x_{i2},. .., x_{i k − 1}, x_{i k + 1},. .., x_{in}) ∀ k ∈ [1, n] [9].
The jackknife standard error estimate as propose [9], is
where θ^{∗}(i), i ∈ [1, n] is a statistic of interest for the i^{th} jackknife sample.
Rights and permissions
Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
About this article
Cite this article
Gatabazi, P., Melesse, S.F. & Ramroop, S. Comparison of three classes of Marginal Risk Set Model in predicting infant mortality among newborn babies at Kigali University Teaching Hospital, Rwanda, 2016. BMC Pediatr 20, 62 (2020). https://doi.org/10.1186/s1288702019451
Received:
Accepted:
Published:
DOI: https://doi.org/10.1186/s1288702019451
Keywords
 Infant mortality
 Survival analysis
 Marginal risk set model
 Resampling
 Covariate
 Rwanda