- Open Access
Reliability of visual assessment of neonatal jaundice among neonates of black descent: a cross-sectional study from Tanzania
BMC Pediatrics volume 21, Article number: 383 (2021)
Jaundice is common among neonates and if untreated can lead to kernicterus. Diagnosing neonatal jaundice (NJ) using Kramer’s method (visual assessment) is considered user-friendly in resource-limited areas. However, there are conflicting findings on reliability of the Kramer’s method in the diagnosis of NJ, particularly of black descent. Therefore, study aimed to determine the accuracy of Kramer’s method in comparison to the total serum bilirubin (TSB) test in the diagnosis of NJ among neonates of black descent in Tanzania.
A cross-sectional study was conducted between June and July 2020 at Muhimbili National Hospital (MNH) in Dar es Salaam Tanzania. A total of 315 neonates were recruited consecutively. In each neonate, jaundice was assessed using Kramer’s method and TSB test. NJ A total of 315 neonates were recruited i. A 2 X 2 table was created for the determination of sensitivity, specificity, positive predictive value (PPV), negative predictive value (NPV), positive and negative likelihood ratios (+LR/−LR), and diagnostic accuracy (effectiveness) of Kramer’s method. Cohen kappa (κ) was used to analyze the agreement between Kramer’s method and TSB. Association between independent variables and presence of jaundice were assessed using the chi-square test and the p < 0.05 was considered to be statistically significant.
The prevalence of NJ was 49.8% by Kramer’s method and 63.5% by TSB. The Sensitivity, Specificity, PPV, and NPV of Kramer’s method were 70.5, 86.1, 89.8, and 62.6%, respectively. The +LR and –LR were 5.07 and 0.34, respectively. The diagnostic accuracy of Kramer’s method was 76.1%. There was a moderate agreement between Kramer’s method and TSB results (κ = 0.524, P<0.001). No significant relationship was observed between the independent variables and the presence of NJ.
Kramer has a good positive predictive value. However, due to low sensitivity and NPV one cannot say that overall predictive ability is good. Also, clinical assessment by Kramer’s method should not be used for screening of NJ. Further studies are needed to investigate the utility of other non-invasive techniques in detecting NJ among neonates of black descent.
Neonatal jaundice (NJ) affects one in two neonates worldwide  and the problem is much bigger in sub-Saharan Africa and South Asia. About 60% of the term and 80% of the preterm babies suffer jaundice during the first 7 days of life and is the major reasons why neonates attend emergency department [2, 3]. Neonates with jaundice are prone to bilirubin encephalopathy and kernicterus which contribute to neonatal morbidity and mortality .
Early detection/diagnosis and appropriate management of NJ are very crucial in order to prevent its complications. Visual assessment (Kramer’s rule), transcutaneous bilirubinometry (TcB), and TSB are the approaches used for the detection of NJ .
Kramer’s method is user friendly because it is non-invasive, does not require any machine and so can be used effectively in resource limited settings. The technique was introduced by Kramer, and showed a positive correlation between progressions of skin discoloration and serum bilirubin levels [5, 6] due to skin thickness differences. According to Kramer, the assessment should begin from the head towards the feet which portray the five zones of the cephalo-caudal progression of jaundice. It is important to examine all zones because studies reported that bilirubin concentration differ significantly between each dermal zone.
Controversial findings on reliability of this technique for detecting NJ have been reported worldwide. Studies conducted in Pakistan (2010) and Indonesia (2017) reported that Kramer’s method is reliable in detection of NJ and darker skin color is not a matter of concern [7, 8]. However, a study conducted in United State of America (USA) reported that Kramer’s method alone is not reliable in detecting NJ especially in darkly pigmented neonates . Besides, study conducted in South Africa reported that Kramer’s method was able to pick only 17% out of the 52% neonates with hyperbilirubinemia .
Despite those conflicting results, Kramer’s method is the backbone in diagnosis of NJ in most of the low-level health facilities in low and middle income countries like Tanzania. Therefore, this study aimed to determine the sensitivity, specificity and accuracy of Kramer’s method in detecting NJ in comparison with TSB test in Tanzania setting.
Study design, period and setting
A hospital-based cross-sectional study aimed at assessing the reliability of the Kramer’s method in detecting NJ was conducted between June and July 2020 at Muhimbili National Hospital (MNH), Tanzania. MNH is a national hospital located in Dar es Salaam center, receiving patients from different parts of the city and regions of Tanzania. The neonatal unit at MNH is one the biggest in the country, serving up to 500 neonates per month. The unit has adequate numbers of skilled pediatricians, medical doctors and nurses. In addition, the MNH has a well-equipped central laboratory with capacity of performing numerous tests, including total serum bilirubin.
Study population and eligibility criteria
Black descent neonates aged less than 28 days who were admitted at MNH and whose mothers/guardians provided written consent to participate in the study were enrolled. Neonates with rashes and those who were severely sick (in life support, in oxygen masks) were excluded because assessment of NJ using Kramer’s method cause physical discomfort to the babies.
Sample size, sample size calculation and sampling technique
A total of 315 neonates were recruited into this study. The formula for ‘diagnostic sample size calculation’ was used to calculate the sample size  (i.e. n = Z2 x Specificity x (1-specificity) / e2 x (1-prevalence)). Specificity of Kramer’s method of 89% from a study conducted in Indonesia , prevalence of NJ of 50% in Tanzania (unpublished article), precision of 5 and 95% (z = 1.96) confidence interval were used to obtain 283.5. Considering 10% non-respondent rate, a total of 315 neonates were obtained. Participants were recruited consecutively.
Data collection procedure
Data were collected using case report form (CRF) comprising of 3 sections; demographic information (age, gender, birthweight, gestational age, and delivery mode), Kramer scores and TSB results. The CRF was developed following literature review and experts’ consultation. Each neonate was subjected to visual assessment using Kramer’s method and TSB test. Neonatal jaundice was defined by yellowish discoloration of the skin by visual assessment or TSB level of ≥ 85 μmol/l by TSB .
Visual assessment using Kramer’s rule
Two medical doctors and 1 pediatrician with experience in neonatal assessment performed the visual assessment on routine ward round. For each neonate, visual assessment using Kramer’s method was performed by the 2 doctors separately and discrepancies were resolved by the pediatrician. Visual assessment of the 5 dermal zones were scored as no jaundice =0, jaundiced at level of head and neck = 1, jaundiced at upper trunk (above umbilicus) =2, jaundiced at lower trunk and thighs (below umbilicus) =3, jaundiced at arms and lower legs =4, jaundiced palms and soles = 5 . The discrepancy of one value in Kramer score was acceptable. Discrepancies of more than one value in Kramer score were resolved by consulting a pediatrician. Kramer score recorded for each neonate was the average score of the two examiners. During assessment, the neonate was fully undressed, and assessment was conducted under blue fluorescent light. Besides, by using the thumb, the skin of neonate was blanched to observe the underlying skin color from the head to toes following standard procedure described by Devi et al. .
About 2–2.5 mls of blood sample was withdrawn from each neonate’s femoral vein using 2.5 cc syringe into a green topped heparinized vacutainer tube. The collected sample was transported to the laboratory for processing within 2 h using a cooler box containing ice packs. In the laboratory, the sample was centrifuged using a centrifuging machine to obtain serum. The obtained serum was analyzed for bilirubin concentration using the spectrophotometric chemistry analyzer (Architect chemistry analyzer, USA). Bilirubin determination was based on the run of bilirubin with diazotized sulfanilic acid. Azobilirubin concentration was measured at an absorbance of 500-600 nm which is proportionate to bilirubin concentration.
Data was entered and analyzed by SPSS version 23 software. Frequencies and percentages were used to summarize categorical variables and continuous variables were summarized by using median (interquartile ranges (IQR)). For Kramer’s method, those who scored 0 were grouped as no jaundice while 1–5 were considered to have jaundice. TSB test of ≥85 μmol/l were considered to have jaundice and below that as no jaundice. Sensitivity, specificity, predictive values and diagnostic effectiveness (accuracy) were obtained by contingency Tables (2 × 2 table for diagnostic test) . Diagnostic accuracy results were defined as very weak (> 50–60%), weak (> 60–70%), moderate (> 70–80%), good (> 80–90%), or very good (> 90–100%). Positive likelihood ratio (+LR) >10 and negative likelihood ratio (−LR) < 0.1 were considered to be a good test . Cohen kappa statistic (κ) was used to determine agreement of Kramer’s method and TSB results, with (κ) of 0.21–0.40, 0.41–0.60, 0.61–0.80 and 0.81–1.00 being fair, moderate, good and very good agreement, respectively . Association of independent variables and NJ was done using the chi-square test (X2 test) and p < 0.05 was considered to be statistically significant.
580 neonates were admitted at the MNH neonatal unit during the study period. 247 were not eligible because written consent was not obtained as mothers were left at health facilities where they gave birth due to obstetric complications/management and 333 were enrolled. Out of 333, 18 were excluded because blood samples were not taken  and TSB test results were misplaced . Therefore, a total of 315 were subjected to analysis Fig. 1.
Socio-demographic characteristics of the participants
Of 315 respondents with complete data, 182 (57.8%) were males with the median (IQR) age of 4 [2,3,4,5,6,7,8] days. More than half (58.4%) of the neonates were delivered by spontaneous vaginal delivery and half of the neonates 158 (50.2%) were born at preterm with median (IQR) birth weight of 2500 (1700–3000) grams Table 1.
Magnitude of NJ by Kramer’s method, TSB and associated factors
The prevalence of NJ was 63.5% by TSB and 49.8% by Kramer’s method. The median (IQR) TSB for neonates who were clinically not jaundiced was 65(40–106) μmol/l and 178 (121–238) μmol/l for neonates who were clinically jaundiced. Following chi-square test, all variables including age (p = 0.46), gender (p = 0.77), delivery mode (p = 0.13), birthweight (p = 0.19) and gestational age (0.78) were not significantly associated with the occurrence of NJ.
Diagnostic accuracy of the Kramer’s method
Kramer’s method had a Sensitivity-70.5%, Specificity-86.1%, PPV-89.8%, and NPV-62.6% in detecting NJ. It had 5.07 LR of detecting jaundice and 0.34 LR of not detecting jaundice among neonates with a diagnostic accuracy of 0.761. Kramer’s method had moderate agreement with TSB test (κ = 52.4, P < 0.01) Table 2.
Diagnostic performance of Kramer’s method based on term, preterm, and birthweight
For neonates born at term, the Kramer test had 75.3% sensitivity, 82.8% specificity, 86.4% PPV, 69.7% NPV and 56.4% Cohen kappa in comparison to 66.4% sensitivity, 82.8% specificity, 93.0% PPV, 56.0NPV and 48.8% Cohen kappa for preterm neonates, P < 0.001 Table 3.
The study aimed at determining the reliability of Kramer’s method for assessment of NJ among neonates of black descent. Our study found the Kramer’s method to have moderate diagnostic accuracy (76%), moderate sensitivity (70.5%), and good specificity (86.1%). Besides, Kramer’s method was found to have good ability of predicting presence of NJ (89.8%).
Our findings are consistent with the study conducted in Indonesia, in which sensitivity and specificity of Kramer’s method reported to be 76.92 and 89.47% respectively. This study was conducted among neonates of black descent and the diagnostic accuracy found to be moderate which is in line with findings from various studies that reported the diagnostic accuracy of Kramer’s method to be poor for neonates of black ethnicity compared to other groups . Hence, our study emphasizes that Kramer’s method may be used as a predictor (PPV =89.8%) of NJ rather than confirmatory test for the presence of NJ. In contrast to our findings, the Indonesian study reported good (86.27%) diagnostic accuracy of Kramer’s method . The observed discrepancy could be due to differences in sample size and characteristics of study participants since our study involved only neonates of black descent while the study in Indonesia included neonates of black and white descent. In addition, our findings are similar to what was reported in Switzerland in which the diagnostic accuracy of Kramer’s method was found to be moderate (73%) .
Our study also demonstrated slightly increases in sensitivity of Kramer’s method when detecting jaundice among term (75.3%) and normal birthweight (80%) neonates and increase in specificity among neonates with low birthweight (89. 4%). These findings are comparable to what was reported in Denmark and Switzerland where sex, birthweight and gestational age were demonstrated to be the determinants of the diagnostic accuracy of the visual assessment technique [15, 16]. These findings suggest the consideration of gestational age and birthweights when using Kramer’s method in predicting NJ.
TcB, a non-invasive technique, can be used to detect NJ at an early stage. In comparison to TSB, this technique is able to detect bilirubin concentration easily, does not cause heel-puncture pain or irritation to the child, and is simple to calibrate. TcB, however, is unable to detect accurate bilirubin concentrations after 10 mg/dl, necessitating TSB confirmation. Furthermore, since the TcB machine costs thousands of dollars, it is not readily available in resource-limited settings like Tanzania. Similarly, since hemoglobin, melanin, skin types, and age all affect readings, most non-invasive methods cannot be used to diagnose severe disease [17,18,19].
Lastly this study showed that the occurrence of NJ was independent of age, gender, delivery mode, birthweight and gestational age. These findings are comparable to a study done by Aprillia et al. (2017) which reported that gender, skin color and gestational age were not the determinants for the occurance of NJ. However, our findings are inconsistent with a review conducted in USA (2011) which reported that physiological jaundice was expected to peak between 3 and 5 days of life . Additionally, a study conducted in India showed that hyperbilirubinemia was much more common in female than male neonates because albumin concentration differs between gender . The observed difference might be due to differences in study population as the review in USA only included neonates of less than 5 days of life and study in India recruited only neonates with low birthweight).
The fact that this study was conducted at the tertiary referral hospital implies that the prevalence of NJ might have been overestimated. Also, Kramer method could be affected by intra- observer variability which was tacked by reconciling individual observations to the level of one-unit difference in scores, and computing Cohen kappa statistics for reliability during data analysis.
Kramer has a good positive predictive value. However, due to low sensitivity and NPV one cannot say that overall predictive ability is good. Also clinical assessment by Kramer’s method should not be used for screening of NNJ. Further studies are needed to investigate the utility of other non-invasive techniques in detecting NJ among neonates of black descent.
Availability of data and materials
The datasets analyzed during this study are available from the corresponding author on reasonable request.
Case report form
Low birth weight
Muhimbili national hospital
Muhimbili university of health and allied sciences
Normal birth weight
Negative predictive value
Positive predictive value
Statistical product and service solution
Spontaneous vaginal delivery
Total serum bilirubin
United states of America
Brits H, Adendorff J, Huisamen D, Beukes D, Botha K, Herbst H, et al. The prevalence of neonatal jaundice and risk factors in healthy term neonates at National District Hospital in Bloemfontein. Afr J Prim Health Care Fami Med. 2018;10(1):1–6.
Ekwochi U, Osuorah CD, Ndu IK. Correlation between total serum bilirubin and clinico-laboratory parameters of babies admitted for neonatal jaundice in a resource-limited setting. Int J Clinicopathological Correlation. 2018;2(2):21.
Cohen RS, Wong RJ, Stevenson DK. Understanding neonatal jaundice: a perspective on causation. Pediatr Neonatol. 2010;51(3):143–8.
Kulkarni S, Dolas A, Doibale M. Risk factors of neonates with indirect hyperbilirubinemia in a tertiary care hospital. Int J Basic Appl Med Sci. 2014;4(1):395–9.
Wan A, Daud SM, Teh S, Choo Y, Kutty FM. Management of neonatal jaundice in primary care. Malaysian Fam Physician. 2016;11(2–3):16.
Devi S, Dash M, Chitra F. Detection of neonatal jaundice among the newborn using Kramer’s criteria. Epidemiology (Sunnyvale). 2018;8(355):2161–1165.1000355.
Hatzenbuehler L, Zaidi AK, Sundar S, Sultana S, Abbasi F, Rizvi A, et al. Validity of neonatal jaundice evaluation by primary health-care workers and physicians in Karachi, Pakistan. J Perinatol. 2010;30(9):616.
Aprillia Z, Gayatri D, Waluyanti FT. Sensitivity, Specificity, and Accuracy of Kramer Examination of Neonatal Jaundice: Comparison with Total Bilirubin Serum. Compr Child Adolesc Nurs. 2017;40(sup1):88–94.
Keren R, Tremont K, Luan X, Cnaan A. Visual assessment of jaundice in term and late preterm infants. Arch Dis Child Fetal Neonatal Ed. 2009;94(5):F317–F22.
Negida A, Fahim NK, Negida Y. Sample size calculation guide-part 4: how to calculate the sample size for a diagnostic test accuracy study based on sensitivity, specificity, and the area under the roc curve. Advanced J Emerg Med. 2019;3(3):e33-e.
Porter ML, Dennis MBL. Hyperbilirubinemia in the term newborn. Am Fam Physician. 2002;65(4):599.
Kramer LI. Advancement of dermal icterus in the jaundiced newborn. Am J Dis Children. 1969;118(3):454–8.
Florkowski CM. Sensitivity, specificity, receiver-operating characteristic (ROC) curves and likelihood ratios: communicating the performance of diagnostic tests. Clin Biochem Rev. 2008;29(Suppl 1):S83.
Šimundić A-M. Measures of diagnostic accuracy: basic definitions. Med Biol Sci. 2008;22(4):61.
Szabo P, Wolf M, Bucher H, Haensse D, Fauchere J, Arlettaz R. Assessment of jaundice in preterm neonates: comparison between clinical assessment, two transcutaneous bilirubinometers and serum bilirubin values. Acta Paediatr. 2004;93(11):1491–5.
Knudsen A, Ebbesen F. Cephalocaudal progression of jaundice in newborns admitted to neonatal intensive care units. Neonatology. 1997;71(6):357–61.
Saini N, Kumar A, Khera P. Non-invasive bilirubin detection technique for jaundice prediction using smartphones. Int J Comput Sci Inf Secur. 2016;14(8):1060.
Karon BS, Teske A, Santrach PJ, Cook WJ. Evaluation of the BiliChek noninvasive bilirubin analyzer for prediction of serum bilirubin and risk of hyperbilirubinemia. Am J Clin Pathol. 2008;130(6):976–82.
Gupta A, Kumar A, Khera P, editors. Jaundice prediction through non-invasive techniques: Issues and challenges. 2015 Annual IEEE India Conference (INDICON); 2015: IEEE.
Dennery PA, Seidman DS, Stevenson DK. Neonatal hyperbilirubinemia. N Engl J Med. 2001;344(8):581–90.
We thank all the staff members of the neonatal unit at MNH for their tireless support during this study. We acknowledge the MUHAS physiology department members for their contributions, critics and inputs to this work. In addition, we acknowledge management of University of Dodoma for supporting the study. Special thanks to the neonates as well as all parents and guardians who allowed their neonates to be involved in this study.
Ethics approval and consent to participate
Ethical clearance was obtained from Muhimbili University of Health and Allied Sciences (MUHAS) institutional’ review board with reference number DA.287/298/01A/. All procedures performed on study participants were in accordance with the Helsinki declaration. Permission to conduct this study at MNH was obtained from the Office of Research and Publications. Informed consent was obtained from all mothers/guardians before enrollment of their neonates into the study. Participants’ numbers and codes were used instead of names during data collection, analysis and presentation to ensure confidentiality.
Consent for publication
The authors declare that they have no competing interest.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
About this article
Cite this article
Dionis, I., Chillo, O., Bwire, G.M. et al. Reliability of visual assessment of neonatal jaundice among neonates of black descent: a cross-sectional study from Tanzania. BMC Pediatr 21, 383 (2021). https://doi.org/10.1186/s12887-021-02859-x
- Neonatal jaundice
- Black descent
- Diagnostic accuracy