Limitations and opportunities of whole blood bilirubin measurements by GEM premier 4000®
© The Author(s). 2017
Received: 7 November 2016
Accepted: 21 March 2017
Published: 29 March 2017
Neonatal hyperbilirubinemia has traditionally been screened by either total serum bilirubin or transcutaneous bilirubin. Whole blood bilirubin (TwB) by the GEM Premier 4000® blood gas analyzer (GEM) is a relatively new technology and it provides fast bilirubin results with a small sample volume and can measure co-oximetry and other analytes. Our clinical study was to evaluate the reliability of TwB measured by the GEM and identify analytical and clinical factors that may contribute to possible bias.
440 consecutive healthy newborn samples that had plasma bilirubin ordered for neonatal hyperbilirubinemia screening were included. TwB was first measured using the GEM, after which the remainder of the blood was spun and plasma neonatal bilirubin was measured using the VITROS 5600® (VITROS).
62 samples (14%) were excluded from analysis due to failure in obtaining GEM results. Passing-Bablok regression suggested that the GEM results were negatively biased at low concentrations of bilirubin and positively biased at higher concentrations relative to the VITROS results (y = 1.43x-61.13). Bland-Altman plots showed an overall negative bias of the GEM bilirubin with a wide range of differences compared to VITROS. Both hemoglobin concentration and hemolysis affected the accuracy of the GEM results. Clinically, male infants had higher mean bilirubin levels, and infants delivered by caesarean section had lower hemoglobin levels. When comparing the number of results below the 40th percentile and above the 95th percentile cut-offs in the Bhutani nomogram which would trigger discharge or treatment, GEM bilirubin exhibited poor sensitivity and poor specificity in contrast to VITROS bilirubin.
An imperfect correlation was observed between whole blood bilirubin measured on the GEM4000® and plasma bilirubin on the VITROS 5600®. The contributors to the observed differences between the two instruments were specimen hemolysis and the accuracy of hemoglobin measurements, the latter of which affects the calculation of plasma-equivalent bilirubin. Additionally, the lack of standardization of total bilirubin calibration particularly in newborn specimens, may also account for some of the disagreement in results.
KeywordsNeonatal Hyperbilirubinemia Screening Whole blood bilirubin
Jaundice is the most common condition that requires medical attention in newborns. Although it is benign and transient in most cases, some neonates with severe hyperbilirubinemia are at risk for bilirubin-induced neurologic dysfunction (BIND), which occurs when unconjugated bilirubin crosses the blood-brain barrier and binds to brain tissue causing irreversible neurologic damage if left untreated . Prevention of BIND requires appropriate clinical assessment for risk factors, timely measurement of bilirubin levels and early treatment with either phototherapy or exchange transfusion . Universal screening for newborn hyperbilirubinemia by measuring either total serum bilirubin (TSB) or transcutaneous bilirubin (TcB) has been adopted by many countries . TSB has been the clinical standard for determining risk for kernicterus since the publication of Bhutani (1999) . TcB is known to be less accurate at high bilirubin levels and may be affected by skin pigmentation (although improved with newer models) and skin thickening, making it unsuitable for older neonates [5, 6]. Most importantly, TcB is a physiologically different parameter than TSB because TcB assesses mainly extravascular bilirubin whereas TSB reflects the intravascular bilirubin concentration .
Whole blood bilirubin (TwB) analyzed on a blood gas instrument is a new and promising alternate method for neonatal hyperbilirubinemia screening. Compared to TSB it requires a smaller sample volume, has a faster turnaround time, and offers concurrent measurement of a full range of analytes (blood gas, electrolytes, glucose, lactate and co-oximetry), which allows for an efficient and comprehensive assessment of newborn status especially in critically ill neonates. Given these potential advantages of TwB, we conducted a prospective clinical study to compare TwB to TSB in the laboratory. Our objectives were to compare TwB results on the GEM Premier 4000® blood gas analyzer (Instrumentation Laboratory, Bedford, MA) (GEM) against plasma neonatal bilirubin results on the VITROS 5600® (Ortho Clinical Diagnostics, Rochester, NY) (VITROS) and to examine whether bilirubin measurement is influenced by various pre-analytical, analytical or clinical factors.
Three hundred and thirty-one newborns for whom bilirubin testing was clinically required were enrolled between May and August 2014 (total of 440 samples as some neonates had multiple samples). The inclusion criteria were for the neonates to be <14 days postnatal age and that sufficient blood volume was collected to enable measurement of both whole blood and plasma bilirubin. Capillary whole blood samples were collected into plain heparinized microtainers. TwB was measured using the GEM after which the remainder of the specimen was centrifuged and the clinically required plasma neonatal bilirubin was measured using the unconjugated and conjugated bilirubin (BuBc) slide on the VITROS. Clinical data such as gestational age, gender, birth weight, postnatal age, Apgar score and delivery methods (caesarean section (CS) vs vaginal delivery) were extracted from chart reviews. This study was approved by the Research Ethics Committee of the University of British Columbia (H13-02615). Consent was not required as this was a quality assessment project in the lab and no additional blood was drawn.
Laboratory analysis principles
TwB measured by the GEM is a point-of-care test that was approved by the Federal Drug Administration in 2010. The blood gas analyzer measures total bilirubin along with the hemoglobin (Hb) fractions in hemolyzed whole blood samples by direct spectrophotometry in the co-oximetry module. In order to distinguish each component, multi-wavelengths are used and multi-variate regression algorithms established in a spectral library are compared when measuring a whole blood sample. Plasma equivalent bilirubin results are then reported using a formula [Bili plasma = bili whole blood/(1-hematocrit)], and hematocrit (Hct) is calculated as [Hct = 0.03x Hb]. The constant 0.03 represents the average concentration of hemoglobin (g/dL) within the red blood cells. GEM bilirubin is calibrated based on total bilirubin spectrum made of known levels of conjugated and unconjugated bilirubin. Since the GEM uses a direct spectrophotometric assay, it cannot distinguish different fractions of bilirubin.
Neonatal bilirubin concentrations in heparinized plasma were measured using the multi-layered slide technique by reflectance spectrophotometry on a VITROS 5600® analyzer. In the BuBc slide, plasma distributes and reacts with reagents while Hb and other interfering compounds are trapped and masked from being measured. Bu and Bc are measured simultaneously. The VITROS calibrator is composed of high grades of Bu that is traceable to NIST 916a and the synthetic ditaurobilirubin disodium salt for Bc. The Hemolysis index (H index) is a quantitative estimate of Hb that can be measured spectrophotometrically using the VITROS and was used to assess the effect of hemolysis on the measurement of whole blood bilirubin by the GEM.
The study population was characterized by descriptive statistics. Passing-Bablok regression, Bland-Altman plots , and paired sample t-tests were used to examine the differences between the whole blood bilirubin (GEM) and BuBc measurements (VITROS). To determine the relationship between the difference in measurements by the GEM and the VITROS and clinical factors such as gestational age, gender, birth weight, Apgar score and delivery mode, we used Welch’s t-tests for comparisons between two groups, ANOVA for comparisons among more than two groups, and linear regression for relationships with continuous variables. Apgar scores were dichotomized into <7 and ≥7 groups for comparison. All comparisons of the above clinical variables were conducted on the first sample from each neonate.
We used generalized additive modeling to describe the association between the difference in methods and Hb concentration (as the relationship was clearly non-linear) and linear modeling to describe the association between the difference in methods and hemolysis (H index) on all samples, including the repeat samples on some neonates.
Finally, we assessed the predictive value of the GEM whole blood bilirubin results by applying the Bhutani nomogram  to classify them into the different risk categories. We also classified the VITROS values against the nomogram and used the numbers in each of the risk categories as the standard against which to compare the GEM results. We calculated sensitivity, specificity, negative predictive value (NPV), and positive predictive value (PPV) for risk cutoffs at the <40th percentile and >95th percentile, which would trigger discharge and treatment, respectively. Analyses were carried out using R version 3.2.4.
Demographic and clinical variables of the 318 unique neonates
Gestational age (weeks), mean (SD)
Postnatal age (hours), median (IQR)a
Mode of delivery
APGAR at 1 min <7
APGAR at 5 min <7
Hemoglobin effects/hemolysis effects
Clinical factors contributing to the differences in bilirubin measurement between the GEM and VITROS methods
A statistically significant difference in bilirubin levels was detected between male and female infants. When plasma bilirubin was measured using the BuBc slide, male infants had higher levels with an average concentration of 163.5 umol/L and standard deviation of 58.2 umol/L than female infants with an average of 143.4 umol/L and a standard deviation of 52.1 umol/L (Welch’s t = −3.2, df = 313, p = 0.001). A significant difference in whole blood bilirubin levels between male and female infants (Welch’s t = −3.2, df = 312.9, p = 0.002) was also detected when measured with the GEM methodology. Similar to the BuBc slide, male infants had higher whole blood bilirubin concentrations when compared to female infants (162.9 umol/L for male; 136.1 umol/L for female) when measured with the GEM method. Although method differences were observed in gender specific bilirubin levels, these differences were not statistically significant.
Similarly, we found that there were no significant effects of mode of delivery, gestational age, birth weight and Apgar scores on the difference between the two methods (data not shown). However, we found that there was a significant difference in the mean Hb concentrations in neonates delivered by CS in comparison to those delivered vaginally (Welch's t = −2.86, df = 230.1, p = 0.005). Babies born via CS had lower hemoglobin levels on average (190.8 g/L vs 197.7 g/L).
Sensitivity, specificity, and predictive values for VITROS nomogram vs GEM nomogram
GEM results of >95th percentile on the Bhutani nomogram compared to VITROS results of >95th percentile
Negative (no treatment)
42 (true positives)
50 (false positives)
Negative (no treatment)
23 (false negatives)
258 (true negatives)
GEM results of <40th percentile on the Bhutani nomogram compared to VITROS results of >95th percentile
Negative (no discharge)
87 (true positives)
55 (false positives)
Negative (no discharge)
24 (false negatives)
207 (true negatives)
This clinical study compared the bilirubin measurements between the whole blood GEM methodology and the plasma VITROS BuBc slide methodology in healthy newborns with capillary samples. We found an imperfect correlation between TwB and neonatal bilirubin such that bilirubin levels were underestimated at low concentrations by the GEM method while at concentrations greater than 142 umol/L, the GEM method tended to overestimate relative to the VITROS method. When classifying the risks for neonatal hyperbilirubinemia based on the Bhutani nomogram with these two methods, GEM exhibited low sensitivity and low positive predictive value in comparison to VITROS. Many factors including pre-analytical, analytical and clinical may have contributed to the observed differences between the methods.
First, although TwB is measured, the instrument reports plasma equivalent bilirubin using a calculation that relies on the measurement of Hb. While this conversion facilitates comparison with the Bhutani nomogram for risk stratification, inaccurate Hb leads to inaccurate plasma equivalent bilirubin results. Additionally, TwB was unreportable in 14% of samples because Hb was outside of the manufacturer’s claimed measurable range (30 g/L - 230 g/L). This observation could be related to the methodology used by the GEM to measure Hb or pre-analytical factors such as microclot formation due to poor sample mixing which may have affected the accuracy of the co-oximetry with falsely high Hb results. Interestingly, we did find that healthy term newborns within the first 2 days of life have higher normal Hb levels up to 239 g/L . Neonatal Hb levels may be even higher due to polycythemia or other conditions and this will definitely challenge the analytical range of Hb by the GEM methodology (upper analytical range is 230 g/L). Specimens with Hb exceeding the 230 g/L will lead to a failure in generating a bilirubin result and a high failure rate would be problematic for a diagnostic device being considered as a screening tool. We also found that hemolysis can affect the difference between the GEM and the VITROS. Since all whole blood samples have to be hemolyzed before co-oximetry analysis, it is puzzling why hemolysis can affect accuracy. Based on the similar trends observed for both hemoglobin effect and hemolysis effect, we postulate this is just another reflection that high Hb concentration in the hemolyzed specimen can affect accuracy.
Second, the difference between results by the two methods might be related to differences in calibration. The GEM bilirubin is calibrated to total bilirubin, whereas the BuBc slides on the VITROS is calibrated to the respective fractions. Two tests are available from Ortho Diagnostics: neonatal bilirubin (BuBc) and total bilirubin (TBIL) , with BuBc being recommended for testing in neonates under 2 weeks of age and TBIL for patients 15 days old and above by the manufacturer . The sum of Bu and Bc is not the same as total bilirubin because the latter contains not only Bu and Bc, but also delta bilirubin which is a bilirubin covalently bounded with albumin. Due to the lack of interference by Hb , the simultaneous measurement of Bu and Bc, and the absence of a clinically significant difference between BuBc and total bilirubin measurements in newborns (neonates have <2% delta bilirubin) [11, 12], we only offer BuBc in our hospital to avoid clinician confusion. Commercial calibrators used by field methods are known to affect the accuracy of bilirubin measurements on chemistry analyzers  and the clinical impact by different calibrators has been published recently. Kuzniewicz et al.  reported that the recalibration of the BuBc method by the manufacturer in 2012 led to a 39% relative reduction in infants with a TSB level of 257 umol/L and more than 50% reduction in both birth hospital phototherapy and readmissions for phototherapy. Similarly, the implementation of a new formulation of total bilirubin by another manufacturer (Roche diagnostics) also affected neonatal phototherapy rates even compared to its previous formulation . Our comparison of GEM total bilirubin to BuBc instead of TBIL would add another challenge for the comparison with the whole blood bilirubin method by GEM. Standardization is needed for all bilirubin methods, which would improve the care for neonatal hyperbilirubinemia and allow for neonates to be monitored using different methods during the course of their disease.
We also analyzed possible contributing clinical factors to the observed differences in the methods and found a significant difference between methods for male and female infants. Additionally, we found that male infants had higher bilirubin levels than female infants in our patient population, which supports a previous report that male gender is associated with a higher risk of developing hyperbilirubinemia . This suggests that the relationship between the difference in the methods and infant sex was at least partially driven by the relationship between average bilirubin levels and infant sex.
For comparisons at the >95th percentile level which would trigger treatment, sensitivity and PPV were 65 and 46%, respectively. These results suggest that had GEM bilirubin been used as the screening tool for risk stratification based on the Bhutani nomogram, 35% of neonates that would have been treated by VITROS results would not have been treated by the GEM results. Similarly, for low risk neonates (<40th percentile), who can be safely discharged, the sensitivity and PPV were 78 and 61%, respectively. This suggests that only 61% of those low-risk levels by GEM would be classified as low-risk by VITROS. These results support the conclusion that GEM bilirubin is neither sensitive nor specific enough to screen for neonatal hyperbilirubinemia.
Finally, we found that neonates delivered by CS had lower Hb concentrations than neonates delivered vaginally. This is consistent with the previous finding that neonates post vaginal deliveries have higher hemoglobin levels which could be due to the recommendation of delayed cord clamping which increases placental-fetal transfusion , whereas the duration of placental transfusion in CS tends to be shorter since an immediate cord clamping is often performed to avoid maternal bleeding, infections or other surgery-related complications . A systematic review and meta-analysis study also indicated that CS compared with vaginal delivery, was associated with an increased placental residual blood volume which would cause a decreased level of hematological indices including hemoglobins in the neonates . These interesting findings could be achieved in this study because of the availability of Hb measurements by whole blood gas analyzers. This is extremely useful when caring for babies with hyperbilirubinemia due to hemolytic causes as simultaneous availability of whole blood bilirubin and Hb results would reduce the total blood volume and time needed for both tests in the lab.
This study suggests that TwB by the GEM is not yet ready for neonatal hyperbilirubinemia screening. Handling the difficult matrix of whole blood samples from newborns requires technological improvements to control pre-analytical and analytical variables. Ultimately, improvement of total bilirubin calibration with standardization and the accuracy of Hb measurements will help TwB become utilized clinically. With technical improvement, we believe whole blood bilirubin has the potential to become a valid screening tool considering the benefits of having additional analytes especially hemoglobin.
GEM Premier 4000® blood gas analyzer
Total serum bilirubin
Whole blood bilirubin
We gratefully acknowledge the mentorship of Dr. Cathy Halstead and the keen assistance of the medical laboratory technologists and assistants of the chemistry lab, especially Ms. Leslie Trowski. We also thank Michelle Dittrick for REB application and two nursing students: Abraham Cabaccang & Anita Wong for collecting demographic data of the study.
No internal or external funding for this manuscript.
Availability of data and materials
The datasets used and/or analysed during the current study available from the corresponding author on reasonable request.
LW conceptualized and designed the study, drafted the initial manuscript, and approved the final manuscript as submitted; AA carried out the initial analyses, reviewed and revised the manuscript, and approved the final manuscript as submitted; BJ, KH and ML were involved in the interpretation of data, revising the manuscript and approved the final manuscript as submitted; MB designed the data collection sheets, coordinated and supervised data collection, critically reviewed the manuscript, and approved the final manuscript as submitted.
The authors declare that they have no competing interests.
Consent for publication
Ethics approval and consent to participate
This study was approved by the Research Ethics Committee of the University of British Columbia (H13-02615). Consent was not required as this was a quality assessment project in the lab and no additional blood was drawn.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
- Shapiro SM. Definition of the clinical spectrum of kernicterus and bilirubin-induced neurologic dysfunction (BIND). J Perinatol. 2005;25(1):54–9.View ArticlePubMedGoogle Scholar
- American Academy of Pediatrics Subcommittee on Hyperbilirubinemia. Management of hyperbilirubinemia in the newborn infant 35 or more weeks of gestation. Pediatrics. 2004;114(1):297–316.View ArticleGoogle Scholar
- Raimondi F, Ferrara T, Borrelli AC, Schettino D, Parrella C, Capasso L. Neonatal hyperbilirubinemia: a critical appraisal of current guidelines and evidence. J Pediatr Neonatal individualized Med. 2012;1(1):25–32.Google Scholar
- Bhutani VK, Johnson L, Sivieri EM. Predictive ability of a predischarge hour-specific serum bilirubin for subsequent significant hyperbilirubinemia in healthy term and near-term newborns. Pediatrics. 1999;103(1):6–14.View ArticlePubMedGoogle Scholar
- El-Beshbishi SN, Shattuck KE, Mohammad AA, Petersen JR. Hyperbilirubinemia and transcutaneous bilirubinometry. Clin Chem. 2009;55(7):1280–7.View ArticlePubMedGoogle Scholar
- Bosschaart N, Kok JH, Newsum AM, Ouweneel DM, Mentink R, van Leeuwen TG, Aalders MC. Limitations and opportunities of transcutaneous bilirubin measurements. Pediatrics. 2012;129(4):689–94.View ArticlePubMedGoogle Scholar
- Bland JM, Altman DG. Statistical methods for assessing agreement between two methods of clinical measurement. Lancet. 1986;1(8476):307–10.View ArticlePubMedGoogle Scholar
- Cousineau J, Anctil S, Carceller A, Gonthier M, Delvin EE. Neonate capillary blood gas reference values. Clin Biochem. 2005;38(10):905–7.View ArticlePubMedGoogle Scholar
- VITROS Chemistry Products BuBc Slides, TBIL Slides and Bilirubin Supplement [Package inserts]. http://techdocs.orthoclinical.com/TechDocs/TechDocSearch.aspx?tID=0&culture=en-ca. Accessed 4 Nov 2016.
- Lo SF, Jendrzejczak B, Doumas BT. Total or neonatal bilirubin assays in the Vitros 5,1 FS: hemoglobin interference, hemolysis, icterus index. Clin Chem. 2007;53(4):799–800.View ArticlePubMedGoogle Scholar
- Langbaum ME, Farber SJ, Rosenthal P. Automated total and neonatal bilirubin values in newborns: is a distinction clinically relevant? Clin Chem. 1992;38(9):1690–3.PubMedGoogle Scholar
- Brett EM, Hicks JM, Powers DM, Rand RN. Delta bilirubin in serum of pediatric patients: correlations with age and disease. Clin Chem. 1984;30(9):1561–4.PubMedGoogle Scholar
- Lo SF, Doumas BT. The status of bilirubin measurements in U.S. laboratories: why is accuracy elusive? Semin Perinatol. 2011;35(3):141–7.View ArticlePubMedGoogle Scholar
- Kuzniewicz MW, Greene DN, Walsh EM, McCulloch CE, Newman TB. Association between laboratory calibration of a serum bilirubin assay, neonatal bilirubin levels, and phototherapy use. JAMA Pediatr. 2016;170(6):557–61.View ArticlePubMedGoogle Scholar
- Lyon ME, Baerg KL, Olson TN, Agnew BL, Smith-Fehr JC, Lyon AW. The clinical impact of implementing the Roche® bilirubin total Gen.3 method on neonate phototherapy. Clin Biochem. 2015;48(16–17):1171–3.View ArticlePubMedGoogle Scholar
- Newman TB, Xiong B, Gonzales VM, Escobar GJ. Prediction and prevention of extreme neonatal hyperbilirubinemia in a mature health maintenance organization. Arch Pediatr Adolesc Med. 2000;154(11):1140–7.View ArticlePubMedGoogle Scholar
- World Health Organization. WHO recommendations for the prevention and treatment of postpartum haemorrhage. http://apps.who.int/iris/bitstream/10665/75411/1/9789241548502_eng.pdf. Accessed 4 Nov 2016.
- Shirvani F, Radfar M, Hashemieh M, Soltanzadeh MH, Khaledi H, Mogadam MA. Effect of timing of umbilical cord clamp on newborns’ iron status and its relation to delivery type. Arch Iran Med. 2010;13(5):420–5.PubMedGoogle Scholar
- Zhou YB, Li HT, Zhu LP, Liu JM. Impact of cesarean section on placental transfusion and iron-related hematological indices in term neonates a systemic review and meta-analysis. Placenta. 2014;35(1):1–8.View ArticlePubMedGoogle Scholar