Development and validation of a model for early diagnosis of biliary atresia
BMC Pediatrics volume 23, Article number: 549 (2023)
Background and aims
Early diagnosis of biliary atresia (BA), particularly distinguishing it from other causes of neonatal cholestasis (NC), is challenging. This study aimed to design and validate a predictive model for BA by using the data available at the initial presentation.
Infants presenting with NC were retrospectively identified from tertiary referral hospitals and constituted the model design cohort (n = 148); others were enrolled in a prospective observational study and constituted the validation cohort (n = 21). Clinical, laboratory, and abdominal ultrasonographic features associated with BA were assessed. A prediction model was developed using logistic regression and decision tree (DT) analyses.
Three predictors, namely, gamma glutamyl transpeptidase (γGT) level, triangular cord sign (TC sign), and gallbladder abnormalities, were identified as factors for diagnosing BA in multivariate logistic regression, which was used to develop the DT model. The area under the receiver operating characteristic (ROC) curve (AUC) value for the model was 0.905, which was greater than those for γGT level, TC sign, or gallbladder abnormalities alone in the prediction of BA.
A simple prediction model combining liver function and abdominal ultrasonography findings can provide a moderate and early estimate of the risk of BA in patients with NC.
Neonatal cholestasis (NC) is a relatively common clinical issue that presents a complex diagnostic challenge for clinicians . Cholestasis includes a complex set of aetiologies. Therefore, the identification of life-threatening and treatable causes of cholestasis is a high priority. Biliary atresia (BA), which has an overall incidence of approximately 1 in 8000 to 1 in 17,000 , can rapidly progress to biliary cirrhosis and hepatic failure, necessitating liver transplantation. BA is the most common cause of liver transplantation in children  and should be distinguished from other nonsurgical causes of cholestasis in a timely manner. At present, early Kasai portoenterostomy (KP) is associated with longer periods of survival without liver transplantation, and in neonates over 60 days of age, the jaundice disappearance rate decreased to 57.0% . Some of the single methods reported in the literature, such as gamma glutamyl transpeptidase (γGT) and triangular cord sign (TC sign), can diagnose BA with moderate accuracy in an NC patient population [5, 6]; most of these methods were based on single studies, and their accuracy requires improvement. Although a few scoring systems have been proposed [7, 8], these involved invasive diagnostic methods, such as liver biopsy, and did not target patients within 2 months of age.
In this study, we aimed to develop and validate a simple and early predictive diagnosis model based on noninvasive diagnostic methods, including clinical, laboratory, and imaging data, to predict the risk of BA in patients with NC. The results of this study may offer a novel and better algorithm for the early diagnosis of BA and hold potential for clinical application.
Patients and methods
This retrospective and prospective study included two consecutive cohorts of infant patients with NC (1041 in total) who were collected, reviewed, and analysed from the Department of Pediatrics of West China Second University Hospital, Sichuan University, China. Of these, 781 patients were aged over 60 days, 62 patients were assigned to the BA group, and 107 patients were assigned to the non-BA group. The first cohort (model design cohort) consisted of 148 infants between December 2008 and December 2017. The second cohort (validation cohort) consisted of 21 consecutively recruited infants with NC between January 2018 and December 2018. NC was defined by direct or conjugated bilirubin (DBIL) concentration > 17.1 µmol/L, if total serum bilirubin (TBIL) concentration ≤ 85.5 µmol/L, or DBIL concentration > 20% of the TBIL concentration, if TBIL concentration > 85.5 µmol/L . The inclusion criteria were a diagnosis of NC and age ≤ 60 days when visiting our centre. The exclusion criteria were parental refusal of Kasai surgery; loss to follow-up; and other severe systematic deformities, such as BA splenic malformation syndrome.
After complete history-taking, thorough clinical examination, and routine investigation, the diagnosis of BA was confirmed by laparotomy and intraoperative cholangiography (IOC) prior to KP. A diagnosis of BA was ruled out on the basis of specific laboratory tests based on the expected aetiology, negative IOC results in some patients, and follow-up evaluations. Diagnosis in the validation cohort was confirmed as BA by IOC and as non-BA either by confirming the actual aetiology or by excluding the possibility of BA by IOC in cases where the aetiology could not be reached. Data from routine investigations, including stool colour assessment; measurement of TBIL and DBIL concentrations, direct bilirubin/total bilirubin ratio (DBIL/TBIL), alanine transaminase (ALT) concentration, aspartate transaminase (AST) concentration, alkaline phosphatase (ALP) concentration, total bile acid (TBA) concentration, lactate dehydrogenase (LDH) concentration, and gamma glutamyl transpeptidase (γGT) concentration; complete blood count; ultrasonography; and Doppler ultrasonography were reviewed.
Ultrasonography was performed using 9 − 3 MHz and 5 − 2 MHz Philips IU 22 and 12 − 5 MHz and 5 − 2 MHz Philips HD 11 (Royal Philips Electronics, the Netherlands). Combined low-frequency and high-frequency ultrasound was used to examine portal hepatic microcysts for strong echogenic fibrous masses and gallbladder abnormalities. Patients fasted for 4 h before examination and were re-examined 30 min after feeding to evaluate gallbladder contractility. Parameters assessed included the size of the liver, TC sign positivity and abnormal gallbladder. Liver enlargement was defined when the maximum oblique diameter of the right liver exceeded the upper limit of the normal age (90 mm) , the thickness of the echogenic anterior wall of the right portal vein just proximal to the right portal vein bifurcation site, which was used to identify the TC sign , and TC sign positivity was defined when the thickness of TC sign was over the cut-off value of the ROC curve. the length of the gallbladder in longitudinal scanning with a description of its wall regularity, and the thickness of the gallbladder in longitudinal scanning with a description of its wall regularity. Abnormal gallbladder findings were defined as follows: (1) gallbladder length evaluated as less than the normal lower limit for age (15 mm) ; (2) gallbladder thickness evaluated as greater than the normal upper limit for age (3 mm) ; or (3) absence of the gallbladder . Ultrasonographic examination of patients was performed by an experienced ultrasound doctor.
Descriptive results were expressed as number (percentage), mean ± standard deviation (mean ± SD), or median (interquartile range, IQR) (for data that were not normally distributed). For quantitative data, chi-square tests or Fisher’s exact tests were performed to detect the statistical significance of differences between groups, while analysis of variance (ANOVA), t tests, and Wilcoxon tests were used for continuous variables. Binary univariate and multivariate logistic regression analyses were performed for each variable and are presented herein. The diagnostic performance was expressed as sensitivity, specificity, positive predictive value (PPV), negative predictive value (NPV), and accuracy (percentage of correctly identified patients), and all were expressed as percentages. The cut-off values for optical clinical performance (best sensitivity and specificity simultaneously) of individual parameters and the overall accuracy of the scoring system were determined from the ROC.
A decision tree (DT) was constructed using the R package rpart, and a DT plot was drawn using the rattle package. In short, the root node or the first question was “Was γGT > 184 U/L in the patient”? In the subsequent classification trees, “no” indicated a branch to the right, while “yes” represented a branch to the left. The terminal nodes were used to predict BA or non-BA. DT was built for prediction using the RF package with 500 regression trees. The results were considered significant if the p value was ≤ 0.05. Statistical analysis was performed using SPSS Statistics 20.0 and SPSS Modeller 18 software for Windows (SPSS Inc., Chicago, IL, USA).
Demographic, laboratory, and clinical characteristics of the study participants
A total of 148 infants between December 2008 and December 2018 met the eligibility criteria and were retrospectively enrolled. Of these, 52 infants (35.1%) were diagnosed with BA, and the remaining 96 infants (64.9%) had cholestasis due to causes other than BA (non-BA). The mean age was 48 ± 10 days in the BA group and 44 ± 11 days in the non-BA group (P = .02). The majority of non-BA patients were male (55.2%), although the sex distribution was nearly equal in the BA (males, 46.2%) group (P = .31). Stool colour, DBIL, DBIL/TBIL, γGT, gallbladder abnormalities, and positive TC signs were significantly different between the BA and non-BA groups (P < .01), whereas the two groups did not show any differences in ALT, AST, TBIL, LDH, ALP, or hepatomegaly (P > .05). Data for the other characteristics, including clinical findings (stool colour), laboratory findings (ALT, AST, TBIL, DBIL, DBIL/TBIL, γGT, LDH, and ALP), and ultrasonography (hepatomegaly, gallbladder abnormalities, and TC sign) are also listed in Table 1.
Univariate logistic regression analysis of variables significantly associated with BA
Univariate and multivariate logistic regression analyses were performed to determine independent variables associated with BA. Statistically significant differences in variables, including stool colour, DBIL, DBIL/TBIL, γGT, gallbladder, and TC signs, were identified between the BA and non-BA groups (Table 1). The γGT and TC signs showed a medium independent prediction property with AUC > 0.7. However, the AUCs for stool colour, DBIL, DBIL/TBIL, and gallbladder findings were < 0.7 (Table 2).
Establishment and validation of the logistic regression-based nomogram in predicting BA
A nomogram to predict BA was developed on the basis of multivariable logistic regression analysis using the six factors that were identified to be significantly different between the BA and non-BA groups, namely, stool colour, DBIL, DBIL/TBIL, γGT, abnormal gallbladder findings, and TC sign positivity. We found that stool colour, γGT levels, abnormal gallbladder findings, and TC sign positivity were significantly associated with BA (P = < .01), whereas DBIL and DBIL/TBIL were not (P > .05); thus, these four factors were used as predictors to build the nomogram prediction model for BA. The relationship between these factors and BA was assessed using multivariate logistic regression; the resulting data are presented in Table 3.
Establishment of the DT model in predicting BA
The DT for the prediction of BA included four study variables: stool colour, γGT, abnormal gallbladder, and TC sign positivity. For the establishment of the DT model, the first question, also known as the root node, was (1) Was γGT > 184 U/L in the patient? In the classification tree, “no” represents a branch to the left. Infant patients who met this criterion were classified as non-BA. If the answer was “yes,” the second question was (2) Was the TC sign positive? Infant patients who met this criterion were classified as BA. For patients who did not meet the criteria, the tree further queried (3) Was the gallbladder normal? If the answer was “no,” the patients were classified as BA. If the answer was “yes,” the patients were classified as non-BA (Fig. 1). The DT model could discriminate BA with 77.0% sensitivity and 86.0% specificity and an overall diagnostic accuracy of 83.0%.
Validation of the BA diagnostic model
To verify the applicability of this model, a second cohort of infants with cholestatic liver disease was tested, which included infants with BA (n = 10) and those with other infantile cholestatic liver diseases (n = 11). The mean age was 52 ± 8 days in the BA group and 45 ± 11 days in the non-BA group (P = .21). The majority of BA patients were male (60%), and a few non-BA patients were male (36.4%) (P = .28). Data for other characteristics, including γGT, abnormal gallbladder and TC sign positivity, are also listed in Table 4.
Only one of the patients with BA (1/10) was wrongly categorized into the non-BA group, the γGT concentration of the patient was 134 U/L classified as non-BA according to Fig. 1 node 2, and we had no opportunity to use other indicators because there was no branch below node 2. The TC sign was positive, and the gallbladder was abnormal. One of the patients without BA (1/11) was wrongly categorized into the BA group. The γGT concentration of the patient was 192 U/L, classified as BA. It is possible that the γGT concentration of the patient was nearly the cut-off. The model could discriminate BA with 90.0% sensitivity and 90.9% specificity and an overall diagnostic accuracy of 90.5%. The AUC was 0.91, which was better than those of γGT concentration, gallbladder abnormality, and TC sign positivity alone (0.86 0.63, and 0.86).
Accurate diagnosis of BA using existing diagnostic approaches is challenging primarily because of the overlapping features between BA and other forms of NC attributable to different causes. Moreover, the current diagnostic methods may involve radiation exposure or are costly, highly technical, and invasive. We developed a predictive and simple diagnostic model to differentiate BA from other causes of NC at an early stage. Our diagnostic model was composed of three variables based on simple laboratory and imaging findings, which can be obtained cost-effectively, quickly, and noninvasively without exposure to radiation. Validation of the model revealed its high discrimination ability, which was better than those of γGT concentration, gallbladder abnormalities, and TC signs positive alone. Therefore, this model can help clinicians easily and promptly diagnose BA before 60 days.
Early accurate diagnosis of BA is critical for timely intervention with KP to restore bile flow and slow down the progression of this disease in infants [4, 14]. The performance of KP before 60 days of age is known to have better outcomes, and the survival rates with native liver decreased when the age at surgery increased . Therefore, we excluded patients older than 60 days from the study. To our knowledge, this is the first study conducted with infants within 60 days of age.
The current preoperative variables used for the diagnosis of BA primarily include laboratory indices, such as bilirubin , DBIL , and γGT  concentrations. Some studies have reported using γGT to diagnose BA [6, 18], although the cut-off values varied across these studies. Rendon-Macias revealed that a γGT level > 250 U/L had high diagnostic value for BA , and Tang reported that a γGT level > 300 U/L had high diagnostic value . Since the reference value of neonatal γGT level is greatly affected by age, an increasing number of recent studies have suggested that different cut-off values should be used to evaluate infants at different ages . In this study, the cut-off value of γGT was 184 U/L, and the difference from other studies could be attributed to the fact that we selected infants within 60 days of age, which represents a younger population than that reported in previous studies [6, 18].
There are some other parameters that require consideration in this regard. Some indicators we think important, such as clay-like stools, were not included in the model. We found that the odds ratio of clay-like stool was less than that of other stools, perhaps because the early stools of children with BA may show yellow, yellow‒green, or other normal infant stool colours. However, when the bile excretion channel is partially obstructed, stool can appear pale yellow, and the degree of obstruction increases with age, finally resulting in clay or grey‒white stools. The patients we included had early-stage BA. This may be the reason that clay-like stool is not included in the model. Ultrasonography can help rule out biliary malformations such as congenital choledochocoele. It offers the advantages of simplicity, non-invasiveness, cost-effectiveness, and dynamic observation. This is a routine inspection item for children with NC [20, 21]. The TC sign is a very important diagnostic feature . Many studies have reported that the TC sign shows high specificity for diagnosing BA [5, 22]. However, in younger infants, the TC sign may not have completely formed, making its evaluation unclear; thus, the positive rate of the TC sign varies greatly in different age groups . A TC sign > 3.4 mm  or > 4 mm  has been reported to show high diagnostic specificity. In this study, the cut-off TC sign was 3.6 mm. Children with BA often show poor development of the gallbladder or cystic duct, morphological changes, and retraction of the gallbladder. Our study found that the sensitivity and specificity of TC sign positivity were 63.0% and 90%, respectively, and those of gallbladder abnormalities were 91.9% and 48.4%, respectively, in accordance with the results reported by Yoon . The model was accurate for the diagnosis of BA and has potential for clinical application.
In addition to abdominal ultrasound, several medical imaging techniques have been used for screening BA, including cross-sectional magnetic resonance imaging (MRI), hepatobiliary scintigraphy, cholangiopancreatography (MRCP) , duodenal tube test (DTT) and liver biopsy . Yang reported that serum matrix metalloproteinase-7 (MMP-7) may be a reliable biomarker for BA, with high sensitivity (98.7%) and specificity (95.0%) . However, while MMP-7 evaluations may be possible in paediatric liver disease centres, for most clinicians, the MMP-7 level is not a primary evaluation parameter, since many clinicians may be unaware of its significance and most hospitals may not be equipped to perform MMP-7 assays.
Most recently, Kim et al. established an ultrasonography and hepatobiliary scintigraphy-based score for the diagnosis of BA in infants with jaundice and reported good discrimination ability (AUC = 0.98) . However, this score required hepatobiliary scintigraphy, involved exposure to radiation and was technically difficult. El-Guindi developed a diagnostic score for BA that included clinical, laboratory, ultrasonographic, and histopathological parameters, which showed a high accuracy rate of 98.8% in predicting BA . However, liver biopsy is invasive and technically difficult. Thus, existing diagnostic methods appear to have a number of limitations: they are costly and/or involve exposure to radiation, technical difficulty, or highly invasive procedures.
Although our study offered useful information about the value of the nomogram for diagnosing BA, it has a number of limitations that must be acknowledged. First, the model was established retrospectively, and a selection bias may exist as a result. Second, we aimed to discriminate BA from NC in the early stage when KP was feasible, so only infants within 60 days of age were included. Thus, the model may not be appropriate for infants aged > 60 days. Third, the sensitivity of model development was 77.0%, which means that approximately 23.0% of patients (BA with γGT less than 184 U/L) will be mistaken for non-BA because the TC sign and gallbladder cannot be used in these patients. Therefore, the sensitivity of the model should be further improved in the future. For example, different γGT thresholds should be analysed to improve sensitivity, or a diagnosis score system should be developed that combines all indicators. Furthermore, we initially reviewed the data for more than 1000 patients, but most of them did not have BA, while others had BA but were more than 60 days of age; thus, these patients were excluded from our study. Finally, this model was only based on regular liver function and abdominal ultrasonography markers, and other biomarkers were not assessed. Thus, future studies should recruit more infants to validate the model.
By using liver function and abdominal ultrasonography data, we developed a simple and easily applicable model that showed moderate discrimination ability for the diagnosis of BA. This model is only for patients younger than 60 days. For patients aged > 60 days, the model may not be appropriate and may need to be combined with other indicators. However, the model may facilitate prediction of the risk of BA for patients younger than 60 days.
Availability of data and materials
The data that support the findings of this study are available from the corresponding author on reasonable request.
Gamma glutamyl transpeptidase
- TC sign:
Triangular cord sign
Receiver operating characteristic
Area under the receiver operating characteristic curve
Direct or conjugated bilirubin
Total serum bilirubin
Laparotomy and intraoperative cholangiography
Direct bilirubin/total bilirubin ratio
Total bile acid
Analysis of variance
Positive predictive value
Negative predictive value
Magnetic resonance imaging
Duodenal tube test
Gottesman LE, Vecchio D, Aronoff SC. Etiologies of conjugated hyperbilirubinemia in infancy: a systematic review of 1692 subjects. BMC Pediatr. 2015;15:1–8.
Jimenez-Rivera C, Jolin-Dahel KS, Fortinsky KJ, et al. International incidence and outcomes of biliary atresia. J Pediatr Gastroenterol Nutr. 2013;56:344–54.
Moyer V, Freese DK, Whitington PF, et al. Guideline for the evaluation of cholestatic jaundice in infants: recommendations of the North American society for pediatric gastroenterology, hepatology and nutrition. J Pediatr Gastroenterol Nutr. 2004;39:115–28.
Nio M, Sasaki H, Wada M, et al. Impact of age at Kasai operation on short-and long–term outcomes of type III biliary atresia at a single institution. J Pediatr Surg. 2010;45:2361–3.
Nemati M, Rafeey M, Shakeri A. Ultrasound findings in biliary atresia: the role of triangular cord sign. Pakistan J Biol Sci. 2009;12:95–7.
Tang K-S, Huang L-T, Huang Y-H, et al. Gamma-glutamyl transferase in the diagnosis of biliary atresia. Acta Paediatr Taiwanica. 2007;48:196.
Dong R, Jiang J, Zhang S, et al. Development and validation of novel diagnostic models for biliary atresia in a large cohort of Chinese patients. EBioMedicine. 2018;34:223–30.
El-Guindi MA-S, Sira MM, Sira AM, et al. Design and validation of a diagnostic score for biliary atresia. J Hepatol. 2014;61:116–23.
De Bruyne R, Van Biervliet S, Velde V. Clinical practice: neonatal cholestasis. Eur J Pediatrics. 2011;170:279–84.
Konuş O, Ozdemir A, Akkaya A, et al. Normal liver, spleen, and kidney dimensions in neonates, infants, and children: evaluation with sonography. AJR Am J Roentgenol. 1998;171:1693–8.
Koob M, Pariente D, Habes D, et al. The porta hepatis microcyst: an additional sonographic sign for the diagnosis of biliary atresia. Eur Radiol. 2017;27:1812–21.
McGahan JP, Phillips H, Cox K. Sonography of the normal pediatric gallbladder and biliary tract. Radiology. 1982;144:873–5.
Sun Y, Zheng S, Qian Q. Ultrasonographic evaluation in the differential diagnosis of biliary atresia and infantile hepatitis syndrome. Pediatr Surg Int. 2011;27:675–9.
Shneider BL, Magee JC, Karpen SJ, et al. Total serum bilirubin within 3 months of hepatoportoenterostomy predicts short-term outcomes in biliary atresia. J Pediatr. 2016;170:211-217.e212.
Carvalho ED, Santos JL, Silveira TR, et al. Biliary atresia: the Brazilian experience. J Pediatr (Rio J). 2010;86:473–9.
Wang KS, Surgery So, Fetus Co, et al. Newborn screening for biliary atresia. Pediatrics. 2015;136:e1663-1669.
Harpavat S, Finegold MJ, Karpen SJ. Patients with biliary atresia have elevated direct/conjugated bilirubin levels shortly after birth. Pediatrics. 2011;128:e1428-1433.
Rendón-Macías ME, Villasís-Keever MA, Castañeda-Muciño G, et al. Improvement in accuracy of gamma-glutamyl transferase for differential diagnosis of biliary atresia by correlation with age. Turk J Pediatr. 2008;50:253.
Chen X, Dong R, Shen Z, et al. Value of gamma-glutamyl transpeptidase for diagnosis of biliary atresia by correlation with age. J Pediatr Gastroenterol Nutr. 2016;63:370–3.
Wang L, Yang Y, Chen Y, et al. Early differential diagnosis methods of biliary atresia: a meta-analysis. Pediatr Surg Int. 2018;34:363–80.
Zhou L, Shan Q, Tian W, et al. Ultrasound for the diagnosis of biliary atresia: a meta-analysis. Am J Roentgenol. 2016;206:W73–82.
Di Serafino M, Esposito F, Mercogliano C, et al. The triangular cord sign. Abdom Radiol (New York). 2016;41:1867–8.
Lee SM, Cheon J-E, Choi YH, et al. Ultrasonographic diagnosis of biliary atresia based on a decision-making tree model. Korean J Radiol. 2015;16:1364–72.
Yoon HM, Suh CH, Kim JR, et al. Diagnostic performance of sonographic features in patients with biliary atresia: a systematic review and meta-analysis. J Ultrasound Med. 2017;36:2027–38.
Brittain JM, Kvist N, Johansen LS, et al. Hepatobiliary scintigraphy for early diagnosis of biliary atresia. Dan Med J. 2016;63:A5253.
Mandelia A, Lal R, Mutt N. Role of hepatobiliary scintigraphy and preoperative liver biopsy for exclusion of biliary atresia in neonatal cholestasis syndrome. Indian J Pediatr. 2017;84:685–90.
Yang L, Zhou Y, Xu Pp, et al. Diagnostic accuracy of serum matrix metalloproteinase-7 for biliary atresia. Hepatology. 2018;68:2069–77.
Kim JR, Hwang J-Y, Yoon HM, et al. Risk estimation for biliary atresia in patients with neonatal cholestasis: development and validation of a risk score. Radiology. 2018;288:262–9.
We acknowledge the editors and anonymous reviewers for insightful suggestions on this work. We also thank Nature Research Editing Service for English language editing.
This study was funded by the Research Foundation of Guangzhou Women and Children’s Medical Center (No.1600048-04,No.1600060-04) and the Science and Technology Programme of Guangzhou (No.2023A04J0604).The fundings hospital had no role in the study design, recruitment of individuals, data analysis, or writing of the report.
Ethics approval and consent to participate
The study was approved by the Research Ethics Committee of West China Second University Hospital, Sichuan University, and the requirement for informed consent was waived owing to the observation nature of this study. This study was performed in compliance with the Declaration of Helsinki and other relevant regulations.
Consent for publication
All authors agreed to publish this article.
The authors declare that they have no competing interests.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
About this article
Cite this article
Gong, Z., Lin, L., Lu, G. et al. Development and validation of a model for early diagnosis of biliary atresia. BMC Pediatr 23, 549 (2023). https://doi.org/10.1186/s12887-023-04370-x