Computer-based facial recognition as an assisting diagnostic tool to identify children with Noonan syndrome

Huang, Yulu; Sun, Haomiao; Chen, Qinchang; Shen, Junjun; Han, Jin; Shan, Shiguang; Wang, Shushui

doi:10.1186/s12887-024-04827-7

Research
Open access
Published: 24 May 2024

Computer-based facial recognition as an assisting diagnostic tool to identify children with Noonan syndrome

Yulu Huang¹^na1,
Haomiao Sun^3,4^na1,
Qinchang Chen²,
Junjun Shen²,
Jin Han⁵,
Shiguang Shan^3,4 &
…
Shushui Wang^1,2

BMC Pediatrics volume 24, Article number: 361 (2024) Cite this article

411 Accesses
Metrics details

Abstract

Background

Noonan syndrome (NS) is a rare genetic disease, and patients who suffer from it exhibit a facial morphology that is characterized by a high forehead, hypertelorism, ptosis, inner epicanthal folds, down-slanting palpebral fissures, a highly arched palate, a round nasal tip, and posteriorly rotated ears. Facial analysis technology has recently been applied to identify many genetic syndromes (GSs). However, few studies have investigated the identification of NS based on the facial features of the subjects.

Objectives

This study develops advanced models to enhance the accuracy of diagnosis of NS.

Methods

A total of 1,892 people were enrolled in this study, including 233 patients with NS, 863 patients with other GSs, and 796 healthy children. We took one to 10 frontal photos of each subject to build a dataset, and then applied the multi-task convolutional neural network (MTCNN) for data pre-processing to generate standardized outputs with five crucial facial landmarks. The ImageNet dataset was used to pre-train the network so that it could capture generalizable features and minimize data wastage. We subsequently constructed seven models for facial identification based on the VGG16, VGG19, VGG16-BN, VGG19-BN, ResNet50, MobileNet-V2, and squeeze-and-excitation network (SENet) architectures. The identification performance of seven models was evaluated and compared with that of six physicians.

Results

All models exhibited a high accuracy, precision, and specificity in recognizing NS patients. The VGG19-BN model delivered the best overall performance, with an accuracy of 93.76%, precision of 91.40%, specificity of 98.73%, and F1 score of 78.34%. The VGG16-BN model achieved the highest AUC value of 0.9787, while all models based on VGG architectures were superior to the others on the whole. The highest scores of six physicians in terms of accuracy, precision, specificity, and the F1 score were 74.00%, 75.00%, 88.33%, and 61.76%, respectively. The performance of each model of facial recognition was superior to that of the best physician on all metrics.

Conclusion

Models of computer-assisted facial recognition can improve the rate of diagnosis of NS. The models based on VGG19-BN and VGG16-BN can play an important role in diagnosing NS in clinical practice.

Peer Review reports

Introduction

Patients suffering from most genetic syndromes (GSs) have characteristic craniofacial appearances, and the facial dysmorphia in people with the same GS is usually similar [1]. In 2003, Loos et al. [2] were the first to use a machine learning-based method to classify five syndromes with an accuracy of 76%. Advances in the technologies used for facial analysis have spawned a large number of studies on automatic facial recognition for the identification of GSs [3,4,5]. With improvements in data storage and computational power in recent years, the convolution neural network (CNN) has emerged as the most important method of facial recognition. Unlike traditional methods of machine learning, the CNN can automatically extract the most discriminative features from the input images by using the convolution operation. In 2020, Qin et al. [6] used the CNN to develop a model for the identification of Down syndrome (DS) from the facial features of subjects. It achieved an accuracy of 95.87% and a specificity of 97.40% in distinguishing subjects with DS from healthy subjects. In 2021, Pan et al. [7] established an automatic system based on the CNN to identify patients with Turner syndrome (TS) based on their facial features. It obtained an accuracy of 96.9% and a specificity of 97%. These studies show that facial recognition can be used for the identification of a variety of GSs.

Noonan syndrome (NS) (OMIM: 163,950), first reported by Noonan and Ehmke [8], is one of the most common types of GSs. NS is a genetically heterogeneous disorder, with an estimated prevalence of one in 1,000–2,500, that is caused by germline mutations in the 12 critical genes of the highly conserved Ras/mitogen-activated protein kinases (MAPK) pathway [9]. It is characterized by distinctive facial features, a short stature, congenital heart defect, and developmental delays of varying degrees. Researchers have explored facial recognition based on traditional machine learning for the identification of NS [3, 10,11,12]. However, few studies have considered the identification of NS based on facial recognition by using the CNN. In past work, our research team developed a model of facial recognition for identifying NS by using the CNN that achieved an accuracy of 81% in distinguishing between NS patients and patients suffering from other GSs¹³. The accuracy of this model needs to be further improved.

Patients and methods

Patients and dataset

We constructed a dataset consisting of 3,948 frontal facial images of 233 NS patients, 863 patients suffering from other GSs, and 796 healthy children.

Photographs of 78 NS patients (Figs. 1), 285 patients suffering from other GSs, and 796 healthy children were collected from the Guangdong Provincial Peoples’ Hospital in China from January 2017 to June 2023. We used one to 10 frontal photos of each subject. The diagnoses of NS and the other GSs were confirmed through genetic testing. Nineteen healthy children were subjected to genetic testing to confirm that they did not have any GS, while the other healthy children were evaluated by two pediatric geneticists for the same purpose. Only one patient in our hospital belonged to the Zhuang ethnic group, while all the other GS patients were ethnic Han. We also collected photographs of 37 NS patients from the medical literature [8, 13,14,15,16,17,18,19], as well as photographs of 118 NS patients and 578 patients suffering from other GSs from the GestaltMatcher database [20] (GMDB, available at: https://db.gestaltmatcher.org). One frontal photograph of each patient was collected from the literature and the GMDB. All facial images collected from the literature and the GMDB were sufficiently clear for model construction. Information on the ages of the NS patients, data for whom were collected from the literature and the GMDB, was not obtained. We used an online software called Facial Age (https://www.facialage.com/) to estimate their ages. This information is provided in Supplemental File 1.

The training set contained 365 images of 186 NS patients, 2,203 images of 705 patients suffering from other GSs, and 630 images of 630 healthy children. The test set contained 124 images of 47 NS patients, 460 images of 158 patients suffering from other GSs, and 166 images of 166 healthy children. 2,663 images of 863 patients suffering from a total of 78 kinds of other GSs were collected (Table 1). A total of 3,948 images of 1,892 subjects were considered.

Table 1 Other genetic syndromes

Full size table

This study was authorized by the Research Ethics Committee of Guangdong Provincial People’s Hospital (Project No. KY-Z-2020-033-04). Written informed consent was obtained from the guardians of the patients. Permission to use images from the GMDB was also obtained. All facial images collected from the literature and the GMDB were used only for AI-based research.

Image pre-processing

Image pre-processing consisted of three steps: face detection, data augmentation, and image normalization. Face detection was performed by using the multi-task convolutional neural network (MTCNN), which applies a cascade structure with three multi-task networks: a proposal network (P-Net), a refinement network (R-Net), and an output network (O-Net) [21]. Each image was initially resized to different scales to build an image pyramid. The images were then input to a three-stage cascaded framework. In the first stage, copies of the images were fed to P-Net, which generated candidate bounding boxes containing the faces of the subjects. R-Net, a complex version of the CNN, refined the windows in the second stage to reject a large number of windows that did not feature the faces of the subjects. Finally, O-Net, a more powerful version of the CNN, was applied to process the output of R-Net to extract more features from it, and high-confidence bounding boxes for the face and five landmark points (left eye, right eye, tip of the nose, and left and right corners of the mouth) were generated.

Augmentation technology was used to increase the diversity of the data and expand the sample size. The images were randomly manipulated through rotation, jittering, cropping, and flipping while the facial features of the subjects remained unchanged. The images were normalized to 256 pixels × 256 pixels to match the number of dimensions of the input. The normalized images were then used to construct seven models.

Architectures

The following CNN architectures were used to construct seven models: VGG16, VGG19, VGG16-BN, VGG19-BN, ResNet50, MobileNet-V2, and SENet.

VGGNet [22] is an architecture that applies small 3 × 3 convolution kernels to capture the receptive field. VGG16 consists of 13 convolutional layers and three fully connected layers. VGG19 adds three convolutional layers to VGG16. Batch normalization (BN) [23] is a mechanism that reduces the variation in the distribution of the inputs to each mini-batch and expedites the training of the neural networks. BN is inserted before every ReLU in the convolutional layers of the VGG16-BN and VGG19-BN models. The mini-batch size was set to 64 in this study. ResNet50 [24] consists of 49 convolutional layers and an average pooling layer. It contains residual blocks in which the inputs can be passed directly to the outputs. The residual blocks avoid the problem of the vanishing gradient and help learn complex features. SENet [25] introduces an architectural unit called the squeeze-and-excitation (SE) block based on ResNet. The SE block performs three sequential operations on the input image: squeezing, excitation, and reweighting. MobileNet [26,27,28] is a lightweight CNN designed for mobile and embedded devices. MobileNet-V2 introduces a linear bottleneck and an inverted residual structure to avoid losing a large amount of information.

Construction and testing of CNN models

We used the ImageNet dataset to pre-train the CNN architectures to expedite their convergence by enabling them to capture the general characteristics of the images. The extracted parameters of the facial features were stored in the convolutional and pooling layers, which were frozen to ensure that the parameters of the pre-trained networks could not be adjusted by back-propagation. Pre-training was followed by the initialization of the weights of each CNN model. The last classification layer (softmax) was replaced with a fully connected layer comprising two outputs for binary classification. Each model was then fine-tuned by redesigning and training the fully connected layers on the training set. All models were trained for a maximum of 100 epochs, with a mini-batch size of 64. The training data were randomly shuffled before every epoch. The initial learning rate was set to 0.1, and decayed according to a cosine annealing schedule. At the beginning of each training epoch, we used the initial parameters to perform a forward calculation on all images and recorded the predicted probability of NS for each image. The cross-entropy loss function was subsequently used to calculate the loss between the output labels and the real labels. Following this, the Adam optimizer was used to perform back-propagation and update the model parameters. These steps were repeated until each model finally converged.

The test set was used to evaluate the performance of each model. For each input image, each model provided a probability that predicted whether the corresponding patient suffered from NS. When this probability exceeded 50%, the patient was classified as an NS patient, while the relevant patient was classified as a GS patient or a healthy subject if the probability was below 50%.

Comparison between models and physicians

Three junior pediatricians (with three to five years of clinical experience) and three senior pediatricians (with more than 15 years of clinical experience) were invited to identify NS patients based on the facial images in the test set. Each image was shown to them without any attendant clinical information. The physicians were then given 10 s to determine whether the individual shown in the photos suffered from NS.

Evaluation metrics

The results of prediction were divided into four categories: true positive (TP), true negative (TN), false positive (FP), and false negative (FN). Accuracy, precision, specificity, F1 score, the receiver operating characteristic (ROC) curve, and the area under the curve (AUC) were used to evaluate classification performance. The probability of each image in the test set was used to generate the ROC curves, based on which the AUC was computed. These metrics were calculated as follows:

$$Accuracy=\frac{TP+TN}{TP+FP+TN+FN}$$

(2)

$$Precision=\frac{TP}{TP+FP}$$

(3)

$$Recall=\frac{TP}{TP+FN}$$

(4)

$$Specificity=\frac{TN}{TN+FP}$$

(5)

$${F}_{1}=2*\frac{Precision*Recall}{Precision+Recall}$$

(6)

The framework for constructing the diagnostic models for NS is illustrated in Fig. 2.

Results

We constructed seven face recognition-assisted diagnostic models for NS patients by using VGG16, VGG19, VGG16-BN, VGG19-BN, ResNet, MobileNet, and SENet. The accuracy, precision, specificity, and F1 score of each model are presented in Table 2, while their ROC curves are shown in Fig. 3. The VGG19-BN model delivered the best overall performance, with an accuracy of 93.76%, precision of 91.40%, specificity of 98.73%, and F1 score of 78.34%. The VGG16-BN model achieved the highest AUC value of 0.9787. Models based on the VGGNet architectures (VGG16, VGG19, VGG16-BN, and VGG19-BN) outperformed the other models (ResNet50, MobileNet-V2, and SENet) overall.

Table 2 Performance of the CNN models

Full size table

The performance of the six physicians is shown in Table 3. Senior pediatrician 2 achieved the highest accuracy (74.00%), precision (75.00%), and specificity (88.33%). Junior pediatrician 1 achieved the highest F1 score of 71.76%. The mean scores of accuracy, precision, specificity, and the F1 score of all six physicians were 63.50%, 59.91%, 67.78%, and 59.82%, respectively. All CNN models outperformed the physicians in terms of accuracy, precision, specificity, and the F1 score.

Table 3 Performance of each physician

Full size table

Discussion

Noonan syndrome is a multi-system genetic disorder that presents with development delays, congenital heart disease, renal anomalies, and a distinctive facial appearance. The facial features include a high forehead, hypertelorism, ptosis, inner epicanthal folds, down-slanting palpebral fissures, a round nasal tip, and posteriorly rotated ears. This characteristic facial morphology is an important clue for identifying NS. However, these facial characteristics are most prominent in infancy, and become less apparent with age in many people with NS. Their facial features may range from subtle to typical.

Facial recognition technology has been applied for the identification of GSs. In 2003, Loos et al. were the first to use facial recognition technology to classify five genetic syndromes. They [2] used the Gabor wavelet(GW) transform, a traditional machine learning-based method, to pre-process 55 photographs of patients of mucopolysaccharidosis type III, Cornelia de Lange syndrome, fragile X syndrome, Prader–Willi syndrome, and Williams–Beuren syndrome. A comparison of the feature vectors of 32 facial nodes led to an accuracy of classification of 42/55 (76%). Since then, a number of traditional machine learning-based methods have been developed for identifying GSs. In 2013, Zhao et al. [29] collected 100 frontal facial photographs of 50 patients with GSs as well as those of 50 healthy children to construct models of facial recognition for identifying DS. They constructed four traditional machine learning models based on the support vector machine (SVM) with the radial basis function (RBF) kernel, linear SVM, k-nearest neighbor (k-NN), and random forest (RF). The SVM with the RBF kernel achieved the best performance, with an accuracy of 94.6% and a precision of 93.3%. These results show that facial recognition technology can be used to accurately identify DS. In 2017, Kruszka et al. [10] developed a model of facial recognition to identify NS based on the SVM. They used 161 images of 161 subjects with NS from 20 countries. The facial analysis technology was able to identify NS patients in all population groups with a sensitivity and specificity of 88% and 89%, respectively. In another study, Porras et al. trained an SVM model to distinguish between patients with NS and those with Williams–Beuren syndrome, and obtained an accuracy of 85.68%. The models of facial recognition used in the above studies were all trained by using traditional machine learning-based methods. These methods require a long time for computations as well as manual feature extraction, which is laborious. In addition, the extracted features often lack a high-level representation of the face, which results in the loss of valuable information and leads to inaccurate detection [30].

With improvements in data storage and computational power, CNN has emerged as the most important method of facial recognition. The CNN is an automatic machine learning technique that does not require manually labeled images. It has exhibited impressive performance in many tasks of image classification. Porras et al. [31] developed a CNN-based model of genetic screening that used facial images of 1,400 children with 128 kinds of GSs, in addition to 1,400 matched controls. The dataset contained only one facial image from each participant. The images were obtained from three publicly available databases and the archives of the Children’s National Hospital (Washington, DC, USA). This CNN-based model achieved an accuracy of 88% and a specificity of 86% in terms of GS detection. These results show that the CNN-based model of recognition can be used to screen GS patients. DeepGestalt, a CNN-based algorithm, was introduced for identifying GS based on facial recognition in 2014. It has been incorporated into a smartphone app called Face2Gene (http://www.face2gene.com/, FDNA Inc, Boston MA USA) [32,33,34]. In 2023, Luis et al. [1] used Face2Gene to identify NS in a sample of Colombian subjects, and obtained a top-1 accuracy of 66.7% and a top-5 accuracy of 77.8%. In 2021, our team [35] developed a CNN model of facial recognition based on the ResNet architecture and the ArcFace loss function for identifying NS patients. The model achieved an accuracy of 92% in distinguishing between NS patients and healthy subjects. However, it recorded an accuracy of only 81% when it was used to distinguish between NS cases and other GSs. To meet the requirements of clinical practice, the performance of these models for NS identification still needs to be improved.

In the present study, we developed seven models of facial recognition for identifying NS based on VGGNet, ResNet50, MobileNet-V2, and SENet. These CNN-based classification architectures have been widely used for image recognition in recent years, and are characterized by small kernels, deep network structures, and few parameters. VGGNet uses small convolution kernels of size 3 × 3 to construct the network. The use of the residual block allows ResNet to solve the vanishing gradient problem. MobileNet is lightweight and suitable for low-power devices without GPUs. The SE block enables the network to automatically recalibrate the feature maps by selectively emphasizing informative channels. In the context of our study, these models can help physicians distinguish between NS patients and those suffering from other GSs as well as healthy children. Each CNN model outperformed the six pediatricians who were recruited for this study. In addition, the models based on the VGG network series achieved good performance. Similar results have been obtained in other studies on few-shot learning in the domain of medical image-based recognition. Krushi Patel et al. [36] constructed five models for classifying colorectal polyps based on different CNN architectures (VGG, ResNet, DenseNet, SENet, and MnasNet). The training set included 119 endoscopy videos of patients suffering from colorectal polyps. The VGG network achieved the best performance with an accuracy of 79.78%. In 2021, Liu et al. [37] developed five models of facial recognition for William syndrome patients by using the VGG16, VGG19, ResNet18, ResNet34, and MobileNet-V2 architectures respectively. The VGG19 model achieved the best performance, followed by the VGG16 model. The authors presumed that the VGG network series might be more suitable than the other networks for recognition tasks involving a limited number of samples [38].

BN is a technique for training deep neural networks that standardizes the inputs to a layer for each mini-batch. It aims to reduce the internal covariate shift to accelerate the training process [23]. BN can provide three major benefits. Firstly, it increases the training speed by normalizing inputs of each layer to have zero mean and unit variance. Secondly, BN acts as a regularizer and allow the model to converge with a high learning rate. Thirdly, BN is able to prevent over-fitting, so it can replace Dropout and Local Response Normalization to simplify the network [39]. BN also has a beneficial effect on gradient flow through the network by reducing the dependence of the gradients on the scale of the parameters and their initial values. In this study, the VGG19-BN model achieved the best overall performance, with the highest accuracy, precision, and specificity. The VGG16-BN model achieved the highest score (0.9787) in terms of the AUC, followed by the VGG19-BN model (0.9415). The addition of BN can thus improve the performance of the VGG model. One study [38] on the automatic image classification showed that BN can accelerate the convergence of the model, improve its precision, and reduce anticipation loss.

Limitations

This study has four main limitations: (1) A reliable diagnostic model typically relies on a sufficiently large dataset. As NS is a rare genetic disease, a limited number of facial images were collected for this study. (2) Only a portion of the healthy children considered here were genetically tested, while the other healthy children were evaluated by two pediatric geneticists to exclude the presence of GSs. Thus, some of them might have been undiagnosed patients of GSs. However, this probability is extremely low, as none of healthy children manifested any symptoms of GSs. (3) A suitable training set for models of facial recognition should include information from a multi-racial population. Only one patient in our center was from the Zhuang ethnic group, while all the other GSs patients were ethnic Han. (4) The facial features of NS are most prominent in infancy and become less apparent with age. Owing to the limited sample size, we did not stratify the data according to the age of the patients when developing models of facial recognition. We plan to collect more data on NS patients of different ages to optimize our models in future work.

Conclusion

The results of our study demonstrate that the computer-assisted model of facial recognition can improve the diagnosis of NS. The models of facial recognition based on VGG-19 BN and VGG-16 BN can thus play an important role in the diagnosis of NS in clinical practice.

Data availability

All data used during the study are available from the corresponding authors upon reasonable request.

Abbreviations

NS:: Noonan syndrome
GSs:: genetic syndromes
MTCNN:: multi-task convolutional neural network
SENet:: Squeeze-and-Excitation Network
CNN:: convolutional neural network
DS:: Down syndrome, TS: Turner syndrome, MAPK: mitogen-activated protein kinases, GMDB: GestaltMatcher database
P-Net:: proposal network
R-Net:: refinement network
O-Net:: output network
BN:: Batch normalization
TP:: true positive
TN:: true negative
FP:: false positive
FN:: false negative
ROC:: receiver operating characteristic
AUC:: area under the curve
GW:: Gabor wavelet
SVM:: support vector machine
RBF:: radial basis function
k-NN:: k-nearest neighbor
RF:: random forest

References

Echeverry-Quiceno LM, Candelo E, Gómez E, et al. Population-specific facial traits and diagnosis accuracy of genetic and rare diseases in an admixed Colombian population[J]. Sci Rep. 2023;13(1):6869.
Article CAS PubMed PubMed Central Google Scholar
Loos HS, Wieczorek D, Wurtz RP et al. Computer-based recognition of dysmorphic faces[J]. European Journal of Human Genetics.
Boehringer S, Vollmar T, Tasse C, et al. Syndrome identification based on 2D analysis software[J]. Eur J Hum Genetics: EJHG. 2006;14(10):1082–9.
Article Google Scholar
Qiang J, Wu D, Du H, et al. Review on facial-recognition-based applications in Disease Diagnosis[J]. Bioengineering. 2022;9(7):273.
Article PubMed PubMed Central Google Scholar
Saraydemir Ş, Taşpınar N, Eroğul O, et al. Down syndrome diagnosis based on gabor wavelet transform[J]. J Med Syst. 2012;36(5):3205–13.
Article PubMed Google Scholar
Qin B, Liang L, Wu J, et al. Automatic Identification of Down Syndrome using facial images with deep convolutional neural Network[J]. Diagnostics (Basel Switzerland). 2020;10(7):487.
PubMed Google Scholar
Pan Z, Shen Z, Zhu H, et al. Clinical application of an automatic facial recognition system based on deep learning for diagnosis of Turner syndrome[J]. Endocrine. 2021;72(3):865–73.
Article CAS PubMed Google Scholar
Yaoita M, Niihori T, Mizuno S, et al. Spectrum of mutations and genotype–phenotype analysis in noonan syndrome patients with RIT1 mutations[J]. Hum Genet. 2016;135(2):209–22.
Article CAS PubMed Google Scholar
Saint-Laurent C, Mazeyrie L, Yart A, et al. Novel therapeutic perspectives in Noonan syndrome and RASopathies[J]. European Journal of Pediatrics; 2023.
Kruszka P, Porras AR, Addissie YA, et al. Noonan syndrome in diverse populations[J]. Am J Med Genet: A. 2017;173(9):2323–34.
Article CAS PubMed Google Scholar
Tekendo-Ngongang C, Kruszka P. Noonan syndrome on the African Continent[J]. Birth Defects Res. 2020;112(10):718–24.
Article CAS PubMed Google Scholar
Porras AR, Summar M, Linguraru MG. Objective differential diagnosis of noonan and williams–beuren syndromes in diverse populations using quantitative facial phenotyping[J]. Volume 9. Molecular Genetics & Genomic Medicine; 2021. 5.
Nemcikova M, Vejvalkova S, Fencl F, et al. A novel heterozygous RIT1 mutation in a patient with noonan syndrome, leukopenia, and transient myeloproliferation—a review of the literature[J]. Eur J Pediatrics. 2016;175(4):587–92.
Article CAS Google Scholar
Cordeddu V, Yin JC, Gunnarsson C, et al. Activating mutations affecting the dbl homology domain of SOS2 cause noonan syndrome[J]. Hum Mutat. 2015;36(11):1080–7.
Article CAS PubMed PubMed Central Google Scholar
Pagnamenta AT, Kaisaki PJ, Bennett F, et al. Delineation of dominant and recessive forms of LZTR1 -associated noonan syndrome[J]. Clin Genet. 2019;95(6):693–703.
Article CAS PubMed PubMed Central Google Scholar
Sarkozy A, Carta C, Moretti S, et al. Germline BRAF mutations in Noonan, LEOPARD, and cardiofaciocutaneous syndromes: molecular diversity and associated phenotypic spectrum[J]. Hum Mutat. 2009;30(4):695–702.
Article CAS PubMed PubMed Central Google Scholar
Leung GKC, Luk HM, Tang VHM, et al. Integrating functional analysis in the next-generation sequencing diagnostic pipeline of RASopathies[J]. Sci Rep. 2018;8(1):2421.
Article PubMed PubMed Central Google Scholar
Passarge E, Robinson PN, Graul-Neumann LM. Marfanoid–progeroid–lipodystrophy syndrome: a newly recognized fibrillinopathy[J]. Eur J Hum Genet. 2016;24(9):1244–7.
Article CAS PubMed PubMed Central Google Scholar
Li X, Yao R, Tan X, et al. Molecular and phenotypic spectrum of noonan syndrome in Chinese patients[J]. Clin Genet. 2019;96(4):290–9.
Article CAS PubMed Google Scholar
Hsieh TC, Bar-Haim A, Moosa S, et al. GestaltMatcher facilitates rare disease matching using facial phenotype descriptors[J]. Nat Genet. 2022;54(3):349–57.
Article CAS PubMed PubMed Central Google Scholar
Zhang K, Zhang Z, Li Z, et al. Joint face detection and alignment using multi-task cascaded convolutional networks[J]. IEEE Signal Process Lett. 2016;23(10):1499–503.
Article Google Scholar
Simonyan K, Zisserman A. Very deep convolutional networks for large-scale image recognition[M]. arXiv; 2015.
Ioffe S, Szegedy C. Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift[M]. arXiv, 2015.
He K, Zhang X, Ren S et al. Deep residual learning for image recognition[M]. arXiv, 2015.
Hu J, Shen L, Albanie S et al. Squeeze-and-excitation networks[M]. arXiv, 2019.
Howard AG, Zhu M, Chen B, et al. MobileNets: efficient convolutional neural networks for mobile vision applications[M]. arXiv; 2017.
Sandler M, Howard A, Zhu M et al. MobileNetV2: Inverted residuals and linear bottlenecks[M]. arXiv, 2019.
Searching. for MobileNetV3 | IEEE Conference Publication | IEEE Xplore[EB/OL]. [2023-08-09]. https://ieeexplore.ieee.org/document/9008835.
Qian Zhao, Rosenbaum K, Okada K et al. Automated down syndrome detection using facial photographs[C]//2013 35th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC). Osaka: IEEE, 2013: 3670–3673.
Bi B, Wang Y, Zhang H, et al. Microblog-HAN: a micro-blog rumor detection model based on heterogeneous graph attention network[J]. PLoS ONE. 2022;17(4):e0266598.
Article CAS PubMed PubMed Central Google Scholar
Porras AR, Rosenbaum K, Tor-Diez C, et al. Development and evaluation of a machine learning-based point-of-care screening tool for genetic syndromes in children: a multinational retrospective study[J]. Lancet Digit Health. 2021;3(10):e635–43.
Article CAS PubMed Google Scholar
Gurovich Y, Hanani Y, Bar O, et al. Identifying facial phenotypes of genetic disorders using deep learning[J]. Nat Med. 2019;25(1):60–4.
Article CAS PubMed Google Scholar
Latorre-Pellicer A, Ascaso Á, Trujillano L, et al. Evaluating Face2Gene as a tool to identify cornelia de lange syndrome by facial phenotypes[J]. Int J Mol Sci. 2020;21(3):1042.
Article CAS PubMed PubMed Central Google Scholar
Ciancia S, Goedegebuure WJ, Grootjen LN, et al. Computer-aided facial analysis as a tool to identify patients with silver–russell syndrome and prader–willi syndrome[J]. Eur J Pediatrics. 2023;182(6):2607–14.
Article CAS Google Scholar
Yang H, Hu XR, Sun L, et al. Automated facial recognition for noonan syndrome using novel deep convolutional neural network with additive angular margin loss[J]. Front Genet. 2021;12:669841.
Article PubMed PubMed Central Google Scholar
Patel K, Li K, Tao K, et al. A comparative study on polyp classification using convolutional neural networks[J]. PLoS ONE. 2020;15(7):e0236452.
Article CAS PubMed PubMed Central Google Scholar
Liu H, Mo ZH, Yang H, et al. Automatic facial recognition of Williams-Beuren Syndrome based on deep convolutional neural Networks[J]. Front Pead. 2021;9:648255.
Article Google Scholar
Ren R, Zhang S, Sun H, et al. Research on Pepper External Quality Detection based on Transfer Learning Integrated with convolutional neural Network[J]. Sensors. 2021;21(16):5305.
Article PubMed PubMed Central Google Scholar
Zhou Y, Yuan C, Zeng F et al. An Object Detection Algorithm for Deep Learning Based on Batch Normalization[M]//Qiu M. Smart Computing and Communication: Vol. 10699. Cham: Springer International Publishing, 2018: 438–448.

Download references

Acknowledgements

The authors would like to thank the parents and children who enrolled in the study. Their outstanding support and contributions are gratefully appreciated.

Funding

This study was supported by the National Natural Science Foundation of China (Grant No. 82070321).

Author information

Yulu Huang and Haomiao Sun contributed equally to this work.

Authors and Affiliations

Department of Pediatric Cardiology, Guangdong Cardiovascular Institute, Guangdong Provincial People’s Hospital, Guangdong Academy of Medical Sciences, No. 106, Zhongshan 2nd Road, Yuexiu District, Guangzhou, China
Yulu Huang & Shushui Wang
Department of Pediatric Cardiology, Guangdong Provincial People’s Hospital (Guangdong Academy of Medical Sciences), Southern Medical University, No. 106, Zhongshan 2nd Road, Yuexiu District, Guangzhou, China
Qinchang Chen, Junjun Shen & Shushui Wang
Key Laboratory of Intelligent Information Processing, Institute of Computing Technology, Chinese Academy of Sciences, No. 6 South Science Academy Road, Haidian District, Beijing, China
Haomiao Sun & Shiguang Shan
University of Chinese Academy of Sciences, No. 80 Zhongguancun Road East, Haidian District, Beijing, China
Haomiao Sun & Shiguang Shan
Prenatal diagnosis center, Guangzhou Women and Children’s Medical Center, Guangzhou Medical University, No. 9 Jinsui Road, Tianhe District, Guangzhou, China
Jin Han

Authors

Yulu Huang
View author publications
You can also search for this author in PubMed Google Scholar
Haomiao Sun
View author publications
You can also search for this author in PubMed Google Scholar
Qinchang Chen
View author publications
You can also search for this author in PubMed Google Scholar
Junjun Shen
View author publications
You can also search for this author in PubMed Google Scholar
Jin Han
View author publications
You can also search for this author in PubMed Google Scholar
Shiguang Shan
View author publications
You can also search for this author in PubMed Google Scholar
Shushui Wang
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Shushui Wang: design of the work, funding acquisition, project administration, supervision, reviewing and editing. Shiguang Shan: software, interpretation of data. Yulu Huang: data curation, methodology, data analysis, writing original draft. Haomiao Sun: software, data analysis. Qinchang Chen: data collection, data analysis. Junjun Shen: data analysis, Jin Han: data collection. All authors contributed to the article. All authors reviewed the manuscript and approved the submitted version.

Corresponding authors

Correspondence to Shiguang Shan or Shushui Wang.

Ethics declarations

Ethics approval and consent to participate

The studies involving human participants were reviewed and approved by Research Ethics Committee of Guangdong Provincial People’s Hospital, Guangdong Academy of Medical Sciences Project No. KY-Z-2020-033-04). Written informed consent to participate in this study was provided by the participants’ legal guardian. Permission to use the images in GMDB was also obtained.

Consent for publication

Written informed consent was obtained from the individual(s), and minor(s)’ legal guardian, for the publication of any potentially identifiable images or data included in this article.

Competing interests

The authors declare that there is no conflict of interest.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Electronic supplementary material

Below is the link to the electronic supplementary material.

Supplementary Material 1

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Cite this article

Huang, Y., Sun, H., Chen, Q. et al. Computer-based facial recognition as an assisting diagnostic tool to identify children with Noonan syndrome. BMC Pediatr 24, 361 (2024). https://doi.org/10.1186/s12887-024-04827-7

Download citation

Received: 11 January 2024
Accepted: 10 May 2024
Published: 24 May 2024
DOI: https://doi.org/10.1186/s12887-024-04827-7

Computer-based facial recognition as an assisting diagnostic tool to identify children with Noonan syndrome

Abstract

Background

Objectives

Methods

Results

Conclusion

Introduction

Patients and methods

Patients and dataset

Image pre-processing

Architectures

Construction and testing of CNN models

Comparison between models and physicians

Evaluation metrics

Results

Discussion

Limitations

Conclusion

Data availability

Abbreviations

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Ethics approval and consent to participate

Consent for publication

Competing interests

Additional information

Publisher’s Note

Electronic supplementary material

Supplementary Material 1

Rights and permissions

About this article

Cite this article

Share this article

Keywords

BMC Pediatrics

Contact us