Skip to main content

Screening tools for autism in culturally and linguistically diverse paediatric populations: a systematic review

Abstract

Background

Autism Spectrum Disorder (ASD) has varying prevalence rates worldwide, often higher in culturally diverse populations. Cultural differences can affect autism symptom recognition. Language barriers and differing healthcare attitudes may delay diagnosis and intervention. Most autism screening tools were developed in Western, predominantly Caucasian populations, and their appropriateness in culturally and linguistically diverse (CALD) contexts remains uncertain. There is a lack of comprehensive data on the accuracy of these tools in identifying autism in culturally and linguistically diverse groups. Consequently, it is unclear whether current screening tools are culturally sensitive and appropriate.

Methods

A research protocol was registered in PROSPERO (CRD42022367308). A comprehensive search of literature published from inception to October 2022 was conducted using the following databases: PubMed, Medline Complete, Scopus, PsychInfo and CINAHL Complete. The articles were screened using pre-determined inclusion and exclusion criteria. Data extracted included participant demographics, screening tool psychometric properties (validity, reliability, accuracy) and acceptability. A narrative synthesis approach was used.

Results

From the initial retrieval of 2310 citations, 51 articles were included for analysis. The studies were conducted in 32 different countries with screening tools in the following languages: Chinese, Spanish, Korean, Turkish, Arabic, Kurdish, Persian, Serbian, Italian, French, Sinhala, Taiwanese, Finnish, Northern Soho, Albanian, German, Japanese, Vietnamese, Farsi, Greek and English. There was no data on acceptability of the screening tools in CALD populations. Validity, reliability, and accuracy ranged from poor to excellent with consistently high performance by screening tools devised within the populations they are intended for.

Conclusions

The review evaluated autism screening tools in culturally diverse populations, with a focus on validity, reliability, and acceptability. It highlighted variations in the effectiveness of these tools across different cultures, with high performance by tools devised specifically for the intended population, emphasizing the need for culturally sensitive screening tools. Further research is needed to improve culturally specific, reliable autism screening tools for equitable assessment and intervention in diverse communities.

Peer Review reports

Background

Autism Spectrum Disorder (ASD) is a multifaceted neurodevelopmental condition marked by difficulties in social communication and the occurrence of repetitive, limited, and/or sensory behaviours and interests [1]. The prevalence of autism worldwide is approximately 1 in 100 [2]; however estimates vary, with some countries finding that as many as 1 in 36 children are on the autism spectrum [3]. Prevalence rates also exhibit some gender disparities in the latest Centers for Disease Control and Prevention (CDC) reports, with autism being diagnosed approximately four times more commonly among boys than among girls [3]. Recently, research has shown that, contrary to prior beliefs, autism prevalence is lower among Caucasian children compared to other racial and ethnic groups [3].

These groups are commonly recognised internationally as those of a “non-English speaking background” or NESB, however this review and its inclusion criteria followed the Australian definition of “culturally and linguistically diverse” (CALD) to refer to these populations, where they are defined as people born in non-English speaking countries, and/or who do not speak English in their home [4]. As such, the CALD terminology includes those who are NESB.

In this regard, although autism has been found to be more common among children from culturally diverse and non-English speaking backgrounds (NESB) [3], rates vary among geographic regions and ethnic demographics [5]. This variation may be due to both actual disparities in prevalence as well as a reflection of other factors indexing social vulnerability. These may include exposure to social risk factors, limited awareness and opportunities for early identification, influence of social stigma and/or variations in the services and methods used for screening, diagnosing and determining autism prevalence [6,7,8].

Another factor implicated in differences in prevalence, ansed a potential barrier to screening is cultural differences in symptom recognition. This may make it difficult to recognise symptoms of autism, particularly in non-Caucasian populations. A study by Matson et al. [9] discussed variation in autism symptoms between children from Israel, South Korea, the United Kingdom (UK) and the United States of America (USA). They found that in certain cultures, such as in Native American and East Asian cultures [10, 11] reduced eye contact and non-verbal communication, stereotypical ASD symptoms, are favoured and hence less likely to be considered a sign of concern and more so a sign of respect,[10, 11]. Additionally, pointing with the index finger is considered inappropriate in Chinese culture [12]. Other studies have also commented on cultural differences in displays of emotion and facial expression [13] as well as preference for increased interpersonal space in Japanese culture [14]. This may be interpreted as a lack of engagement in social communication by individuals from Western cultures affecting autism screening and evaluations [14].

Certain features also considered stereotypical of autism such as hand-flapping and rocking were uncommon among children who had been diagnosed in Africa [15]. Given that these are less noticeable, they become less reliable indicators of pathology. While certain symptoms may not manifest, there could be the emergence of behaviours that are less conspicuous within Caucasian populations. This highlights the potential difficulties that may arise during assessment for identification including the measures used, especially if the assessment is not culturally appropriate [16].

In addition to differences in what are considered to be signs of concern, it has been found that both language differences and variations in cultural attitudes and beliefs related to the role of healthcare services may contribute to delays in health-seeking efforts [17]. Ou et al. [18] found in a study on Australian infants that even if the mother spoke English proficiently, infants from culturally and linguistically diverse (CALD) communities were less likely to access health services and receive mental health interventions. A study by Hussain et al. [19] found a significant link between CALD status and delayed age at which developmental concerns were raised, as well as higher severity of autism symptoms at the time of professional input. In this regard, a multicultural cohort study found an inverse care law in that those from the most disadvantaged backgrounds with the highest risk for developmental problems were least likely to engage with developmental surveillance programs for early identification, thereby missing opportunities for early intervention [17, 20]. These barriers to identifying autism result in a reduced rate of identification and treatment and as such, failed opportunities to capitalise on a period of increased neuroplasticity [21]. Studies have shown, and it is now well-known, that early identification and intervention for autism is essential to enhance the developmental outcomes and quality of life for affected individuals as well as their families [22]. On the other hand, there are significant repercussions of a missed diagnosis. Lupindo et al. [23] found these not only resulted in missed opportunity to provide support and scaffolding during a critical period of development but also had serious implications for psychological well-being during childhood as well as into adult life.

To investigate differences in the rates and presentations of autism in non-Caucasian children tools used for screening children for autism must be scrutinised. Most autism screening tools originated in the USA or the UK, both high-income countries comprising of a predominantly Caucasian population [24], and are now being extrapolated for use in cultures distinct from their places of origin [25].

Clinicians typically utilise the Diagnostic and Statistical Manual’s criteria, recently in its Fifth Edition, Text Revision (DSM-V-TR) [26] to define a diagnosis of autism. For a diagnosis of ASD according to the DSM-V criteria, a child must have persistent deficits in all three categories of social communication and interaction as well as meeting two of four categories related to restricted, repetitive patterns of behaviour, interests or activities. Most available screening tools have been based on the earlier DSM-IV [1] criteria [27].

Although various tools have been validated for the identification of autism, their applicability and accuracy in diverse cultural and linguistic contexts remain relatively unexplored. While tools have often been adapted for use and assessed in specific cultures [28] – demonstrating they can be used – it is less clear whether these tools should be used to adequately capture autism symptoms among diverse populations. Despite the increasing number of autism screening tools, there remains a lack of comprehensive pooled data on their accuracy in identifying children with cultural or linguistic diversity. Given the acknowledged disparities in interpretation of signs and symptoms of autism among CALD groups, it is questionable whether the current screening tools are cross-culturally sensitive as well as appropriate.

This systematic review aimed to comprehensively assess the existing literature on screening tools for autism in CALD paediatric populations specifically regarding acceptability, reliability, validity, and accuracy. A narrative systematic review was used to evaluate and synthesise the literature and answer the following research questions:

  1. 1.

    What available autism screening tools have been used with culturally and linguistically diverse populations?

  2. 2.

    Are the autism screening tools acceptable to parents of and service providers working with culturally and linguistically diverse populations?

  3. 3.

    What is the reliability, validity and accuracy of the autism screening tools in culturally and linguistically diverse populations?

Methods

Before initiating this review, a research protocol was formulated and officially registered with the University of York Centre for Reviews and Dissemination (PROSPERO; registration number: CRD42022367308).

Search strategy

Five databases (PubMed, Medline Complete, Scopus, PsychInfo and CINAHL Complete) were searched from inception to October 2022 to identify studies which reported on the psychometric properties and acceptability of autism screening tools delivered to CALD paediatric populations. The search terms included (autis* OR neurodevelop* OR asperg* OR developmental) AND (q-chat OR srs OR “quantitative checklist for autism in toddlers” OR “social responsiveness scale” OR m-chat OR “Modified Checklist for Autism in Toddlers” OR scq OR “social communication questionnaire” OR cars OR “Child Autism Rating Scale” OR “ASD screening”) AND (immigran* OR migrant OR *lingual OR ethnic* OR cultur* OR cald OR *racial OR racial* OR linguistic OR multicultural OR refugee OR aborigin* OR native* OR “first nations” OR indigenous OR “children of color” OR “children of colour” OR “people of color” OR “people of colour” OR cross-cultural OR cross-country OR latin OR latin* OR hispanic OR spanish OR black OR asian OR chinese OR mandarin OR arabic OR african OR indian OR subcontinental OR hindi OR french OR vietnamese).

Inclusion and exclusion criteria

Articles were included in the review if they met the following criteria: (1) evaluated an autism screening tool, (2) study participants were aged 0–17 years and 11 months, (3) published in a peer reviewed journal and (4) the population of interest was predominantly (> 70%) from a CALD community (people of non-English speaking background, as well as people born outside of the studied country (if it is one of the main English speaking countries, as below) and whose first language is not English).

Articles were excluded if they (1) included a screening tool to evaluate an intervention outcome only (i.e., there was no evaluation of the screening tool itself); (2) were not data-based (e.g., books, theoretical papers, editorials, reviews); (3) were unpublished dissertations/theses; or (4) were populations (> 30% of sample) from the list of main English speaking countries: the main English speaking countries identified by the Australian Bureau of Statistics are Australia, Canada, Republic of Ireland, New Zealand, South Africa, UK (England, Scotland, Wales, Northern Ireland), and USA. Records were also excluded if the English translation or full text article could not be located, if they utilised secondary screening tools, if they did not report results of the ASD subscale of the relevant screening tool or if a diagnostic tool was evaluated in place of a screening tool. Given the limited number of studies on this topic, studies with tools that were used with children outside the recommended age range were also included.

The inter-rater reliability for title/abstract and full-text screening of database searches were 82% and 88%, respectively.

Data extraction

The Cochrane Effective Practice and Organization of Care Review Qualitative Evidence Syntheses guidelines were used to guide data extraction. Data extraction of included studies was performed primarily by author EH with review by authors AH and PH using Excel spreadsheet software (Microsoft Excel, Microsoft Corporation). Data items extracted included country-based, ethnicity, study setting, study design, aims, population (including number, age and gender), measure being evaluated and study outcomes relevant to the review.

Assessment of methodological quality

Two reviewers (EH and PH) conducted separate evaluations of the study quality using either the Quality Assessment of Diagnostic Accuracy Studies Tool (QUADAS-2) [29] or the Mixed Methods Appraisal Tool (MMAT) [30]. The MMAT was applied to assess the quality of non-randomised studies whereas the QUADAS-2 was utilised to appraise diagnostic accuracy studies. Applying the MMAT to the studies requires consideration of five questions that assess (1) if the participants represent the target population, (2) if the measurements are appropriate, (3) if there are complete outcome data, (4) if confounders are accounted for and (5) if the intervention/exposure occurs as intended. Each outcome is assigned “yes”, “no”, or “can’t tell”. The QUADAS-2 allows quality appraisal of diagnostic accuracy studies based on (1) patient selection, (2) index text, (3) reference standard and (4) flow and timing. Within each category, reviewers may rate the risk of bias and applicability as “low”, “high”, or “unclear”. Appendix 1 defines the coding rules that were applied during the appraisal. Coding decisions were operationalised prior to performing the quality assessment. If any signalling question related to risk of bias resulted in a “no” response this was judged as “high risk”. If there were no “no” responses the response “yes” or “unclear” that was more frequent determined the ultimate risk of bias. In this way, if the “yes” response was dominant the study was deemed to have a “low: risk of bias in that Domain. Concerns regarding applicability for Domain 1–3 was determined based on discussion between reviewers EH and PH. Disagreement between reviewers was solved by reaching a consensus over discussion. A third reviewer (VE) was available to settle unresolved disputes if necessary. See Appendix 1 for further details regarding the rules applied during QUADAS-2 application.

The results for the quality assessments using QUADAS-2 may be found in Fig. 1. Table 1 details the findings of the MMAT appraisal.

Fig. 1
figure 1

Results for the quality assessments using QUADAS-2

Table 1 Results of quality appraisal for relevant studies using Mixed Methods Appraisal Tool (MMAT), version 2018

Data synthesis

Due to the high heterogeneity of the outcomes, a narrative synthesis approach was used in this review where results are consolidation from numerous studies primarily utilising textual descriptions to summarise and elucidate the synthesis findings.

Results

Literature search results

A total of 2310 citations were retrieved from the initial database search (674 from PsychInfo, 515 from PubMed, 507 from Scopus, 484 from Medline complete, and 130 from CINAHL Complete). Following the removal of duplicates, 1360 potentially relevant records remained. The articles were then screened using the inclusion and exclusion criteria. Title and abstract screening led to the exclusion of 1254 articles, resulting in 106 articles to undergo full text review. Following this, the full texts of 106 articles were reviewed, resulting in the exclusion of 55 articles, and ultimately, 51 articles were included and analysed in this review (Fig. 2).

Fig. 2
figure 2

PRISMA flow diagram of the literature selection process

Characteristics of included studies

The studies were conducted in 32 different countries. The delineation of number of studies per country studied and language studied are summarised in Tables 2 and 3 respectively. Table 4 presents the data extracted from each of the studies. It includes demographic data as well as reported classification measures and psychometric properties.

Table 2 Number of studies per country studied
Table 3 Number of studies per language studied
Table 4 Characteristics of included studies

Psychometric properties of the screening tools

All studies provided psychometric evaluation of the screening tool utilised using varying parameters. The most common measure of validity was reporting of the sensitivity and specificity. Interpretation of these based on consensus of the reviewers was based on rule of thumb where ≥ 0.9 = high, 0.6- 0.89 = moderate, < 0.6 = poor. Some studies also reported positive predictive value (PPV) and negative predictive value (NPV).

Reliability was most frequently documented using Cronbach’s α coefficient where ≥ 0.9 is excellent, 0.8–0.89 is good, 0.7–0.79 is acceptable, 0.6 -0.69 is questionable, 0.5–0.59 is poor and < 0.5 is unacceptable [79]. Less commonly reliability was reported using intraclass correlation coefficient (ICC), interrater reliability (r), omega (Ω) and theta (θ).

Screening tests

Thirteen different autism screening tools were found by this review to have been used with CALD populations.

Modified Checklist for Autism in Toddlers (M-CHAT)

The Modified Checklist for Autism in Toddlers (M-CHAT), created and validated in the United States by Robins et al. [80], is a 23-item questionnaire designed for screening children between the ages of 16–36 months for autism. It was used in 12 studies published between 2008 and 2021. It was translated into 8 languages and used in 10 countries. The M-CHAT is a modification of the original Checklist for Autism in Toddlers (CHAT) which was found to have poor sensitivity [81].

Twelve studies used a translated version of the M-CHAT. Five of these conducted the M-CHAT on children outside of the recommended age group, testing up to 10 years of age [36, 39, 41, 43, 67]. Three of these administered a Spanish M-CHAT [41,42,43] with varying translations appropriate to the region studied. Canal-Bedia et al. [42] reported some modifications made to increase acceptability in the setting of initial parental misunderstanding. This included the use of different toys and Spanish colloquialisms. The studies reported good validity scores with a sensitivity of 0.93–1, specificity of 0.63–0.98 and acceptable reliability α= 0.7.

Two studies were conducted on Asian participants. Wong [37] implemented a Chinese M-CHAT in Taiwan with moderate validity scores, sensitivity 77% and specificity 72.4% and good reliability α = 0.8. Seung [33] also found excellent reliability using the Korean M-CHAT α = 0.9. Both of these studies found the parents’ level of education to be relevant to their understanding of the screening tool. The latter reported that some parents had difficulty understanding some questions, answering “to the best of their ability”. Baduel et al. [38] validated the French M-CHAT and reported a moderate sensitivity of 0.67 and good specificity of 0.94–0.99. They provided extra training to the paediatricians and day-care staff conducting the screening but did not comment on acceptability.

Eldin et al. [36] conducted a cross sectional study across 9 Arabic-speaking countries, resulting in moderate validity scores, sensitivity 86% and specificity 80%. Perera et al. [40] examined toddlers using a Sinhala M-CHAT and found a low sensitivity of 25% but moderate specificity of 70%. Samadi et al. [39] conducted the M-CHAT in Kurdish and Persian in Iran and found good sensitivity 90.3% and moderate specificity of 80.7%, commenting on extensive workshops and educational sessions for those completing the assessment including parents.

Most of the studies had a high risk of bias in the patient selection (50%) and flow and timing (75%) QUADAS domains. The majority showed a low risk (63%) of bias in terms of the index test but many (38%) had unclear details related to the conduction and interpretation of the reference standard tests. All studies had low concern regarding applicability. Of the relevant studies, Seung [33] was rated using the MMAT and met the criteria for all domains except accounting for confounding variables.

Modified Checklist for Autism in Toddlers, Revised with Follow-Up (M-CHAT-R/F)

The original authors of the M-CHAT developed a revised version, the Modified Checklist for Autism in Toddlers, Revised with Follow-Up (M-CHAT-R/F) [82] in 2009 to reduce the burden of follow-up interviews on the screening process. It involves 20 items and a revised scoring system. It was studied by 8 of the articles from 2016–2023 in 7 languages.

Three studies on Turkish populations [50,51,52] all employed trained staff to administer the M-CHAT via interview increasing acceptability by parents using added explanations and demonstrations if necessary. They showed good validity, a sensitivity of 1 and specificity of 0.95 and reliability α  = 0.84—0.96.

Two Spanish studies [48, 49] reported validity related to sensitivity ranging from 0.79 – 1 and specificity ranging from 0.98—0.99. Guo [45] tested a Chinese population of 7928 toddlers and found a sensitivity and specificity of 0.96 and 0.87 respectively. The Portuguese version conducted in Brazil by Losapio et al. [47] found reasonable sensitivity but poor specificity (Table 2). Of note, this study extrapolated the use of the screening tool to children up to 6 years of age. The Malian study [46] found poor sensitivity but good specificity. Albanian and Serbian studies [31, 44] reported acceptable to excellent reliability (Cronbach’s α) of their tools ranging from 0.74 – 0.91 respectively. Within these studies, the Spanish studies made comments related to increasing sensitivity by using culturally appropriate examples and rewording some of the questions to increase clarity. Vorster [35] commented that in a South African population where the participants were presented with a Northern Soho (local language) and culturally adapted English version of this tool, there was higher acceptability and preference for the latter.

Most of these studies showed a high risk of bias in the patient selection (78%) and flow and timing (100%) QUADAS domains. The majority showed a low risk (78%) of bias in terms of the index test, but many (44%) had unclear details related to the reference standard. Carakovac et al. [31] and Vorster [35] were rated using the MMAT and met criteria for all domains except accounting for confounding variables. It was unclear if the former had complete data.

Quantitative Checklist for Autism in Toddlers (Q-CHAT)

The Quantitative Checklist for Autism in Toddlers (Q-CHAT) is a 25-item questionnaire developed for toddlers aged 18 to 30 months [83]. It is a modification of the M-CHAT where dichotomous yes/no responses were altered to ordinal responses on a 5-point scale (0—4). It was translated to 2 languages and applied in 3 studies each between 2019 and 2021.

The Italian version of the Q-CHAT was conducted on a non-clinical sample [69] reporting acceptable internal consistency Cronbach’s α = 0.68. In a separate study on a clinical sample [32], they found higher internal consistency, Cronbach’s α = 0.87 and moderate sensitivity 83% and specificity 78%. Both these studies collected extensive demographic data from the parents including education level. Stefanovic [70] tested Serbian toddlers with the Q-CHAT and found a high sensitivity of 96.2% and specificity of 81.9%. Ruta et al. [69] and Stefanovic et al. [70] were appraised using QUADAS-2 and showed a high risk of bias in the patient selection and flow and timing domains and unclear risk related to the index test. Ruta et al. [32] were more appropriately analysed using MMAT and rated “yes” in all domains.

Social Responsiveness Scale (SRS)

The Social Responsiveness Scale (SRS) is a 65-question screening measure for children aged 2.5 – 18 years [84]. The SRS was also popular and was used in 12 studies from 2008 to 2021 in countries. Four studies conducted in Chinese [53,54,55,56] showed good to excellent reliability scores with Cronbach’s α  = 0.871 – 0.94, as well as good validity measures sensitivity 0.93 – 0.97 and specificity 0.7—0.82.

Several other translated versions were tested in Asian populations including 2 Korean studies [61, 62], 1 Japanese study [60] and 1 Vietnamese study [64]. Overall, these showed fair reliability α = 0.721 – 0.88 and moderate validity scores, sensitivity 0.725 – 0.93 and specificity 0.667 – 98%. Cheon et al. [61] specified some modifications made to allow culturally appropriate clarification. Both the Mexican [63] and German [59] adaptations reported high reliability with the former also showing good validity measures. Similarly, the Finnish SRS [58] had very high sensitivity [1] and specificity (0.96).

In the quality appraisal, 83% of these studies indicated a high risk of bias in the patient selection. Many had unclear details regarding the index testing (83%) and flow and timing (50%). One study raised concerns regarding the applicability of the reference standard used [57].

Autism Spectrum Rating Scales (ASRS)

The Autism Spectrum Rating Scales (ASRS) developed by Goldstein in 2009 [85] is relatively longer comprising of 70 questions and applicable to children aged 2 – 18 years old. Zhou [65] conducted this test in Chinese on 1625 participants and found variable internal consistency (Cronbach’s α) of 0.585 to 0.929, high sensitivity of 94.2% and moderate specificity of 77%. In 2018, the team tested this on a Chinese kindergarten population and found a poorer sensitivity of 88.6% and better specificity of 84.5% (Zhou, 2018) [86]. This questionnaire was modified by Zhou 2017 and tested on different participants aged 6–12 years old resulting in similar sensitivity to their initial study of 93% but a high false positive rate with a PPV of 39.1% and specificity of 83.2%.

All studies indicated a high risk of bias in the patient selection and flow and timing domains as well as an unclear risk of bias related to the index test. There were low concerns related to the applicability within the studies.

Social Communication Questionnaire (SCQ)

The Social Communication Questionnaire (SCQ) is a 40-item yes/no response screening tool developed by Rutter et al. [87] aimed at children above the age of 4 years. The SCQ was translated to 4 different languages and studied from 2019–2022.

Four studies were found to use this test. Aldosari et al. [67] tested an Arabic version and found high reliability, moderate sensitivity (79.6%) and good specificity (96.6%). In the Greek population (Karaminis,2022) [66] the sensitivity ranged from 86.3%-88.7%, and the specificity ranged from 83.7% to 91.4%. Both of these studies mention removing references to English rhymes and making it culturally appropriate to increase acceptability. In a Chinese population, Liu et al. [68] found the test to be sensitive and specific in populations both under and over 4 years of age. This team collected data on parental education levels suggesting an impact on acceptability. Sangare et al. [46] also evaluated the SCQ in the Malian population and found similar sensitivity and specificity, 71% and 72% respectively.

All studies displayed a high risk of bias in the patient selection domain and 75% showed a high risk related to the flow and timing domains. They all indicated unclear details to ascertain risk related to the reference standard. There were low concerns related to the applicability among the various domains in all the studies.

Childhood Autism Spectrum Test (CAST)

The Childhood Autism Spectrum Test (CAST) is an instrument developed in the UK for screening for autism in children 4–11 years old [88]. It was translated to 2 languages and applied in 3 studies from 2013–2017.

Two studies [34, 71] tested the reliability of the Mandarin CAST on a Chinese primary school population, reporting it to be good with Spearman rho = 0.73 and theta = 0.89 respectively. A similar age group was tested using a Spanish translation [72] leading to a sensitivity score of 83.9%, a specificity of 92.5%, and an internal consistency \((a)\) of 0.826. This study discussed some cultural adaptations such as changing unfamiliar games to culturally familiar descriptions.

During the quality appraisal, two studies [71, 72] indicated a high risk of bias in the patient selection. The study by Sun et al. [34] was analysed using the MMAT and deemed “yes” to all questions except for if they accounted for confounders and also showed a lack of details regarding whether participants were representative of the target population.

Autism Spectrum Quotient (Child) (AQ-C)

The Autism Spectrum Quotient (AQ) (Child), AQ-C, is a 50-item parent-report questionnaire developed for children aged 4–11 years [89]. The Mandarin version completed on 4–10-year-old children [73] showed moderate reliability with an α coefficient of 0.765 and good validity with a sensitivity of 94% and specificity of 88.2%.

Indian Scale for Assessment of Autism (ISAA) and Indian Autism Screening Questionnaire (IASQ)

The Indian Scale for Assessment of Autism (ISAA) is described as a validated and mandated tool for autism assessment in India. The ISAA is a 40-item instrument with a 5-point rating scale [74]. All individuals, including health professionals, are required to undergo training prior to administration. It was studied by 2 of the publications [74, 75].

The Indian Autism Screening Questionnaire (IASQ) was derived from the ISAA and is a shorter, 10-question tool that requires comparatively brief training that can be done online. It was tested by 1 of the studies [75]. In a study population aged 3–17 years the ISAA was found to have a high validity score r = 0.77, internal consistency and reliability Cronbach’s α = 0.93 and good discriminant ability AUC = 0.93 [74]. The Indian Autism Screening Questionnaire (IASQ) on a similar aged population led to good sensitivity of 97% and specificity of 81% [75].

Autism Behaviour Checklist (ABC)

The Autism Behaviour Checklist (ABC) developed by Krug et al. [90], is a rating scale with 57 items for children aged 12 – 14 years. It was tested on a Mandarin population [76] aged 1.5–14 years leading to a moderate sensitivity of 80.45% but poor specificity of 58.67%.

Autism Spectrum Screening Questionnaire (ASSQ)

The ASSQ is a screening questionnaire developed by Ehlers and team in 1999 [91] with 27 different questions structured for carers of 7- to 16-year-old children. In a Finnish population of 8-year-old children in 2012, it showed sensitivity = 89% and specificity = 82% [77].

Hiva

Hiva (a word meaning “wish” in the Kurdish language) is a tool developed in Iran for its population consisting of the ten most commonly occurring autism symptoms identified by Iranian parents and professionals. It is based on the DSM-IV criteria and is aimed at children aged 3 – 11 years old. It was trialled in 1 study in 2015 in both Kurdish and Persian [39] in a population aged 2–5 years old. The scale had high validity measures (sensitivity = 100% and specificity 97%). The tool had poor internal consistency with Cronbach’s α= 0.58.

Toddler Autism Screening Questionnaire (TASQ)

The Toddler Autism Screening Questionnaire was developed by Tsai and team [78] for Taiwanese children based on a qualitative study where a child psychiatrist interviewed families of children with autism. The tool includes 18 questions requiring yes/no answers. In their studied population aged 18 – 26 months, Tsai et al. found their devised screening tool to have high validity scores (sensitivity = 100% and specificity 97%).

For this review, the quality appraisals of the above once-studied screening tools were combined for the purpose of reporting and analysis. These studies (CAST [88], AQ-C [89], ISAA [74], IASQ [75], ABC [76], Hiva [39], TASQ [78]) demonstrated varied ratings on quality appraisal. They predominantly indicated a high risk of bias related to patient selection and flow and timing (75%). Most indicated an unclear risk of bias related to the reference standard (50%). However, there were mixed findings regarding the index test domain, with 50% indicating a low risk of bias, 37% rating unclear risk and 13% measuring high risk. All studies indicated low concerns regarding applicability.

Acceptability

None of the studies included in this review presented data on the acceptability (parental and service provider) of the autism screening tools.

Discussion

This review examined the literature on screening tools used for autism in CALD populations. A total of 51 studies that examined 13 screening tools in CALD populations were used to ascertain the accuracy, reliability, validity, and acceptability. Although a variety of tools were identified, only some tools were implemented in more than one study. Of the total number of studies, 20 studies examined the reliability, validity and/or accuracy of the screening tool. Nineteen studies provided only validity scores and 12 studies presented only reliability measures.

The psychometric evaluations of the tools varied both in the properties evaluated and in the number of studies that assessed each screener. The performance varied from excellent to poor in terms of sensitivity and specificity. Table 5 highlights a description of the key findings. For example, the M-CHAT was studied in several different languages with good reliability and/or validity of the tool found when used in Spanish, Turkish, Chinese, Korean, French, and Arabic populations indicating its potential utility in these groups. The M-CHAT-R/F, the updated version of the M-CHAT, appears to have similar results and was found to be valid and reliable in Spanish, Chinese, Portuguese, Serbian, Malian, Northern Soho, and Albanian communities.

Table 5 Summary description of key findings

However, the M-CHAT’s performance was not universally robust. The tool’s sensitivity was as low as 25% in a Sinhalese demographic [40]. It is worth noting that this study [40] was the only study that employed the M-CHAT within a South Asian nation. It was found that despite a “rigorous translation process” first to Sinhalese and then back to English to ensure validity, some behaviours significant for the diagnosis of autism were not recognised as abnormal by the Sinhalese mothers. This highlights a disadvantage of applying a tool designed for one population to another population where it might not align with the cultural context. The tool’s contrasting accuracy in different settings underscores a critical research gap.

On the other hand Perera et al. [92] demonstrated that a pictorial autism screening tool yielded 88% sensitivity in differentiating autism from typical developing Sinhalese children. Hence, there may be merits of using alternative methods such as pictorial scales for identifying autism in diverse populations [93] where there is poor evidence base supporting conventional screening tools and presents an area for further studies.

Other factors may have also affected the success of the M-CHAT in some studies. Samadi et al. [39] reported good utility of the M-CHAT in Kurdish and Persian in Iran. However, this may be attributable to the conduction of extensive workshops and educational sessions for those completing the assessment including parents. This is additional to the standardised M-CHAT guidelines and can not only influence the extrapolation of these findings but also may not be feasible to replicate in ongoing clinical practice.

Importantly, the studies in this review did not consistently report cut-off scores for a positive screen, sensitivity, and specificity for these instruments. To ensure standardised screening efforts, it may be significant to establish and report culturally appropriate cut-off points as more studies are conducted in populations without screening histories. This may be an area where further research and validation studies are beneficial to determine the ideal cut-off for each population.

Various translations of the SCQ have proven reliable across populations and demonstrated validity specifically within Arabic, Greek and Malian populations and higher sensitivity within a Chinese demographic, despite the larger sample size (n = 819) and wide age range (2 – 18 years). Liu et al. [68] analysed subgroups of various ages and found the validity scores to be consistent in < 4 years and > 4 years. This is clinically significant as there are limited options for autism screening between 30 and 48 months of age, with the M-CHAT-R/F being the only validated tool in China at the time but limited to toddlers up to 30 months of age. Thus, the SCQ demonstrates remarkable versatility across varying age groups in addition to good reliability and validity. Further research would be beneficial to determine if these psychometric properties translate to other cultures.

Given the various autism screening tools analysed within a variety of countries in this study, an accurate comparison of the screening tools cannot be completed. Although generally it can be noted that the M-CHAT appeared to have the highest overall validity and reliability, its low sensitivity within Sinhalese populations highlights that CALD populations cannot be generalised.

This is furthered by the fact that the four screening tools included in this review that were devised specifically for their culture, namely IASQ/ISAA for Indian populations, Hiva for Kurdish and Persian cohorts and the TASQ for Taiwanese children demonstrated excellent sensitivity (0.97 – 1). This indicates not only the accuracy, but also the benefit of tailoring the screening tool for the particular culture. The methodologies behind the development of these tools vary. The ISAA [74] and IASQ [75] originated from a collaborative effort from a body of health professionals who devised questions based on the Indian population. Items in the TASQ [78] were based on a qualitative study where a child psychiatrist conducted interviews with Taiwanese families. The Hiva scale [39] was based on DSM 5 criteria but the questions were devised specifically for the Persian and Kurdish populations. This emphasises that the key factor is not only the method of tool development, which can vary, but the consistent element is that it was created in partnership with local stakeholders within the context of the specific population.

The performance of these tools created within their intended populations further supports the notion that the correct application of existing tools in different cultural and linguistic settings extends beyond just translation. It requires a comprehensive assessment of potential mismatches in language and concepts, followed by adjustments to ensure comprehension by the target population. A review by Soto et al. [8] emphasised that the objective of cultural adaptation is to achieve “functional equivalence” with the original version which, as demonstrated by the varying psychometric results in this review, is difficult to establish with only translation.

During the process of this review, some cultural adjustments were noted. Some examples include the replacement of the name “peek-a-boo” with a description of the game for Mexican families undertaking the Mexican M-CHAT [41] and the removal of references to British rhymes in the Arabic SCQ [67]. However, this review did not rigorously examine the cultural adaptation process for each study and its adherence to recommended guidelines [94]. This would be a beneficial area for further research as it may influence the quality of the psychometric outcomes. Soto et al. [8] did delve into this area and found that the details of adaptation methods were rarely reported with a notably large variation in the processes that were reported.

The ultimate diagnosis of autism is based on identifiable behavioural and social-emotional patterns highlighted in the DSM criteria. Nevertheless, the understanding of these behaviours can be shaped by diverse cultural contexts, leading to variations in identification [6]. Evaluating difficulties in social communication hinges on deviations from culturally defined norms, thus inviting differing interpretations. Consequently, assigning an impairment in this category to children across different cultures may lack uniformity, potentially disregarding significant cultural nuances in ASD assessments. Following from this arise questions regarding the cross-cultural sensitivity and validity of the DSM criteria which forms the basis of most screening tools. This may form a key area for future research and consideration of a revision of the criteria to reflect cultural differences.

Another finding of this review is a noticeable gap in the literature with a lack of studies exploring the acceptability of autism screening tools within CALD populations. The absence of acceptability studies impacts our understanding of the broader societal impact and is both academically and clinically relevant to ensure equitable and culturally competent screening [95]. Without a clear understanding of how these tools are received within CALD communities, there may be inadvertent use of methods that are linguistically inappropriate or culturally insensitive. This may lead to inaccurate diagnoses or delays in early intervention and support. It can also result in mistrust within the communities towards the healthcare system [96], exacerbating disparities.

Clinical implications

There are several clinical implications stemming from this review. While it appears to have been useful in several studies, the use of multiple stages of translation and explanation as well as the requirement of additional personnel, training and education in the assessment introduces an additional layer of complexity to the assessment process. This can increase the risk of misunderstandings between the healthcare provider and parent/carer, potentially affecting the accuracy of the diagnosis.

Moreover, the use of multiple stages within the screening (e.g., follow-up interviews and phone calls) and diagnosis process may lead to increased drop-out or decreased follow-up rate. This can potentially result in further delays in intervention. The additional steps are also relevant in terms of policymaking as they can consume a considerable amount of time and resources with negative financial implications.

Based on this review, due to the heterogeneity of data, there was no currently available screening tool that appeared universally perfect for all CALD populations. This review did inform that the M-CHAT, and its updated version M-CHAT-R/F, as well as the SRS were the most frequently translated and utilised, all with good validity and reliability. Whilst the tools devised specifically for its culture (IASQ/ISAA, Hiva and TASQ) performed excellently, if it is not feasible to devise a novel measure, the community may consider one of the three available tools with appropriate culturally appropriate amendments.

Highlighting the potential positive impact of the findings of this review on clinical practice and policy is crucial. By recognising the importance of cultural sensitivity in screening and diagnosis, as emphasised in the review, clinicians and policymakers can prioritise the development and implementation of culturally appropriate assessment methods. This, in turn, can lead to more accurate diagnoses and early interventions tailored to the needs of children with autism in culturally and linguistically diverse communities. By involving local stakeholders in the research process, such as community leaders and healthcare providers, we can ensure that these efforts result in meaningful improvements in the lives of children with autism in CALD communities.

Incorporating cultural sensitivity and community engagement into the development and validation of autism screening tools is essential for ensuring their effectiveness across diverse populations. One approach is to involve community members, including parents, caregivers, and local healthcare providers, in the design and validation process. By soliciting input from culturally diverse perspectives, screening tools can be adapted to better reflect the cultural norms and expectations of different communities. For example, rather than relying solely on standardised behavioural criteria, as aforementioned this may have different interpretations in different cultures, screening tools could incorporate culturally specific behaviours and communication styles that may indicate autism in certain populations. Additionally, providing training and resources to healthcare professionals on culturally competent assessment techniques, may it be outside of the screening process, can help improve the accuracy of autism diagnoses in culturally diverse settings. Ultimately, by integrating cultural sensitivity and community engagement into the development and validation process, we can ensure that autism screening tools are more inclusive and reflective of diverse experiences.

Strengths and limitations

One of the strengths of this review lies in its inclusivity of a wide range of studies from diverse languages and cultural backgrounds, each of which is distinct from the others. This allows for a more rounded understanding of the complexities and variations in autism in CALD children across a variety of cultures. The strength is enhanced by the use of a systematic approach with broad inclusion criteria and the inspection by two independent reviewers.

This review has several limitations. First, the inclusion of a wide range of studies from diverse cultural backgrounds, while a strength, also introduces a potential limitation in terms of the heterogeneity of the data. The heterogeneity extends to the age groups studied by the screening tools. The various tools target different age groups however some studies extrapolated the use of the tool to outside of the recommended age range potentially affecting the validity and reliability. Moreover, within this review the analysis of single-stage studies was combined with that of studies utilising multiple stages of screening which may have affected the resulting psychometric parameters. Variations in research methodologies, sample sizes, and cultural contexts make it challenging to draw uniform conclusions and comparisons across all studies. Further research may group these studies into more specific categories.

Of significance is that the majority of the studies in this review exhibited a high risk of bias during quality assessment, using the QUADAS-2 and several studies lacked sufficient detail in their MMAT evaluations. This raises concern regarding the credibility of the findings and whether they can be generalised to other settings and contexts. Another limitation of this study is the absence of an examination of grey literature (defined as “literature that is not formally published in sources such as books or journal articles” [97]) and reference lists which may have allowed for more complete data on autism screening in CALD communities. Additionally, language barriers and the exclusion of studies not available in English may limit the comprehensiveness of the review, potentially excluding valuable research conducted in non-English-speaking regions. Furthermore, while the review highlights the cultural diversity within the selected studies, it focused only on reliability and validity which was demonstrated by psychometric parameters rather than the relevance of items to specific cultural contexts and as mentioned above, the adaptation process.

Conclusion

This review critically examined autism screening tools that have been employed in CALD populations, with a focus on their validity, reliability, accuracy, and acceptability. Notably, no studies have explored the acceptability of these tools and further research is needed in this area. By systematically exploring how these tools are perceived, understood, and embraced by CALD communities, we can develop a better understanding of their effectiveness and potential limitations in these populations.

Nevertheless, the findings demonstrate the variability in psychometric properties across diverse populations. This highlights the diversity of the unique nuances, expressions, and interpretations of autism within different communities and the importance of recognising the heterogeneity in our approach to screening. Dependence on screening tools developed in Western contexts may result in biases and inaccuracies when applied to other settings.

The review illustrates the benefit of cultural sensitivity during screening and diagnosis as accuracy in these areas has profound implications for culturally appropriate assessment and early intervention with equitable access. Further research is needed to enhance the development of valid and reliable culturally specific autism screening tools with the involvement of local stakeholders.

Additionally, there should be continued efforts to address the ongoing stigma associated with autism in some cultural contexts. Cultural beliefs regarding neurodevelopmental disorders vary widely, leading to misconceptions and misunderstandings about autism often hindering discussion and support-seeking. Language barriers, limited access to culturally sensitive information, and disparities in healthcare resources further exacerbate the issue. To combat this stigma effectively, strategies must be tailored to specific cultural contexts. This may include community-based education programs, culturally sensitive awareness campaigns, and the involvement of community leaders and trusted figures in spreading accurate information about autism. Empowering individuals and families within CALD communities to openly discuss and seek assistance for autism can help reduce stigma and promote acceptance and inclusion, thereby facilitating earlier diagnoses and access to interventions.

Availability of data and materials

The data generated and/or analysed during the current study may be available from the corresponding author upon reasonable request.

Abbreviations

ABC:

Autism behaviour checklist

ADI-R:

Autism diagnostic interview - revised

ASSQ:

Autism spectrum screening questionnaire

AQ-C:

Autism spectrum quotient - child

ASD:

Autism spectrum disorder

ASRS:

Autism spectrum rating scales

CAST:

Childhood autism spectrum test

CALD:

Culturally and linguistically diverse

CHAT:

Checklist for autism in toddlers

IASQ:

Indian autism screening questionnaire

ISAA:

: Indian scale for assessment of autism

MMAT:

Mixed methods appraisal tool

M-CHAT:

Modified checklist for autism in toddlers

M-CHAT-R/F:

Modified checklist for autism in toddlers, revised with follow-up

PPV:

Positive predictive value

Q-CHAT:

Quantitative checklist for autism in toddlers

QUADAS-2:

Quality assessment of diagnostic accuracy studies tool

SCQ:

Social communication questionnaire

SRS:

Social responsiveness scale

TASQ:

Toddler autism screening questionnaire

USA:

United States of America

UK:

United Kingdom

References

  1. Diagnostic A. Statistical Manual of Mental Disorders (DSM IV). USA: Association of American Psychiatry; 2005.

    Google Scholar 

  2. Zeidan J, Fombonne E, Scorah J, Ibrahim A, Durkin MS, Saxena S, et al. Global prevalence of autism: A systematic review update. Autism Res. 2022;15(5):778–90.

    Article  PubMed  PubMed Central  Google Scholar 

  3. Maenner MJ, Warren Z, Williams AR, Amoakohene E, Bakian AV, Bilder DA, et al. Prevalence and characteristics of autism spectrum disorder among children aged 8 years—Autism and Developmental Disabilities Monitoring Network, 11 sites, United States, 2020. MMWR Surveill Summ. 2023;72(2):1.

    Article  PubMed  PubMed Central  Google Scholar 

  4. Pham TTL, Berecki-Gisolf J, Clapperton A, O’Brien KS, Liu S, Gibson K. Definitions of culturally and linguistically diverse (CALD): a literature review of epidemiological research in Australia. Int J Environ Res Public Health. 2021;18(2):737.

    Article  PubMed  PubMed Central  Google Scholar 

  5. Elsabbagh M, Divan G, Koh YJ, Kim YS, Kauchali S, Marcín C, et al. Global prevalence of autism and other pervasive developmental disorders. Autism Res. 2012;5(3):160–79.

    Article  PubMed  PubMed Central  Google Scholar 

  6. de Leeuw A, Happé F, Hoekstra RA. A conceptual framework for understanding the cultural and contextual factors on autism across the globe. Autism Res. 2020;13(7):1029–50.

    Article  PubMed  PubMed Central  Google Scholar 

  7. Liu F, Scheeren AM, Grove R, Hoekstra RA, Wang K, Guo D, et al. Exploring cultural differences in autistic traits: A factor analytic study of children with autism in China and the Netherlands. J Autism Dev Disord. 2022;52(11):4750–62.

    Article  PubMed  Google Scholar 

  8. Soto S, Linas K, Jacobstein D, Biel M, Migdal T, Anthony BJ. A review of cultural adaptations of screening tools for autism spectrum disorders. Autism. 2015;19(6):646–61.

    Article  PubMed  Google Scholar 

  9. Matson JL, Worley JA, Fodstad JC, Chung K-M, Suh D, Jhin HK, et al. A multinational study examining the cross cultural differences in reported symptoms of autism spectrum disorders: Israel, South Korea, the United Kingdom, and the United States of America. Res Autism Spectrum Disord. 2011;5(4):1598–604.

    Article  Google Scholar 

  10. Akechi H, Senju A, Uibo H, Kikuchi Y, Hasegawa T, Hietanen JK. Attention to eye contact in the West and East: Autonomic responses and evaluative ratings. PLoS ONE. 2013;8(3):e59312.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  11. Uono S, Hietanen JK. Eye contact perception in the west and east: A cross-cultural study. PLoS ONE. 2015;10(2):e0118094.

    Article  PubMed  PubMed Central  Google Scholar 

  12. Zhang J, Wheeler JJ, Richey D. Cultural Validity in Assessment Instruments for Children with Autism from a Chinese Cultural Perspective. 2006.

  13. Norbury CF, Sparks A. Difference or disorder? Cultural issues in understanding neurodevelopmental disorders. Dev Psychol. 2013;49(1):45.

    Article  PubMed  Google Scholar 

  14. Sicorello M, Stevanov J, Ashida H, Hecht H. Effect of gaze on personal space: A Japanese-German cross-cultural study. J Cross Cult Psychol. 2019;50(1):8–21.

    Article  Google Scholar 

  15. Ametepee LK, Chitiyo M. What we know about autism in Africa: A brief research synthesis. J Int Assoc Special Educ. 2009;10(1):11–3.

    Google Scholar 

  16. St. Amant HG, Schrager SM, Peña-Ricardo C, Williams ME, Vanderbilt DL. Language barriers impact access to services for children with autism spectrum disorders. J Autism Dev Disord. 2018;48:333–40.

    Article  PubMed  Google Scholar 

  17. Eapen V, Walter A, Guan J, Descallar J, Axelsson E, Einfeld S, et al. Maternal help-seeking for child developmental concerns: Associations with socio-demographic factors. J Paediatr Child Health. 2017;53(10):963–9.

    Article  PubMed  Google Scholar 

  18. Ou L, Chen J, Hillman K. Health services utilisation disparities between English speaking and non-English speaking background Australian infants. BMC Public Health. 2010;10:1–13.

    Article  Google Scholar 

  19. Hussain A, John JR, Dissanayake C, Frost G, Girdler S, Karlov L, et al. Sociocultural factors associated with detection of autism among culturally and linguistically diverse communities in Australia. BMC Pediatr. 2023;23(1):1–11.

    Article  Google Scholar 

  20. Lewis AK. Cultural and linguistic diversity among children and families referred for diagnostic evaluation of developmental delay and disability: Implications for service delivery. J Policy Pract Intellect Disabil. 2021;18(2):113–9.

    Article  Google Scholar 

  21. Guerrero MGB, Sobotka SA. Understanding the barriers to receiving autism diagnoses for Hispanic and Latinx families. Pediatr Ann. 2022;51(4):e167–71.

    Article  PubMed  PubMed Central  Google Scholar 

  22. Kodak T, Bergmann S. Autism spectrum disorder: characteristics, associated behaviors, and early intervention. Pediatr Clin. 2020;67(3):525–35.

    Google Scholar 

  23. Lupindo BM, Maw A, Shabalala N. Late diagnosis of autism: Exploring experiences of males diagnosed with autism in adulthood. Curr Psychol. 2023;42(28):24181–97.

    Article  Google Scholar 

  24. MacKinnon NJ, Emery V, Waller J, Ange B, Ambade P, Gunja M, et al. Mapping Health Disparities in 11 High-Income Nations. JAMA network open. 2023;6(7):e2322310-e.

    Article  Google Scholar 

  25. Dixon DR, Granpeesheh D, Tarbox J, Smith MN. Early detection of autism spectrum disorders. International handbook of autism and pervasive developmental disorders: Springer; 2011. p. 197–214.

    Google Scholar 

  26. Association AP. Diagnostic and statistical manual of mental disorders. Text revision. 2000.

  27. Berument SK, Rutter M, Lord C, Pickles A, Bailey A. Autism screening questionnaire: diagnostic validity. Br J Psychiatry. 1999;175(5):444–51.

    Article  CAS  PubMed  Google Scholar 

  28. Cibralic S, Hawker P, Khan F, Mendoza Diaz A, Woolfenden S, Murphy E, et al. Developmental screening tools used with first nations populations: a systematic review. Int J Environ Res Public Health. 2022;19(23):15627.

    Article  PubMed  PubMed Central  Google Scholar 

  29. Whiting PF, Rutjes AW, Westwood ME, Mallett S, Deeks JJ, Reitsma JB, et al. QUADAS-2: a revised tool for the quality assessment of diagnostic accuracy studies. Ann Intern Med. 2011;155(8):529–36.

    Article  PubMed  Google Scholar 

  30. Hong QN, Fàbregues S, Bartlett G, Boardman F, Cargo M, Dagenais P, Gagnon MP, Griffiths F, Nicolau B, O’Cathain A, Rousseau MC. The Mixed Methods Appraisal Tool (MMAT) version 2018 for information professionals and researchers. Education for information. 2018;34(4):285–91.

  31. Carakovac M, Jovanovic J, Kalanj M, Rudic N, Aleksic-Hil O, Aleksic B, et al. Serbian language version of the modified checklist for autism in toddlers, revised, with follow-up: cross-cultural adaptation and assessment of reliability. Sci Rep. 2016;6:38222.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  32. Ruta L, Arduino GM, Gagliano A, Apicella F, Leonardi E, Famà FI, et al. Psychometric properties, factor structure and cross-cultural validity of the quantitative CHecklist for Autism in Toddlers (Q-CHAT) in an Italian community setting. Res Autism Spectr Disord. 2019;64:39–48.

    Article  Google Scholar 

  33. Seung HK, Ji J, Kim SJ, Sung I, Youn YA, Hong G, et al. Examination of the Korean Modified Checklist of Autism in Toddlers: Item Response Theory. J Autism Dev Disord. 2015;45(9):2744–57.

    Article  PubMed  Google Scholar 

  34. Sun X, Allison C, Auyeung B, Matthews FE, Baron-Cohen S, Brayne C. The Mandarin Chinese version of the Childhood Autism Spectrum Test (CAST): Test–retest reliability. Res Dev Disabil. 2013;34(10):3267–75.

    Article  PubMed  Google Scholar 

  35. Vorster C, Kritzinger A, Coetser LE, van der Linde J. Preliminary reliability of south african adaptation and northern sotho translation of the modified checklist for autism in toddlers, revised with follow-up. South Afr J Commun Disord. 2021;68(1):1–7.

    Google Scholar 

  36. Eldin AS, Habib D, Noufal A, Farrag S, Bazaid K, Al-Sharbati M, et al. Use of M-CHAT for a multinational screening of young children with autism in the Arab countries. Int Rev Psychiatry. 2008;20(3):281–9.

    Article  Google Scholar 

  37. Wong YS, Yang CC, Stewart L, Chiang CH, Wu CC, Iao LS. Use of the Chinese version modified checklist for autism in toddlers in a high-risk sample in Taiwan. Res Autism Spectrum Disord. 2018;49:56–64.

    Article  Google Scholar 

  38. Baduel S, Guillon Q, Afzali MH, Foudon N, Kruck J, Rogé B. The French version of the Modified-Checklist for Autism in Toddlers (M-CHAT): A validation study on a French sample of 24 month-old children. J Autism Dev Disord. 2017;47(2):297–304.

    Article  PubMed  Google Scholar 

  39. Samadi SA, McConkey R. Screening for autism in Iranian preschoolers: Contrasting M-CHAT and a scale developed in Iran. J Autism Dev Disord. 2015;45(9):2908–16.

    Article  PubMed  Google Scholar 

  40. Perera H, Wijewardena K, Aluthwelage R. Screening of 18–24-month-old children for autism in a semi-urban community in Sri Lanka. J Trop Pediatr. 2009;55(6):402–5.

    Article  CAS  PubMed  Google Scholar 

  41. Albores-Gallo L, Roldán-Ceballos O, Villarreal-Valdes G, Betanzos-Cruz BX, Santos-Sánchez C, Martínez-Jaime MM, Lemus-Espinosa I, Hilton CL. M‐CHAT Mexican version validity and reliability and some cultural considerations. Int Sch Res Notices. 2012;2012(1):408694.

  42. Canal-Bedia R, García-Primo P, Martín-Cilleros MV, Santos-Borbujo J, Guisuraga-Fernández Z, Herráez-García L, et al. Modified checklist for autism in toddlers: cross-cultural adaptation and validation in Spain. J Autism Dev Disord. 2011;41(10):1342–51.

    Article  PubMed  Google Scholar 

  43. Jensen K, Noazin S, Bitterfeld L, ra, Carcelen A, Vargas-Cuentas NI, et al. Autism Detection in Children by Combined Use of Gaze Preference and the M-CHAT-R in a Resource-Scarce Setting. J Autism Dev Disord. 2021;51(3):994–1006.

  44. Brennan L, Fein D, Como A, Rathwell IC, Chen CM. Use of the Modified Checklist for Autism, Revised with Follow Up-Albanian to Screen for ASD in Albania. J Autism Dev Disord. 2016;46(11):3392–407.

    Article  PubMed  PubMed Central  Google Scholar 

  45. Guo C, Luo M, Wang X, Huang S, Meng Z, Shao J, et al. Reliability and validity of the Chinese version of Modified Checklist for Autism in Toddlers, Revised, with Follow-Up (M-CHAT-R/F). J Autism Dev Disord. 2019;49(1):185–96.

    Article  PubMed  Google Scholar 

  46. Sangare M, Toure HB, Toure A, Karembe A, Dolo H, Coulibaly YI, et al. Validation of two parent-reported autism spectrum disorders screening tools M-CHAT-R and SCQ in Bamako. Mali ENeurologicalSci. 2019;15: 100188.

    Article  PubMed  Google Scholar 

  47. Losapio MF, Siquara GM, Lampreia C, Lázaro CP, Pondé MP. Translation into Brazilian Portuguese and validation of the M-CHAT-R/F scale for early screening of autism spectrum disorder. Revista Paulista de Pediatria. 2022;41:e2021262.

  48. Magán-Maganto M, Canal-Bedia R, Hernández-Fabián A, Bejarano-Martín Á, Fernández-Álvarez CJ, Martínez-Velarte M, et al. Spanish cultural validation of the modified checklist for autism in toddlers, revised. J Autism Dev Disord. 2020;50(7):2412–23.

    Article  PubMed  Google Scholar 

  49. Coelho-Medeiros ME, Bronstein J, Aedo K, Pereira JA, Arraño V, Perez CA, et al. M-CHAT-R/F Validation as a screening tool for early detection in children with autism spectrum disorder. Revista chilena de pediatria. 2019;90(5):492–9.

    PubMed  Google Scholar 

  50. Kara B, Mukaddes NM, Altınkaya I, Güntepe D, Gökçay G, Özmen M. Using the Modified Checklist for Autism in Toddlers in a well-child clinic in Turkey: Adapting the screening method based on culture and setting. Autism. 2014;18(3):331–8.

    Article  PubMed  Google Scholar 

  51. Kara Uzun A, ÇuhaciÇakir B, Uğur Ç, Paksoy Z. Evaluation of toddlers aged between 18 and 30 months for autism spectrum disorders using the turkish version of the modified checklist for autism in toddlers test: A cross-sectional study. Anadolu Psikiyatri Dergisi. 2019;20(2):196–203.

    Google Scholar 

  52. Kondolot M, Özmert EN, Öztop DB, Mazıcıoğlu MM, Gümüş H, Elmalı F. The modified checklist for autism in Turkish toddlers: A different cultural adaptation sample. Research in Autism Spectrum Disorders. 2016;21:121–7.

    Article  Google Scholar 

  53. Cen CQ, Liang YY, Chen QR, Chen KY, Deng HZ, Chen BY, Zou XB. Investigating the validation of the Chinese Mandarin version of the Social Responsiveness Scale in a Mainland China child population. BMC psychiatry. 2017;17:1–5.

  54. Gau SSF, Liu LT, Wu YY, Chiu YN, Tsai WC. Psychometric properties of the Chinese version of the Social Responsiveness Scale. Res Autism Spectrum Disord. 2013;7(2):349–60.

    Article  Google Scholar 

  55. Wang J, Lee LC, Chen YS, Hsu JW. Assessing autistic traits in a Taiwan preschool population: Cross-cultural validation of the social responsiveness scale (SRS). J Autism Dev Disord. 2012;42(11):2450–9.

    Article  PubMed  Google Scholar 

  56. Zhou B, Zhou H, Wu L, Zou X, Luo X, Fombonne E, et al. Assessing the accuracy of the modified chinese autism spectrum rating scale and social responsiveness scale for screening autism spectrum disorder in chinese children. Neurosci Bull. 2017;33(2):168–74.

    Article  PubMed  PubMed Central  Google Scholar 

  57. Tehrani-Doost M, Shahrivar Z, Torabi N, Ansari S, Haji-Esmaeelzadeh M, Saeed-Ahmadi S. Cross-cultural validation and normative data of the social responsiveness scale in a group of Iranian general child population. J Autism Dev Disord. 2020;50(7):2389–96.

    Article  PubMed  Google Scholar 

  58. Jussila K, Kuusikko-Gauffin S, Mattila ML, Loukusa S, Pollock-Wurman R, Joskitt L, et al. Cross-cultural differences in the Parent Rated Social Responsiveness Scale (SRS)? Evaluation of the Finnish version among high-functioning school aged males with and without autism spectrum disorder. Res Autism Spectrum Disord. 2015;9:38–44.

    Article  Google Scholar 

  59. Bölte S, Poustka F, Constantino JN. Assessing autistic traits: Cross-cultural validation of the Social Responsiveness Scale (SRS). Autism Res. 2008;1(6):354–63.

    Article  PubMed  Google Scholar 

  60. Kamio Y, Moriwaki A, Inada N. Utility of teacher-report assessments of autistic severity in Japanese school children. Autism Res Treat. 2013;2013: 373240.

    PubMed  PubMed Central  Google Scholar 

  61. Cheon KA, Park JI, Koh YJ, Song J, Hong HJ, Kim YK, et al. The Social Responsiveness Scale in relation to DSM IV and DSM5 ASD in Korean children. Autism Res. 2016;9(9):970–80.

    Article  PubMed  PubMed Central  Google Scholar 

  62. Chun J, Bong G, Han JH, Oh M, Yoo HJ. Validation of Social responsiveness scale for Korean preschool children with autism. Psychiatry Investig. 2021;18(9):831–40.

    Article  PubMed  PubMed Central  Google Scholar 

  63. Fombonne E, Marcin C, Bruno R, Tinoco CM, Marquez CD. Screening for autism in Mexico. Autism Res. 2012;5(3):180–9.

    Article  PubMed  Google Scholar 

  64. Nguyen PH, Ocansey ME, Miller M, Le DTK, Schmidt RJ, Prado EL. The reliability and validity of the Social Responsiveness Scale to measure autism symptomology in Vietnamese children. Autism Res. 2019;12(11):1706–18.

    Article  PubMed  PubMed Central  Google Scholar 

  65. Zhou H, Zhang L, Wu L, Zou X, Luo X, Xia K, et al. Validity and reliability analysis of the Chinese parent version of the Autism Spectrum Rating Scale (6–18 years). Psychiatry Res. 2015;230(2):255–61.

    Article  PubMed  Google Scholar 

  66. Karaminis T, Stavrakaki S. The psychometric properties of the greek version of the social communication questionnaire. Autism Research. 2022.

  67. Aldosari M, Fombonne E, Aldhalaan H, Ouda M, Elhag S, Alshammari H, et al. Validation of the Arabic version of the Social Communication Questionnaire. Autism. 2019;23(7):1655–62.

    Article  PubMed  PubMed Central  Google Scholar 

  68. Liu SY, Wang X, Chen Q, Chen JJ, Jin CK, Zhan XL, Guo CH, Li XH, Lin LZ, Jing J. The validity and reliability of the simplified Chinese version of the Social Communication Questionnaire. Autism Res. 2022;15(9):1732–41.

  69. Ruta L, Chiarotti F, Arduino GM, Apicella F, Leonardi E, Maggio R, Carrozza C, Chericoni N, Costanzo V, Turco N, Tartarisco G. Validation of the quantitative checklist for autism in toddlers in an Italian clinical sample of young children with autism and other developmental disorders. Front Psychiatry. 2019;10:488.

  70. Stevanović D. Quantitative Checklist for Autism in Toddlers (Q-CHAT): A psychometric study with Serbian Toddlers. Research in Autism Spectrum Disorders. 2021;83:101760.

  71. Sun X, Allison C, Auyeung B, Matthews FE, Norton S, Baron-Cohen S, et al. Psychometric properties of the Mandarin version of the Childhood Autism Spectrum Test (CAST): An exploratory study. J Autism Dev Disord. 2014;44(7):1565–76.

    Article  PubMed  PubMed Central  Google Scholar 

  72. Morales-Hidalgo P, Roigé-Castellví J, Vigil-Colet A, Canals SJ. The Childhood Autism Spectrum Test (CAST): Spanish adaptation and validation. Autism Res. 2017;10(9):1491–8.

    Article  PubMed  Google Scholar 

  73. Sun F, Dai M, Lin L, Sun X, Murray AL, Auyeung B, et al. Psychometric properties of the Chinese Version of Autism Spectrum Quotient-Children’s Version: A sex-specific analysis. Autism Res. 2019;12(2):303–15.

    Article  PubMed  Google Scholar 

  74. Chakraborty S, Thomas P, Bhatia T, Nimgaonkar VL, Deshp, e SN. Assessment of severity of autism using the Indian scale for assessment of autism. Indian J Psychol Med. 2015;37(2):169–74.

  75. Chakraborty S, Bhatia T, Antony N, Roy A, Shriharsh V, ana, et al. Comparing the Indian Autism Screening Questionnaire (IASQ) and the Indian Scale for Assessment of Autism (ISAA) with the Childhood Autism Rating Scale-Second Edition (CARS2) in Indian settings. PloS one. 2022;17(9):e0273780.

  76. Chu JH, Bian F, Yan RY, Li YL, Cui YH, Li Y. Comparison of diagnostic validity of two autism rating scales for suspected autism in a large Chinese sample. World J Clin Cases. 2022;10(4):1206–17.

    Article  PubMed  PubMed Central  Google Scholar 

  77. Mattila ML, Jussila K, Linna SL, Kielinen M, Bloigu R, Kuusikko-Gauffin S, et al. Validation of the Finnish Autism Spectrum Screening Questionnaire (ASSQ) for clinical settings and total population screening. J Autism Dev Disord. 2012;42(10):2162–80.

    Article  PubMed  Google Scholar 

  78. Tsai W-C, Soong W-T, Shyu Y-IL. Toddler Autism Screening Questionnaire: Development and potential clinical validity. Autism. 2012;16(4):340–9.

  79. Tavakol M, Dennick R. Making sense of Cronbach’s alpha. Int J Med Educ. 2011;2:53.

    Article  PubMed  PubMed Central  Google Scholar 

  80. Robins DL, Fein D, Barton ML, Green JA. The Modified Checklist for Autism in Toddlers: an initial study investigating the early detection of autism and pervasive developmental disorders. J Autism Dev Disord. 2001;31:131–44.

    Article  CAS  PubMed  Google Scholar 

  81. Baron-Cohen S, Allen J, Gillberg C. Can autism be detected at 18 months?: The needle, the haystack, and the CHAT. Br J Psychiatry. 1992;161(6):839–43.

    Article  CAS  PubMed  Google Scholar 

  82. Robins DL, Casagrande K, Barton M, Chen C-MA, Dumont-Mathieu T, Fein D. Validation of the modified checklist for autism in toddlers, revised with follow-up (M-CHAT-R/F). Pediatrics. 2014;133(1):37–45.

  83. Allison C, Baron-Cohen S, Wheelwright S, Charman T, Richler J, Pasco G, et al. The Q-CHAT (Quantitative CHecklist for Autism in Toddlers): a normally distributed quantitative measure of autistic traits at 18–24 months of age: preliminary report. J Autism Dev Disord. 2008;38:1414–25.

    Article  PubMed  Google Scholar 

  84. Constantino JN, Gruber CP. Social responsiveness scale: SRS-2. 2012.

  85. Goldstein S, Naglieri JA. Autism spectrum rating scales (ASRS): Multi-Health System North Tonawanda, NY; 2009.

  86. Zhou H, Li C, Luo X, Wu L, Huang Y, Zhang L, et al. Cross-Cultural Revision and Psychometric Properties of theChinese Version of the Autism Spectrum Rating Scale (2–5 Years). Frontiers in neurology. 2018;9:460.

    Article  PubMed  PubMed Central  Google Scholar 

  87. Rutter MB, A; Lord, C. Social communication questionnaire. Los Angeles, CA: Western Psychological Services. 2003.

  88. Scott FJ, Baron-Cohen S, Bolton P, Brayne C. The CAST (Childhood Asperger Syndrome Test) Preliminary development of a UK screen for mainstream primary-school-age children. Autism. 2002;6(1):9–31.

    Article  PubMed  Google Scholar 

  89. Auyeung B, Baron-Cohen S, Wheelwright S, Allison C. The autism spectrum quotient: Children’s version (AQ-Child). J Autism Dev Disord. 2008;38:1230–40.

    Article  PubMed  Google Scholar 

  90. Krug DA, Arick J, Almond P. Behavior checklist for identifying severely handicapped individuals with high levels of autistic behavior. Child psychology & psychiatry & allied disciplines. 1980.

  91. Ehlers S, Gillberg C, Wing L. A screening questionnaire for Asperger syndrome and other high-functioning autism spectrum disorders in school age children. J Autism Dev Disord. 1999;29:129–41.

    Article  CAS  PubMed  Google Scholar 

  92. Perera H, Jeewandara KC, Seneviratne S, Guruge C. Culturally adapted pictorial screening tool for autism spectrum disorder: A new approach. World J Clin Pediatr. 2017;6(1):45.

    Article  PubMed  PubMed Central  Google Scholar 

  93. Janvier YM, Coffield CN, Harris JF, Mandell DS, Cidav Z. The Developmental Check-In: Development and initial testing of an autism screening tool targeting young children from underserved communities. Autism. 2019;23(3):689–98.

    Article  PubMed  Google Scholar 

  94. Guillemin F, Bombardier C, Beaton D. Cross-cultural adaptation of health-related quality of life measures: literature review and proposed guidelines. J Clin Epidemiol. 1993;46(12):1417–32.

    Article  CAS  PubMed  Google Scholar 

  95. Carlton J, Griffiths H, Horwood A, Mazzone P, Walker R, Simonsz H. Acceptability of childhood screening: a systematic narrative review. Public Health. 2021;193:126–38.

    Article  CAS  PubMed  Google Scholar 

  96. Esler DM, Johnston F, Thomas D. The acceptability of a depression screening tool in an urban, Aboriginal community-controlled health service. Aust N Z J Public Health. 2007;31(3):259–63.

    Article  PubMed  Google Scholar 

  97. Higgins JP, Green S. Cochrane handbook for systematic reviews of interventions. 2008.

Download references

Funding

Not applicable.

Author information

Authors and Affiliations

Authors

Contributions

EH, SC, PH, ADM, and JRJ assisted in the conceptualisation of the research question and design of the study; EH and PH conducted independent data extraction and quality assessment; EH drafted the manuscript under the supervision of SC, JRJ, and VE. All authors (EH, SC, PH, JRJ, AH, ADM and VE) reviewed and approved the final manuscript.

Corresponding author

Correspondence to Valsamma Eapen.

Ethics declarations

Ethics approval and consent to participate

Not applicable.

Consent for publication

Not applicable.

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Huda, E., Hawker, P., Cibralic, S. et al. Screening tools for autism in culturally and linguistically diverse paediatric populations: a systematic review. BMC Pediatr 24, 610 (2024). https://doi.org/10.1186/s12887-024-05067-5

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1186/s12887-024-05067-5

Keywords