Skip to main content
  • Research article
  • Open access
  • Published:

Measuring early childhood development in multiple contexts: the internal factor structure and reliability of the early Human Capability Index in seven low and middle income countries

Abstract

Background

The fourth year of the Sustainable Development Agenda era calls for countries to continue to invest not only in interventions and policies that will promote global equity and sustainability, but also in the monitoring systems required to track progress against these targets. A more pragmatic solution to measuring children’s early development in low and middle income countries in particular, is required. This study explores the psychometric properties of the early Human Capability Index (eHCI), a population measure of holistic development for children aged 3–5 years, designed with the vision of being flexible and feasible for use in low resource and capacity settings.

Methods

Utilizing data from seven low and middle income countries: Brazil (n = 1810), China (n = 11,421), Kiribati (n = 8339), Lao People’s Democratic Republic (n = 7493), Samoa (n = 12,191), Tonga (n = 6214), and Tuvalu (n = 549), analyses explored the internal factor structure and reliability of scores produced by the tool within each country.

Results

Confirmatory factor analyses and internal consistency coefficients demonstrated that after local adaptation, translation, and different implementation methods across countries, the eHCI maintained the same factor structure of nine theoretically-based developmental domains: Physical Health, Verbal Communication, Cultural Knowledge, Social and Emotional Skills, Perseverance, Approaches to Learning, Numeracy, Reading, and Writing.

Conclusions

Findings support the aims of the eHCI in being adaptable and applicable for use within a range of low and middle income countries to facilitate measurement and monitoring of children’s early development, as is required for the tracking of progress towards the Sustainable Development Agenda.

Peer Review reports

Background

Global endorsement of the United Nations Sustainable Development Goals (SDGs), an agenda for which the healthy development of children is central [1], together with burgeoning evidence regarding the value of investing in children’s early years [2], have highlighted the need for services and supports that provide children with the opportunity not only to survive, but to developmentally thrive [3]. In turn, the creation and utilization of instruments that measure such development in children has gained momentum. The early Human Capability Index (eHCI), a population measure designed to capture the holistic development of children aged 3–5 years, represents one such effort. This paper presents preliminary evidence of the psychometric properties of scores produced by the eHCI, and highlights how the tool could make an important contribution to the task of evaluating early childhood policies and programs as well as monitoring children’s development in the early years.

Tracking progress towards healthy child development

Ratified by United Nations member countries in 2015, the Sustainable Development Agenda specifies 17 goals and 169 targets to end poverty, mitigate inequality, and protect the planet for a better future [4]. The fourth year of the SDG era calls for countries to continue to invest not only in interventions and policies that will promote global equity and sustainability, but also in the monitoring systems required to track progress against these targets and thus identify those at risk of falling behind. Of particular relevance to early childhood development, SDG 4.2 states that by 2030, countries must ensure that all girls and boys have access to quality early childhood development, care and pre-primary education so that they are ready for primary education. To track progress against this target, countries are required to monitor (i) the percentage of children under 5 years of age who are developmentally on track in health, learning and psychosocial wellbeing, and (ii) national participation rates in early childhood education.

While many countries monitor national enrolment rates in early childhood education [5], few track the status of children’s early development. Measuring progress toward SDG 4.2 calls for population monitoring of children’s early health and development outcomes. Indeed, child development contributes to a number of SDG targets, including those related to health, gender equity, and poverty reduction, and thus global monitoring of children’s early development is key to supporting progress toward the broader Sustainable Development Agenda [6]. Faced with limitations in terms of the measurement instruments available as well as the resources and capacity required to implement such monitoring systems, tracking children’s health and development in low and middle income countries will be a challenge. In this, however, lies an important opportunity to promote and address the current obstacles associated with measuring children’s early development.

A call for new measurement solutions

Measurement of children’s development is influenced by culture, language, and theory. What are considered important aspects of and appropriate goals for children’s development, as well as what are deemed suitable assessment techniques to capture this information, can vary considerably across cultures and contexts [7,8,9]. Consequently, although tools need to capture aspects of child development that are important to outcomes throughout the life course, they should also be aligned with local culture and early learning and development frameworks, so that they not only accurately reflect children’s capabilities, but also produce information relevant to local policy and practice [1, 8].

A number of measurement initiatives are currently underway to monitor children’s early development at national, regional, and global levels. Some examples include the Early Development Instrument (EDI), the Early Childhood Development Index (ECDI), the Caregiver Reported Early Development Instrument (CREDI), the International Development and Early Learning Assessment (IDELA), the East Asia-Pacific Early Chid Development Scales (EAP-ECDS), the Malawi Developmental Assessment Tool (MDAT), the Measurement of Development and Early Learning (MODEL), and the Regional Project on Child Development Indicators (PRIDI) [3]. Various characteristics of the instruments in use however, including the cost of licensing fees, the level of enumerator training required prior to administration, the time they take to administer, how they are administered, and their applicability and adaptability within different contexts, constitute considerable barriers to their utilization. This is especially the case in contexts where resources and capacity are limited. To overcome these challenges, international leaders in early childhood have called for a more pragmatic and reliable solution to measuring children’s early development in low and middle income countries in particular. It was against this background that the eHCI was developed.

The early Human Capability Index

Designed to capture key aspects of holistic development in children aged 3–5 years, the eHCI was developed with the vision of being feasible for use in low resource and capacity settings while having the ability to capture change in children’s development over time [10]. The tool includes approximately 60 items (dependent upon country adaptation) spanning nine developmental domains (Physical Health, Verbal Communication, Cultural Knowledge, Social and Emotional Skills, Perseverance, Approaches to Learning, Numeracy, Reading, and Writing) and can be completed via adult report (e.g. by children’s caregivers, teachers, or early childhood practitioners) in less than 10 min. The eHCI requires minimal resources to be implemented; the tool is available for anyone to use free of charge, little enumerator training is required, and it can be completed quickly and easily by any adult who knows the child. Further, the tool was designed so that it can be easily adapted and utilized within diverse contexts for a range of purposes, including population monitoring, the evaluation of effects of early childhood policies and programs, as well as longitudinal studies seeking to predict children’s future capabilities.

Development of the eHCI

The eHCI was originally developed in 2013 for the purposes of evaluating a program designed to support children and families to be better prepared for school in Tonga [11]. Consultations were undertaken to understand locally, what good child development at school entry looks like, first establishing broad areas (i.e. domains) of development and then identifying specific items within these areas. On the basis of consultations and the child development literature, a draft instrument was developed and independently reviewed by child development experts. Once translated into Tongan, stakeholders originally consulted reviewed the instrument to ensure content and face validity. Piloting was then conducted to determine respondent understanding of items, efficiency of data collection methods, if scale distributions were discriminating between children as theoretically expected, as well as any floor or ceiling effects (i.e. if items were too hard or too easy) for the targeted range of children aged 3–5 years. After revisions based on findings from the pilot as well as a final review by local stakeholders, the eHCI was implemented nationally. Exploration of the psychometric properties of scores produced by the eHCI census in Tonga demonstrated adequate discriminant validity (comparison of mean scores across children grouped by demographic characteristics met theoretical expectations, i.e. older children scored higher than younger children, girls received slightly higher scores than boys, children of more educated mothers received higher scores than those of less educated mothers) and internal scale reliability (through Rasch analysis) [10].

Utilization of the eHCI

The eHCI has since been adapted and utilized to support a range of early childhood development projects in several low and middle income countries, predominantly across the Asia-Pacific region [12,13,14,15,16,17]. Similar to a number of tools designed to measure children’s development [18,19,20], development and adaptations of the eHCI in each new context were informed by a combination of both theoretical conceptualization as well as local expert consultation regarding the key aspects of children’s development that are predictive of future capabilities. Through these consultative processes, content and face validity of the instrument were established and adaptations and translations were ensured to be capturing the true intent of each item [21]. The internal factor structure of the instrument, however, is yet to be explored within multiple countries.

Evaluating the psychometric properties of scores produced by a measurement tool is fundamental for its future utilization and effectiveness [22]. An instrument measuring children’s development that lacks in reliability and validity could produce biased scores that lead to ill-informed decisions. With eHCI data now available in multiple countries, work is needed to explore the tool’s validity and reliability. An instrument with the properties of the eHCI that produces scores that are psychometrically robust and appropriate for use within diverse settings has potential for global applicability. Indeed, such a tool could better enable population monitoring of children’s development, as is required for SDG 4.2, particularly in low and middle income countries, with the ultimate goal of shaping services and policy to promote global equity of children’s health and development.

The current study

This research is a first step in working to establish the psychometric properties of scores produced by the eHCI within different cultures and contexts. Utilizing data previously collected from seven low and middle income countries, Brazil, China, Kiribati, Lao People’s Democratic Republic (PDR), Samoa, Tonga, and Tuvalu, analyses sought to explore the internal factor structure and reliability of scores produced by the tool within each country. Findings will be used to guide recommendations regarding the reporting of eHCI results moving forward.

Methods

Measures

The early Human Capability Index

Completed by an adult who knows the child, the eHCI includes approximately 60 items (dependent upon country adaptation) measuring children’s development across nine domains: Physical Health, Verbal Communication, Cultural Knowledge, Social and Emotional Skills, Perseverance, Approaches to Learning, Numeracy, Reading, and Writing. Response options to each item are binary (“yes”/“no” or “able”/“unable”). The majority of items are positively worded so that the “yes”/“able” responses are scored as 1, and the “no”/“unable” responses are scored as 0. A small number of items (ranging from 4 in Kiribati and Lao PDR to 6 in Tonga) are negatively worded and thus are reverse scored. Individual item scores in each domain are averaged so that children receive a score for each developmental domain ranging from 0 to 1, with higher scores indicative of better development.

The eHCI underwent a local adaptation process to ensure the tool’s content and face validity in each country. Thus, although many items are similar across different adaptations of the eHCI, some items and domains differ across countries. To illustrate, the Perseverance domain is measured by the same 4 items across all adaptations of the eHCI. In contrast, the Physical Health domain varies from 2 items in Brazil, 3 items in Samoa and China, 4 items in Tonga, and 5 items in Tuvalu and Kiribati, while the Laotian version of the eHCI does not capture physical health as a result of local expert consultation. Table 1 presents the Tongan eHCI (in English) and the percentage of children for whom respondents reported yes/able for each item, while Additional file 1: Tables S1, Additional file 2: Table S2, Additional file 3: Table S3, Additional file 4: Table S4, Additional file 5: Table S5 and Additional file 6: Table S6 present the same information for remaining countries, highlighting similarities and differences between adapted versions of the instrument.

Table 1 Tongan eHCI items and n (%) children for whom their caregiver/teacher reported yes/able

Demographic characteristics

In addition to measuring children’s development, the eHCI collects information about children’s demographic characteristics as well as relevant contextual information. Specifically, respondents provide information about children’s age, gender, and special needs status, and then dependent upon country adaptation, they also provide information about children’s height and weight, their mother’s highest level of education, whether the child has attended preschool, if there are reading materials (i.e. books) in the child’s home, and caregivers’ engagement in six different types of stimulating activities with their children in the home (e.g. reading a book, playing, counting etc). Variables presented in this manuscript include children’s age, gender, preschool attendance, as well as maternal education.

Data collection procedures

Data included in this manuscript were collected from seven countries between 2013 and 17, utilizing different sampling techniques and data collection methods. Contextual information regarding data collection procedures in each country are summarized in Table 2.

Table 2 Data collection contexts and procedures

Participants

Characteristics of each country sample are presented in Table 3. Samples ranged in size from 549 children in Tuvalu to 12,191 in Samoa, with children ranging in age from 2 to 6 years. Though the eHCI was designed to capture the development of children aged 3–5 years, the tool has also been used to collect data on children who fall slightly outside of this age range. This is a result of varied data collection purposes across countries. For instance, in Lao PDR, 2 year olds were included in data collection as this dataset serves as the baseline measure for a randomized control trial; younger children needed to be included at baseline to ensure they also fall into midline and endline data collections in years to come. Each country sample had a relatively even split of males and females; maternal education ranged from the majority of children with mothers who had never attended school (30.2%), started (27.3%), or finished primary school in Lao PDR (29.1%), to the majority of children with mothers who had completed secondary school (42.2%) or tertiary studies in Tonga (17.8%); while the percentage of children who had attended preschool ranged from 23.2% in Lao PDR to 100.0% in Brazil.

Table 3 Sample descriptive characteristics

Statistical analysis

First, confirmatory factor analyses (CFAs) were conducted separately for each country to determine the fit of eHCI data to the theoretical structure of the instrument (i.e. nine developmental domains, or eight domains in the case of Lao PDR). Next, CFAs were conducted separately for children aged 3, 4, and 5 years old in each country (as this was the age range consistent across all countries) to explore any variation in fit based on children’s age. Children with missing age data were excluded from this analysis (Brazil n = 2, China n = 56, Kiribati n = 884, Lao PDR n = 852, Tonga n = 53, Tuvalu n = 3). Additional CFAs were conducted for the Lao PDR sample stratified by maternal education, to explore if fit of data to the theoretical structure of the eHCI varied by respondent’s level of education. Specifically, the sample was split into two groups: low maternal education (i.e. no school, started primary, finished primary) and high maternal education (i.e. finished secondary, tertiary) and CFAs were conducted separately for each group. Children with missing maternal education data were excluded from this analysis (n = 5). This analysis was conducted for the Lao PDR sample only, as it was the sole sample for which data were available on the education level of all respondents.

Goodness-of-fit indices including χ2 (p > 0.05 indicates good fit; Brown, 2006), root mean square error of approximation (RMSEA; values ≤0.06 indicate good model fit while those between 0.06 and 0.08 indicate satisfactory fit; Hu & Bentler, [24]; Yu, [25]), the Comparative Fit Index and the Tucker-Lewis Fit Index (CFI and TLI, respectively; values ≥0.95 indicate good fit while values between 0.90 and 0.95 indicate satisfactory fit; Hu & Bentler, [24]; Yu, [25]; Schreiber, Nora, Stage, Barlow, & King, [26]), as well as standardized factor loadings (≥ 0.4 considered high and thus deemed a good indicator of the underlying construct; Costello & Osborne, [27]) were used to evaluate model fit. CFAs were conducted in Mplus [28] utilizing polychoric correlation matrices and the weighted least squares mean and variance adjusted (WLSMV) estimation method, both of which are deemed most appropriate for use with binary-type data such as that of the eHCI [29,30,31], as well as oblique (geomin) factor rotation which assumes correlations amongst factors [31].

The internal reliability of eHCI domains was also examined for each country, which is often conducted in conjunction with factor analysis to measure how interrelated a set of items are and thus how well they, collectively, measure the underlying construct of focus [32]. Although the majority of similar research assesses internal reliability using Cronbach’s alpha [19, 20], increasingly, ordinal reliability coefficients, specifically ordinal alpha, are deemed to be more appropriate in the case of evaluating the internal reliability of scales including items with binary response options in particular [33]. As such, ordinal alpha coefficients were calculated as well as Cronbach’s alpha (≥ 0.70 deemed acceptable for both coefficients; Bland & Altman, [34]; Gadermann et al., [33]) to allow for comparison with previous research. These analyses were conducted using the package ‘psych’ in R-Studio [35].

Results

Model fit indices yielded from a CFA for each country are presented in Table 4. Although a statistically significant χ2 indicates poor model fit, this test is sensitive to sample size and thus large samples are likely to yield significant χ2 values [24, 36]. Considering country samples were large, we instead rely more heavily on other fit indices. While χ2 values were statistically significant in all countries, RMSEA values indicated good model fit (i.e. ≤ 0.06) in each country with the exception of Samoa, for which the RMSEA value (0.07) indicated satisfactory fit. CFI and TLI values were ≥ 0.90 in all countries, again indicating satisfactory fit of eHCI data to the theoretical structure of the instrument (i.e. 9 developmental domains, or 8 domains in Lao PDR). Table 4 also presents model fit indices from CFAs conducted separately for 3, 4, and 5-year-old children, demonstrating relatively inconsistent results across countries. For instance, although model fit indices indicated better fit of data to the theoretical structure of the eHCI for 5-year-old children in Brazil and Samoa, slightly better fit was observed for 3-year-olds in Tonga and Tuvalu, relative to that of older children.

Table 4 Model fit indices from confirmatory factor analyses in each country

Standardized factor loadings yielded from full sample CFAs are presented in Table 5 for Tonga, and Additional file 7: Tables S7, Additional file 8: Table S8, Additional file 9: Table S9, Additional file 10: Table S10, Additional file 11: Table S11 and Additional file 12: Table S12 for remaining countries (factor loadings yielded from CFAs for 3, 4, and 5-year-old children are available from authors upon request). Items had factor loadings ≥0.40 across domains and countries with few exceptions. Items that form Numeracy, Reading, and Writing domains in particular had high factor loadings (≥ 0.80 on average) consistently across countries with few exceptions, while the strength of factor loadings for remaining developmental domains tended to be more varied across countries. For example, factor loadings for items in the Verbal Communication domain in China ranged from 0.52–0.90, while those in Lao PDR ranged from 0.73–0.94. Similarly, factor loadings for items in the Cultural Knowledge domain in Lao PDR ranged from 0.58–0.83, while those in Brazil ranged from 0.76–0.99. Reverse-scored items in Physical Health, Social and Emotional Skills, and Perseverance domains had weak factor loadings in all countries but Brazil. In contrast, Brazil was the only country in which some non-reverse-scored items had weak factor loadings. Specifically, Reading item 6 for which just under 2% of children were reported to be able to read complex sentences had a factor loading of 0.36, and Writing item 1 for which all but just under 2% of children were reported to be able to scribble on paper had a factor loading of 0.21.

Table 5 Factor loadings from confirmatory factor analysis in Tonga

Table 6 presents model fit indices for CFAs for low versus high maternal education in Lao PDR, with standardized factor loadings presented in Additional file 13: Tables S13 and Additional file 14: Table S14. RMSEA, CFI and TLI values indicated better fit of eHCI data to the theoretical structure of the instrument (i.e. 8 domains in Lao PDR) when respondents had a higher level of education. Factor loadings for reverse-scored items in Social and Emotional Skills and Perseverance domains, however, were weak across both education groups.

Table 6 Model fit indices from confirmatory factor analyses in Lao PDR grouped by maternal education

Finally, Table 7 presents internal consistency coefficients for eHCI domains in each country, demonstrating varied results across domains. The Numeracy domain had consistently high internal reliability across countries, with ordinal α ≥ .91, which is considered high [33]. Verbal Communication (ordinal α ≥ 0.87), Cultural Knowledge (ordinal α ≥ 0.79), Social and Emotional Skills (ordinal α ≥ 0.70), Approaches to Learning (ordinal α ≥ 0.86), Reading (ordinal α ≥ 0.87), and Writing (ordinal α ≥ 0.78) domains also yielded internal consistency coefficients deemed to be acceptable across countries. In contrast, the remaining two domains, Physical Health and Perseverance, demonstrated less than satisfactory internal reliability with ordinal α < 0.70 in all countries with the exception of Tuvalu and Kiribati on the Physical Health domain (ordinal α = 0.77 and 0.76, respectively) and Tuvalu (ordinal α = 0.75) and Brazil (ordinal α = 0.75) on the Perseverance domain.

Table 7 Internal reliability of eHCI domains

Discussion

The current study presents the psychometric properties of scores produced by the eHCI in seven low and middle income countries. Results demonstrated adequate fit of eHCI data to the theoretical structure of the instrument measuring children’s development across 9 domains (or 8 domains in the case of Lao PDR). Overall, findings lend support to the aims of the eHCI in being adaptable and applicable for use within a range of low and middle income countries to facilitate measurement of children’s early development [10].

Psychometric findings

Samples utilized in this research differed considerably across countries in terms of children’s demographic backgrounds, data collection methods and purposes, as well as sampling techniques and sizes. Although it might be argued that such differences present a challenge to exploring and comparing the validity and reliability of the eHCI within multiple contexts, this is the pragmatic nature in which the instrument was intended to be used; for a range of purposes and across varied contexts. As Yapa and Bärnighausen [37] discuss, the resource constraints that come with research in low and middle income countries are often the driving force behind creative solutions. As such, we argue that there is strength in that the eHCI was found to demonstrate a common underlying factor structure within the varied contexts in which the instrument has been implemented.

Numeracy, Reading, and Writing domains in particular were found to be working consistently across countries. Items that form these domains had high factor loadings and these scales had high internal reliability across countries. Similar results have been reported for other measures of children’s development, for instance, factor analyses of domains that constitute the EDI, a teacher-completed checklist measuring children’s holistic development in their first year of school, have demonstrated the Language and Cognitive Development domain (which captures children’s literacy and numeracy skills) to have the best fit across multiple countries [38]. Examination of items that form Numeracy, Reading, and Writing domains in the eHCI highlight that little adaptation of these items was required across countries. For instance, the Numeracy domain covers the same concepts of shape, colour, and number recognition, counting ability, and knowledge of numerical concepts such as time and weight, across countries. This might suggest these domains to be the more universal aspects of children’s development, indeed such skills have been demonstrated to be important predictors of outcomes throughout the life course [39, 40], and thus results are consistently strong across countries. Such skills are also arguably more easily observable (i.e. it is likely that a caregiver or teacher knows if a child can read or count, as opposed to whether they know if a child is always wanting to learn new things as measured in the Approaches to Learning domain, or if a child knows good from bad foods as captured in the Physical Health domain), which may also have had an influence on results.

In contrast, results across Physical Health, Verbal Communication, Cultural Knowledge, Social and Emotional Skills, Perseverance, and Approaches to Learning domains tended to demonstrate more variation across country samples. This is unsurprising considering the nature of the skills measured by these domains, including health and hygiene practices, verbal communication abilities, knowledge of culture and culturally acceptable behaviours, social interactions and emotional regulation, as well as how tasks are approached and the ability to complete them, which are aspects of development that would be considered to be more contextually and culturally specific. To illustrate, although Social and Emotional Skills was demonstrated to be one of the distinct domains that the eHCI captures within each country, variation in the strength of factor loadings of items that form this domain between Brazil and Lao PDR might be, in part, explained by cultural differences in social interactions and the expression of emotions between the two countries. Item factor loadings were lower in Lao PDR (on average around 0.65–0.70), an individualistic culture in which emotion is perceived to be experienced internally within an individual, whereas the same item loadings were higher in Brazil (on average around 0.80–0.85), a collectivist culture whereby emotions are thought to occur between people and thus are expressed openly [41]. These results could also be attributed to variation in methodological bias across countries [42]. For instance, acquiescence, the tendency to agree with statements, has been demonstrated to be more common in collectivist cultures [43]. Variation of results across countries in this way highlights the important influence of culture on the measurement of children’s development, and the need for tools to capture not only the aspects of children’s development that are important predictors of later outcomes, but to also be aligned with local culture in order to produce information that both accurately reflects children’s abilities and is relevant to local policy and practice [1, 8].

Reverse-scored items in the Physical Health, Social and Emotional, and Perseverance domains had weak factor loadings in all countries but Brazil, indicating that, compared to other items, they are poorer measures of the underlying constructs being measured by these domains. These results reflect initial findings from analyses conducted in Tonga following the development of the tool, with Rasch analyses indicating that a number of reverse-scored items did not fit the model well relative to other items [10]. It is possible that enumerators and/or respondents had difficulty in understanding and/or responding to these negatively worded items. Previous research has shown that reverse-scored items tend to load onto a factor separate to the construct they are intended to measure, that instead reflects aspects of item method [44, 45]. This was not observed in Brazil however, and this could be due to sample differences in this context relative to other countries. Specifically, Brazil was the only sample for which only preschool teachers completed the eHCI, as opposed to a combination of children’s caregivers and teachers, or caregivers only as in other samples. It might be that children’s caregivers and teachers do not respond to the eHCI in the same way, or that a minimum level of education or literacy is required to understand and respond to items. Indeed, results demonstrated better fit of data to the theoretical structure of the eHCI in Lao PDR amongst more educated caregivers who responded to the tool, compared to those less-educated. However, weak factor loadings for reverse-scored items were maintained when analyses were run separately for caregivers who had low versus high education. Together, results raise important questions regarding respondent reliability that need to be explored by future research.

When considering children’s age in determining how the instrument is operating, results were inconsistent across countries (i.e. in some countries best fit was observed for 3-year-olds and in others for 5-year-olds), and with the exception of Brazil, model fit indices did not vary in magnitude greatly, indicating that the eHCI appears to work relatively consistently across the age range of 3–5 years. Although these results provide some insight into how effective the eHCI is in measuring development across children of different ages, analyses focused on the discriminant validity of the tool, including if items capture a continuum of development for children of different ages, are needed to further explore this question.

Finally, internal reliability results for Physical Health and Perseverance domains indicated that items that form these domains, collectively, are not a good measure of these underlying constructs. It is possible that this is a result of reverse-scored items (which could be measuring constructs separate to that intended) making up a large proportion of these domains (i.e. one in two items in the Perseverance domain are reverse-scored across countries, while for the Physical Health domain this is the case for one in two items in Brazil, one in three items in Samoa and China, one in four items in Tonga, and one in five items in Tuvalu and Kiribati). Nevertheless, local adaptation of the tool in each country deemed all items important to children’s early development in their contexts, and thus it was not the intention of the current research to exclude items on the basis of psychometric results. An example is the first item in the Physical Health domain regarding children being frequently sick. Although we would not naturally assume that this item measures a child’s skills or capabilities (and subsequently it does not work well in the model), in the contexts of countries of focus whereby illness is common, it was deemed important for the eHCI to provide information regarding children’s experience of illness as one aspect of their holistic development.

Study limitations

When interpreting results of the study, it is important to be cognizant of three limitations. First, although the majority of countries studied utilized a census approach to data collection and thus are considered nationally representative, sampling strategies employed in Brazil, China, and Lao PDR (see Table 2) posit that results may differ if eHCI data were to be collected on nationally representative samples in these countries. Next, results indicate that reverse-scored items may not be operating as intended. Beyond analyses presented in this study, however, the information required to be able to explore this further (for example, insight into respondents’ understanding of reverse-scored items) is not currently available. Finally, it is important to reiterate that demonstrating consistent internal factor structure and reliability, as has been done in this study, is not complete evidence of a valid tool and must be considered together with results from additional psychometric analyses.

Study implications

Relative to other measures of early childhood development currently utilized, the eHCI requires minimal resources to be implemented. Initial psychometric results suggest that this has not come at the cost of the validity and reliability of the instrument. Demonstrating a consistent internal factor structure and reliability is one important aspect of the comprehensive evaluation of an instrument’s validity and reliability. Although not within the scope of the current study, additional work is underway to explore the extent to which eHCI domains can discriminate amongst children’s abilities by a range of demographic and contextual variables, are associated with scores on other measures of child development, show reliability amongst respondents, and are able to predict children’s future outcomes. A low-burden instrument that is both easily adaptable and psychometrically robust within multiple contexts in this way has potential utility internationally. Indeed, such a tool might better enable population monitoring of children’s development, as is required for the tracking of progress towards the Sustainable Development Agenda, particularly in low and middle income countries.

In terms of the reporting of and utilization of data produced by the eHCI, results suggest that eHCI data should continue to be reported across the instrument’s 9 theoretically-based developmental domains, or 8 domains in the case of Lao PDR. Reporting of children’s development across different areas of development in this way enables the identification of areas of both strength and need, and as a result can help to shape more targeted approaches to intervention or policy development. SDG 4.2 however, calls for the monitoring of children who are “developmentally on track”, a concept that, as with children’s development more broadly, is likely to vary across contexts. As such, if the eHCI is to be recommended to track progress against SDG 4.2 in future, research needs to not only work to further validate the instrument, but also determine how “developmentally on track” might be classified utilizing eHCI data.

Conclusion

Initial psychometric results demonstrate that scores produced by the eHCI, after processes of local adaptation, translation and implementation, maintained a similar factor structure of 9 theoretically-based developmental domains (or 8 domains in the case of Lao PDR) within a range of low and middle income countries. Future research is needed to build on these results and help to determine if the eHCI is able to fulfil its purpose of being a reliable, valid, and feasible tool which can help to facilitate the evaluation of early childhood policies and programs as well as measurement and monitoring of children’s development in the early years, particularly in low and middle income countries.

Availability of data and materials

The datasets used and/or analysed during the current study are available from the corresponding author (AS) on reasonable request.

Abbreviations

CFA:

Confirmatory Factor Analysis

CFI:

Comparative Fit Index

CREDI:

Caregiver Reported Early Development Instrument

ECDI:

Early Childhood Development Instrument

EDI:

Early Development Instrument

EHCI:

early Human Capability Index

IDELA:

International Development and Early Learning Assessment

MDAT:

Malawi Developmental Assessment Tool

MODEL:

Measurement of Development and Early Learning

PDR:

People’s Democratic Republic

PRIDI:

Regional Project on Child Development Indicators

RMSEA:

Root Mean Square Error of Approximation

SDG:

Sustainable Development Goal

TLI:

Tucker-Lewis Fit Index

WLSMV:

Weighted Least Squares Mean and Variance

References

  1. Raikes A, Yoshikawa H, Britto PR, Iruka I. Children, youth and developmental science in the 2015–2030 global sustainable development goals. Soc Res Child Dev Soc Policy Rep. 2017;30(3):1–23.

    Article  Google Scholar 

  2. Britto PR, Lye SJ, Proulx K, Yousafzai AK, Matthews SG, Vaivada T, et al. Nurturing care: promoting early childhood development. Lancet. 2017;389:91–102.

    Article  Google Scholar 

  3. Fernald L, Prado E, Kariger P, Raikes A. A toolkit for measuring early childhood development in low- and middle-income countries. Washington DC: World Bank Group; 2017.

    Google Scholar 

  4. United Nations General Assembly. Report of the open working Group of the General Assembly on sustainable development goals: New York City: UNGA; 2014. a/68/970.

  5. World Bank. Atlas of sustainable development goals 2018: world development indicators. Washington, DC: World Bank; 2018.

    Book  Google Scholar 

  6. Britto PR. Early moments matter for every child. New York: UNICEF; 2017.

    Google Scholar 

  7. Hambleton RK, Merenda PF, Speilberger CD. Adapting educational and psychological tests for cross cultural assessment. Mahwah: Lawrence Erlbaum Associates Inc.; 2005.

    Google Scholar 

  8. Keller H, Kärtner J. The cultural solution of universal developmental tasks. Gelfand M, Chiu C, Hong Y, editors. New York: Oxford University Press; 2013.

    Google Scholar 

  9. Rao N, Sun J, Becher Y. Assessing early development and learning across cultures: the East Asia Pacific-early child development scales. Assess Dev Matters. 2015;7(2):21–5.

    Google Scholar 

  10. Brinkman S, Thanh VB. Early Child Development in Tonga. Washington, DC: World Bank; 2017. Available from: http://documents.worldbank.org/curated/en/898671481187892018/pdf/110821-PUB-PUBLIC-pubdate-12-7-2016.pdf

    Google Scholar 

  11. MacDonald K, Brinkman S, Jarvie W, Machuca-Sierra M, McDonall K, Messaoud-Galuis S, et al. Pedagogy versus School Readiness: The Impact of a Randomized Reading Instruction Intervention and Community-Based Playgroup Intervention on Early Grade Reading Outcomes in Tonga. WPS 7944; Washington, D.C.: World Bank Group; 2017.

    Book  Google Scholar 

  12. Brinkman S, Sincovich A, Danchev P. The Status of Early Childhood Health and Development in Northern Lao PDR: Baseline Results from the Early Childhood Education Project. Washington, D.C: World Bank Group; 2016. Available from: http://documents.worldbank.org/curated/en/228221488446175113/The-status-of-early-childhood-health-and-development-in-northern-Lao-PDR

    Google Scholar 

  13. Brinkman S, Sincovich A, Than VB. Early Childhood Development in Samoa: Baseline results from the Samoan Early Human Capability Index. Washington, D.C: World Bank Group; 2017. Available from: http://documents.worldbank.org/curated/en/363761563248716398/pdf/Main-Report.pdf

    Book  Google Scholar 

  14. Brinkman S, Sincovich A, Thanh VB. The Status of Early Childhood Health and Development in Kiribati: Results from a population wide census. Washington, D.C: World Bank Group; 2017. Available from: http://documents.worldbank.org/curated/en/853791563251577440/pdf/The-Status-of-Early-Childhood-Health-and-Development-in-Kiribati-Results-from-a-Population-Wide-Census.pdf

    Book  Google Scholar 

  15. Brinkman S, Sincovich A, Thanh VB. How are Tuvalu's children developing? Evidence-based policy recommendations for better early childhood development in Tuvalu. Washington, D. C: World Bank Group; 2017. Available from: http://documents.worldbank.org/curated/en/373411563250298102/How-are-Tuvalu-s-children-developing-Evidence-based-policy-recommendations-for-better-early-childhood-development-of-Tuvaluan-children

    Google Scholar 

  16. Santos DD, Zanon C. The Brazilian experience with the eHCI: the adaptation process, reduction proceedings, and validation study: Laboratório de Estudos e Pesquisas em Economia Social; 2017.

  17. Zhao J, Brinkman SA, Song Y, Lu C, Young ME, Zhang Y, et al. Measuring Early Childhood Development with the Early Human Capability Index (eHCI): A Reliability and Validity Study in China. J Pediatr. under review.

  18. Janus M, Offord D. Development and psychometric properties of the early development instrument (EDI): a measure of children's school readiness. Can J Behav Sci. 2007;39:1–22.

    Article  Google Scholar 

  19. Rao N, Sun J, Richards B, Weber AM, Sincovich A, Darmsradt GL, et al. Assessing Diversity in Early Childhood Development in the East Asia-Pacific. Child Indic Res. 2018;12(1):235–54.

    Article  Google Scholar 

  20. Pisani L, Borisova I, Dowd AJ. Developing and validating the international development and early learning assessment (IDELA). Int J Educ Res. 2018;91:1–15.

    Article  Google Scholar 

  21. International Test Commission. The ITC Guidelines for Translating and Adapting Tests. 2nd ed; 2017.

    Google Scholar 

  22. Downing SM, Haladyna TM. Handbook of test development. Mahwah: Lawrence Erlbaum Associates; 2006.

    Google Scholar 

  23. World Bank. World Bank open data 2019 Available from: https://data.worldbank.org/.

    Google Scholar 

  24. Hu L, Bentler PM. Cutoff criteria for fit indexes in covariance structure analysis: conventional criteria versus new alternatives. Struct Equ Model Multidiscip J. 1999;6(1):1–55.

    Article  Google Scholar 

  25. Yu C. Evaluating cutoff criteria of model fit indices for latent variable models with binary and continuous outcomes. Los Angeles: University of California; 2002.

  26. Schreiber JB, Nora A, Stage FK, Barlow EA, King J. Reporting structural equation modeling and confirmatory factor analysis results: A review.The Journal of Educational Research. 2006;99:323–338.

    Article  Google Scholar 

  27. Costello AB, Osborne JW. Best practices in exploratory factor analysis: four recommendations for getting the most from your analysis. Pract Assess Res Eval. 2005;10(7):1–9.

    Google Scholar 

  28. Muthen LK, Muthen BO. Mplus Version 7.31. Los Angeles: Muthen & Muthen; 2012.

    Google Scholar 

  29. Flora DB, LaBrish C, Chalmers RP. Old and new ideas for data screening and assumption testing for exploratory and confirmatory factor analysis. Front Psychol. 2012;3(55):1–21.

    Google Scholar 

  30. Holgado-Tello FP, Chacón-Moscoso S, Barbero-García I, Vila-Abad E. Polychoric versus Pearson correlations in exploratory and confirmatory factor analysis of ordinal variables. Qual Quant. 2010;44(1):153–66.

    Article  Google Scholar 

  31. Brown TA. Confirmatory factor analysis for applied research. New York: Guildford Press; 2006.

    Google Scholar 

  32. Pett MA, Lackey NR, Sullivan JJ. Making sense of factor analysis: the use of factor analysis for instrument development in health care research. Thousand Oaks, California: Sage Publications; 2003.

    Book  Google Scholar 

  33. Gadermann AM, Guhn M, Zumbo BD. Estimating ordinal reliability for Likert-type and ordinal item response data: a conceptual, empirical, and practical guide. Pract Assess Res Eval. 2012;17(3):1–13.

    Google Scholar 

  34. Bland JM, Altman DG. Statistics notes: Cronbach's alpha. BMJ (Clin Res ed). 1997;314(7080):572.

    Article  CAS  Google Scholar 

  35. R Core Team. R: A language and environment for statistical computing. Vienna, Austria: R Foundation for Statistical Computing; 2014.

    Google Scholar 

  36. Byrne BM. Structural equation modeling with Mplus: basic concepts, applications, and programming. New York: Taylor and Francis Group; 2012.

    Google Scholar 

  37. Yapa HM, Bärnighausen T. Implementation science in resource-poor countries and communities. Implement Sci. 2018;13(154):1–13.

    Google Scholar 

  38. Janus M, Brinkman S, Duku E. Validity and psychometric properties of the early development instrument in Canada, Australia, United States and Jamaica. Soc Indic Res. 2011;103(2):283–97.

    Article  Google Scholar 

  39. Brinkman S, Gregory T, Harris J, Hart B, Blackmore S, Janus M. Associations between the early development instrument at age 5 and Reading and numeracy skills at ages 8, 10 and 12: a prospective linked data study. Child Indic Res. 2013;6(4):695–708.

    Article  Google Scholar 

  40. Duncan G, Claessens A, Huston A, Pagani L, Engel M, Sexton H, et al. School readiness and later achievement. Dev Psychol. 2007;43(6):1428–46.

    Article  Google Scholar 

  41. Mesquita B. Emotions in collectivist and individualist contexts. J Pers Soc Psychol. 2001;80(1):68–74.

    Article  CAS  Google Scholar 

  42. He J, van de Vijver F. Bias and Equivalence in Cross-Cultural Research. Online Readings Psychol Cult. 2012;2(2):1–19.

  43. Harzing AW. Response styles in cross-national survey research: a 26-country study. Int J Cross-cult Manag. 2006;6:243–66.

    Article  Google Scholar 

  44. Swain SD, Weathers D, Niedrich RW. Assessing three sources of Misresponse to reversed Likert items. J Mark Res. 2008;45(1):116–31.

    Article  Google Scholar 

  45. Netemeyer RG, Bearden WO, Sharma S. Scaling procedures: issues and applications. Newbury Park: Sage Publications, Inc; 2003.

    Book  Google Scholar 

Download references

Acknowledgements

We wish to acknowledge the agencies who supported the collection of eHCI data which were utilized in the current study. Data collection was funded by the Fundação Maria Cecilia Souto Vidigal in Brazil, the China Development Research Foundation in China, the World Bank Group through the Early Childhood Education Project in Lao PDR, and the Global Partnership for Education through the Pacific Early Age Readiness and Learning (PEARL) Program in Kiribati, Samoa, Tonga, and Tuvalu.

Funding

Not applicable.

Author information

Authors and Affiliations

Authors

Contributions

SAB developed the instrument and led adaptations within each country. CZ and DDS contributed to the acquisition of data. AS led the analysis and writing of the manuscript. SAB and TG contributed to the analysis plan and supported development of the manuscript. AS, TG, CZ, DDS, JL, and SAB contributed to the interpretation of results, revised the manuscript, and approved the final version of the manuscript for submission.

Corresponding author

Correspondence to Alanna Sincovich.

Ethics declarations

Ethics approval and consent to participate

The current study is a secondary analysis of pre-existing, de-identified data and thus was deemed exempt from requiring ethical review by the University of Adelaide Human Research Ethics Committee. Consent to participate was obtained from respondents verbally prior to data collection. Verbal consent was sought as this was the method through which all data were collected from respondents (i.e. enumerators asked respondents a series of questions verbally), and this was aligned with the various ethical and research approvals in each country.

Consent for publication

Not applicable.

Competing interests

AS and SAB are employed by the World Bank Group outside of the submitted research. The authors declare no other no other competing interests.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Additional file 1: Table S1.

Brazil eHCI items and n (%) children for whom their caregiver/ teacher reported yes/able.

Additional file 2: Table S2.

China eHCI items and n (%) children for whom their caregiver/ teacher reported yes/able.

Additional file 3: Table S3.

Kiribati eHCI items and n (%) children for whom their caregiver/ teacher reported yes/able.

Additional file 4: Table S4.

Lao PDR eHCI items and n (%) children for whom their caregiver/ teacher reported yes/able.

Additional file 5: Table S5.

Samoa eHCI items and n (%) children for whom their caregiver/ teacher reported yes/able.

Additional file 6: Table S6.

Tuvalu eHCI items and n (%) children for whom their caregiver/ teacher reported yes/able.

Additional file 7: Table S7.

Factor loadings from confirmatory factor analysis in Brazil.

Additional file 8: Table S8

. Factor loadings from confirmatory factor analysis in China.

Additional file 9: Table S9.

Factor loadings from confirmatory factor analysis in Kiribati.

Additional file 10: Table S10.

Factor loadings from confirmatory factor analysis in Lao PDR.

Additional file 11: Table S11.

Factor loadings from confirmatory factor analysis in Samoa.

Additional file 12: Table S12.

Factor loadings from confirmatory factor analysis in Tuvalu.

Additional file 13: Table S13.

Factor loadings from confirmatory factor analysis in Lao PDR – low maternal education.

Additional file 14: Table S14.

Factor loadings from confirmatory factor analysis in Lao PDR – high maternal education.

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Sincovich, A., Gregory, T., Zanon, C. et al. Measuring early childhood development in multiple contexts: the internal factor structure and reliability of the early Human Capability Index in seven low and middle income countries. BMC Pediatr 19, 471 (2019). https://doi.org/10.1186/s12887-019-1852-5

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1186/s12887-019-1852-5

Keywords