The reliability and validity of outcome measures for atopic ... - Core

5 downloads 0 Views 214KB Size Report
ted. The ICCs and coefficient of variations (CV) means of the erythema ..... Hanifin JM, Thurston M, Omoto M, Cherill R, Tofte SJ, Graeber M. The eczema area ...
International Journal of Women's Dermatology 1 (2015) 150–154

Contents lists available at ScienceDirect

International Journal of Women's Dermatology

The reliability and validity of outcome measures for atopic dermatitis in patients with pigmented skin: A grey area☆,☆☆ C.Y. Zhao, MBBS, MMed (Clin Epi) a,d, A. Wijayanti d, M.C. Doria, MD, FPDS a,d, A.G. Harris a,d, S.V. Jain, BSc (Med) Hons d, K.N. Legaspi, MD, FPDS a,d, N.C. Dlova, MBChB, FCDerm, PhD c, M.G. Law, MA, MSc, PhD b, D.F. Murrell, MA, BMBCh, FAAD, FACD, MD, FRCP a,d,⁎ a

Department of Dermatology, St. George Hospital, Sydney, Australia Kirby Institute, University of New South Wales, Sydney, Australia c Derpartment of Dermatology, Nelson R. Mandela School of Medicine, University of KwaZulu-Natal, Durban, South Africa d Faculty of Medicine, University of New South Wales, Sydney, Australia b

a r t i c l e

i n f o

Article history: Received 1 February 2015 Received in revised form 17 May 2015 Accepted 18 May 2015 Keywords: Atopic dermatitis Outcome measures Skin of color

a b s t r a c t Background: Outcome measures for atopic dermatitis (AD) patients with pigmented skin have neither been developed nor validated. Objective: To compare the reliability and validity of four common AD outcome measures in patients with various levels of skin darkness. Method: The inter- and intra-rater reliability and construct validity of the EASI (Eczema Area and Severity Index), objective-SCORing Atopic Dermatitis (oSCORAD), Three Items Severity index (TIS) and Six Areas, Six Sites Atopic Dermatitis (SASSAD) were evaluated in 18 patients of various levels of skin darkness, using their full body photographs, by five trained clinicians. Results: The inter-rater reliability intraclass coefficient (ICCs) and 95% confidence intervals were poor for highly pigmented patients: EASI -.054(-.200 to .657), oSCORAD -.089(-.206 to .598), TIS -.21(-.24 to .147), SASSAD -.071(-.200 to .631); fair for mildly pigmented patients: EASI .464(.140-.839), oSCORAD .588(.265-.89), TIS.524(.200-.865), SASSAD .41(.045-.775); and fair to good for non-pigmented patients: EASI .64(.330-.908), oSCORAD .586(.263-.889), TIS .403(.09-.809), SASSAD .667(.358-.916). Erythema likely contributed to the inter-rater variability. Construct validity had significant correlations across all measures in non-pigmented patients, but no correlations in highly pigmented patients. Conclusion: AD outcome measures have poor reliability and validity in highly pigmented patients, with variations in erythema perception being a contributor. © 2015 The Authors. Published by Elsevier Inc. on behalf of Women's Dermatologic Society. This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).

Introduction Skin color contains vital diagnostic clues in dermatology, as it reflects the underlying pathological process. Patients with pigmented skin are a complex population in dermatology. Inflammatory conditions such as atopic dermatitis (AD) and psoriasis are more difficult to assess in these patients, putting them at risk for misdiagnosis or mistreatment. Many times, patients with skin of color have been excluded from clinical trials in dermatology due to the difficulties of assessing disease severity. There are many reasons to account for the difficulties in assessing pigmented patients with AD. Phenotypic variations secondary to genetic differences is one reason; for example, filaggrin-2 mutation variations ☆ Funding: This study was partly supported by the independent learning project (ILP) program and Honours program supervised by the authors M.G.L and D.F.M. ☆☆ Conflict of interest: The authors state no conflict of interest. ⁎ Corresponding author. E-mail address: [email protected] (D.F. Murrell).

in AD have been found in African-American patients, and have been associated with a more persistent disease course (Margolis et al., 2014; Torrelo, 2014). Another example is that pigmented skin has been shown to be less likely to develop erythema when exposed to irritants (Berardesca and Maibach, 2003). Also, cultural and environmental factors can change how the skin is cared for, leading to further heterogeneity in manifestation. In addition, the clinician’s perception of color may be distorted by the background skin pigmentation or be mistaken for post-inflammatory hyperpigmentation (Ahmad Fadzil et al., 2009). Furthermore, clinical experience with managing patients with skin of color is a contributing factor. As a result, considerable intrarater and interrater variations in assessing the patient can occur, while the validity is compromised by the clinical heterogeneity. AD is a common dermatological condition that can affect patients of all ethnicities and skin types. In the pediatric population, AD has comparably high prevalence of 17% in the United States, 14% in England, 24% in Japan, 17% in Korea, 17% in South Africa (mixed Caucasians and Blacks), 20% in Kenya, and 32% in Melbourne, Australia (Esamai et al.,

http://dx.doi.org/10.1016/j.ijwd.2015.05.002 2352-6475/© 2015 The Authors. Published by Elsevier Inc. on behalf of Women's Dermatologic Society. This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).

C.Y. Zhao et al. / International Journal of Women's Dermatology 1 (2015) 150–154

2002; Oh et al., 2004; Robertson et al., 2004; Shaw et al., 2011; Simpson et al., 2009; Sugiura et al., 1998; Zar et al., 2007). A systematic review in 2012 has also found that in Africa, Eastern Asia, Western Europe and parts of Northern Europe, trends in AD were mainly increasing (Deckers et al., 2012). The course of AD is typically chronic, requiring ongoing monitoring and an accurate assessment instrument. Also, many clinical trials have being conducted to evaluate interventions for AD, calling for a uniform outcome measure. Significant international efforts have been made to facilitate the standardization and validation of AD outcome measures. The Harmonising Core Outcome Measures (HOME) for eczema initiative has had several meetings (Chalmers et al., 2014; Schmitt and Williams, 2010; Schmitt et al., 2012), and confirmed that excoriation, erythema, edema, or papulation and lichenification are four essential components for the assessment of AD severity. A recent systematic review indicated that out of the 16 proposed outcome measures used in clinical trials, only the EASI (Eczema Area and Severity Index) and the SCORing Atopic Dermatitis (SCORAD) have received adequate validation (Schmitt et al., 2013). Meanwhile, a recent HOME consensus recommended the EASI alone as the optimal outcome measure (Chalmers et al., 2014). Despite all of the above progress, the assessment of AD in pigmented-skin patients is still a grey area requiring attention. In fact, a study has found the underreporting of patient’s skin type in clinical trials, with only 59.5% of the clinical trials published in the United States between 2000 and 2009 reporting the patient’s race or ethnicity (Hirano et al., 2012). Another systematic review showed that there is a dearth of studies demonstrating efficacy of systemic AD therapy in different racial and ethnic patient subsets in the United States (Bhattacharya and Silverberg, 2014). The aim of this study was to contribute to the expanding work in the standardization of AD outcome measures by addressing the issue of disease assessment in patients with pigmented skin. A prospective study was conducted to compare the interrater and intrarater reliability, as well as convergent construct validity, of the four most commonly used atopic dermatitis outcome measures in patients with various levels of skin darkness. This study also aimed to explore the underlying factors contributing to the variations, such as erythema. Materials and methods This prospective study was granted ethical approval from the South Eastern Sydney Health District Human Research Ethics Committee Northern Sector (reference: HREC/12/POWH/155).

151

patients were African-American. All assessors were required to attend a training lecture on the use of the EASI, oSCORAD, TIS, and SASSAD. Also, the assessors were required to attend a debriefing session prior to each scoring session to raise queries regarding the administration of these scoring systems. The assessors were completely blinded to the identity of the patients chosen. Scoring process The assessments were performed for 2-hour sessions, over 4 days. Each session was limited to 2 hours in length, to avoid assessor fatigue. Full-body photographs of the 18 patients were presented on a screen of at least 1.5 m by 1.5 m. Three patients with various levels of skin pigmentations, whose identities were unknown to the assessors, were also arranged by a separate investigator to have their photographs repeatedly shown at the end of the 18 patients for intrarater reliability testing. For each of the patients scored, the assessors were given four color-coded scoring sheets including the four measures. The assessors were given the time to view the photographs until they were satisfied with their scores. Each assessor was neither allowed to look at their own scores from the other outcome measures, nor another assessor’s scores. When any patients had minor body parts missing, which five patients did, all assessors were asked not to assess the particular missing body part across all scores. Data input All outcome measure scores were calculated by two separate study investigators. Data input was performed by one investigator, then separately double-checked by another investigator. The five patients with minor body parts missing from their photographs had their EASI, SASSAD, and oSCORAD’s total denominators reduced to reflect the exclusion of the corresponding body parts. Categorization of skin pigmentation levels Each patient’s skin pigmentation was scored by all assessors on a numerical scale of 0 to 10, ranging from 0 representing no pigmentation, to 10 representing the darkest level of pigmentation. The average of each patient’s pigmentation score across the five assessors was then used to categorize patients into three groups: nonpigmented (score range 0-3), mildly pigmented (score range 3.1-7) and highly pigmented (score range 7.1-10). These ranges were chosen as they divide into three approximately equal categories.

Outcome measures tested Statistical analysis The outcome measures evaluated in this study were chosen as these have been most frequently validated as per a systemic review published in 2013 (Schmitt et al., 2013). These include the EASI, SCORAD, of which has a clinician-reported only version called the objective SCORAD (oSCORAD), Three Items Severity index (TIS) and Six Areas, Six Sites Atopic Dermatitis (SASSAD; Berth-Jones, 1996; Hanifin et al., 2001; Stalder and Taieb, 1993; Wolkerstorfer et al., 1999). Participants and assessors The full-body photographs of 20 patients with AD were obtained from dermatology outpatient clinics from Sydney. Two patients were later excluded as they had more than two body parts missing from their full-body photographs. Five assessors participated in the scoring process (D.F.M, M.J.D, A.G.H, S.V.J, and K.L). All assessors were either qualified dermatologists or have been doing full-time dermatology research, and hence had been familiar with atopic dermatitis. Two (M.J.D and K.L) had trained in the Philippines and were used to darker-skinned patients. One had trained in North Carolina (D.F.M), where approximately one third of

All statistical analyses were performed using SPSS Version 22.0 (Armonk, NY; IBM Corp.). A professor of statistics (M.G.L) from the Kirby Institute of University of New South Wales provided help with choosing the most appropriate statistical tests. For reliability testing, both interrater and intrarater reliabilities were assessed by the Intraclass coefficient (ICC) with 95% confidence interval (CI), using a one-way random analysis variance model. When an ICC is below .40, the clinical correlation is poor; when it is between .40 and .59, the level of correlation is fair; when it is between .60 and .74, the level of correlation is good; and when it is between .75 and 1, the level of clinical significance is excellent (Cicchetti, 1994). Scatterplots were constructed to illustrate interrater differences across all outcome measures and skin types. To determine whether the erythema components contributed to the variability in reliability, the erythema component and the “total minus erythema component” of each outcome measure were separately inputted. The ICCs and coefficient of variations (CV) means of the erythema component and the “total minus erythema component” were then calculated. The null hypothesis is that the ICC for the erythema component

152

C.Y. Zhao et al. / International Journal of Women's Dermatology 1 (2015) 150–154

would be bigger than ICC for the “total minus erythema component,” while the CV means across all patients should be smaller. This would indicate that erythema did not contribute to the variability of the scores. To determine the significance of the difference in the ICCs, the CVs of the erythema components and the “total minus erythema components” were correlated using the paired T-test. For convergent construct validity, all outcome measures were correlated with each other, and the respective Spearman rho correlation coefficients were determined. Additional assessments by an overseas dermatologist Additional convergent construct validity testing was performed by a South African dermatologist with expertise in assessing atopic dermatitis in pigmented skin patients (N.C.D). The results were compared to the Australian dermatologists’ results to determine whether experience would improve the validity of outcome measures in highly pigmentedskin patients. Results Demographics Altogether, 18 patients were included in the final analysis for interrater reliability and convergent construct validity (Table 1). Of these, three were of Asian background, 11 were of Caucasian background, two were of African background, and four were of Indian background. Three patients were also used for intrarater reliability analysis (Table 2). Inter-rater reliability The interrater reliability ICCs (and their 95% CIs) for the highly pigmented patients were: TIS, -.21 (-.24 to .147); SASSAD, -.071 (-.2 to .631); EASI, -.054 (CI: -.2 to .657); and oSCORAD, -.089 (-.206 to .598), indicating very poor interrater reliabilities. The ICCs for the mildly pigmented patients were: TIS, .524 (.2-.865); SASSAD, .341 (.045.775); EASI, .464 (.14-.839); and oSCORAD, .588 (.265-.890), indicating fair interrater reliability. The ICCs for the nonpigmented patients were: TIS, .403 (.09-.809); SASSAD, .667 (.358-.916); EASI, .64 (.33.908); and oSCORAD, .586 (.263-.889), indicating fair interrater reliability for the TIS and oSCORAD, and good interrater reliability for the EASI and SASSAD. Interrater scatterplots showed that, in highly pigmented skin patients, all scores have poor interrater reliability regardless of disease severity. In mildly pigmented patients, all scores had poorer interrater reliability with increased disease severity. In nonpigmented patients, interrater reliability does not appear to be influenced by disease severity (Supplementary Fig. 1 and Table 3). Erythema’s contribution to variability The inter-rater reliability ICCs and coefficient of variations (CV) means of the erythema components and “total minus erythema

Table 2 Characteristics of the Patients for Intra-rater Reliability Testing (n = 3).

Age⁎ Sex Ethnicity Average pigmentation score

Patient 1

Patient 2

Patient 3

60 Female Caucasian 2.0

22 Female Asian 5.4

25 Female African 7.6

⁎ When the investigator was unable to ascertain what the patient’s age was when his/her photo was taken, an estimate of age was made by two separate investigators.

components” were compared against each another. For the highly pigmented patients, the EASI and oSCORAD had slight, but clinically significant, superior reliabilities when the erythema components were excluded (EASI: -.171 vs. -.072, p = .034; oSCORAD: -.230 vs. -.062, p = .04; Table 3). This superior reliability was not present in the TIS or SASSAD, or in any other pigmentation groups. These results suggest that the erythema components have likely contributed to the variability of EASI and oSCORAD in highly pigmented skin patients. However, given the poor ICCs of the “total minus erythema components,” other factors were likely to have also contributed to the variability. When comparing the CV means (an indicator of variability) in highly pigmented patients, there were higher CV means in the erythema components than the “total minus erythema components”, across all outcome measures (Table 4). The higher CV mean of the erythema component was not present in any other outcome measure or pigmentation groups, except for SASSAD in mildly pigmented patients. These results were also evident of erythema’s contribution to the variability of the outcome measures in highly pigmented patients. Intrarater reliability For the patients with highly pigmented skin, the intrarater reliability ICCs were poor in all scores except for the SASSAD (Table 5). For the patients with mildly pigmented skin, the intrarater reliability ICCs were poor in the EASI and oSCORAD, and fair for TIS and SASSAD. For the patients with nonpigmented skin, the intra-rater reliabilities of the oSCORAD and TIS were poor, but for EASI and SASSAD were good. Given the wide 95% CIs of the results, the findings were limited by poor power. However, these results may suggest that in patients with highly pigmented skin, intrarater unreliability is more likely to be unreliable. Convergent construct validity In highly pigmented patients, none of the scoring instruments significantly correlated with each other. In mildly pigmented patients, SASSAD is not statistically significantly correlated with any of the other scoring instruments. The correlations of the other three outcome instruments were EASI with TIS: .829 (p = .021), EASI with oSCORAD: .857 (p = .014), and oSCORAD with TIS: .919 (p = .003). In nonpigmented patients, the correlations of all four outcome measures were mostly statistically significant: EASI with TIS: .919 (p = .003), EASI with oSCORAD: 1.000 (p b .01), EASI with SASSAD: .786 (p = .036), oSCORAD with TIS: .919 (p = .003), oSCORAD with SASSAD: .786 (p = .036), and TIS with SASSAD: .793 (p = .033). These results

Table 1 Characteristics of the Total Patient Cohort (n = 18).

N (%) Age range (years) Mean age (years) Male (%) Female (%)

Highly pigmented⁎

Mildly pigmented

Nonpigmented

Overall

4 (22.2) 7-40 26.8 2 (50) 2 (50)

7 (38.9) 2-65 29.1 5 (71.4) 2 (28.6)

7 (38.9) 1-65 32.3 3 (42.9) 4 (57.1)

18 (100) 1-65 29.8 10 (55.6) 8 (44.4)

⁎ The pigmentation groups were derived from the mean pigmentation given by the assessors: nonpigmented (score range 0-3), mildly pigmented (score range 3.1-7), and highly pigmented (score range 7.1-10).

C.Y. Zhao et al. / International Journal of Women's Dermatology 1 (2015) 150–154

153

Table 3 Inter-rater Reliability ICCs of the EASI, OSCORAD, TIS, and SASSAD Total scores; the Erythema Component; and the “Total Minus Erythema Component” by Pigmentation Level. EASI Highly pigmented group ICCs (95% CI; n = 4) Total score .509 (.300-.731) Erythema -.171 (-.200 to .363) Total minus erythema -.072 (-.201 to .628) Mildly pigmented group ICCs (95% CI; n = 7) Total score .464 (.140-.839) Erythema .607 (.290-.896) Total minus erythema .358 (.570-.785) Nonpigmented group ICCs (95% CI; n = 7) Total score .640 (.330-.908) Erythema .749 (.470-.941) Total minus erythema .533 (.209-.869) Overall ICCs (95% CI; n = 18) Total score .431 (.223-.673) Erythema .609 (.409-.798) Total minus erythema .509 (.300-.731)

oSCORAD

TIS

SASSAD

-.089 (-.206 to .598) -.230 (-.245 to .015) -.062 (-.198 to .645)

-.210 (-.240 to .147) -.055 (-.200 to .656) -.229 (-.308 to .399)

-.071 (-.201 to .631) -.088 (-.206 to .599) -.144 (-.222 to .462)

.588 (.265-.890) .636 (.320-.906) .500 (.176-.855)

.524 (.199-.865) .603 (.281-.895) .413 (.100-.814)

.341 (.045-.775) .601 (.280-.895) .193 (-.051 to .670)

.586 (.263-.889) .423 (.108-.820) .560 (.235-.879)

.403 (.092-.809) .409 (.097-.812) .303 (.180-.751)

.667 (.358-.916) .783 (.526-.950) .459 (.139-.837)

.621 (.423-.805) .509 (.300-.731) .597 (.396-.790)

.480 (.270-.710) .568 (.363-.772) .436 (.228-.677)

.504 (.294-.747) .701 (.522-.853) .340 (.141-.598)

suggest the poorer convergent construct validity of all outcome measures in highly pigmented patients. Additional assessments by an overseas dermatologist The same 18 patients were assessed by the South African dermatologist (N.C.D). In highly pigmented patients, none of the scoring instruments significantly correlated, except for EASI with SASSAD (p = .051). In mildly pigmented patients, all outcome measures correlated well, but were limited by low power, as there were only two patients in this group (as per pigmentation score out of 10 rated by the South African dermatologist). In nonpigmented patients, the correlations of all four outcome measures were mostly statistically significant: EASI with TIS: .645 (p = .023), EASI with oSCORAD: .867 (p b .01), EASI with SASSAD: .967 (p b .01), oSCORAD with TIS: 1.000 (p b .01), oSCORAD with SASSAD: .872 (p b .01), and TIS with SASSAD: .648 (p = .023). These results were comparable to the Australian dermatologists, confirming the poorer convergent construct validity of all outcome measures in highly pigmented patients. Discussion To the authors’ knowledge, this is the first study to evaluate the commonest AD outcome measures in patients with pigmented skin. Overall, all AD outcome measures—the EASI, the oSCORAD, the TIS, and the SASSAD—may have poor reliability and validity in patients with very dark skin. All four measures had very poor interrater reliability when used in patients of highly pigmented skin with ICCs of b .4, regardless of disease severity. Thus, we would warn against the

Table 4 Coefficient of Variation (CV) Means of the Erythema Components Versus the “Total Minus Erythema Components” of the Four Outcome Measures by Pigmentation Level. EASI Highly pigmented group CVs (n = 4) Erythema .987 Total minus erythema .525 Mildly pigmented group CVs (n = 7) Erythema .359 Total minus erythema .659 Nonpigmented group CVs (n = 7) Erythema .578 Total minus erythema .746

oSCORAD

TIS

SASSAD

.961 .331

1.01 .499

.852 .427

.323 .283

.249 .377

.323 .471

.284 .526

.201 .807

.387 .647

performance of AD severity assessments by two different doctors, at different time points, in dark-skinned patients. All four measures also produced poorer results when used in mildly pigmented patients as compared to nonpigmented patients, with the level of variation increasing with disease severity. Also, all measures had poor intra-rater reliability in highly pigmented patients, although the power of the results was limited by the small sample size. The study suggested that erythema is a contributor to the inter-rater variations, with higher CV means of erythema components compared to the “total minus erythema components.” This finding echoed a previous study by Ben-Gashir and Hay (2002), which investigated the effect of including and excluding erythema assessment on the assessment of AD severity; when the erythema component of the oSCORAD was excluded and adjusted for the total, disease was found to be severer in Black children. Our study went into more breadth to evaluate this issue. Given that, in our study, the interrater ICCs of the “total minus erythema components” were also poor in the highly pigmented patients, other factors are likely have also contributed to the variability. The inferior reliability of all outcome measures in nonpigmented patients suggested by this study when compared to other studies is likely due to the fact that patients were virtually scored instead of in a routine clinic setting. Three-dimensional assessments for lichenification and papulation were likely compromised, as was the feeling of heat via palpation to distinguish between active inflammation and postinflammatory hyperpigmentation. Given the recent growth of teledermatology worldwide, this study highlighted that virtual scoring may be suboptimal for a thorough and accurate assessment of AD. The authors also acknowledge the limited number of patients used for this study, given the difficulties to obtain high-definition, full body photos of very dark patients. This was compensated for by using at least five trained clinicians to perform the assessments. Another limitation was that being a single-center study, the poorer reliability and validity may not be demonstrated in all other settings and locations. To compensate for this, a South African dermatologist experienced with N95% of her patients being Fitzpatrick skin types V-VI from Durban, South Africa, was invited to also score the same patients for convergent construct validity analyses. Regarding construct validity, all four measures were again inferior in highly pigmented patients, even when scored by a South African dermatologist with experience in assessing dark skin. This is suggestive of underlying heterogenicity of clinical signs in highly pigmented patients, which may not have been included in the four common AD outcome measures. From this study, a new outcome measure may be needed to take into account the manifestation and perception of erythema, as well as other clinical heterogeneity found in dark AD patients such as ichthyosis or prurigo nodularis (Vachiramon et al., 2012). A grey scale may more

154

C.Y. Zhao et al. / International Journal of Women's Dermatology 1 (2015) 150–154

Table 5 Intra-rater Reliability ICCs of the EASI, oSCORAD, TIS and SASSAD by Pigmentation Level.

Highly pigmented Mildly pigmented Nonpigmented

EASI

oSCORAD

TIS

SASSAD

.391 (-.528 to .911) .037 (-.746 to .819) .830 (.184-.980)

.075 (-.728 to .832) -.537 (-.922 to .476) -.151 (-.818 to .747)

-.434 (-.899 to .574) .429 (-.494 to .918) -.760 (-.922 to .122)

.787 (.064-.975) .458 (-.467 to .924) .699 (-.134 to .963)

accurately reflect the erythema shown in ethnic skin, while a heat or temperature scale may help to distinguish between inflammation and post-inflammatory hyperpigmentation. The development of the new outcome measure should be followed by a multicenter validation study in a routine clinical setting. In the field of dermatology, this study also has implications for the validation of psoriasis outcome measures, as well as patch test reading protocols, in patients with very dark skin. In conclusion, the study suggests that AD outcome measures have poor reliability and validity in patients with very dark skin. A new outcome measure may be needed to take into account the manifestation and perception of erythema and other clinical signs in these patients. Supplementary data to this article can be found online at http://dx. doi.org/10.1016/j.ijwd.2015.05.002.

Acknowledgments The authors would like to thank all the AD patients who participated in this study.

References Ahmad Fadzil MH, Ihtatho D, Mohd Affandi A, Hussein SH. Objective assessment of psoriasis erythema for PASI scoring. J Med Eng Technol 2009;33(7):516–24. Ben-Gashir MA, Hay RJ. Reliance on erythema scores may mask severe atopic dermatitis in black children compared with their white counterparts. Br J Dermatol 2002; 147(5):920–5. Berardesca E, Maibach H. Ethnic skin: overview of structure and function. J Am Acad Dermatol 2003;48(Suppl. 6):S139–42. Berth-Jones J. Six area, six sign atopic dermatitis (SASSAD) severity score: a simple system for monitoring disease activity in atopic dermatitis. Br J Dermatol 1996;135(Suppl. 48):25–30. Bhattacharya T, Silverberg JI. Efficacy of systemic treatments for atopic dermatitis in racial and ethnic minorities in the United States. JAMA Dermatol 2014;150(11):1232–4. Chalmers JR, Schmitt J, Apfelbacher C, Dohil M, Eichenfield LF, Simpson EL, et al. Report from the Third International Consensus Meeting to Harmonise Core Outcome Measures for Atopic Eczema/Dermatitis Clinical Trials (HOME). Br J Dermatol 2014; 171(6):1318–25. Cicchetti DV. Guidelines, criteria, and rules of thumb for evaluating normed and standardized assessment instruments in psychology. Psychol Assess 1994;6(4):284. Deckers IA, McLean S, Linssen S, Mommers M, van Schayck CP, Sheikh A. Investigating international time trends in the incidence and prevalence of atopic eczema 1990-2010: a systematic review of epidemiological studies. PLoS One 2012;7(7):e39803.

Esamai F, Ayaya S, Nyandiko W. Prevalence of asthma, allergic rhinitis and dermatitis in primary school children in Uasin Gishu district, Kenya. East Afr Med J 2002;79(10): 514–8. Hanifin JM, Thurston M, Omoto M, Cherill R, Tofte SJ, Graeber M. The eczema area and severity index (EASI): assessment of reliability in atopic dermatitis. EASI Evaluator Group. Exp Dermatol 2001;10(1):11–8. Hirano SA, Murray SB, Harvey VM. Reporting, representation, and subgroup analysis of race and ethnicity in published clinical trials of atopic dermatitis in the United States between 2000 and 2009. Pediatr Dermatol 2012;29(6):749–55. Margolis DJ, Gupta J, Apter AJ, Ganguly T, Hoffstad O, Papadopoulous M, et al. Filaggrin-2 variation is associated with more persistent atopic dermatitis in African American subjects. J Allergy Clin Immunol 2014;133(3):784–9. Oh JW, Pyun BY, Choung JT, Ahn KM, Kim CH, Song W, et al. Epidemiological change of atopic dermatitis and food allergy in school-aged children in Korea between 1995 and 2000. J Korean Med Sci 2004;19(5):716–23. Robertson CF, Roberts MF, Kappers JH. Asthma prevalence in Melbourne schoolchildren: have we reached the peak? Med J Aust 2004;180(6):273–6. Schmitt J, Williams H. Harmonising Outcome Measures for Eczema (HOME). Report from the First International Consensus Meeting (HOME 1), 24 July 2010, Munich, Germany. Br J Dermatol 2010;163(6):1166–8. Schmitt J, Spuls P, Boers M, Thomas K, Chalmers J, Roekevisch E, et al. Towards global consensus on outcome measures for atopic eczema research: results of the HOME II meeting. Allergy 2012;67(9):1111–7. Schmitt J, Langan S, Deckert S, Svensson A, von Kobyetzki L, Thomas K, et al. Assessment of clinical signs of atopic dermatitis: a systematic review and recommendation. J Allergy Clin Immunol 2013;132(6):1337–47. Stalder JF, Taieb A. Severity scoring of atopic dermatitis: the SCORAD index. Consensus Report of the European Task Force on Atopic Dermatitis. Dermatology 1993;186(1): 23–31. Shaw TE, Currie GP, Koudelka CW, Simpson EL. Eczema prevalence in the United States: data from the 2003 National Survey of Children's Health. J Invest Dermatol 2011; 131(1):67–73. Simpson CR, Newton J, Hippisley-Cox J, Sheikh A. Trends in the epidemiology and prescribing of medication for eczema in England. J R Soc Med 2009;102(3):108–17. Sugiura H, Umemoto N, Dehuchi H, Murata Y, Tanaka K, Sawai T, et al. Prevalence of childhood and adolescent atopic dermatitis in a Japanese population: comparison with the disease frequency examined 20 years ago. Acta Derm Venereol 1998; 78(4):293–4. Torrelo A. Atopic dermatitis in different skin types. What is to know? J Eur Acad Dermatol Venereol 2014;28(Suppl. 3):2–4. Vachiramon V, Tey HL, Thompson AE, Yosipovitch G. Atopic dermatitis in African American children: addressing unmet needs of a common disease. Pediatr Dermatol 2012;29(4):395–402. Wolkerstorfer A, de Waard van der Spek FB, Glazenberg EJ, Mulder PG, Oranje AP. Scoring the severity of atopic dermatitis: three item severity score as a rough system for daily practice and as a pre-screening tool for studies. Acta Derm Venereol 1999;79(5): 356–9. Zar HJ, Ehrlich RI, Workman L, Weinberg EG. The changing prevalence of asthma, allergic rhinitis and atopic eczema in African adolescents from 1995 to 2002. Pediatr Allergy Immunol 2007;18(7):560–5.