Skip to main content
Top
Published in: BMC Medical Research Methodology 1/2019

Open Access 01-12-2019 | Research article

Explaining differential item functioning focusing on the crucial role of external information – an example from the measurement of adolescent mental health

Author: Curt Hagquist

Published in: BMC Medical Research Methodology | Issue 1/2019

Login to get access

Abstract

Background

An overarching objective in research comparing different sample groups is to ensure that the reported differences in outcomes are not affected by differences between groups in the functioning of the measurement instruments, i.e. the items have to work in the same way for the different sample groups to be compared. Lack of invariance across sample groups are commonly called Differential Item Functioning (DIF).
There is a sense in which the DIF of an item can be taken account of by resolving (splitting) the item into group specific items, rather than deleting the item. Resolving improves fit, retains the reliability and content provided by the item, and compensates for the DIF in estimation of person parameters on the scale of the instrument. However, it destroys invariance of the item’s parameter value among the groups. Whether or not a DIF item should be resolved depends on whether the source of the DIF is relevant or irrelevant for the content of the variable. The present paper shows how external information can be used to investigate if the gender DIF found in the item “Stomach ache” in a psychosomatic symptoms scale used among adolescents may reflect abdominal pain because of a biological factor, the girls’ menstrual periods.

Methods

Swedish data from the international Health Behaviour in School-aged Children study (HBSC) collected in 2005/06, 2009/10 and 2013/14 were used, comprising a total of 18,983 students in grades 5, 7 and 9. A composite measure of eight items of psychosomatic problems was analysed for DIF with respect to gender and menstrual periods using the Rasch model.

Results

The results support the hypothesis that the source of the gender DIF for the item “Stomach ache” is a gender specific biological factor. In that case the DIF should be resolved if the psychosomatic measure is not intended to tap information about abdominal pain caused by a gender specific biological factor. In contrast, if the measure is intended to tap such information, the DIF should not be resolved.

Conclusions

The conceptualisation of the measure governs whether the item showing DIF should be resolved or not.
Literature
1.
go back to reference Olsen J, IEA European Questionnaire Group. Epidemiology deserves better questionnaires. Int J Epidemiol. 1998;27(6):935.CrossRefPubMed Olsen J, IEA European Questionnaire Group. Epidemiology deserves better questionnaires. Int J Epidemiol. 1998;27(6):935.CrossRefPubMed
2.
3.
go back to reference Guttman LA. The basis for scalogram analysis. In: Stouffer SA, editor. Measurement and prediction. New York: John Wiley; 1950. p. 60–90. Guttman LA. The basis for scalogram analysis. In: Stouffer SA, editor. Measurement and prediction. New York: John Wiley; 1950. p. 60–90.
4.
go back to reference Rasch G. Probabilistic models for some intelligence and attainment tests. Expanded ed. Chicago, IL: University of Chicago Press; 1980. Rasch G. Probabilistic models for some intelligence and attainment tests. Expanded ed. Chicago, IL: University of Chicago Press; 1980.
5.
go back to reference Ironson, G. H. (1983). Using item response theory to measure bias. In R Hambleton (Ed.),Applications of item response theory. Vancouver, Canada: Education Research Institute, British Columbia. Ironson, G. H. (1983). Using item response theory to measure bias. In R Hambleton (Ed.),Applications of item response theory. Vancouver, Canada: Education Research Institute, British Columbia.
6.
go back to reference Osterlind SJ, Everson HT. Differential item functioning. Quantitative applications in the social sciences. 2nd ed. Thousand Oaks, CA: Sage; 2009.CrossRef Osterlind SJ, Everson HT. Differential item functioning. Quantitative applications in the social sciences. 2nd ed. Thousand Oaks, CA: Sage; 2009.CrossRef
7.
go back to reference Cameron IM, Scott NW, Adler M, Reid IC. A comparison of three methods of assessing differential item functioning (DIF) in the hospital anxiety depression scale: ordinal logistic regression, Rasch analysis and the mantel chi-square procedure. Qual Life Res. 2014;23:2883–8.CrossRefPubMed Cameron IM, Scott NW, Adler M, Reid IC. A comparison of three methods of assessing differential item functioning (DIF) in the hospital anxiety depression scale: ordinal logistic regression, Rasch analysis and the mantel chi-square procedure. Qual Life Res. 2014;23:2883–8.CrossRefPubMed
8.
go back to reference Andrich D, Hagquist C. Real and artificial differential item functioning. J Educ Behav Stat. 2012;37:387–416.CrossRef Andrich D, Hagquist C. Real and artificial differential item functioning. J Educ Behav Stat. 2012;37:387–416.CrossRef
9.
go back to reference Hagquist C, Andrich D. Recent advances in analysis of differential item functioning in health research using the Rasch model. Health Qual Life Outcomes. 2017;15:181:1–8.CrossRef Hagquist C, Andrich D. Recent advances in analysis of differential item functioning in health research using the Rasch model. Health Qual Life Outcomes. 2017;15:181:1–8.CrossRef
10.
go back to reference Andrich D, Hagquist C. Real and artificial differential item functioning in polytomous items. Educ Psychol Meas. 2015;75(2):185–207.CrossRefPubMed Andrich D, Hagquist C. Real and artificial differential item functioning in polytomous items. Educ Psychol Meas. 2015;75(2):185–207.CrossRefPubMed
11.
go back to reference Inchley J, Currie D, Young T, Samdal O, Torsheim T, Augustson L, et al. Growing up unequal: gender and socioeconomic differences in young people’s health and well-being. In: Health behaviour in school-aged children (HBSC) study: international report from the 2013/2014 survey. Copenhagen: WHO Regional Office for Europe; 2016. Inchley J, Currie D, Young T, Samdal O, Torsheim T, Augustson L, et al. Growing up unequal: gender and socioeconomic differences in young people’s health and well-being. In: Health behaviour in school-aged children (HBSC) study: international report from the 2013/2014 survey. Copenhagen: WHO Regional Office for Europe; 2016.
12.
go back to reference Ottová-Jordan V, Smith ORF, Augustine L, Gobina I, Rathmann K, Torsheim T, et al. Trends in health complaints from 2002 to 2010 in 34 countries and their association with health behaviours and social context factors at individual and macro-level. Eur J Pub Health. 2015;25(suppl_2):83–9.CrossRef Ottová-Jordan V, Smith ORF, Augustine L, Gobina I, Rathmann K, Torsheim T, et al. Trends in health complaints from 2002 to 2010 in 34 countries and their association with health behaviours and social context factors at individual and macro-level. Eur J Pub Health. 2015;25(suppl_2):83–9.CrossRef
13.
go back to reference Barkmann C, Otto C, Schön G, Schulte-Markwort M, Schlack R, Ravens-Sieberer U, et al. Modelling trajectories of psychosomatic health complaints in children and adolescents: results of the BELLA study. Eur Child Adolesc Psychiatry. 2015;24(6):685–94.CrossRefPubMed Barkmann C, Otto C, Schön G, Schulte-Markwort M, Schlack R, Ravens-Sieberer U, et al. Modelling trajectories of psychosomatic health complaints in children and adolescents: results of the BELLA study. Eur Child Adolesc Psychiatry. 2015;24(6):685–94.CrossRefPubMed
14.
go back to reference Andrich D. A rating formulation for ordered response categories. Psychometrika. 1978;43(4):561–73.CrossRef Andrich D. A rating formulation for ordered response categories. Psychometrika. 1978;43(4):561–73.CrossRef
15.
go back to reference Andrich D, de Jong JHAL, Sheridan BE. Diagnostic opportunities with the Rasch model for ordered response categories. In: Rost J, Langeheine R, editors. Applications of latent trait and latent class models in the social sciences. Münster and New York: Waxmann Verlag GMBH; 1997. p. 59–72. Andrich D, de Jong JHAL, Sheridan BE. Diagnostic opportunities with the Rasch model for ordered response categories. In: Rost J, Langeheine R, editors. Applications of latent trait and latent class models in the social sciences. Münster and New York: Waxmann Verlag GMBH; 1997. p. 59–72.
16.
go back to reference Andrich D, Sheridan B, Luo G. RUMM2030: a windows interactive program for analysing data with Rasch unidimensional models for measurement. Perth, Western Australia: RUMM Laboratory; 2014. Andrich D, Sheridan B, Luo G. RUMM2030: a windows interactive program for analysing data with Rasch unidimensional models for measurement. Perth, Western Australia: RUMM Laboratory; 2014.
17.
go back to reference Andrich D. A structure of index and causal variables. RMT. 2014;28(3):1475–7. Andrich D. A structure of index and causal variables. RMT. 2014;28(3):1475–7.
18.
go back to reference Stenner AJ, Burdick DS, Stone MH. Formative and reflective models: can a Rasch analysis tell the difference? RMT. 2008;22(1):1152–3. Stenner AJ, Burdick DS, Stone MH. Formative and reflective models: can a Rasch analysis tell the difference? RMT. 2008;22(1):1152–3.
19.
go back to reference Tesio L. Items and variables, thinner and thicker variables: gradients, not dichotomies. RMT. 2014;28(3):1477–9. Tesio L. Items and variables, thinner and thicker variables: gradients, not dichotomies. RMT. 2014;28(3):1477–9.
Metadata
Title
Explaining differential item functioning focusing on the crucial role of external information – an example from the measurement of adolescent mental health
Author
Curt Hagquist
Publication date
01-12-2019
Publisher
BioMed Central
Published in
BMC Medical Research Methodology / Issue 1/2019
Electronic ISSN: 1471-2288
DOI
https://doi.org/10.1186/s12874-019-0828-3

Other articles of this Issue 1/2019

BMC Medical Research Methodology 1/2019 Go to the issue