Skip to main content
Top
Published in: BMC Medical Research Methodology 1/2011

Open Access 01-12-2011 | Research article

Imputation by the mean score should be avoided when validating a Patient Reported Outcomes questionnaire by a Rasch model in presence of informative missing data

Authors: Jean-Benoit Hardouin, Ronán Conroy, Véronique Sébille

Published in: BMC Medical Research Methodology | Issue 1/2011

Login to get access

Abstract

Background

Nowadays, more and more clinical scales consisting in responses given by the patients to some items (Patient Reported Outcomes - PRO), are validated with models based on Item Response Theory, and more specifically, with a Rasch model. In the validation sample, presence of missing data is frequent. The aim of this paper is to compare sixteen methods for handling the missing data (mainly based on simple imputation) in the context of psychometric validation of PRO by a Rasch model. The main indexes used for validation by a Rasch model are compared.

Methods

A simulation study was performed allowing to consider several cases, notably the possibility for the missing values to be informative or not and the rate of missing data.

Results

Several imputations methods produce bias on psychometrical indexes (generally, the imputation methods artificially improve the psychometric qualities of the scale). In particular, this is the case with the method based on the Personal Mean Score (PMS) which is the most commonly used imputation method in practice.

Conclusions

Several imputation methods should be avoided, in particular PMS imputation. From a general point of view, it is important to use an imputation method that considers both the ability of the patient (measured for example by his/her score), and the difficulty of the item (measured for example by its rate of favourable responses). Another recommendation is to always consider the addition of a random process in the imputation method, because such a process allows reducing the bias. Last, the analysis realized without imputation of the missing data (available case analyses) is an interesting alternative to the simple imputation in this context.
Literature
1.
go back to reference Fisher GH, Molenaar IW: Rasch Models, Foundations, Recent Developments, and Applications. 1997, New-York: Springer-Verlag Fisher GH, Molenaar IW: Rasch Models, Foundations, Recent Developments, and Applications. 1997, New-York: Springer-Verlag
2.
go back to reference Loevinger J: The technic of homogeneous tests compared with some aspects of scale analysis and factor analysis. Psychological Bulletin. 1948, 45: 507-529.CrossRefPubMed Loevinger J: The technic of homogeneous tests compared with some aspects of scale analysis and factor analysis. Psychological Bulletin. 1948, 45: 507-529.CrossRefPubMed
3.
go back to reference Sijtsma K, Molenaar IW: Introduction to Nonparametric Item Response Theory. 2002, Thousand Oaks, CA: Sage PublicationsCrossRef Sijtsma K, Molenaar IW: Introduction to Nonparametric Item Response Theory. 2002, Thousand Oaks, CA: Sage PublicationsCrossRef
4.
go back to reference van den Wollenberg AL: Two new test statistics for the Rasch model. Psychometrika. 1982, 47: 123-140. 10.1007/BF02296270.CrossRef van den Wollenberg AL: Two new test statistics for the Rasch model. Psychometrika. 1982, 47: 123-140. 10.1007/BF02296270.CrossRef
5.
go back to reference Andrich D: An Index of Person Separation in Latent Trait Theory, the Traditional KR-20 Index, and the Guttman Scale Response Pattern. Education Research and Perspectives. 1982, 9: 95-104. Andrich D: An Index of Person Separation in Latent Trait Theory, the Traditional KR-20 Index, and the Guttman Scale Response Pattern. Education Research and Perspectives. 1982, 9: 95-104.
6.
go back to reference Cronbach LJ: Coefficient alpha and the internal structure of tests. Psychometrika. 1951, 16 (3): 297-334. 10.1007/BF02310555.CrossRef Cronbach LJ: Coefficient alpha and the internal structure of tests. Psychometrika. 1951, 16 (3): 297-334. 10.1007/BF02310555.CrossRef
7.
go back to reference Curran D, Bacchi M, Shmitz SF, Molenberghs G, Sylvester RJ: Identifying the types of missingness in quality of life data from clinical trials. Statistics in Medicine. 1998, 17 (5-7): 739-756. 10.1002/(SICI)1097-0258(19980315/15)17:5/7<739::AID-SIM818>3.0.CO;2-M.CrossRefPubMed Curran D, Bacchi M, Shmitz SF, Molenberghs G, Sylvester RJ: Identifying the types of missingness in quality of life data from clinical trials. Statistics in Medicine. 1998, 17 (5-7): 739-756. 10.1002/(SICI)1097-0258(19980315/15)17:5/7<739::AID-SIM818>3.0.CO;2-M.CrossRefPubMed
8.
go back to reference Van Buuren S: Multiple imputation of discrete and continuous data by fully conditional specification. Statistical Methods in Medical Research. 2007, 16 (3): 219-242. 10.1177/0962280206074463.CrossRefPubMed Van Buuren S: Multiple imputation of discrete and continuous data by fully conditional specification. Statistical Methods in Medical Research. 2007, 16 (3): 219-242. 10.1177/0962280206074463.CrossRefPubMed
9.
go back to reference Aaronson NK, Ahmedzai S, Bergman Bea: The European Organization for Research and Treatment of Cancer QLQ-C30: A quality-of-life instrument for use in international clinical trials in oncology. Journal of the National Cancer Institute. 1993, 85 (5): 365-376. 10.1093/jnci/85.5.365.CrossRefPubMed Aaronson NK, Ahmedzai S, Bergman Bea: The European Organization for Research and Treatment of Cancer QLQ-C30: A quality-of-life instrument for use in international clinical trials in oncology. Journal of the National Cancer Institute. 1993, 85 (5): 365-376. 10.1093/jnci/85.5.365.CrossRefPubMed
10.
go back to reference Ware JE, Sherbourne CD: The MOS 36-item short form health survey (SF-36). I. Conceptual framework and item selection. Medical Care. 1992, 30: 473-483. 10.1097/00005650-199206000-00002.CrossRefPubMed Ware JE, Sherbourne CD: The MOS 36-item short form health survey (SF-36). I. Conceptual framework and item selection. Medical Care. 1992, 30: 473-483. 10.1097/00005650-199206000-00002.CrossRefPubMed
11.
go back to reference Leplege A, Ecosse E, Pouchot J, Coste J, Perneger T: Le questionnaire MOS SF-36 - Manuel de l'utilisateur et guide d'interprétation des scores. 2001, Paris: Estem Leplege A, Ecosse E, Pouchot J, Coste J, Perneger T: Le questionnaire MOS SF-36 - Manuel de l'utilisateur et guide d'interprétation des scores. 2001, Paris: Estem
12.
go back to reference Fielding S, Fayers PM, McDonalds A, McPherson G, Campbell MK: Simple imputation methods were inadequate for missing not at random (MNAR) quality of life data. Health and Quality of Life Outcomes. 2008, 6 (57): 1-57. Fielding S, Fayers PM, McDonalds A, McPherson G, Campbell MK: Simple imputation methods were inadequate for missing not at random (MNAR) quality of life data. Health and Quality of Life Outcomes. 2008, 6 (57): 1-57.
13.
go back to reference Molenberghs G, Kenward MG: Missing data in Clinical Studie. 2007, Chichester: WileyCrossRef Molenberghs G, Kenward MG: Missing data in Clinical Studie. 2007, Chichester: WileyCrossRef
14.
go back to reference Molenberghs G, Thijs H, Jansen I, Beunckens C, Kenward MG, Mallinckrodt C, Carrol RJ: Analyzing incomplete longitudinal clinical trial data. Biostatistics. 2004, 5 (3): 445-464. 10.1093/biostatistics/kxh001.CrossRefPubMed Molenberghs G, Thijs H, Jansen I, Beunckens C, Kenward MG, Mallinckrodt C, Carrol RJ: Analyzing incomplete longitudinal clinical trial data. Biostatistics. 2004, 5 (3): 445-464. 10.1093/biostatistics/kxh001.CrossRefPubMed
15.
go back to reference Shrive FM, Stuart H, Quan H, Ghali WA: Dealing with missing data in a multi-question depression scale: a comparison of imputation methods. BMC Medical Research Methodology. 2006, 6 (57): 1-10. Shrive FM, Stuart H, Quan H, Ghali WA: Dealing with missing data in a multi-question depression scale: a comparison of imputation methods. BMC Medical Research Methodology. 2006, 6 (57): 1-10.
16.
go back to reference Huisman M: Imputation of missing item responses: Some simple techniques. Quality & Quantity. 2000, 34 (4): 331-351. 10.1023/A:1004782230065.CrossRef Huisman M: Imputation of missing item responses: Some simple techniques. Quality & Quantity. 2000, 34 (4): 331-351. 10.1023/A:1004782230065.CrossRef
17.
go back to reference Sijtsma K, Van Der Ark LA: Investigation and Treatment of Missing Item Scores in Test and Questionnaire Data. Multivariate Behavioural Research. 2003, 38 (4): 505-528. 10.1207/s15327906mbr3804_4.CrossRef Sijtsma K, Van Der Ark LA: Investigation and Treatment of Missing Item Scores in Test and Questionnaire Data. Multivariate Behavioural Research. 2003, 38 (4): 505-528. 10.1207/s15327906mbr3804_4.CrossRef
18.
go back to reference Linden WJVD, Hambleton RK: Handbook of Modern Item Response Theory. 1997, New-York: Springer-VerlagCrossRef Linden WJVD, Hambleton RK: Handbook of Modern Item Response Theory. 1997, New-York: Springer-VerlagCrossRef
19.
go back to reference Rubin DB: Inference and missing data. Biometrika. 1976, 63: 581-592. 10.1093/biomet/63.3.581.CrossRef Rubin DB: Inference and missing data. Biometrika. 1976, 63: 581-592. 10.1093/biomet/63.3.581.CrossRef
20.
go back to reference Sébille V, Hardouin JB, Mesbah M: Sequential analysis of latent variables using mixed-effect latent variable models: Impact of non-informative and informative missing data. Statistics in Medicine. 2007, 26: 4889-4904. 10.1002/sim.2959.CrossRefPubMed Sébille V, Hardouin JB, Mesbah M: Sequential analysis of latent variables using mixed-effect latent variable models: Impact of non-informative and informative missing data. Statistics in Medicine. 2007, 26: 4889-4904. 10.1002/sim.2959.CrossRefPubMed
21.
go back to reference Holman R, Glas CAW: Modelling non-ignorable missing-data mechanisms with item response theory models. British Journal of Mathematical and Statistical Psychology. 2005, 58: 1-17.CrossRefPubMed Holman R, Glas CAW: Modelling non-ignorable missing-data mechanisms with item response theory models. British Journal of Mathematical and Statistical Psychology. 2005, 58: 1-17.CrossRefPubMed
22.
go back to reference Chavance M: Handling Missing Items in Quality of Life Studies. Communications in Statistics. Theory and Methods. 2004, 33: 1371-1384. 10.1081/STA-120030155.CrossRef Chavance M: Handling Missing Items in Quality of Life Studies. Communications in Statistics. Theory and Methods. 2004, 33: 1371-1384. 10.1081/STA-120030155.CrossRef
23.
go back to reference Laros JA, Tellegen PJ: Construction and validation of the SON-R 5 1/2-17, The Snijders-Oomen non verbal intelligence test. 1991, Groningen: Wolters-Noordhoff Laros JA, Tellegen PJ: Construction and validation of the SON-R 5 1/2-17, The Snijders-Oomen non verbal intelligence test. 1991, Groningen: Wolters-Noordhoff
24.
go back to reference Verhelst ND, Glas CAW, Verstralen HHFM: One-parameter logistic model OPLM. 1995, Arnhem: CITO Verhelst ND, Glas CAW, Verstralen HHFM: One-parameter logistic model OPLM. 1995, Arnhem: CITO
25.
go back to reference Verhlest ND, Glas CAW: The One Parameter Logistic Model. Rasch Models, Foundations, Recent Developments, and Applications. Edited by: Fischer GH, Molenaar IW. 1997, New York: Springer-Verlag, 215-238. 2 Verhlest ND, Glas CAW: The One Parameter Logistic Model. Rasch Models, Foundations, Recent Developments, and Applications. Edited by: Fischer GH, Molenaar IW. 1997, New York: Springer-Verlag, 215-238. 2
26.
go back to reference Hardouin JB, Bonnaud-Antignac A, Sébille V: Non Parametric Item Response Theory using Stata. The Stata Journal. 2010, 10: to appear Hardouin JB, Bonnaud-Antignac A, Sébille V: Non Parametric Item Response Theory using Stata. The Stata Journal. 2010, 10: to appear
27.
go back to reference Hardouin JB: Rasch analysis: estimation and tests with the Raschtest module. The Stata Journal. 2007, 7: 22-44. Hardouin JB: Rasch analysis: estimation and tests with the Raschtest module. The Stata Journal. 2007, 7: 22-44.
28.
go back to reference Little RJA, Rubin DB: Statistical Analysis with Missing Data. 2002, New-York: WileyCrossRef Little RJA, Rubin DB: Statistical Analysis with Missing Data. 2002, New-York: WileyCrossRef
29.
go back to reference Nap RE: Missing Data: different forms of imputation methods and their application to empirical data sets. Research report VSM-94-01-SW, Departement of Statistics & Measurement Theory. 1994, Groningen: University of Groningen Nap RE: Missing Data: different forms of imputation methods and their application to empirical data sets. Research report VSM-94-01-SW, Departement of Statistics & Measurement Theory. 1994, Groningen: University of Groningen
30.
go back to reference Kahn SR, Lamping DL, T D, Arsenault L, Miron MJ, Roussin A, Desmarais S, Joyal F, Kassis J, Solymoss S, Desjardins L, Johri M, Shrier I: VEINES-QOL/Sym questionnaire was a reliable and valid disease-specific quality of life measure for deep venous thrombosis. Journal of Clinical Epidemiology. 2006, 59 (10): 1049-1056.CrossRefPubMed Kahn SR, Lamping DL, T D, Arsenault L, Miron MJ, Roussin A, Desmarais S, Joyal F, Kassis J, Solymoss S, Desjardins L, Johri M, Shrier I: VEINES-QOL/Sym questionnaire was a reliable and valid disease-specific quality of life measure for deep venous thrombosis. Journal of Clinical Epidemiology. 2006, 59 (10): 1049-1056.CrossRefPubMed
31.
go back to reference Sinfield P, Baker R, Tarrant C, Agarwal S, Colman AM, Steward W, Kockelbergh R, Mellon JK: The Prostate Care Questionnaire for Carers (PCQ-C): reliability, validity and acceptability. BMC Health Serv Res. 2009, 9: 229-10.1186/1472-6963-9-229.CrossRefPubMedPubMedCentral Sinfield P, Baker R, Tarrant C, Agarwal S, Colman AM, Steward W, Kockelbergh R, Mellon JK: The Prostate Care Questionnaire for Carers (PCQ-C): reliability, validity and acceptability. BMC Health Serv Res. 2009, 9: 229-10.1186/1472-6963-9-229.CrossRefPubMedPubMedCentral
32.
go back to reference Fayers PM, Curran D, Machin D: Incomplete Quality of Life data in randomized trials: missing items. Statistics in Medicine. 1998, 17: 679-696. 10.1002/(SICI)1097-0258(19980315/15)17:5/7<679::AID-SIM814>3.0.CO;2-X.CrossRefPubMed Fayers PM, Curran D, Machin D: Incomplete Quality of Life data in randomized trials: missing items. Statistics in Medicine. 1998, 17: 679-696. 10.1002/(SICI)1097-0258(19980315/15)17:5/7<679::AID-SIM814>3.0.CO;2-X.CrossRefPubMed
34.
go back to reference Bernards CA, Sijtsma K: Influence of imputation and EM methods on factor analysis when item nonresponse in questionnaire data is nonignorable. Multivariate Behavioral Research. 2000, 35: 321-364. 10.1207/S15327906MBR3503_03.CrossRef Bernards CA, Sijtsma K: Influence of imputation and EM methods on factor analysis when item nonresponse in questionnaire data is nonignorable. Multivariate Behavioral Research. 2000, 35: 321-364. 10.1207/S15327906MBR3503_03.CrossRef
Metadata
Title
Imputation by the mean score should be avoided when validating a Patient Reported Outcomes questionnaire by a Rasch model in presence of informative missing data
Authors
Jean-Benoit Hardouin
Ronán Conroy
Véronique Sébille
Publication date
01-12-2011
Publisher
BioMed Central
Published in
BMC Medical Research Methodology / Issue 1/2011
Electronic ISSN: 1471-2288
DOI
https://doi.org/10.1186/1471-2288-11-105

Other articles of this Issue 1/2011

BMC Medical Research Methodology 1/2011 Go to the issue