Skip to main content
Top
Published in: BMC Medical Research Methodology 1/2008

Open Access 01-12-2008 | Research article

Data management for prospective research studies using SAS®software

Authors: Robin L Kruse, David R Mehr

Published in: BMC Medical Research Methodology | Issue 1/2008

Login to get access

Abstract

Background

Maintaining data quality and integrity is important for research studies involving prospective data collection. Data must be entered, erroneous or missing data must be identified and corrected if possible, and an audit trail created.

Methods

Using as an example a large prospective study, the Missouri Lower Respiratory Infection (LRI) Project, we present an approach to data management predominantly using SAS software. The Missouri LRI Project was a prospective cohort study of nursing home residents who developed an LRI. Subjects were enrolled, data collected, and follow-ups occurred for over three years. Data were collected on twenty different forms. Forms were inspected visually and sent off-site for data entry. SAS software was used to read the entered data files, check for potential errors, apply corrections to data sets, and combine batches into analytic data sets. The data management procedures are described.

Results

Study data collection resulted in over 20,000 completed forms. Data management was successful, resulting in clean, internally consistent data sets for analysis. The amount of time required for data management was substantially underestimated.

Conclusion

Data management for prospective studies should be planned well in advance of data collection. An ongoing process with data entered and checked as they become available allows timely recovery of errors and missing data.
Appendix
Available only for authorised users
Literature
1.
go back to reference Chilvers CE, Fayers PM, Freedman LS, Greenwood RM, Machin D, Palmer N, Westlake AJ: Improving the quality of data in randomized clinical trials: the COMPACT computer package. COMPACT Steering Committee. Stat Med. 1988, 7 (11): 1165-1170. 10.1002/sim.4780071109.CrossRefPubMed Chilvers CE, Fayers PM, Freedman LS, Greenwood RM, Machin D, Palmer N, Westlake AJ: Improving the quality of data in randomized clinical trials: the COMPACT computer package. COMPACT Steering Committee. Stat Med. 1988, 7 (11): 1165-1170. 10.1002/sim.4780071109.CrossRefPubMed
2.
go back to reference Karrison T: Data editing in a clinical trial. Control Clin Trials. 1981, 2 (1): 15-29. 10.1016/0197-2456(81)90055-6.CrossRefPubMed Karrison T: Data editing in a clinical trial. Control Clin Trials. 1981, 2 (1): 15-29. 10.1016/0197-2456(81)90055-6.CrossRefPubMed
3.
go back to reference Tai BC, Seldrup J: A review of software for data management, design and analysis of clinical trials. Ann Acad Med Singapore. 2000, 29 (5): 576-581.PubMed Tai BC, Seldrup J: A review of software for data management, design and analysis of clinical trials. Ann Acad Med Singapore. 2000, 29 (5): 576-581.PubMed
4.
go back to reference DuChene AG, Hultgren DH, Neaton JD, Grambsch PV, Broste SK, Aus BM, Rasmussen WL: Forms control and error detection procedures used at the Coordinating Center of the Multiple Risk Factor Intervention Trial (MRFIT). Control Clin Trials. 1986, 7 (3 Suppl): 34S-45S. 10.1016/0197-2456(86)90158-3.CrossRefPubMed DuChene AG, Hultgren DH, Neaton JD, Grambsch PV, Broste SK, Aus BM, Rasmussen WL: Forms control and error detection procedures used at the Coordinating Center of the Multiple Risk Factor Intervention Trial (MRFIT). Control Clin Trials. 1986, 7 (3 Suppl): 34S-45S. 10.1016/0197-2456(86)90158-3.CrossRefPubMed
5.
go back to reference Grady D, Newman TB, Vittinghoff E: Data management. Designing clinical research: an epidemiologic approach. Edited by: Hulley SB. 2001, Philadelphia, PA: Williams & Wilkins, 247-257. Grady D, Newman TB, Vittinghoff E: Data management. Designing clinical research: an epidemiologic approach. Edited by: Hulley SB. 2001, Philadelphia, PA: Williams & Wilkins, 247-257.
6.
go back to reference van Es GA: Research practice and data management. Neth J Med. 1996, 48 (1): 38-44. 10.1016/0300-2977(95)00036-4.CrossRefPubMed van Es GA: Research practice and data management. Neth J Med. 1996, 48 (1): 38-44. 10.1016/0300-2977(95)00036-4.CrossRefPubMed
7.
go back to reference Hawkins BS, Singer SW: Design, development, and implementation of a data processing system for multiple controlled trials and epidemiologic studies. Control Clin Trials. 1986, 7 (2): 89-117. 10.1016/0197-2456(86)90027-9.CrossRefPubMed Hawkins BS, Singer SW: Design, development, and implementation of a data processing system for multiple controlled trials and epidemiologic studies. Control Clin Trials. 1986, 7 (2): 89-117. 10.1016/0197-2456(86)90027-9.CrossRefPubMed
8.
go back to reference Gassman JJ, Owen WW, Kuntz TE, Martin JP, Amoroso WP: Data quality assurance, monitoring, and reporting. Control Clin Trials. 1995, 16 (2 Suppl): 104S-136S. 10.1016/0197-2456(94)00095-K.CrossRefPubMed Gassman JJ, Owen WW, Kuntz TE, Martin JP, Amoroso WP: Data quality assurance, monitoring, and reporting. Control Clin Trials. 1995, 16 (2 Suppl): 104S-136S. 10.1016/0197-2456(94)00095-K.CrossRefPubMed
9.
go back to reference Hosking JD, Newhouse MM, Bagniewska A, Hawkins BS: Data collection and transcription. Control Clin Trials. 1995, 16 (2 Suppl): 66S-103S. 10.1016/0197-2456(94)00094-J.CrossRefPubMed Hosking JD, Newhouse MM, Bagniewska A, Hawkins BS: Data collection and transcription. Control Clin Trials. 1995, 16 (2 Suppl): 66S-103S. 10.1016/0197-2456(94)00094-J.CrossRefPubMed
10.
go back to reference Mehr DR, Binder EF, Kruse RL, Zweig SC, Madsen R, Popejoy L, D'Agostino RB: Predicting mortality from lower respiratory infection in nursing home residents: the Missouri LRI Study. JAMA. 2001, 286 (19): 2427-2436. 10.1001/jama.286.19.2427.CrossRefPubMed Mehr DR, Binder EF, Kruse RL, Zweig SC, Madsen R, Popejoy L, D'Agostino RB: Predicting mortality from lower respiratory infection in nursing home residents: the Missouri LRI Study. JAMA. 2001, 286 (19): 2427-2436. 10.1001/jama.286.19.2427.CrossRefPubMed
11.
go back to reference Mehr DR, Binder EF, Kruse RL, Zweig SC, Madsen R, D'Agostino RB: Clinical findings associated with radiographic pneumonia in nursing home residents. J Fam Pract. 2001, 50 (11): 931-937.PubMed Mehr DR, Binder EF, Kruse RL, Zweig SC, Madsen R, D'Agostino RB: Clinical findings associated with radiographic pneumonia in nursing home residents. J Fam Pract. 2001, 50 (11): 931-937.PubMed
12.
go back to reference The SAS System for Windows. 1996, Cary, NC: SAS Institute Inc. The SAS System for Windows. 1996, Cary, NC: SAS Institute Inc.
13.
go back to reference Aday LA, Cornelius LJ: Designing and conducting health surveys: a comprehensive guide. 2006, San Francisco: Jossey-Bass, 3 Aday LA, Cornelius LJ: Designing and conducting health surveys: a comprehensive guide. 2006, San Francisco: Jossey-Bass, 3
14.
go back to reference Babbs CF, Tacker MM: Writing a scientific paper prior to the research. Am J Emerg Med. 1985, 3 (4): 360-363. 10.1016/0735-6757(85)90065-8.CrossRefPubMed Babbs CF, Tacker MM: Writing a scientific paper prior to the research. Am J Emerg Med. 1985, 3 (4): 360-363. 10.1016/0735-6757(85)90065-8.CrossRefPubMed
16.
go back to reference Pinol A, Bergel E, Chaisiri K, Diaz E, Gandeh M: Managing data for a randomised controlled clinical trial: experience from the WHO Antenatal Care Trial. WHO Antenatal Care Trial Research Group. Paediatr Perinat Epidemiol. 1998, 12 (Suppl 2): 142-155. 10.1046/j.1365-3016.12.s2.2.x.CrossRefPubMed Pinol A, Bergel E, Chaisiri K, Diaz E, Gandeh M: Managing data for a randomised controlled clinical trial: experience from the WHO Antenatal Care Trial. WHO Antenatal Care Trial Research Group. Paediatr Perinat Epidemiol. 1998, 12 (Suppl 2): 142-155. 10.1046/j.1365-3016.12.s2.2.x.CrossRefPubMed
17.
go back to reference Pogash RM, Boehmer SJ, Forand PE, Dyer AM, Kunselman SJ: Data management procedures in the Asthma Clinical Research Network. Control Clin Trials. 2001, 22 (6 Suppl): 168S-180S. 10.1016/S0197-2456(01)00170-2.CrossRefPubMed Pogash RM, Boehmer SJ, Forand PE, Dyer AM, Kunselman SJ: Data management procedures in the Asthma Clinical Research Network. Control Clin Trials. 2001, 22 (6 Suppl): 168S-180S. 10.1016/S0197-2456(01)00170-2.CrossRefPubMed
18.
go back to reference Nyiendo J, Attwood M, Lloyd C, Ganger B, Haas M: Data management in practice-based research. J Manipulative Physiol Ther. 2002, 25 (1): 49-57. 10.1067/mmt.2002.120417.CrossRefPubMed Nyiendo J, Attwood M, Lloyd C, Ganger B, Haas M: Data management in practice-based research. J Manipulative Physiol Ther. 2002, 25 (1): 49-57. 10.1067/mmt.2002.120417.CrossRefPubMed
19.
go back to reference Cody RP: Cody's data cleaning techniques using SAS software. 1999, Cary, NC: SAS Institute Inc. Cody RP: Cody's data cleaning techniques using SAS software. 1999, Cary, NC: SAS Institute Inc.
20.
go back to reference Hogg RJ: Trials and tribulations of multicenter studies. Lessons learned from the experiences of the Southwest Pediatric Nephrology Study Group (SPNSG). Pediatr Nephrol. 1991, 5 (3): 348-351. 10.1007/BF00867501.CrossRefPubMed Hogg RJ: Trials and tribulations of multicenter studies. Lessons learned from the experiences of the Southwest Pediatric Nephrology Study Group (SPNSG). Pediatr Nephrol. 1991, 5 (3): 348-351. 10.1007/BF00867501.CrossRefPubMed
21.
go back to reference Swan SH, Brazil C, Drobnis EZ, Liu F, Kruse RL, Hatch M, Redmon JB, Wang C, Overstreet JW, The Study for Future Families Research Group: Geographic differences in semen quality of fertile U.S. males. Environ Health Perspect. 2003, 111 (4): 414-420.CrossRefPubMedPubMedCentral Swan SH, Brazil C, Drobnis EZ, Liu F, Kruse RL, Hatch M, Redmon JB, Wang C, Overstreet JW, The Study for Future Families Research Group: Geographic differences in semen quality of fertile U.S. males. Environ Health Perspect. 2003, 111 (4): 414-420.CrossRefPubMedPubMedCentral
22.
go back to reference Vinson DC, Maclure M, Reidinger C, Smith GS: A population-based case-crossover and case-control study of alcohol and the risk of injury. J Stud Alcohol. 2003, 64 (3): 358-366.CrossRefPubMed Vinson DC, Maclure M, Reidinger C, Smith GS: A population-based case-crossover and case-control study of alcohol and the risk of injury. J Stud Alcohol. 2003, 64 (3): 358-366.CrossRefPubMed
23.
go back to reference Galliher JM, Stewart TV, Pathak PK, Werner JJ, Dickinson LM, Hickner JM: Data collection outcomes comparing paper forms with PDA forms in an office-based patient survey. Ann Fam Med. 2008, 6 (2): 154-160. 10.1370/afm.762.CrossRefPubMedPubMedCentral Galliher JM, Stewart TV, Pathak PK, Werner JJ, Dickinson LM, Hickner JM: Data collection outcomes comparing paper forms with PDA forms in an office-based patient survey. Ann Fam Med. 2008, 6 (2): 154-160. 10.1370/afm.762.CrossRefPubMedPubMedCentral
24.
go back to reference McFadden ET, LoPresti F, Bailey LR, Clarke E, Wilkins PC: Approaches to data management. Control Clin Trials. 1995, 16 (2 Suppl): 30S-65S. 10.1016/0197-2456(94)00093-I.CrossRefPubMed McFadden ET, LoPresti F, Bailey LR, Clarke E, Wilkins PC: Approaches to data management. Control Clin Trials. 1995, 16 (2 Suppl): 30S-65S. 10.1016/0197-2456(94)00093-I.CrossRefPubMed
25.
go back to reference Braithwaite WR: Data field standards and the Health Insurance Portability and Accountability Act. Stat Med. 2001, 20 (9–10): 1323-1330. 10.1002/sim.669.CrossRefPubMed Braithwaite WR: Data field standards and the Health Insurance Portability and Accountability Act. Stat Med. 2001, 20 (9–10): 1323-1330. 10.1002/sim.669.CrossRefPubMed
26.
go back to reference Electronic Records; Electronic Signatures: Final Rule. 62 Fed. Reg. 13,429. 1997. Statute. Electronic Records; Electronic Signatures: Final Rule. 62 Fed. Reg. 13,429. 1997. Statute.
Metadata
Title
Data management for prospective research studies using SAS®software
Authors
Robin L Kruse
David R Mehr
Publication date
01-12-2008
Publisher
BioMed Central
Published in
BMC Medical Research Methodology / Issue 1/2008
Electronic ISSN: 1471-2288
DOI
https://doi.org/10.1186/1471-2288-8-61

Other articles of this Issue 1/2008

BMC Medical Research Methodology 1/2008 Go to the issue