Skip to main content
Top
Published in: Systematic Reviews 1/2024

Open Access 01-12-2024 | Research

Addressing the challenges of reconstructing systematic reviews datasets: a case study and a noisy label filter procedure

Authors: Rutger Neeleman, Cathalijn H. C. Leenaars, Matthijs Oud, Felix Weijdema, Rens van de Schoot

Published in: Systematic Reviews | Issue 1/2024

Login to get access

Abstract

Systematic reviews and meta-analyses typically require significant time and effort. Machine learning models have the potential to enhance screening efficiency in these processes. To effectively evaluate such models, fully labeled datasets—detailing all records screened by humans and their labeling decisions—are imperative. This paper presents the creation of a comprehensive dataset for a systematic review of treatments for Borderline Personality Disorder, as reported by Oud et al. (2018) for running a simulation study. The authors adhered to the PRISMA guidelines and published both the search query and the list of included records, but the complete dataset with all labels was not disclosed. We replicated their search and, facing the absence of initial screening data, introduced a Noisy Label Filter (NLF) procedure using active learning to validate noisy labels. Following the NLF application, no further relevant records were found. A simulation study employing the reconstructed dataset demonstrated that active learning could reduce screening time by 82.30% compared to random reading. The paper discusses potential causes for discrepancies, provides recommendations, and introduces a decision tree to assist in reconstructing datasets for the purpose of running simulation studies.
Appendix
Available only for authorised users
Literature
1.
go back to reference Akhter, S., Pauyo, T., & Khan, M. What is the difference between a systematic review and a meta-analysis? In V. Musahl, J. Karlsson, M. T. Hirschmann, O. R. Ayeni, R. G. Marx, J. L. Koh, & N. Nakamura (Eds.), Basic Methods Handbook for Clinical Orthopaedic Research: A Practical Guide and Case Based Research Approach (pp. 331–342). 2019; Springer. https://doi.org/10.1007/978-3-662-58254-1_37 Akhter, S., Pauyo, T., & Khan, M.  What is the difference between a systematic review and a meta-analysis? In V. Musahl, J. Karlsson, M. T. Hirschmann, O. R. Ayeni, R. G. Marx, J. L. Koh, & N. Nakamura (Eds.), Basic Methods Handbook for Clinical Orthopaedic Research: A Practical Guide and Case Based Research Approach (pp. 331–342). 2019; Springer. https://​doi.​org/​10.​1007/​978-3-662-58254-1_​37
3.
go back to reference Bateman, A. W., & Fonagy, P. Psychotherapy for severe personality disorder. Article did not do justice to available research data. BMJ (Clinical Research Ed.). 1999;319(7211):709–710; author reply 710–711. Bateman, A. W., & Fonagy, P.  Psychotherapy for severe personality disorder. Article did not do justice to available research data. BMJ (Clinical Research Ed.). 1999;319(7211):709–710; author reply 710–711.
5.
go back to reference Bloodgood, M., & Vijay-Shanker, K. A method for stopping active learning based on stabilizing predictions and the need for user-adjustable stopping. ArXiv Preprint ArXiv:1409.5165. 2014. Bloodgood, M., & Vijay-Shanker, K. A method for stopping active learning based on stabilizing predictions and the need for user-adjustable stopping. ArXiv Preprint ArXiv:1409.5165.  2014.
13.
go back to reference Embase. Emtree—Embase. embase.com. 2023. Embase. Emtree—Embase. embase.com. 2023.
14.
go back to reference Ferdinands G, Schram R, de Bruin J, Bagheri A, Oberski DL, Tummers L, Teijema JJ, van de Schoot R. Performance of active learning models for screening prioritization in systematic reviews: A simulation study into the Average Time to Discover relevant records. Syst Rev. 2023;12(1):100.CrossRefPubMedPubMedCentral Ferdinands G, Schram R, de Bruin J, Bagheri A, Oberski DL, Tummers L, Teijema JJ, van de Schoot R. Performance of active learning models for screening prioritization in systematic reviews: A simulation study into the Average Time to Discover relevant records. Syst Rev. 2023;12(1):100.CrossRefPubMedPubMedCentral
26.
go back to reference Nadort M, Arntz A, Smit JH, Giesen-Bloo J, Eikelenboom M, Spinhoven P, van Asselt T, Wensing M, van Dyck R. Implementation of outpatient schema therapy for borderline personality disorder with versus without crisis support by the therapist outside office hours: a randomized trial. Behav Res Ther. 2009;47(11):961–73. https://doi.org/10.1016/j.brat.2009.07.013.CrossRefPubMed Nadort M, Arntz A, Smit JH, Giesen-Bloo J, Eikelenboom M, Spinhoven P, van Asselt T, Wensing M, van Dyck R. Implementation of outpatient schema therapy for borderline personality disorder with versus without crisis support by the therapist outside office hours: a randomized trial. Behav Res Ther. 2009;47(11):961–73. https://​doi.​org/​10.​1016/​j.​brat.​2009.​07.​013.CrossRefPubMed
27.
go back to reference Neeleman, R., Oud, M., Weijdema, F., Leenaars, C., & Schoot, R. van de. Scripts, data and output to reproduce ‘Addressing the Challenges of Reconstructing Systematic Reviews Datasets: A Case Study and a Noisy Label Filter Procedure’. (2022). https://doi.org/10.17605/OSF.IO/PJR97 Neeleman, R., Oud, M., Weijdema, F., Leenaars, C., & Schoot, R. van de. Scripts, data and output to reproduce ‘Addressing the Challenges of Reconstructing Systematic Reviews Datasets: A Case Study and a Noisy Label Filter Procedure’. (2022).  https://​doi.​org/​10.​17605/​OSF.​IO/​PJR97
30.
go back to reference Page, M. J., McKenzie, J. E., Bossuyt, P. M., Boutron, I., Hoffmann, T. C., Mulrow, C. D., Shamseer, L., Tetzlaff, J. M., Akl, E. A., Brennan, S. E., Chou, R., Glanville, J., Grimshaw, J. M., Hróbjartsson, A., Lalu, M. M., Li, T., Loder, E. W., Mayo-Wilson, E., McDonald, S., … Moher, D. The PRISMA 2020 statement: An updated guideline for reporting systematic reviews. BMJ. 2021: n71. https://doi.org/10.1136/bmj.n71 Page, M. J., McKenzie, J. E., Bossuyt, P. M., Boutron, I., Hoffmann, T. C., Mulrow, C. D., Shamseer, L., Tetzlaff, J. M., Akl, E. A., Brennan, S. E., Chou, R., Glanville, J., Grimshaw, J. M., Hróbjartsson, A., Lalu, M. M., Li, T., Loder, E. W., Mayo-Wilson, E., McDonald, S., … Moher, D. The PRISMA 2020 statement: An updated guideline for reporting systematic reviews. BMJ. 2021: n71. https://​doi.​org/​10.​1136/​bmj.​n71
34.
go back to reference Teijema, J., Van de Schoot, R., Ferdinands, G., Lombaers, P., & De Bruin, J. ASReview Makita: A workflow generator for simulation studies using the command line interface of ASReview LAB (v0.7.1) [Computer software]. Zenodo. 2023. https://doi.org/10.5281/zenodo.8052176 Teijema, J., Van de Schoot, R., Ferdinands, G., Lombaers, P., & De Bruin, J. ASReview Makita: A workflow generator for simulation studies using the command line interface of ASReview LAB (v0.7.1) [Computer software]. Zenodo.  2023. https://​doi.​org/​10.​5281/​zenodo.​8052176
35.
go back to reference Van den Bosch LMC. Efficacy of dialectical behaviour therapy in the treatment of female borderline patients with and without substance abuse problems: Result of a Dutch study. Dialectische gedragstherapie bij Nederlandse vrouwen met een borderline persoonlijkheidsstoornis, met en zonder verslavingsproblemen. 2005;47(3):127–37. Van den Bosch LMC. Efficacy of dialectical behaviour therapy in the treatment of female borderline patients with and without substance abuse problems: Result of a Dutch study. Dialectische gedragstherapie bij Nederlandse vrouwen met een borderline persoonlijkheidsstoornis, met en zonder verslavingsproblemen. 2005;47(3):127–37.
39.
go back to reference Vlachos A. A stopping criterion for active learning. Comput Speech Lang. 2008;22(3):295–312.CrossRef Vlachos A. A stopping criterion for active learning. Comput Speech Lang. 2008;22(3):295–312.CrossRef
40.
go back to reference Yang, E., Lewis, D. D., & Frieder, O. Heuristic stopping rules for technology-assisted review. Proceedings of the 21st ACM Symposium on Document Engineering 2021:1–10. Yang, E., Lewis, D. D., & Frieder, O. Heuristic stopping rules for technology-assisted review. Proceedings of the 21st ACM Symposium on Document Engineering  2021:1–10.
Metadata
Title
Addressing the challenges of reconstructing systematic reviews datasets: a case study and a noisy label filter procedure
Authors
Rutger Neeleman
Cathalijn H. C. Leenaars
Matthijs Oud
Felix Weijdema
Rens van de Schoot
Publication date
01-12-2024
Publisher
BioMed Central
Published in
Systematic Reviews / Issue 1/2024
Electronic ISSN: 2046-4053
DOI
https://doi.org/10.1186/s13643-024-02472-w

Other articles of this Issue 1/2024

Systematic Reviews 1/2024 Go to the issue