Skip to main content
Top
Published in: Neuroradiology 4/2022

01-04-2022 | Magnetic Resonance Imaging | Functional Neuroradiology

Different FreeSurfer versions might generate different statistical outcomes in case–control comparison studies

Authors: Pavel Filip, Petr Bednarik, Lynn E. Eberly, Amir Moheet, Alena Svatkova, Heidi Grohn, Anjali F. Kumar, Elizabeth R. Seaquist, Silvia Mangia

Published in: Neuroradiology | Issue 4/2022

Login to get access

Abstract

Purpose

Neuroimaging pipelines have long been known to generate mildly differing results depending on various factors, including software version. While considered generally acceptable and within the margin of reasonable error, little is known about their effect in common research scenarios such as inter-group comparisons between healthy controls and various pathological conditions. The aim of the presented study was to explore the differences in the inferences and statistical significances in a model situation comparing volumetric parameters between healthy controls and type 1 diabetes patients using various FreeSurfer versions.

Methods

T1- and T2-weighted structural scans of healthy controls and type 1 diabetes patients were processed with FreeSurfer 5.3, FreeSurfer 5.3 HCP, FreeSurfer 6.0 and FreeSurfer 7.1, followed by inter-group statistical comparison using outputs of individual FreeSurfer versions.

Results

Worryingly, FreeSurfer 5.3 detected both cortical and subcortical volume differences out of the preselected regions of interest, but newer versions such as FreeSurfer 5.3 HCP and FreeSurfer 6.0 reported only subcortical differences of lower magnitude and FreeSurfer 7.1 failed to find any statistically significant inter-group differences.

Conclusion

Since group averages of individual FreeSurfer versions closely matched, in keeping with previous literature, the main origin of this disparity seemed to lie in substantially higher within-group variability in the model pathological condition. Ergo, until validation in common research scenarios as case–control comparison studies is included into the development process of new software suites, confirmatory analyses utilising a similar software based on analogous, but not fully equivalent principles, might be considered as supplement to careful quality control.
Appendix
Available only for authorised users
Literature
2.
go back to reference Bigler, E. D., Skiles, M., Wade, B. S. C., Abildskov, T. J., Tustison, N. J., Scheibel, R. S., et al. (2020). FreeSurfer 5.3 versus 6.0: are volumes comparable? A chronic effects of neurotrauma consortium study. Brain Imaging and Behavior, 14(5), 1318–1327. https://doi.org/10.1007/s11682-018-9994-x Bigler, E. D., Skiles, M., Wade, B. S. C., Abildskov, T. J., Tustison, N. J., Scheibel, R. S., et al. (2020). FreeSurfer 5.3 versus 6.0: are volumes comparable? A chronic effects of neurotrauma consortium study. Brain Imaging and Behavior, 14(5), 1318–1327. https://​doi.​org/​10.​1007/​s11682-018-9994-x
3.
go back to reference Buckner RL, Head D, Parker J, Fotenos AF, Marcus D, Morris JC, Snyder AZ (2004) A unified approach for morphometric and functional data analysis in young, old, and demented adults using automated atlas-based head size normalization: reliability and validation against manual measurement of total intracranial volume. Neuroimage 23(2):724–738CrossRef Buckner RL, Head D, Parker J, Fotenos AF, Marcus D, Morris JC, Snyder AZ (2004) A unified approach for morphometric and functional data analysis in young, old, and demented adults using automated atlas-based head size normalization: reliability and validation against manual measurement of total intracranial volume. Neuroimage 23(2):724–738CrossRef
4.
go back to reference Cardinale F, Chinnici G, Bramerio M, Mai R, Sartori I, Cossu M et al (2014) Validation of FreeSurfer-estimated brain cortical thickness: comparison with histologic measurements. Neuroinformatics 12(4):535–542CrossRef Cardinale F, Chinnici G, Bramerio M, Mai R, Sartori I, Cossu M et al (2014) Validation of FreeSurfer-estimated brain cortical thickness: comparison with histologic measurements. Neuroinformatics 12(4):535–542CrossRef
5.
go back to reference Chepkoech J-L, Walhovd KB, Grydeland H, Fjell AM, Initiative ADN (2016) Effects of change in FreeSurfer version on classification accuracy of patients with Alzheimer’s disease and mild cognitive impairment. Hum Brain Mapp 37(5):1831–1841CrossRef Chepkoech J-L, Walhovd KB, Grydeland H, Fjell AM, Initiative ADN (2016) Effects of change in FreeSurfer version on classification accuracy of patients with Alzheimer’s disease and mild cognitive impairment. Hum Brain Mapp 37(5):1831–1841CrossRef
6.
go back to reference Dice LR (1945) Measures of the amount of ecologic association between species. Ecology 26(3):297–302CrossRef Dice LR (1945) Measures of the amount of ecologic association between species. Ecology 26(3):297–302CrossRef
9.
go back to reference Glasser MF, Sotiropoulos SN, Wilson JA, Coalson TS, Fischl B, Andersson JL et al (2013) The minimal preprocessing pipelines for the Human Connectome Project. Neuroimage 80:105–124CrossRef Glasser MF, Sotiropoulos SN, Wilson JA, Coalson TS, Fischl B, Andersson JL et al (2013) The minimal preprocessing pipelines for the Human Connectome Project. Neuroimage 80:105–124CrossRef
12.
go back to reference Hammers A, Heckemann R, Koepp MJ, Duncan JS, Hajnal JV, Rueckert D, Aljabar P (2007) Automatic detection and quantification of hippocampal atrophy on MRI in temporal lobe epilepsy: a proof-of-principle study. Neuroimage 36(1):38–47CrossRef Hammers A, Heckemann R, Koepp MJ, Duncan JS, Hajnal JV, Rueckert D, Aljabar P (2007) Automatic detection and quantification of hippocampal atrophy on MRI in temporal lobe epilepsy: a proof-of-principle study. Neuroimage 36(1):38–47CrossRef
13.
go back to reference Han X, Jovicich J, Salat D, van der Kouwe A, Quinn B, Czanner S et al (2006) Reliability of MRI-derived measurements of human cerebral cortical thickness: the effects of field strength, scanner upgrade and manufacturer. Neuroimage 32(1):180–194CrossRef Han X, Jovicich J, Salat D, van der Kouwe A, Quinn B, Czanner S et al (2006) Reliability of MRI-derived measurements of human cerebral cortical thickness: the effects of field strength, scanner upgrade and manufacturer. Neuroimage 32(1):180–194CrossRef
16.
go back to reference Jovicich J, Czanner S, Han X, Salat D, van der Kouwe A, Quinn B et al (2009) MRI-derived measurements of human subcortical, ventricular and intracranial brain volumes: reliability effects of scan sessions, acquisition sequences, data analyses, scanner upgrade, scanner vendors and field strengths. Neuroimage 46(1):177–192CrossRef Jovicich J, Czanner S, Han X, Salat D, van der Kouwe A, Quinn B et al (2009) MRI-derived measurements of human subcortical, ventricular and intracranial brain volumes: reliability effects of scan sessions, acquisition sequences, data analyses, scanner upgrade, scanner vendors and field strengths. Neuroimage 46(1):177–192CrossRef
18.
go back to reference Morey RA, Petty CM, Xu Y, Hayes JP, Wagner HR II, Lewis DV et al (2009) A comparison of automated segmentation and manual tracing for quantifying hippocampal and amygdala volumes. Neuroimage 45(3):855–866CrossRef Morey RA, Petty CM, Xu Y, Hayes JP, Wagner HR II, Lewis DV et al (2009) A comparison of automated segmentation and manual tracing for quantifying hippocampal and amygdala volumes. Neuroimage 45(3):855–866CrossRef
19.
go back to reference Morey RA, Selgrade ES, Wagner HR, Huettel SA, Wang L, McCarthy G (2010) Scan–rescan reliability of subcortical brain volumes derived from automated segmentation. Hum Brain Mapp 31(11):1751–1762PubMedPubMedCentral Morey RA, Selgrade ES, Wagner HR, Huettel SA, Wang L, McCarthy G (2010) Scan–rescan reliability of subcortical brain volumes derived from automated segmentation. Hum Brain Mapp 31(11):1751–1762PubMedPubMedCentral
21.
go back to reference Musen G, Lyoo IK, Sparks CR, Weinger K, Hwang J, Ryan CM et al (2006) Effects of type 1 diabetes on gray matter density as measured by voxel-based morphometry. Diabetes 55(2):326–333CrossRef Musen G, Lyoo IK, Sparks CR, Weinger K, Hwang J, Ryan CM et al (2006) Effects of type 1 diabetes on gray matter density as measured by voxel-based morphometry. Diabetes 55(2):326–333CrossRef
24.
go back to reference Rosas HD, Liu AK, Hersch S, Glessner M, Ferrante RJ, Salat DH et al (2002) Regional and progressive thinning of the cortical ribbon in Huntington’s disease. Neurology 58(5):695–701CrossRef Rosas HD, Liu AK, Hersch S, Glessner M, Ferrante RJ, Salat DH et al (2002) Regional and progressive thinning of the cortical ribbon in Huntington’s disease. Neurology 58(5):695–701CrossRef
26.
go back to reference Tae WS, Kim SS, Lee KU, Nam E-C, Kim KW (2008) Validation of hippocampal volumes measured using a manual method and two automated methods (FreeSurfer and IBASPM) in chronic major depressive disorder. Neuroradiology 50(7):569CrossRef Tae WS, Kim SS, Lee KU, Nam E-C, Kim KW (2008) Validation of hippocampal volumes measured using a manual method and two automated methods (FreeSurfer and IBASPM) in chronic major depressive disorder. Neuroradiology 50(7):569CrossRef
Metadata
Title
Different FreeSurfer versions might generate different statistical outcomes in case–control comparison studies
Authors
Pavel Filip
Petr Bednarik
Lynn E. Eberly
Amir Moheet
Alena Svatkova
Heidi Grohn
Anjali F. Kumar
Elizabeth R. Seaquist
Silvia Mangia
Publication date
01-04-2022
Publisher
Springer Berlin Heidelberg
Published in
Neuroradiology / Issue 4/2022
Print ISSN: 0028-3940
Electronic ISSN: 1432-1920
DOI
https://doi.org/10.1007/s00234-021-02862-0

Other articles of this Issue 4/2022

Neuroradiology 4/2022 Go to the issue