01 Siemens MAGNETOM Trio by Image Editor

2016's top 100 journal articles Brain power

Validity of countless functional magnetic resonance imaging (fMRI) studies in doubt [2016’s top 100 journal articles]

Bruce Boyes5 Jan 2017

1,552 2 minutes read

This is part 6 of a miniseries reviewing selected papers from the top 100 most-discussed journal articles of 2016.

Functional magnetic resonance imaging (fMRI) has become a popular tool for understanding the human brain, with PubMed listing some 40,000 published papers. However, despite this popularity, the statistical methods used with fMRI have rarely been validated using real data.

International neuroimaging data sharing initiatives have now made it possible to evaluate statistical methods with real data, and a number of studies have started to do this. In one of these studies, a group of researchers analysed 1,484 resting-state fMRI datasets with one specific software package. They found a high degree of false positives, up to 70% compared with the expected 5%. However, it was not clear if this finding would propagate to group studies, or what the statistical validity of other fMRI software packages would be.

The same group of researchers sought to address these limitations in a new study¹ that conducted an evaluation of group inference with the three most common fMRI software packages. The paper reporting the study is article #62 of the top 100 most-discussed journal articles of 2016.

In the new study, 2,880,000 random group analyses were performed to compute the false-positive rates of the three fMRI software packages. The analyses comprised 1,000 one-sided random analyses repeated for 192 parameter combinations, three thresholding approaches, and the five tools in the three software packages.

The researchers found that the three software packages can produce “P values that are erroneous, being spuriously low and inflating statistical significance.” They state that:

This calls into question the validity of countless published fMRI studies based on parametric clusterwise inference. It is important to stress that we have focused on inferences corrected for multiple comparisons in each group analysis, yet some 40% of a sample of 241 recent fMRI papers did not report correcting for multiple comparisons, meaning that many group results in the fMRI literature suffer even worse false-positive rates than found here.

In response to their findings, the researchers advise that:

Due to lamentable archiving and data-sharing practices, it is unlikely that problematic analyses can be redone. Considering that it is now possible to evaluate common statistical methods using real fMRI data, the fMRI community should, in our opinion, focus on validation of existing methods.

They conclude their paper by highlighting the critical role that data sharing played in their work, and the need for study authors to share their statistical results and data:

Although our massive empirical study depended on shared data, it is disappointing that almost none of the published studies have shared their data, neither the original data nor even the 3D statistical maps. As no analysis method is perfect, and new problems and limitations will be certainly found in the future, we commend all authors to at least share their statistical results and ideally the full data.

This support for data sharing contrasts with the controversial views expressed by the editors of the New England Journal of Medicine (NEJM) in article #34 of the top 100 most-discussed journal articles of 2016.

Header image source: 01 Siemens MAGNETOM Trio by Image Editor is licensed by CC BY 2.0.

Reference:

Eklund, A., Nichols, T. E., & Knutsson, H. (2016). Cluster failure: why fMRI inferences for spatial extent have inflated false-positive rates. Proceedings of the National Academy of Sciences, 201602413. ↩

Rate this post

Also published on Medium.

Cookie	Duration	Description
cookielawinfo-checkbox-advertisement	1 year	Set by the GDPR Cookie Consent plugin, this cookie is used to record the user consent for the cookies in the "Advertisement" category .
cookielawinfo-checkbox-analytics	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Analytics".
cookielawinfo-checkbox-functional	11 months	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Functional".
cookielawinfo-checkbox-necessary	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Necessary".
cookielawinfo-checkbox-others	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Other.
cookielawinfo-checkbox-performance	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Performance".
viewed_cookie_policy	11 months	The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.

Cookie	Duration	Description
_ga	2 years	The _ga cookie, installed by Google Analytics, calculates visitor, session and campaign data and also keeps track of site usage for the site's analytics report. The cookie stores information anonymously and assigns a randomly generated number to recognize unique visitors.
_gid	1 day	Installed by Google Analytics, _gid cookie stores information on how visitors use a website, while also creating an analytics report of the website's performance. Some of the data that are collected include the number of visitors, their source, and the pages they visit anonymously.

Bruce Boyes

Related Articles

Scientists rise up against statistical significance [Top 100 journal articles of 2019]

Statisticians respond to misuse and misinterpretation of “statistical significance” (p-values) in research