RESUMEN
OBJECTIVE: Three leading neurobiological hypotheses about autism spectrum disorder (ASD) propose underconnectivity between brain regions, atypical function of the amygdala, and generally higher variability between individuals with ASD than between neurotypical individuals. Past work has often failed to generalize, because of small sample sizes, unquantified data quality, and analytic flexibility. This study addressed these limitations while testing the above three hypotheses, applied to amygdala functional connectivity. METHODS: In a comprehensive preregistered study, the three hypotheses were tested in a subset (N=488 after exclusions; N=212 with ASD) of the Autism Brain Imaging Data Exchange data sets. The authors analyzed resting-state functional connectivity (FC) from functional MRI data from two anatomically defined amygdala subdivisions, in three hypotheses with respect to magnitude, pattern similarity, and variability, across different anatomical scales ranging from whole brain to specific regions and networks. RESULTS: A Bayesian approach to hypothesis evaluation produced inconsistent evidence in ASD for atypical amygdala FC magnitude, strong evidence that the multivariate pattern of FC was typical, and no consistent evidence of increased interindividual variability in FC. The results strongly depended on analytic choices, including preprocessing pipeline for the neuroimaging data, anatomical specificity, and subject exclusions. CONCLUSIONS: A preregistered set of analyses found no reliable evidence for atypical functional connectivity of the amygdala in autism, contrary to leading hypotheses. Future studies should test an expanded set of hypotheses across multiple processing pipelines, collect deeper data per individual, and include a greater diversity of participants to ensure robust generalizability of findings on amygdala FC in ASD.
RESUMEN
Neuroimaging research faces a crisis of reproducibility. With massive sample sizes and greater data complexity, this problem becomes more acute. Software that operates on imaging data defined using the Brain Imaging Data Structure (BIDS) - BIDS Apps - have provided a substantial advance. However, even using BIDS Apps, a full audit trail of data processing is a necessary prerequisite for fully reproducible research. Obtaining a faithful record of the audit trail is challenging - especially for large datasets. Recently, the FAIRly big framework was introduced as a way to facilitate reproducible processing of large-scale data by leveraging DataLad - a version control system for data management. However, the current implementation of this framework was more of a proof of concept, and could not be immediately reused by other investigators for different use cases. Here we introduce the BIDS App Bootstrap (BABS), a user-friendly and generalizable Python package for reproducible image processing at scale. BABS facilitates the reproducible application of BIDS Apps to large-scale datasets. Leveraging DataLad and the FAIRly big framework, BABS tracks the full audit trail of data processing in a scalable way by automatically preparing all scripts necessary for data processing and version tracking on high performance computing (HPC) systems. Currently, BABS supports jobs submissions and audits on Sun Grid Engine (SGE) and Slurm HPCs with a parsimonious set of programs. To demonstrate its scalability, we applied BABS to data from the Healthy Brain Network (HBN; n=2,565). Taken together, BABS allows reproducible and scalable image processing and is broadly extensible via an open-source development model.
RESUMEN
DataLad is a Python-based tool for the joint management of code, data, and their relationship, built on top of a versatile system for data logistics (git-annex) and the most popular distributed version control system (Git). It adapts principles of open-source software development and distribution to address the technical challenges of data management, data sharing, and digital provenance collection across the life cycle of digital objects. DataLad aims to make data management as easy as managing code. It streamlines procedures to consume, publish, and update data, for data of any size or type, and to link them as precisely versioned, lightweight dependencies. DataLad helps to make science more reproducible and FAIR (Wilkinson et al., 2016). It can capture complete and actionable process provenance of data transformations to enable automatic re-computation. The DataLad project (datalad.org) delivers a completely open, pioneering platform for flexible decentralized research data management (RDM) (Hanke, Pestilli, et al., 2021). It features a Python and a command-line interface, an extensible architecture, and does not depend on any centralized services but facilitates interoperability with a plurality of existing tools and services. In order to maximize its utility and target audience, DataLad is available for all major operating systems, and can be integrated into established workflows and environments with minimal friction.
RESUMEN
There has been a recent major upsurge in the concerns about reproducibility in many areas of science. Within the neuroimaging domain, one approach is to promote reproducibility is to target the re-executability of the publication. The information supporting such re-executability can enable the detailed examination of how an initial finding generalizes across changes in the processing approach, and sampled population, in a controlled scientific fashion. ReproNim: A Center for Reproducible Neuroimaging Computation is a recently funded initiative that seeks to facilitate the "last mile" implementations of core re-executability tools in order to reduce the accessibility barrier and increase adoption of standards and best practices at the neuroimaging research laboratory level. In this report, we summarize the overall approach and tools we have developed in this domain.