Power and reproducibility in the external validation of brain-phenotype predictions.

Rosenblatt, Matthew; Tejavibulya, Link; Sun, Huili; Camp, Chris C; Khaitova, Milana; Adkinson, Brendan D; Jiang, Rongtao; Westwater, Margaret L; Noble, Stephanie; Scheinost, Dustin

Rosenblatt, Matthew; Tejavibulya, Link; Sun, Huili; Camp, Chris C; Khaitova, Milana; Adkinson, Brendan D; Jiang, Rongtao; Westwater, Margaret L; Noble, Stephanie; Scheinost, Dustin.

Afiliação

Rosenblatt M; Department of Biomedical Engineering, Yale University, New Haven, CT, USA. matthew.rosenblatt@yale.edu.
Tejavibulya L; Interdepartmental Neuroscience Program, Yale University, New Haven, CT, USA.
Sun H; Department of Biomedical Engineering, Yale University, New Haven, CT, USA.
Camp CC; Interdepartmental Neuroscience Program, Yale University, New Haven, CT, USA.
Khaitova M; Department of Radiology and Biomedical Imaging, Yale School of Medicine, New Haven, CT, USA.
Adkinson BD; Interdepartmental Neuroscience Program, Yale University, New Haven, CT, USA.
Jiang R; Department of Radiology and Biomedical Imaging, Yale School of Medicine, New Haven, CT, USA.
Westwater ML; Department of Radiology and Biomedical Imaging, Yale School of Medicine, New Haven, CT, USA.
Noble S; Department of Radiology and Biomedical Imaging, Yale School of Medicine, New Haven, CT, USA.
Scheinost D; Department of Bioengineering, Northeastern University, Boston, MA, USA.

Nat Hum Behav ; 2024 Jul 31.

Article em En | MEDLINE | ID: mdl-39085406

ABSTRACT

ABSTRACT

Brain-phenotype predictive models seek to identify reproducible and generalizable brain-phenotype associations. External validation, or the evaluation of a model in external datasets, is the gold standard in evaluating the generalizability of models in neuroimaging. Unlike typical studies, external validation involves two sample sizes the training and the external sample sizes. Thus, traditional power calculations may not be appropriate. Here we ran over 900 million resampling-based simulations in functional and structural connectivity data to investigate the relationship between training sample size, external sample size, phenotype effect size, theoretical power and simulated power. Our analysis included a wide range of datasets the Healthy Brain Network, the Adolescent Brain Cognitive Development Study, the Human Connectome Project (Development and Young Adult), the Philadelphia Neurodevelopmental Cohort, the Queensland Twin Adolescent Brain Project, and the Chinese Human Connectome Project; and phenotypes age, body mass index, matrix reasoning, working memory, attention problems, anxiety/depression symptoms and relational processing. High effect size predictions achieved adequate power with training and external sample sizes of a few hundred individuals, whereas low and medium effect size predictions required hundreds to thousands of training and external samples. In addition, most previous external validation studies used sample sizes prone to low power, and theoretical power curves should be adjusted for the training sample size. Furthermore, model performance in internal validation often informed subsequent external validation performance (Pearson's r difference <0.2), particularly for well-harmonized datasets. These results could help decide how to power future external validation studies.

Texto completo

Imprimir

XML

PubMed Links

Buscar no Google

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Idioma: En Revista: Nat Hum Behav Ano de publicação: 2024 Tipo de documento: Article

Texto completo

Imprimir

XML

PubMed Links

Buscar no Google

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Idioma: En Revista: Nat Hum Behav Ano de publicação: 2024 Tipo de documento: Article