Pesquisa | Biblioteca Virtual em Saúde

1.

Reproducibility and Repeatability of US Shear-Wave and Transient Elastography in Nonalcoholic Fatty Liver Disease.

Pierce, Theodore T; Ozturk, Arinc; Sherlock, Sarah P; Moura Cunha, Guilherme; Wang, Xiaohong; Li, Qian; Hunt, David; Middleton, Michael S; Martin, Marian; Corey, Kathleen E; Edenbaum, Hannah; Shankar, Sudha S; Heymann, Helen; Kamphaus, Tania N; Calle, Roberto A; Covarrubias, Yesenia; Loomba, Rohit; Obuchowski, Nancy A; Sanyal, Arun J; Sirlin, Claude B; Fowler, Kathryn J; Samir, Anthony E.

Radiology ; 312(3): e233094, 2024 Sep.

Artigo em Inglês | MEDLINE | ID: mdl-39254458

RESUMO

Background US shear-wave elastography (SWE) and vibration-controlled transient elastography (VCTE) enable assessment of liver stiffness, an indicator of fibrosis severity. However, limited reproducibility data restrict their use in clinical trials. Purpose To estimate SWE and VCTE measurement variability in nonalcoholic fatty liver disease (NAFLD) within and across systems to support clinical trial diagnostic enrichment and clinical interpretation of longitudinal liver stiffness. Materials and Methods This prospective, observational, cross-sectional study (March 2021 to November 2021) enrolled adults with NAFLD, stratified according to the Fibrosis-4 (FIB-4) index (≤1.3, >1.3 and <2.67, ≥2.67), at two sites to assess SWE with five US systems and VCTE with one system. Each participant underwent 12 elastography examinations over two separate days within 1 week, with each day's examinations conducted by a different operator. VCTE and SWE measurements were reported in units of meters per second. The primary end point was the different-day, different-operator reproducibility coefficient (RDCDDDO) pooled across systems for SWE and individually for VCTE. Secondary end points included system-specific RDCDDDO, same-day, same-operator repeatability coefficient (RCSDSO), and between-system same-day, same-operator reproducibility coefficient. The planned sample provided 80% power to detect a pooled RDCDDDO of less than 35%, the prespecified performance threshold. Results A total of 40 participants (mean age, 60 years ± 10 [SD]; 24 women) with low (n = 17), intermediate (n = 15), and high (n = 8) FIB-4 scores were enrolled. RDCDDDO was 30.7% (95% upper bound, 34.4%) for SWE and 35.6% (95% upper bound, 43.9%) for VCTE. SWE system-specific RDCDDDO varied from 24.2% to 34.3%. The RCSDSO was 21.0% for SWE (range, 13.9%-35.0%) and 19.6% for VCTE. The SWE between-system same-day, same-operator reproducibility coefficient was 52.7%. Conclusion SWE met the prespecified threshold, RDCDDDO less than 35%, with VCTE having a higher RDCDDDO. SWE variability was higher between different systems. These estimates advance liver US-based noninvasive test qualification by (a) defining expected variability, (b) establishing that serial examination variability is lower when performed with the same system, and (c) informing clinical trial design. ClinicalTrials.gov Identifier NCT04828551 © RSNA, 2024 Supplemental material is available for this article.

Assuntos

Técnicas de Imagem por Elasticidade , Hepatopatia Gordurosa não Alcoólica , Humanos , Técnicas de Imagem por Elasticidade/métodos , Hepatopatia Gordurosa não Alcoólica/diagnóstico por imagem , Feminino , Masculino , Reprodutibilidade dos Testes , Pessoa de Meia-Idade , Estudos Prospectivos , Estudos Transversais , Adulto , Fígado/diagnóstico por imagem , Idoso , Cirrose Hepática/diagnóstico por imagem

2.

Special Report on the Consensus QIBA Profile for Objective Analytical Validation of Non-calcified and High-risk Plaque and Other Biomarkers using Computed Tomography Angiography.

Buckler, Andrew J; Abbara, Suhny; Budoff, Matthew J; Carr, John Jeffrey; De Cecco, Carlo N; DeMarco, J Kevin; Ferencik, Maros; Figtree, Gemma A; Ikuta, Ichiro; Kolossváry, Márton; Konrad, Mathis; Lal, Brajesh K; Marques, Hugo; Moss, Alastair J; Obuchowski, Nancy A; van Beek, Edwin J R; Virmani, Renu; Williams, Michelle C; Saba, Luca; Joseph Schoepf, U.

Acad Radiol ; 2024 Jul 26.

Artigo em Inglês | MEDLINE | ID: mdl-39060206

RESUMO

RATIONALE AND OBJECTIVES: Evidence is building in support of the clinical utility of atherosclerotic plaque imaging by computed tomography angiography (CTA). There is increasing organized activity to embrace non-calcified plaque (NCP) as a formally defined biomarker for clinical trials, and high-risk plaque (HRP) for clinical care, as the most relevant measures for the field to advance and worthy of community efforts to validate. Yet the ability to assess the quantitative performance of any given specific solution to make these measurements or classifications is not available. Vendors use differing definitions, assessment metrics, and validation data sets to describe their offerings without clinician users having the capability to make objective assessments of accuracy and precision and how this affects diagnostic confidence. MATERIALS AND METHODS: The QIBA Profile for Atherosclerosis Biomarkers by CTA was created by the Quantitative Imaging Biomarkers Alliance (QIBA) to improve objectivity and decrease the variability of noninvasive plaque phenotyping. The Profile provides claims on the accuracy and precision of plaque measures individually and when combined. RESULTS: Individual plaque morphology measurements are evaluated in terms of bias (accuracy), slope (consistency of the bias across the measurement range, needed for measurements of change), and variability. The multiparametric plaque stability phenotype is evaluated in terms of agreement with expert pathologists. The Profile is intended for a broad audience, including those engaged in discovery science, clinical trials, and patient care. CONCLUSION: This report provides a rationale and overview of the Profile claims and how to comply with the Profile in research and clinical practice. SUMMARY STATEMENT: This article summarizes objective means to validate the analytical performance of non-calcified plaque (NCP), other emerging plaque morphology measurements, and multiparametric histology-defined high-risk plaque (HRP), as outlined in the QIBA Profile for Atherosclerosis Biomarkers by CTA.

3.

Artificial intelligence and radiologists in prostate cancer detection on MRI (PI-CAI): an international, paired, non-inferiority, confirmatory study.

Saha, Anindo; Bosma, Joeran S; Twilt, Jasper J; van Ginneken, Bram; Bjartell, Anders; Padhani, Anwar R; Bonekamp, David; Villeirs, Geert; Salomon, Georg; Giannarini, Gianluca; Kalpathy-Cramer, Jayashree; Barentsz, Jelle; Maier-Hein, Klaus H; Rusu, Mirabela; Rouvière, Olivier; van den Bergh, Roderick; Panebianco, Valeria; Kasivisvanathan, Veeru; Obuchowski, Nancy A; Yakar, Derya; Elschot, Mattijs; Veltman, Jeroen; Fütterer, Jurgen J; de Rooij, Maarten; Huisman, Henkjan.

Lancet Oncol ; 25(7): 879-887, 2024 Jul.

Artigo em Inglês | MEDLINE | ID: mdl-38876123

RESUMO

BACKGROUND: Artificial intelligence (AI) systems can potentially aid the diagnostic pathway of prostate cancer by alleviating the increasing workload, preventing overdiagnosis, and reducing the dependence on experienced radiologists. We aimed to investigate the performance of AI systems at detecting clinically significant prostate cancer on MRI in comparison with radiologists using the Prostate Imaging-Reporting and Data System version 2.1 (PI-RADS 2.1) and the standard of care in multidisciplinary routine practice at scale. METHODS: In this international, paired, non-inferiority, confirmatory study, we trained and externally validated an AI system (developed within an international consortium) for detecting Gleason grade group 2 or greater cancers using a retrospective cohort of 10 207 MRI examinations from 9129 patients. Of these examinations, 9207 cases from three centres (11 sites) based in the Netherlands were used for training and tuning, and 1000 cases from four centres (12 sites) based in the Netherlands and Norway were used for testing. In parallel, we facilitated a multireader, multicase observer study with 62 radiologists (45 centres in 20 countries; median 7 [IQR 5-10] years of experience in reading prostate MRI) using PI-RADS (2.1) on 400 paired MRI examinations from the testing cohort. Primary endpoints were the sensitivity, specificity, and the area under the receiver operating characteristic curve (AUROC) of the AI system in comparison with that of all readers using PI-RADS (2.1) and in comparison with that of the historical radiology readings made during multidisciplinary routine practice (ie, the standard of care with the aid of patient history and peer consultation). Histopathology and at least 3 years (median 5 [IQR 4-6] years) of follow-up were used to establish the reference standard. The statistical analysis plan was prespecified with a primary hypothesis of non-inferiority (considering a margin of 0·05) and a secondary hypothesis of superiority towards the AI system, if non-inferiority was confirmed. This study was registered at ClinicalTrials.gov, NCT05489341. FINDINGS: Of the 10 207 examinations included from Jan 1, 2012, through Dec 31, 2021, 2440 cases had histologically confirmed Gleason grade group 2 or greater prostate cancer. In the subset of 400 testing cases in which the AI system was compared with the radiologists participating in the reader study, the AI system showed a statistically superior and non-inferior AUROC of 0·91 (95% CI 0·87-0·94; p<0·0001), in comparison to the pool of 62 radiologists with an AUROC of 0·86 (0·83-0·89), with a lower boundary of the two-sided 95% Wald CI for the difference in AUROC of 0·02. At the mean PI-RADS 3 or greater operating point of all readers, the AI system detected 6·8% more cases with Gleason grade group 2 or greater cancers at the same specificity (57·7%, 95% CI 51·6-63·3), or 50·4% fewer false-positive results and 20·0% fewer cases with Gleason grade group 1 cancers at the same sensitivity (89·4%, 95% CI 85·3-92·9). In all 1000 testing cases where the AI system was compared with the radiology readings made during multidisciplinary practice, non-inferiority was not confirmed, as the AI system showed lower specificity (68·9% [95% CI 65·3-72·4] vs 69·0% [65·5-72·5]) at the same sensitivity (96·1%, 94·0-98·2) as the PI-RADS 3 or greater operating point. The lower boundary of the two-sided 95% Wald CI for the difference in specificity (-0·04) was greater than the non-inferiority margin (-0·05) and a p value below the significance threshold was reached (p<0·001). INTERPRETATION: An AI system was superior to radiologists using PI-RADS (2.1), on average, at detecting clinically significant prostate cancer and comparable to the standard of care. Such a system shows the potential to be a supportive tool within a primary diagnostic setting, with several associated benefits for patients and radiologists. Prospective validation is needed to test clinical applicability of this system. FUNDING: Health~Holland and EU Horizon 2020.

Assuntos

Inteligência Artificial , Imageamento por Ressonância Magnética , Neoplasias da Próstata , Radiologistas , Humanos , Masculino , Neoplasias da Próstata/diagnóstico por imagem , Neoplasias da Próstata/patologia , Idoso , Estudos Retrospectivos , Pessoa de Meia-Idade , Gradação de Tumores , Países Baixos , Curva ROC

4.

¹⁸F-fluciclovine PET/CT to distinguish radiation necrosis from tumor progression for brain metastases treated with radiosurgery: results of a prospective pilot study.

Tom, Martin C; DiFilippo, Frank P; Jones, Stephen E; Suh, John H; Obuchowski, Nancy A; Smile, Timothy D; Murphy, Erin S; Yu, Jennifer S; Barnett, Gene H; Angelov, Lilyana; Mohammadi, Alireza M; Huang, Steve S; Wu, Guiyun; Johnson, Scott; Peereboom, David M; Stevens, Glen H J; Ahluwalia, Manmeet S; Chao, Samuel T.

J Neurooncol ; 163(3): 647-655, 2023 Jul.

Artigo em Inglês | MEDLINE | ID: mdl-37341842

RESUMO

PURPOSE: Distinguishing radiation necrosis from tumor progression among patients with brain metastases previously treated with stereotactic radiosurgery represents a common diagnostic challenge. We performed a prospective pilot study to determine whether PET/CT with 18F-fluciclovine, a widely available amino acid PET radiotracer, repurposed intracranially, can accurately diagnose equivocal lesions. METHODS: Adults with brain metastases previously treated with radiosurgery presenting with a follow-up tumor-protocol MRI brain equivocal for radiation necrosis versus tumor progression underwent an 18F-fluciclovine PET/CT of the brain within 30 days. The reference standard for final diagnosis consisted of clinical follow-up until multidisciplinary consensus or tissue confirmation. RESULTS: Of 16 patients imaged from 7/2019 to 11/2020, 15 subjects were evaluable with 20 lesions (radiation necrosis, n = 16; tumor progression, n = 4). Higher SUVmax statistically significantly predicted tumor progression (AUC = 0.875; p = 0.011). Lesion SUVmean (AUC = 0.875; p = 0.018), SUVpeak (AUC = 0.813; p = 0.007), and SUVpeak-to-normal-brain (AUC = 0.859; p = 0.002) also predicted tumor progression, whereas SUVmax-to-normal-brain (p = 0.1) and SUVmean-to-normal-brain (p = 0.5) did not. Qualitative visual scores were significant predictors for readers 1 (AUC = 0.750; p < 0.001) and 3 (AUC = 0.781; p = 0.045), but not for reader 2 (p = 0.3). Visual interpretations were significant predictors for reader 1 (AUC = 0.898; p = 0.012) but not for reader 2 (p = 0.3) or 3 (p = 0.2). CONCLUSIONS: In this prospective pilot study of patients with brain metastases previously treated with radiosurgery presenting with a contemporary MRI brain with a lesion equivocal for radiation necrosis versus tumor progression, 18F-fluciclovine PET/CT repurposed intracranially demonstrated encouraging diagnostic accuracy, supporting the pursuit of larger clinical trials which will be necessary to establish diagnostic criteria and performance.

Assuntos

Neoplasias Encefálicas , Radiocirurgia , Adulto , Humanos , Tomografia por Emissão de Pósitrons combinada à Tomografia Computadorizada/métodos , Radiocirurgia/efeitos adversos , Projetos Piloto , Estudos Prospectivos , Neoplasias Encefálicas/diagnóstico por imagem , Neoplasias Encefálicas/radioterapia , Neoplasias Encefálicas/etiologia , Necrose/diagnóstico por imagem , Necrose/etiologia

5.

Test-retest repeatability of ADC in prostate using the multi b-Value VERDICT acquisition.

Rogers, Harriet J; Singh, Saurabh; Barnes, Anna; Obuchowski, Nancy A; Margolis, Daniel J; Malyarenko, Dariya I; Chenevert, Thomas L; Shukla-Dave, Amita; Boss, Michael A; Punwani, Shonit.

Eur J Radiol ; 162: 110782, 2023 May.

Artigo em Inglês | MEDLINE | ID: mdl-37004362

RESUMO

PURPOSE: VERDICT (Vascular, Extracellular, Restricted Diffusion for Cytometry in Tumours) MRI is a multi b-value, variable diffusion time DWI sequence that allows generation of ADC maps from different b-value and diffusion time combinations. The aim was to assess precision of prostate ADC measurements from varying b-value combinations using VERDICT and determine which protocol provides the most repeatable ADC. MATERIALS AND METHODS: Forty-one men (median age: 67.7 years) from a prior prospective VERDICT study (April 2016-October 2017) were analysed retrospectively. Men who were suspected of prostate cancer and scanned twice using VERDICT were included. ADC maps were formed using 5b-value combinations and the within-subject standard deviations (wSD) were calculated per ADC map. Three anatomical locations were analysed per subject: normal TZ (transition zone), normal PZ (peripheral zone), and index lesions. Repeated measures ANOVAs showed which b-value range had the lowest wSD, Spearman correlation and generalized linear model regression analysis determined whether wSD was related to ADC magnitude and ROI size. RESULTS: The mean lesion ADC for b0b1500 had the lowest wSD in most zones (0.18-0.58x10-4 mm2/s). The wSD was unaffected by ADC magnitude (Lesion: p = 0.064, TZ: p = 0.368, PZ: p = 0.072) and lesion Likert score (p = 0.95). wSD showed a decrease with ROI size pooled over zones (p = 0.019, adjusted regression coefficient = -1.6x10-3, larger ROIs for TZ versus PZ versus lesions). ADC maps formed with a maximum b-value of 500 s/mm2 had the largest wSDs (1.90-10.24x10-4 mm2/s). CONCLUSION: ADC maps generated from b0b1500 have better repeatability in normal TZ, normal PZ, and index lesions.

Assuntos

Próstata , Neoplasias da Próstata , Masculino , Humanos , Idoso , Próstata/diagnóstico por imagem , Próstata/patologia , Estudos Prospectivos , Estudos Retrospectivos , Imagem de Difusão por Ressonância Magnética/métodos , Neoplasias da Próstata/diagnóstico por imagem , Neoplasias da Próstata/patologia

6.

Introduction to Multiparametric QIB Series.

Obuchowski, Nancy A; Hall, Timothy J.

Acad Radiol ; 30(2): 145-146, 2023 02.

Artigo em Inglês | MEDLINE | ID: mdl-36608957

7.

Multiparametric Data-driven Imaging Markers: Guidelines for Development, Application and Reporting of Model Outputs in Radiomics.

Wang, Xiaofeng; Pennello, Gene; deSouza, Nandita M; Huang, Erich P; Buckler, Andrew J; Barnhart, Huiman X; Delfino, Jana G; Raunig, David L; Wang, Lu; Guimaraes, Alexander R; Hall, Timothy J; Obuchowski, Nancy A.

Acad Radiol ; 30(2): 215-229, 2023 02.

Artigo em Inglês | MEDLINE | ID: mdl-36411153

RESUMO

This paper is the fifth in a five-part series on statistical methodology for performance assessment of multi-parametric quantitative imaging biomarkers (mpQIBs) for radiomic analysis. Radiomics is the process of extracting visually imperceptible features from radiographic medical images using data-driven algorithms. We refer to the radiomic features as data-driven imaging markers (DIMs), which are quantitative measures discovered under a data-driven framework from images beyond visual recognition but evident as patterns of disease processes irrespective of whether or not ground truth exists for the true value of the DIM. This paper aims to set guidelines on how to build machine learning models using DIMs in radiomics and to apply and report them appropriately. We provide a list of recommendations, named RANDAM (an abbreviation of "Radiomic ANalysis and DAta Modeling"), for analysis, modeling, and reporting in a radiomic study to make machine learning analyses in radiomics more reproducible. RANDAM contains five main components to use in reporting radiomics studies: design, data preparation, data analysis and modeling, reporting, and material availability. Real case studies in lung cancer research are presented along with simulation studies to compare different feature selection methods and several validation strategies.

Assuntos

Neoplasias Pulmonares , Imageamento por Ressonância Magnética Multiparamétrica , Humanos , Curva ROC , Imageamento por Ressonância Magnética Multiparamétrica/métodos , Diagnóstico por Imagem , Neoplasias Pulmonares/diagnóstico por imagem , Pulmão

8.

Multiparametric Quantitative Imaging in Risk Prediction: Recommendations for Data Acquisition, Technical Performance Assessment, and Model Development and Validation.

Huang, Erich P; Pennello, Gene; deSouza, Nandita M; Wang, Xiaofeng; Buckler, Andrew J; Kinahan, Paul E; Barnhart, Huiman X; Delfino, Jana G; Hall, Timothy J; Raunig, David L; Guimaraes, Alexander R; Obuchowski, Nancy A.

Acad Radiol ; 30(2): 196-214, 2023 02.

Artigo em Inglês | MEDLINE | ID: mdl-36273996

RESUMO

Combinations of multiple quantitative imaging biomarkers (QIBs) are often able to predict the likelihood of an event of interest such as death or disease recurrence more effectively than single imaging measurements can alone. The development of such multiparametric quantitative imaging and evaluation of its fitness of use differs from the analogous processes for individual QIBs in several key aspects. A computational procedure to combine the QIB values into a model output must be specified. The output must also be reproducible and be shown to have reasonably strong ability to predict the risk of an event of interest. Attention must be paid to statistical issues not often encountered in the single QIB scenario, including overfitting and bias in the estimates of model performance. This is the fourth in a five-part series on statistical methodology for assessing the technical performance of multiparametric quantitative imaging. Considerations for data acquisition are discussed and recommendations from the literature on methodology to construct and evaluate QIB-based models for risk prediction are summarized. The findings in the literature upon which these recommendations are based are demonstrated through simulation studies. The concepts in this manuscript are applied to a real-life example involving prediction of major adverse cardiac events using automated plaque analysis.

Assuntos

Diagnóstico por Imagem , Humanos , Diagnóstico por Imagem/métodos , Biomarcadores , Simulação por Computador

9.

A Framework for Evaluating the Technical Performance of Multiparameter Quantitative Imaging Biomarkers (mp-QIBs).

Obuchowski, Nancy A; Huang, Erich; deSouza, Nandita M; Raunig, David; Delfino, Jana; Buckler, Andrew; Hatt, Charles; Wang, Xiaofeng; Moskowitz, Chaya; Guimaraes, Alexander; Giger, Maryellen; Hall, Timothy J; Kinahan, Paul; Pennello, Gene.

Acad Radiol ; 30(2): 147-158, 2023 02.

Artigo em Inglês | MEDLINE | ID: mdl-36180328

RESUMO

Multiparameter quantitative imaging incorporates anatomical, functional, and/or behavioral biomarkers to characterize tissue, detect disease, identify phenotypes, define longitudinal change, or predict outcome. Multiple imaging parameters are sometimes considered separately but ideally are evaluated collectively. Often, they are transformed as Likert interpretations, ignoring the correlations of quantitative properties that may result in better reproducibility or outcome prediction. In this paper we present three use cases of multiparameter quantitative imaging: i) multidimensional descriptor, ii) phenotype classification, and iii) risk prediction. A fourth application based on data-driven markers from radiomics is also presented. We describe the technical performance characteristics and their metrics common to all use cases, and provide a structure for the development, estimation, and testing of multiparameter quantitative imaging. This paper serves as an overview for a series of individual articles on the four applications, providing the statistical framework for multiparameter imaging applications in medicine.

Assuntos

Diagnóstico por Imagem , Reprodutibilidade dos Testes , Diagnóstico por Imagem/métodos , Biomarcadores , Fenótipo

10.

The RSNA QIBA Profile for Amyloid PET as an Imaging Biomarker for Cerebral Amyloid Quantification.

Smith, Anne M; Obuchowski, Nancy A; Foster, Norman L; Klein, Gregory; Mozley, P David; Lammertsma, Adriaan A; Wahl, Richard L; Sunderland, John J; Vanderheyden, Jean-Luc; Benzinger, Tammie L S; Kinahan, Paul E; Wong, Dean F; Perlman, Eric S; Minoshima, Satoshi; Matthews, Dawn.

J Nucl Med ; 64(2): 294-303, 2023 02.

Artigo em Inglês | MEDLINE | ID: mdl-36137760

RESUMO

A standardized approach to acquiring amyloid PET images increases their value as disease and drug response biomarkers. Most 18F PET amyloid brain scans often are assessed only visually (per regulatory labels), with a binary decision indicating the presence or absence of Alzheimer disease amyloid pathology. Minimizing technical variance allows precise, quantitative SUV ratios (SUVRs) for early detection of ß-amyloid plaques and allows the effectiveness of antiamyloid treatments to be assessed with serial studies. Methods: The Quantitative Imaging Biomarkers Alliance amyloid PET biomarker committee developed and validated a profile to characterize and reduce the variability of SUVRs, increasing statistical power for these assessments. Results: On achieving conformance, sites can justify a claim that brain amyloid burden reflected by the SUVR is measurable to a within-subject coefficient of variation of no more than 1.94% when the same radiopharmaceutical, scanner, acquisition, and analysis protocols are used. Conclusion: This overview explains the claim, requirements, barriers, and potential future developments of the profile to achieve precision in clinical and research amyloid PET imaging.

Assuntos

Doença de Alzheimer , Processamento de Imagem Assistida por Computador , Humanos , Processamento de Imagem Assistida por Computador/métodos , Tomografia por Emissão de Pósitrons/métodos , Doença de Alzheimer/patologia , Peptídeos beta-Amiloides/metabolismo , Encéfalo/metabolismo , Biomarcadores , Amiloide/metabolismo , Compostos de Anilina

11.

Comparing the Accuracy of Diagnostic Tests When Disease Is Characterized by an Ordinal Scale.

Obuchowski, Nancy A.

Am J Epidemiol ; 192(4): 632-643, 2023 04 06.

Artigo em Inglês | MEDLINE | ID: mdl-36549904

RESUMO

In diagnostic medicine, the true disease status of a patient is often represented on an ordinal scale-for example, cancer stage (0, I, II, III, or IV) or coronary artery disease severity measured using the Coronary Artery Disease Reporting and Data System (CAD-RADS) scale (none, minimal, mild, moderate, severe, or occluded). With advances in quantitation of diagnostic images and in artificial intelligence (AI), both supervised and unsupervised algorithms are being developed to help physicians correctly grade disease. Most of the diagnostic accuracy literature deals with binary disease status (disease present or absent); however, tests diagnosing ordinal-scaled diseases should not be reduced to a binary status just to simplify diagnostic accuracy testing. In this paper, we propose different characterizations of ordinal-scale accuracy for different clinical use scenarios, along with methods for comparing tests. In the simplest scenario, just the proportion of correct grades is considered; other scenarios address the magnitude and direction of misgrading; and at the other extreme, a weighted accuracy measure with weights based on the relative costs of different types of misgrading is presented. The various scenarios are illustrated using a coronary artery disease example where the accuracy of AI algorithms in providing patients with the correct CAD-RADS grade is assessed.

Assuntos

Doença da Artéria Coronariana , Humanos , Angiografia Coronária/métodos , Inteligência Artificial , Algoritmos , Testes Diagnósticos de Rotina

12.

Imaging Service Navigators: An Approach Toward More Efficient and Effective Communications.

Subhas, Naveen; Johnson, Stefan; Caruso, Christine; Kollai, Elizabeth; Obuchowski, Nancy A; Mody, Rekha; Parker, H Joseph; Borkowski, Gregory P.

J Am Coll Radiol ; 20(1): 79-86, 2023 01.

Artigo em Inglês | MEDLINE | ID: mdl-36494062

RESUMO

PURPOSE: Many practices have implemented support services to assist radiologists with noninterpretive tasks; however, little research has been performed to assess the overall effect of these services. The purpose of this study was to evaluate the effect of a team of imaging service navigators (ISNs) incorporated into a practice on (1) number of communications, (2) time saved by radiologists, and (3) radiologist satisfaction with the service. METHODS: The numbers and types of reports dictated by radiologists were captured for 6-month periods before and after ISN implementation. Communication rates before and after implementation were then calculated. The amount of perceived time savings using the ISN team and satisfaction with the service were assessed through pre- and postimplementation surveys of participating radiologists. Mean and median time savings and satisfaction rates were calculated. RESULTS: The overall communication rate increased from 2.196% before ISNs to 3.278% after ISNs (49% increase; 95% confidence interval, 47%-52%). Communication rates increased among all communication subtypes (critical, urgent, routine, and actionable), with the highest increases in urgent (94%) and actionable (75%) findings. Before implementation, radiologists reported spending 39 min on average per day on communications tasks, with only 33% of radiologists indicating that the communication process was efficient. After implementation, radiologists reported mean time savings of 28 min (95% confidence interval, 19.9-35.1), and 82% of radiologists indicated a positive or highly positive view of the ISN service. CONCLUSIONS: After ISN implementation, communication rates increased and radiologists reported spending less time performing communications. Most radiologists were satisfied with the service.

Assuntos

Diagnóstico por Imagem , Radiologistas , Humanos , Comunicação , Inquéritos e Questionários , Satisfação Pessoal

13.

Correction to: Sex-based differences in left ventricular remodeling in patients with chronic aortic regurgitation: a multi-modality study.

Tower-Rader, Albree; Mathias, Isadora Sande; Obuchowski, Nancy A; Kocyigit, Duygu; Kumar, Yash; Donnellan, Eoin; Bolen, Michael; Phelan, Dermot; Flamm, Scott; Griffin, Brian; Cho, Leslie; Svensson, Lars G; Pettersson, Gosta; Popovic, Zoran; Kwon, Deborah H.

J Cardiovasc Magn Reson ; 24(1): 41, 2022 Jun 28.

Artigo em Inglês | MEDLINE | ID: mdl-35764994

14.

Meniscal Treatment as a Predictor of Worse Articular Cartilage Damage on MRI at 2 Years After ACL Reconstruction: The MOON Nested Cohort.

Altahawi, Faysal F; Reinke, Emily K; Briskin, Isaac; Cantrell, William A; Flanigan, David C; Fleming, Braden C; Huston, Laura J; Li, Xiaojuan; Oak, Sameer R; Obuchowski, Nancy A; Scaramuzza, Erica A; Winalski, Carl S; Zajichek, Alex; Spindler, Kurt P; Jones, Morgan H.

Am J Sports Med ; 50(4): 951-961, 2022 03.

Artigo em Inglês | MEDLINE | ID: mdl-35373606

RESUMO

BACKGROUND: Patients undergoing anterior cruciate ligament reconstruction (ACLR) are at an increased risk for posttraumatic osteoarthritis (PTOA). While we have previously shown that meniscal treatment with ACLR predicts more radiographic PTOA at 2 to 3 years postoperatively, there are a limited number of similar studies that have assessed cartilage directly with magnetic resonance imaging (MRI). HYPOTHESIS: Meniscal repair or partial meniscectomy at the time of ACLR independently predicts more articular cartilage damage on 2- to 3-year postoperative MRI compared with a healthy meniscus or a stable untreated tear. STUDY DESIGN: Cohort study; Level of evidence, 2. METHODS: A consecutive series of patients undergoing ACLR from 1 site within the prospective, nested Multicenter Orthopaedic Outcomes Network (MOON) cohort underwent bilateral knee MRI at 2 to 3 years postoperatively. Patients were aged <36 years without previous knee injuries, were injured while playing sports, and had no history of concomitant ligament surgery or contralateral knee surgery. MRI scans were graded by a board-certified musculoskeletal radiologist using the modified MRI Osteoarthritis Knee Score (MOAKS). A proportional odds logistic regression model was built to predict a MOAKS-based cartilage damage score (CDS) relative to the contralateral control knee for each compartment as well as for the whole knee, pooled by meniscal treatment, while controlling for sex, age, body mass index, baseline Marx activity score, and baseline operative cartilage grade. For analysis, meniscal injuries surgically treated with partial meniscectomy or meniscal repair were grouped together. RESULTS: The cohort included 60 patients (32 female; median age, 18.7 years). Concomitant meniscal treatment at the time of index ACLR was performed in 17 medial menisci (13 meniscal repair and 4 partial meniscectomy) and 27 lateral menisci (3 meniscal repair and 24 partial meniscectomy). Articular cartilage damage was worse in the ipsilateral reconstructed knee (P < .001). A meniscal injury requiring surgical treatment with ACLR predicted a worse CDS for medial meniscal treatment (medial compartment CDS: P = .005; whole joint CDS: P < .001) and lateral meniscal treatment (lateral compartment CDS: P = .038; whole joint CDS: P = .863). Other predictors of a worse relative CDS included age for the medial compartment (P < .001), surgically observed articular cartilage damage for the patellofemoral compartment (P = .048), and body mass index (P = .007) and age (P = .020) for the whole joint. CONCLUSION: A meniscal injury requiring surgical treatment with partial meniscectomy or meniscal repair at the time of ACLR predicted worse articular cartilage damage on MRI at 2 to 3 years after surgery. Further research is required to differentiate between the effects of partial meniscectomy and meniscal repair.

Assuntos

Lesões do Ligamento Cruzado Anterior , Cartilagem Articular , Menisco , Ortopedia , Adolescente , Adulto , Lesões do Ligamento Cruzado Anterior/diagnóstico por imagem , Lesões do Ligamento Cruzado Anterior/patologia , Lesões do Ligamento Cruzado Anterior/cirurgia , Cartilagem Articular/cirurgia , Estudos de Coortes , Feminino , Humanos , Imageamento por Ressonância Magnética/métodos , Menisco/diagnóstico por imagem , Menisco/cirurgia , Estudos Prospectivos

15.

Statistical practice and transparent reporting in the neurosciences: Preclinical motor behavioral experiments.

Hogue, Olivia; Harvey, Tucker; Crozier, Dena; Sonneborn, Claire; Postle, Abagail; Block-Beach, Hunter; Somasundaram, Eashwar; May, Francis J; Snyder Braun, Monica; Pasadyn, Felicia L; King, Khandi; Johnson, Casandra; Dolansky, Mary A; Obuchowski, Nancy A; Machado, Andre G; Baker, Kenneth B; Barnholtz-Sloan, Jill S.

PLoS One ; 17(3): e0265154, 2022.

Artigo em Inglês | MEDLINE | ID: mdl-35312695

RESUMO

Longitudinal and behavioral preclinical animal studies generate complex data, which may not be well matched to statistical approaches common in this literature. Analyses that do not adequately account for complexity may result in overly optimistic study conclusions, with consequences for reproducibility and translational decision-making. Recent work interrogating methodological shortcomings in animal research has not yet comprehensively investigated statistical shortcomings in the analysis of complex longitudinal and behavioral data. To this end, the current cross-sectional meta-research study rigorously reviewed published mouse or rat controlled experiments for motor rehabilitation in three neurologic conditions to evaluate statistical choices and reporting. Medline via PubMed was queried in February 2020 for English-language articles published January 1, 2017- December 31, 2019. Included were articles that used rat or mouse models of stroke, Parkinson's disease, or traumatic brain injury, employed a therapeutic controlled experimental design to determine efficacy, and assessed at least one functional behavioral assessment or global evaluation of function. 241 articles from 99 journals were evaluated independently by a team of nine raters. Articles were assessed for statistical handling of non-independence, animal attrition, outliers, ordinal data, and multiplicity. Exploratory analyses evaluated whether transparency or statistical choices differed as a function of journal factors. A majority of articles failed to account for sources of non-independence in the data (74-93%) and/or did not analytically account for mid-treatment animal attrition (78%). Ordinal variables were often treated as continuous (37%), outliers were predominantly not mentioned (83%), and plots often concealed the distribution of the data (51%) Statistical choices and transparency did not differ with regards to journal rank or reporting requirements. Statistical misapplication can result in invalid experimental findings and inadequate reporting obscures errors. Clinician-scientists evaluating preclinical work for translational promise should be mindful of commonplace errors. Interventions are needed to improve statistical decision-making in preclinical behavioral neurosciences research.

Assuntos

Neurociências , Projetos de Pesquisa , Animais , Estudos Transversais , Camundongos , Ratos , Reprodutibilidade dos Testes

16.

Multireader Diagnostic Accuracy Imaging Studies: Fundamentals of Design and Analysis.

Obuchowski, Nancy A; Bullen, Jennifer.

Radiology ; 303(1): 26-34, 2022 04.

Artigo em Inglês | MEDLINE | ID: mdl-35166584

RESUMO

The design and analysis of multireader multicase (MRMC) studies are quite challenging. These studies differ from most medical studies because they need a reference standard and sampling from two populations (ie, reader and patient populations). They are quite expensive to conduct, requiring a good deal of readers' time for image interpretation. One common problem is the use of imperfect reference standards, often correlated with the test or tests being evaluated. Another common issue is oversimplification of the multidimensional MRMC data. In this study, the fundamentals of MRMC study design and analysis are reviewed. The goal is to provide investigators with a guide to the fundamentals of MRMC design and analysis, with references to more detailed discussions. In addition, readers are updated on newer areas of research, including correction for studies with multiple diagnostic accuracy end points and adjustment for location bias.

Assuntos

Diagnóstico por Imagem , Projetos de Pesquisa , Humanos , Curva ROC , Sensibilidade e Especificidade

17.

Sex-based differences in left ventricular remodeling in patients with chronic aortic regurgitation: a multi-modality study.

Tower-Rader, Albree; Mathias, Isadora Sande; Obuchowski, Nancy A; Kocyigit, Duygu; Kumar, Yash; Donnellan, Eoin; Bolen, Michael; Phelan, Dermot; Flamm, Scott; Griffin, Brian; Cho, Leslie; Svensson, Lars G; Pettersson, Gosta; Popovic, Zoran; Kwon, Deborah.

J Cardiovasc Magn Reson ; 24(1): 12, 2022 02 22.

Artigo em Inglês | MEDLINE | ID: mdl-35193584

RESUMO

BACKGROUND: Significant aortic regurgitation (AR) leads to left ventricular (LV) remodeling; however, little data exist regarding sex-based differences in LV remodeling in this setting. We sought to compare LV remodeling and AR severity, assessed by echocardiography and cardiovascular magnetic resonance (CMR), to discern sex-based differences. METHODS: Patients with ≥ moderate chronic AR by echocardiography who underwent CMR within 90 days between December 2005 and October 2015 were included. Nonlinear regression models were built to assess the effect of AR regurgitant fraction (RF) on LV remodeling. A generalized linear model and Bland Altman analyses were constructed to evaluate differences between CMR and echocardiography. Referral for surgical intervention based on symptoms and LV remodeling was evaluated. RESULTS: Of the 243 patients (48.3 ± 16.6 years, 58 (24%) female), 119 (49%) underwent surgical intervention with a primary indication of severe AR, 97 (82%) men, 22 (18%) women. Significant sex differences in LV remodeling emerged on CMR. Women demonstrated significantly smaller LV end-diastolic volume index (LVEDVI) (96.8 ml/m2 vs 125.6 ml/m2, p < 0.001), LV end-systolic volume index (LVESVI) (41.1 vs 54.5 ml/m2, p < 0.001), blunted LV dilation in the setting of increasing AR severity (LVEDVI p value < 0.001, LVESVI p value 0.011), and LV length indexed (8.32 vs 9.69 cm, p < 0.001). On Bland Altman analysis, a significant interaction with sex and LV diameters was evident, demonstrating a significant increase in the difference between CMR and echocardiography measurements as the LV enlarged in women: LVEDVI (p = 0.006), LVESVI (p < 0.001), such that echocardiographic measurements increasingly underestimated LV diameters in women as the LV enlarged. LV length was higher for males with a linear effect from RF (p < 0.001), with LV length increasing at a higher rate with increasing RF for males compared to females (two-way interaction with sex p = 0.005). Sphericity volume index was higher for men after adjusting for a relative wall thickness (p = 0.033). CONCLUSIONS: CMR assessment of chronic AR revealed significant sex differences in LV remodeling and significant echocardiographic underestimation of LV dilation, particularly in women. Defining optimal sex-based CMR thresholds for surgical referral should be further developed. TRIAL REGISTRATION: NA.

Assuntos

Insuficiência da Valva Aórtica , Insuficiência da Valva Aórtica/diagnóstico por imagem , Insuficiência da Valva Aórtica/cirurgia , Ecocardiografia , Feminino , Humanos , Masculino , Valor Preditivo dos Testes , Caracteres Sexuais , Função Ventricular Esquerda , Remodelação Ventricular

18.

Improving the Robustness of Diagnostic Accuracy Results By Asking Study Readers to Further Distinguish Subjects Who Appear to be Without the Condition Of Interest.

Bullen, Jennifer A; Koo, Chi Wan; Bogoni, Luca; Obuchowski, Nancy A.

Acad Radiol ; 29(4): 550-558, 2022 04.

Artigo em Inglês | MEDLINE | ID: mdl-34366278

RESUMO

RATIONALE AND OBJECTIVES: In diagnostic accuracy studies, cases in which a reader does not see the condition of interest are often given the same score for ROC analysis (e.g. confidence score of 0%). However, many of these cases can be further distinguished and doing so may result in more robust ROC results. MATERIALS AND METHODS: We examined two recent, real-world studies which used different procedures to encourage readers to further distinguish subjects who appear to be without the condition of interest. For each study, we calculated the results under two conditions. In the "zeros distinguished" (ZD) condition, we incorporated the confidence scores collected to further distinguish the normal-looking subjects. In the "zeros not distinguished" (ZND) condition, we disregarded these scores and simply gave the unit of analysis a score of zero whenever the reader did not identify the condition of interest in that unit. We compared the two conditions on (1) coverage of the ROC space and (2) discrepancy between parametric and nonparametric results. RESULTS: Compared to the ZND condition, coverage of the ROC space was improved in the ZD condition for all ROC curves in both studies. In the first study, there was a significant reduction in the discrepancy between parametric and nonparametric results (median discrepancy in ZND vs ZD condition: 0.033 vs 0.011, p = 0.012). A similar reduction was not seen in the second study, though the discrepancies were very low in both conditions (0.003 vs 0.006, p = 0.313). CONCLUSION: Prompting readers to further distinguish cases in which they do not see the condition of interest may result in more robust ROC results, though further exploration of this topic is warranted.

Assuntos

Curva ROC , Humanos

19.

How to show that a new imaging method can replace a standard method, when no reference standard is available?

Omoumi, Patrick; Obuchowski, Nancy A.

Eur Radiol ; 32(4): 2810-2812, 2022 04.

Artigo em Inglês | MEDLINE | ID: mdl-34796382

Assuntos

Padrões de Referência , Humanos

20.

Development, validation, qualification, and dissemination of quantitative MR methods: Overview and recommendations by the ISMRM quantitative MR study group.

Weingärtner, Sebastian; Desmond, Kimberly L; Obuchowski, Nancy A; Baessler, Bettina; Zhang, Yuxin; Biondetti, Emma; Ma, Dan; Golay, Xavier; Boss, Michael A; Gunter, Jeffrey L; Keenan, Kathryn E; Hernando, Diego.

Magn Reson Med ; 87(3): 1184-1206, 2022 03.

Artigo em Inglês | MEDLINE | ID: mdl-34825741

RESUMO

On behalf of the International Society for Magnetic Resonance in Medicine (ISMRM) Quantitative MR Study Group, this article provides an overview of considerations for the development, validation, qualification, and dissemination of quantitative MR (qMR) methods. This process is framed in terms of two central technical performance properties, i.e., bias and precision. Although qMR is confounded by undesired effects, methods with low bias and high precision can be iteratively developed and validated. For illustration, two distinct qMR methods are discussed throughout the manuscript: quantification of liver proton-density fat fraction, and cardiac T1 . These examples demonstrate the expansion of qMR methods from research centers toward widespread clinical dissemination. The overall goal of this article is to provide trainees, researchers, and clinicians with essential guidelines for the development and validation of qMR methods, as well as an understanding of necessary steps and potential pitfalls for the dissemination of quantitative MR in research and in the clinic.

Assuntos

Imageamento por Ressonância Magnética , Terapia com Prótons , Viés , Espectroscopia de Ressonância Magnética , Prótons , Reprodutibilidade dos Testes

RESUMO

Assuntos

RESUMO

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

Assuntos

RESUMO

Assuntos

ENVIAR RESULTADO:

SELEÇÃO DE REFERÊNCIAS

DETALHE DA PESQUISA