Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 234
Filtrar
1.
Lancet Oncol ; 25(7): 879-887, 2024 Jul.
Artigo em Inglês | MEDLINE | ID: mdl-38876123

RESUMO

BACKGROUND: Artificial intelligence (AI) systems can potentially aid the diagnostic pathway of prostate cancer by alleviating the increasing workload, preventing overdiagnosis, and reducing the dependence on experienced radiologists. We aimed to investigate the performance of AI systems at detecting clinically significant prostate cancer on MRI in comparison with radiologists using the Prostate Imaging-Reporting and Data System version 2.1 (PI-RADS 2.1) and the standard of care in multidisciplinary routine practice at scale. METHODS: In this international, paired, non-inferiority, confirmatory study, we trained and externally validated an AI system (developed within an international consortium) for detecting Gleason grade group 2 or greater cancers using a retrospective cohort of 10 207 MRI examinations from 9129 patients. Of these examinations, 9207 cases from three centres (11 sites) based in the Netherlands were used for training and tuning, and 1000 cases from four centres (12 sites) based in the Netherlands and Norway were used for testing. In parallel, we facilitated a multireader, multicase observer study with 62 radiologists (45 centres in 20 countries; median 7 [IQR 5-10] years of experience in reading prostate MRI) using PI-RADS (2.1) on 400 paired MRI examinations from the testing cohort. Primary endpoints were the sensitivity, specificity, and the area under the receiver operating characteristic curve (AUROC) of the AI system in comparison with that of all readers using PI-RADS (2.1) and in comparison with that of the historical radiology readings made during multidisciplinary routine practice (ie, the standard of care with the aid of patient history and peer consultation). Histopathology and at least 3 years (median 5 [IQR 4-6] years) of follow-up were used to establish the reference standard. The statistical analysis plan was prespecified with a primary hypothesis of non-inferiority (considering a margin of 0·05) and a secondary hypothesis of superiority towards the AI system, if non-inferiority was confirmed. This study was registered at ClinicalTrials.gov, NCT05489341. FINDINGS: Of the 10 207 examinations included from Jan 1, 2012, through Dec 31, 2021, 2440 cases had histologically confirmed Gleason grade group 2 or greater prostate cancer. In the subset of 400 testing cases in which the AI system was compared with the radiologists participating in the reader study, the AI system showed a statistically superior and non-inferior AUROC of 0·91 (95% CI 0·87-0·94; p<0·0001), in comparison to the pool of 62 radiologists with an AUROC of 0·86 (0·83-0·89), with a lower boundary of the two-sided 95% Wald CI for the difference in AUROC of 0·02. At the mean PI-RADS 3 or greater operating point of all readers, the AI system detected 6·8% more cases with Gleason grade group 2 or greater cancers at the same specificity (57·7%, 95% CI 51·6-63·3), or 50·4% fewer false-positive results and 20·0% fewer cases with Gleason grade group 1 cancers at the same sensitivity (89·4%, 95% CI 85·3-92·9). In all 1000 testing cases where the AI system was compared with the radiology readings made during multidisciplinary practice, non-inferiority was not confirmed, as the AI system showed lower specificity (68·9% [95% CI 65·3-72·4] vs 69·0% [65·5-72·5]) at the same sensitivity (96·1%, 94·0-98·2) as the PI-RADS 3 or greater operating point. The lower boundary of the two-sided 95% Wald CI for the difference in specificity (-0·04) was greater than the non-inferiority margin (-0·05) and a p value below the significance threshold was reached (p<0·001). INTERPRETATION: An AI system was superior to radiologists using PI-RADS (2.1), on average, at detecting clinically significant prostate cancer and comparable to the standard of care. Such a system shows the potential to be a supportive tool within a primary diagnostic setting, with several associated benefits for patients and radiologists. Prospective validation is needed to test clinical applicability of this system. FUNDING: Health~Holland and EU Horizon 2020.


Assuntos
Inteligência Artificial , Imageamento por Ressonância Magnética , Neoplasias da Próstata , Radiologistas , Humanos , Masculino , Neoplasias da Próstata/diagnóstico por imagem , Neoplasias da Próstata/patologia , Idoso , Estudos Retrospectivos , Pessoa de Meia-Idade , Gradação de Tumores , Países Baixos , Curva ROC
2.
Radiology ; 313(1): e233055, 2024 Oct.
Artigo em Inglês | MEDLINE | ID: mdl-39377680

RESUMO

The apparent diffusion coefficient (ADC) provides a quantitative measure of water mobility that can be used to probe alterations in tissue microstructure due to disease or treatment. Establishment of the accepted level of variance in ADC measurements for each clinical application is critical for its successful implementation. The Diffusion-Weighted Imaging Biomarker Committee of the Quantitative Imaging Biomarkers Alliance (QIBA) has recently advanced the ADC Profile from the consensus to clinically feasible stage for the brain, liver, prostate, and breast. This profile distills multiple studies on ADC repeatability and describes detailed procedures to achieve stated performance claims on an observed ADC change within acceptable confidence limits. In addition to reviewing the current ADC Profile claims, this report has used recent literature to develop proposed updates for establishing metrology benchmarks for mean lesion ADC change that account for measurement variance. Specifically, changes in mean ADC exceeding 8% for brain lesions, 27% for liver lesions, 27% for prostate lesions, and 15% for breast lesions are claimed to represent true changes with 95% confidence. This report also discusses the development of the ADC Profile, highlighting its various stages, and describes the workflow essential to achieving a standardized implementation of advanced quantitative diffusion-weighted MRI in the clinic. The presented QIBA ADC Profile guidelines should enable successful clinical application of ADC as a quantitative imaging biomarker and ensure reproducible ADC measurements that can be used to confidently evaluate longitudinal changes and treatment response for individual patients.


Assuntos
Imagem de Difusão por Ressonância Magnética , Humanos , Imagem de Difusão por Ressonância Magnética/métodos , Masculino , Feminino , Biomarcadores , Reprodutibilidade dos Testes
3.
Radiology ; 312(3): e233094, 2024 Sep.
Artigo em Inglês | MEDLINE | ID: mdl-39254458

RESUMO

Background US shear-wave elastography (SWE) and vibration-controlled transient elastography (VCTE) enable assessment of liver stiffness, an indicator of fibrosis severity. However, limited reproducibility data restrict their use in clinical trials. Purpose To estimate SWE and VCTE measurement variability in nonalcoholic fatty liver disease (NAFLD) within and across systems to support clinical trial diagnostic enrichment and clinical interpretation of longitudinal liver stiffness. Materials and Methods This prospective, observational, cross-sectional study (March 2021 to November 2021) enrolled adults with NAFLD, stratified according to the Fibrosis-4 (FIB-4) index (≤1.3, >1.3 and <2.67, ≥2.67), at two sites to assess SWE with five US systems and VCTE with one system. Each participant underwent 12 elastography examinations over two separate days within 1 week, with each day's examinations conducted by a different operator. VCTE and SWE measurements were reported in units of meters per second. The primary end point was the different-day, different-operator reproducibility coefficient (RDCDDDO) pooled across systems for SWE and individually for VCTE. Secondary end points included system-specific RDCDDDO, same-day, same-operator repeatability coefficient (RCSDSO), and between-system same-day, same-operator reproducibility coefficient. The planned sample provided 80% power to detect a pooled RDCDDDO of less than 35%, the prespecified performance threshold. Results A total of 40 participants (mean age, 60 years ± 10 [SD]; 24 women) with low (n = 17), intermediate (n = 15), and high (n = 8) FIB-4 scores were enrolled. RDCDDDO was 30.7% (95% upper bound, 34.4%) for SWE and 35.6% (95% upper bound, 43.9%) for VCTE. SWE system-specific RDCDDDO varied from 24.2% to 34.3%. The RCSDSO was 21.0% for SWE (range, 13.9%-35.0%) and 19.6% for VCTE. The SWE between-system same-day, same-operator reproducibility coefficient was 52.7%. Conclusion SWE met the prespecified threshold, RDCDDDO less than 35%, with VCTE having a higher RDCDDDO. SWE variability was higher between different systems. These estimates advance liver US-based noninvasive test qualification by (a) defining expected variability, (b) establishing that serial examination variability is lower when performed with the same system, and (c) informing clinical trial design. ClinicalTrials.gov Identifier NCT04828551 © RSNA, 2024 Supplemental material is available for this article.


Assuntos
Técnicas de Imagem por Elasticidade , Hepatopatia Gordurosa não Alcoólica , Humanos , Técnicas de Imagem por Elasticidade/métodos , Hepatopatia Gordurosa não Alcoólica/diagnóstico por imagem , Feminino , Masculino , Reprodutibilidade dos Testes , Pessoa de Meia-Idade , Estudos Prospectivos , Estudos Transversais , Adulto , Fígado/diagnóstico por imagem , Idoso , Cirrose Hepática/diagnóstico por imagem
4.
J Vasc Interv Radiol ; 2024 Oct 08.
Artigo em Inglês | MEDLINE | ID: mdl-39389233

RESUMO

PURPOSE: To compare the safety and effectiveness of liver vein deprivation (LVD) and portal vein embolization (PVE) in patients scheduled to undergo liver resection. MATERIALS AND METHODS: This retrospective cohort study included 59 patients who underwent either PVE (n = 28) or LVD (n = 31) in preparation for liver resection. The primary outcome was percent change in future liver remnant volume (FLRV). Secondary endpoints were degree of hypertrophy (DH) and kinetic growth rate (KGR). RESULTS: Low baseline FLRV, time interval in days between the procedure and follow-up imaging (Ti) positively impacted the primary and secondary endpoints in both groups. Percent change in FLRV was higher in the LVD group (52.8% ± 5.3%) than in the PVE group (22.3% ± 3.0%; P <.001). DH was also higher in the LVD group (15.4% ± 1.7%) than in the PVE group (6.4% ± 0.9%; P <.001). KGR did not differ significantly between groups (LVD, 0.54%/d ± 0.06%/d; PVE, 0.35%/d ± 0.1%/d; P =0.239). When patients with baseline standardized FLRV > 35% were excluded from the analysis, LVD group demonstrated higher values than the PVE group in KGR (0.57%/d ± 0.06%/d vs 0.29%/d ± 0.05%/d; P <.001), percent change in FLRV (64.2% ± 6.0% vs 25.9% ± 4.3%; P <.001), and DH (15.4% ± 1.4% vs 6.6% ± 1.0%; P <.001). No adverse events were noted in either group. CONCLUSION: LVD appears to be safe and may be superior to PVE in inducing hypertrophy of FLR in patients scheduled to undergo surgical resection.

5.
AJR Am J Roentgenol ; 2024 Oct 02.
Artigo em Inglês | MEDLINE | ID: mdl-39356481

RESUMO

Prostate MRI has traditionally relied on qualitative interpretation. However, quantitative components hold the potential to markedly improve performance. The ADC from DWI is probably the most widely recognized quantitative MRI biomarker and has shown strong discriminatory value for clinically significant prostate cancer (csPCa) as well as for recurrent cancer after treatment. Advanced diffusion techniques, including intravoxel incoherent motion, diffusion kurtosis, diffusion tensor imaging, and specific implementations such as restriction spectrum imaging, purport even better discrimination, but are more technically challenging. The inherent T1 and T2 of tissue also provide diagnostic value, with more advanced techniques deriving luminal water imaging and hybrid-multidimensional MRI. Dynamic contrast-enhanced imaging, primarily using a modified Tofts model, also shows independent discriminatory value. Finally, quantitative size and shape features can be combined with the aforementioned techniques and be further refined using radiomics, texture analysis, and artificial intelligence. Which technique will ultimately find widespread clinical use will depend on validation across a myriad of platforms use-cases.

6.
J Appl Clin Med Phys ; 25(1): e14235, 2024 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-38059633

RESUMO

PURPOSE: The purpose of this investigation was to assess the effect of visceral adipose tissue volume (VA) on reader efficacy in diagnosing and characterizing small bowel Crohn's disease using lower exposure CT enterography (CTE). Secondarily, we investigated the effect of lower exposure and VA on reader diagnostic confidence. METHODS: Prospective paired investigation of 256 CTE, 129 with Crohn's disease, were reconstructed at 100% and simulated 50% and 30% exposure. The senior author provided the disease classification for the 129 patients with Crohn's disease. Patient VA was measured, and exams were evaluated by six readers for presence or absence of Crohn's disease and phenotype using a 0-10-point scale. Logistic regression models assessed the effect of VA on sensitivity and specificity. RESULTS: The effect of VA on sensitivity was significantly reduced at 30% exposure (odds radio [OR]: 1.00) compared to 100% exposure (OR: 1.12) (p = 0.048). There was no statistically significant difference among the exposures with respect to the effect of visceral fat on specificity (p = 0.159). The study readers' probability of agreement with the senior author on disease classification was 60%, 56%, and 53% at 100%, 50%, and 30% exposure, respectively (p = 0.004). When detecting low severity Crohn's disease, readers' mean sensitivity was 83%, 75%, and 74% at 100%, 50%, and 30% exposure, respectively (p = 0.002). In low severity disease, sensitivity also tended to increase as visceral fat increased (ORs per 1000 cm3 increase in visceral fat: 1.32, 1.31, and 1.18, p = 0.010, 0.016, and 0.100, at 100%, 50%, and 30% exposure). CONCLUSIONS: While the interaction is complex, VA plays a role in detecting and characterizing small bowel Crohn's disease when exposure is altered, particularly in low severity disease.


Assuntos
Doença de Crohn , Enteropatias , Humanos , Doença de Crohn/diagnóstico por imagem , Gordura Intra-Abdominal/diagnóstico por imagem , Estudos Prospectivos , Tomografia Computadorizada por Raios X/métodos
7.
Am J Epidemiol ; 192(4): 632-643, 2023 04 06.
Artigo em Inglês | MEDLINE | ID: mdl-36549904

RESUMO

In diagnostic medicine, the true disease status of a patient is often represented on an ordinal scale-for example, cancer stage (0, I, II, III, or IV) or coronary artery disease severity measured using the Coronary Artery Disease Reporting and Data System (CAD-RADS) scale (none, minimal, mild, moderate, severe, or occluded). With advances in quantitation of diagnostic images and in artificial intelligence (AI), both supervised and unsupervised algorithms are being developed to help physicians correctly grade disease. Most of the diagnostic accuracy literature deals with binary disease status (disease present or absent); however, tests diagnosing ordinal-scaled diseases should not be reduced to a binary status just to simplify diagnostic accuracy testing. In this paper, we propose different characterizations of ordinal-scale accuracy for different clinical use scenarios, along with methods for comparing tests. In the simplest scenario, just the proportion of correct grades is considered; other scenarios address the magnitude and direction of misgrading; and at the other extreme, a weighted accuracy measure with weights based on the relative costs of different types of misgrading is presented. The various scenarios are illustrated using a coronary artery disease example where the accuracy of AI algorithms in providing patients with the correct CAD-RADS grade is assessed.


Assuntos
Doença da Artéria Coronariana , Humanos , Angiografia Coronária/métodos , Inteligência Artificial , Algoritmos , Testes Diagnósticos de Rotina
8.
Radiology ; 309(1): e231092, 2023 10.
Artigo em Inglês | MEDLINE | ID: mdl-37815451

RESUMO

Background There is a need for reliable noninvasive methods for diagnosing and monitoring nonalcoholic fatty liver disease (NAFLD). Thus, the multidisciplinary Non-invasive Biomarkers of Metabolic Liver disease (NIMBLE) consortium was formed to identify and advance the regulatory qualification of NAFLD imaging biomarkers. Purpose To determine the different-day same-scanner repeatability coefficient of liver MRI biomarkers in patients with NAFLD at risk for steatohepatitis. Materials and Methods NIMBLE 1.2 is a prospective, observational, single-center short-term cross-sectional study (October 2021 to June 2022) in adults with NAFLD across a spectrum of low, intermediate, and high likelihood of advanced fibrosis as determined according to the fibrosis based on four factors (FIB-4) index. Participants underwent up to seven MRI examinations across two visits less than or equal to 7 days apart. Standardized imaging protocols were implemented with six MRI scanners from three vendors at both 1.5 T and 3 T, with central analysis of the data performed by an independent reading center (University of California, San Diego). Trained analysts, who were blinded to clinical data, measured the MRI proton density fat fraction (PDFF), liver stiffness at MR elastography (MRE), and visceral adipose tissue (VAT) for each participant. Point estimates and CIs were calculated using χ2 distribution and statistical modeling for pooled repeatability measures. Results A total of 17 participants (mean age, 58 years ± 8.5 [SD]; 10 female) were included, of which seven (41.2%), six (35.3%), and four (23.5%) participants had a low, intermediate, or high likelihood of advanced fibrosis, respectively. The different-day same-scanner mean measurements were 13%-14% for PDFF, 6.6 L for VAT, and 3.15 kPa for two-dimensional MRE stiffness. The different-day same-scanner repeatability coefficients were 0.22 L (95% CI: 0.17, 0.29) for VAT, 0.75 kPa (95% CI: 0.6, 0.99) for MRE stiffness, 1.19% (95% CI: 0.96, 1.61) for MRI PDFF using magnitude reconstruction, 1.56% (95% CI: 1.26, 2.07) for MRI PDFF using complex reconstruction, and 19.7% (95% CI: 15.8, 26.2) for three-dimensional MRE shear modulus. Conclusion This preliminary study suggests that thresholds of 1.2%-1.6%, 0.22 L, and 0.75 kPa for MRI PDFF, VAT, and MRE, respectively, should be used to discern measurement error from real change in patients with NAFLD. ClinicalTrials.gov registration no. NCT05081427 © RSNA, 2023 Supplemental material is available for this article. See also the editorial by Kozaka and Matsui in this issue.


Assuntos
Técnicas de Imagem por Elasticidade , Hepatopatia Gordurosa não Alcoólica , Adulto , Feminino , Humanos , Pessoa de Meia-Idade , Biomarcadores , Estudos Transversais , Técnicas de Imagem por Elasticidade/métodos , Fibrose , Fígado/diagnóstico por imagem , Fígado/patologia , Imageamento por Ressonância Magnética/métodos , Hepatopatia Gordurosa não Alcoólica/diagnóstico por imagem , Hepatopatia Gordurosa não Alcoólica/patologia , Estudos Prospectivos
9.
J Neurooncol ; 163(3): 647-655, 2023 Jul.
Artigo em Inglês | MEDLINE | ID: mdl-37341842

RESUMO

PURPOSE: Distinguishing radiation necrosis from tumor progression among patients with brain metastases previously treated with stereotactic radiosurgery represents a common diagnostic challenge. We performed a prospective pilot study to determine whether PET/CT with 18F-fluciclovine, a widely available amino acid PET radiotracer, repurposed intracranially, can accurately diagnose equivocal lesions. METHODS: Adults with brain metastases previously treated with radiosurgery presenting with a follow-up tumor-protocol MRI brain equivocal for radiation necrosis versus tumor progression underwent an 18F-fluciclovine PET/CT of the brain within 30 days. The reference standard for final diagnosis consisted of clinical follow-up until multidisciplinary consensus or tissue confirmation. RESULTS: Of 16 patients imaged from 7/2019 to 11/2020, 15 subjects were evaluable with 20 lesions (radiation necrosis, n = 16; tumor progression, n = 4). Higher SUVmax statistically significantly predicted tumor progression (AUC = 0.875; p = 0.011). Lesion SUVmean (AUC = 0.875; p = 0.018), SUVpeak (AUC = 0.813; p = 0.007), and SUVpeak-to-normal-brain (AUC = 0.859; p = 0.002) also predicted tumor progression, whereas SUVmax-to-normal-brain (p = 0.1) and SUVmean-to-normal-brain (p = 0.5) did not. Qualitative visual scores were significant predictors for readers 1 (AUC = 0.750; p < 0.001) and 3 (AUC = 0.781; p = 0.045), but not for reader 2 (p = 0.3). Visual interpretations were significant predictors for reader 1 (AUC = 0.898; p = 0.012) but not for reader 2 (p = 0.3) or 3 (p = 0.2). CONCLUSIONS: In this prospective pilot study of patients with brain metastases previously treated with radiosurgery presenting with a contemporary MRI brain with a lesion equivocal for radiation necrosis versus tumor progression, 18F-fluciclovine PET/CT repurposed intracranially demonstrated encouraging diagnostic accuracy, supporting the pursuit of larger clinical trials which will be necessary to establish diagnostic criteria and performance.


Assuntos
Neoplasias Encefálicas , Radiocirurgia , Adulto , Humanos , Tomografia por Emissão de Pósitrons combinada à Tomografia Computadorizada/métodos , Radiocirurgia/efeitos adversos , Projetos Piloto , Estudos Prospectivos , Neoplasias Encefálicas/diagnóstico por imagem , Neoplasias Encefálicas/radioterapia , Neoplasias Encefálicas/etiologia , Necrose/diagnóstico por imagem , Necrose/etiologia
10.
Radiology ; 303(1): 26-34, 2022 04.
Artigo em Inglês | MEDLINE | ID: mdl-35166584

RESUMO

The design and analysis of multireader multicase (MRMC) studies are quite challenging. These studies differ from most medical studies because they need a reference standard and sampling from two populations (ie, reader and patient populations). They are quite expensive to conduct, requiring a good deal of readers' time for image interpretation. One common problem is the use of imperfect reference standards, often correlated with the test or tests being evaluated. Another common issue is oversimplification of the multidimensional MRMC data. In this study, the fundamentals of MRMC study design and analysis are reviewed. The goal is to provide investigators with a guide to the fundamentals of MRMC design and analysis, with references to more detailed discussions. In addition, readers are updated on newer areas of research, including correction for studies with multiple diagnostic accuracy end points and adjustment for location bias.


Assuntos
Diagnóstico por Imagem , Projetos de Pesquisa , Humanos , Curva ROC , Sensibilidade e Especificidade
11.
Magn Reson Med ; 87(3): 1184-1206, 2022 03.
Artigo em Inglês | MEDLINE | ID: mdl-34825741

RESUMO

On behalf of the International Society for Magnetic Resonance in Medicine (ISMRM) Quantitative MR Study Group, this article provides an overview of considerations for the development, validation, qualification, and dissemination of quantitative MR (qMR) methods. This process is framed in terms of two central technical performance properties, i.e., bias and precision. Although qMR is confounded by undesired effects, methods with low bias and high precision can be iteratively developed and validated. For illustration, two distinct qMR methods are discussed throughout the manuscript: quantification of liver proton-density fat fraction, and cardiac T1 . These examples demonstrate the expansion of qMR methods from research centers toward widespread clinical dissemination. The overall goal of this article is to provide trainees, researchers, and clinicians with essential guidelines for the development and validation of qMR methods, as well as an understanding of necessary steps and potential pitfalls for the dissemination of quantitative MR in research and in the clinic.


Assuntos
Imageamento por Ressonância Magnética , Terapia com Prótons , Viés , Espectroscopia de Ressonância Magnética , Prótons , Reprodutibilidade dos Testes
12.
J Cardiovasc Magn Reson ; 24(1): 12, 2022 02 22.
Artigo em Inglês | MEDLINE | ID: mdl-35193584

RESUMO

BACKGROUND: Significant aortic regurgitation (AR) leads to left ventricular (LV) remodeling; however, little data exist regarding sex-based differences in LV remodeling in this setting. We sought to compare LV remodeling and AR severity, assessed by echocardiography and cardiovascular magnetic resonance (CMR), to discern sex-based differences. METHODS: Patients with ≥ moderate chronic AR by echocardiography who underwent CMR within 90 days between December 2005 and October 2015 were included. Nonlinear regression models were built to assess the effect of AR regurgitant fraction (RF) on LV remodeling. A generalized linear model and Bland Altman analyses were constructed to evaluate differences between CMR and echocardiography. Referral for surgical intervention based on symptoms and LV remodeling was evaluated. RESULTS: Of the 243 patients (48.3 ± 16.6 years, 58 (24%) female), 119 (49%) underwent surgical intervention with a primary indication of severe AR, 97 (82%) men, 22 (18%) women. Significant sex differences in LV remodeling emerged on CMR. Women demonstrated significantly smaller LV end-diastolic volume index (LVEDVI) (96.8 ml/m2 vs 125.6 ml/m2, p < 0.001), LV end-systolic volume index (LVESVI) (41.1 vs 54.5 ml/m2, p < 0.001), blunted LV dilation in the setting of increasing AR severity (LVEDVI p value < 0.001, LVESVI p value 0.011), and LV length indexed (8.32 vs 9.69 cm, p < 0.001). On Bland Altman analysis, a significant interaction with sex and LV diameters was evident, demonstrating a significant increase in the difference between CMR and echocardiography measurements as the LV enlarged in women: LVEDVI (p = 0.006), LVESVI (p < 0.001), such that echocardiographic measurements increasingly underestimated LV diameters in women as the LV enlarged. LV length was higher for males with a linear effect from RF (p < 0.001), with LV length increasing at a higher rate with increasing RF for males compared to females (two-way interaction with sex p = 0.005). Sphericity volume index was higher for men after adjusting for a relative wall thickness (p = 0.033). CONCLUSIONS: CMR assessment of chronic AR revealed significant sex differences in LV remodeling and significant echocardiographic underestimation of LV dilation, particularly in women. Defining optimal sex-based CMR thresholds for surgical referral should be further developed. TRIAL REGISTRATION: NA.


Assuntos
Insuficiência da Valva Aórtica , Insuficiência da Valva Aórtica/diagnóstico por imagem , Insuficiência da Valva Aórtica/cirurgia , Ecocardiografia , Feminino , Humanos , Masculino , Valor Preditivo dos Testes , Caracteres Sexuais , Função Ventricular Esquerda , Remodelação Ventricular
13.
Radiology ; 301(2): 423-432, 2021 11.
Artigo em Inglês | MEDLINE | ID: mdl-34491127

RESUMO

MRI-based cartilage compositional analysis shows biochemical and microstructural changes at early stages of osteoarthritis before changes become visible with structural MRI sequences and arthroscopy. This could help with early diagnosis, risk assessment, and treatment monitoring of osteoarthritis. Spin-lattice relaxation time constant in rotating frame (T1ρ) and T2 mapping are the MRI techniques best established for assessing cartilage composition. Only T2 mapping is currently commercially available, which is sensitive to water, collagen content, and orientation of collagen fibers, whereas T1ρ is more sensitive to proteoglycan content. Clinical application of cartilage compositional imaging is limited by high variability and suboptimal reproducibility of the biomarkers, which was the motivation for creating the Quantitative Imaging Biomarkers Alliance (QIBA) Profile for cartilage compositional imaging by the Musculoskeletal Biomarkers Committee of the QIBA. The profile aims at providing recommendations to improve reproducibility and to standardize cartilage compositional imaging. The QIBA Profile provides two complementary claims (summary statements of the technical performance of the quantitative imaging biomarkers that are being profiled) regarding the reproducibility of biomarkers. First, cartilage T1ρ and T2 values are measurable at 3.0-T MRI with a within-subject coefficient of variation of 4%-5%. Second, a measured increase or decrease in T1ρ and T2 of 14% or more indicates a minimum detectable change with 95% confidence. If only an increase in T1ρ and T2 values is expected (progressive cartilage degeneration), then an increase of 12% represents a minimum detectable change over time. The QIBA Profile provides recommendations for clinical researchers, clinicians, and industry scientists pertaining to image data acquisition, analysis, and interpretation and assessment procedures for T1ρ and T2 cartilage imaging and test-retest conformance. This special report aims to provide the rationale for the proposed claims, explain the content of the QIBA Profile, and highlight the future needs and developments for MRI-based cartilage compositional imaging for risk prediction, early diagnosis, and treatment monitoring of osteoarthritis.


Assuntos
Cartilagem Articular/diagnóstico por imagem , Joelho/diagnóstico por imagem , Imageamento por Ressonância Magnética/métodos , Osteoartrite do Joelho/diagnóstico por imagem , Humanos , Reprodutibilidade dos Testes
14.
Radiology ; 298(3): 640-651, 2021 03.
Artigo em Inglês | MEDLINE | ID: mdl-33464181

RESUMO

Background Proton density fat fraction (PDFF) estimated by using chemical shift-encoded (CSE) MRI is an accepted imaging biomarker of hepatic steatosis. This work aims to promote standardized use of CSE MRI to estimate PDFF. Purpose To assess the accuracy of CSE MRI methods for estimating PDFF by determining the linearity and range of bias observed in a phantom. Materials and Methods In this prospective study, a commercial phantom with 12 vials of known PDFF values were shipped across nine U.S. centers. The phantom underwent 160 independent MRI examinations on 27 1.5-T and 3.0-T systems from three vendors. Two three-dimensional CSE MRI protocols with minimal T1 bias were included: vendor and standardized. Each vendor's confounder-corrected complex or hybrid magnitude-complex based reconstruction algorithm was used to generate PDFF maps in both protocols. The Siemens reconstruction required a configuration change to correct for water-fat swaps in the phantom. The MRI PDFF values were compared with the known PDFF values by using linear regression with mixed-effects modeling. The 95% CIs were calculated for the regression slope (ie, proportional bias) and intercept (ie, constant bias) and compared with the null hypothesis (slope = 1, intercept = 0). Results Pooled regression slope for estimated PDFF values versus phantom-derived reference PDFF values was 0.97 (95% CI: 0.96, 0.98) in the biologically relevant 0%-47.5% PDFF range. The corresponding pooled intercept was -0.27% (95% CI: -0.50%, -0.05%). Across vendors, slope ranges were 0.86-1.02 (vendor protocols) and 0.97-1.0 (standardized protocol) at 1.5 T and 0.91-1.01 (vendor protocols) and 0.87-1.01 (standardized protocol) at 3.0 T. The intercept ranges (absolute PDFF percentage) were -0.65% to 0.18% (vendor protocols) and -0.69% to -0.17% (standardized protocol) at 1.5 T and -0.48% to 0.10% (vendor protocols) and -0.78% to -0.21% (standardized protocol) at 3.0 T. Conclusion Proton density fat fraction estimation derived from three-dimensional chemical shift-encoded MRI in a commercial phantom was accurate across vendors, imaging centers, and field strengths, with use of the vendors' product acquisition and reconstruction software. © RSNA, 2021 See also the editorial by Dyke in this issue.


Assuntos
Fígado Gorduroso/diagnóstico por imagem , Imageamento por Ressonância Magnética/métodos , Imagens de Fantasmas , Algoritmos , Biomarcadores , Humanos , Processamento de Imagem Assistida por Computador , Imageamento Tridimensional , Estudos Prospectivos , Prótons , Reprodutibilidade dos Testes , Estados Unidos
15.
Eur Radiol ; 31(10): 7566-7574, 2021 Oct.
Artigo em Inglês | MEDLINE | ID: mdl-33768291

RESUMO

OBJECTIVES: Proton density fat fraction (PDFF) is a validated biomarker of tissue fat quantification. However, validation has been limited to single-center or multi-center series using non-FDA-approved software. Thus, we assess the bias, linearity, and long-term reproducibility of PDFF obtained using commercial PDFF packages from several vendors. METHODS: Over 35 months, 438 subjects and 16 volunteers from a multi-center observational trial underwent PDFF MRI measurements using a 3-T MR system from one of three different vendors or a 1.5-T system from one vendor. Fat-water phantom sets were measured as part of each subject's examination. Manual region-of-interest measurements on the %fat image, then cross-sectional bias, linearity, and long-term reproducibility were assessed. RESULTS: Three hundred ninety-two phantom measurements were evaluable (90%). Bias ranged from 2.4 to - 3.8% for the lowest to the highest weight %fat. Regression fits of PDFF against synthesis weight %fat showed negligible non-linear effects and a linear slope of 0.94 (95% confidence interval: 0.938, 0.947). We observed significant vendor (p < 0.001) and field strength (p < 0.001) differences in bias and longitudinal variability. When the results were pooled across sites, vendors, and field strengths, the estimated reproducibility coefficient was 6.93% (95% CI: 6.25%, 7.81%). CONCLUSIONS: This study demonstrated good linearity, accuracy, and reproducibility for all investigated manufacturers and field strengths. However, significant vendor-dependent and field strength-dependent bias were found. While longitudinal PDFF measurements may be made using different field strength or vendor MR systems, if the MR system is not the same, based on these results, only PDFF changes ≥ 7% can be considered a true difference. KEY POINTS: • Phantom fat fraction (PDFF) MRI measurements over 35 months demonstrated good linearity, accuracy, and reproducibility for the vendor systems investigated. • Non-linear effects were negligible (linear slope of 0.94) over 0-100% fat; however, significant vendor (p < 0.001) and field strength (p<0.001) differences in bias and longitudinal variability were identified. Bias ranged from 2.4 to - 3.8% for 0-100 weight% fat, respectively. • Measurement bias could affect the accuracy of PDFF in clinical use. As the reproducibility coefficient was 6.93%, only greater changes in % fat can be considered true differences when making longitudinal PDFF measurements on different MR systems.


Assuntos
Imageamento por Ressonância Magnética , Prótons , Estudos Transversais , Humanos , Fígado , Imagens de Fantasmas , Reprodutibilidade dos Testes
16.
Eur Radiol ; 31(8): 6001-6012, 2021 Aug.
Artigo em Inglês | MEDLINE | ID: mdl-33492473

RESUMO

Existing quantitative imaging biomarkers (QIBs) are associated with known biological tissue characteristics and follow a well-understood path of technical, biological and clinical validation before incorporation into clinical trials. In radiomics, novel data-driven processes extract numerous visually imperceptible statistical features from the imaging data with no a priori assumptions on their correlation with biological processes. The selection of relevant features (radiomic signature) and incorporation into clinical trials therefore requires additional considerations to ensure meaningful imaging endpoints. Also, the number of radiomic features tested means that power calculations would result in sample sizes impossible to achieve within clinical trials. This article examines how the process of standardising and validating data-driven imaging biomarkers differs from those based on biological associations. Radiomic signatures are best developed initially on datasets that represent diversity of acquisition protocols as well as diversity of disease and of normal findings, rather than within clinical trials with standardised and optimised protocols as this would risk the selection of radiomic features being linked to the imaging process rather than the pathology. Normalisation through discretisation and feature harmonisation are essential pre-processing steps. Biological correlation may be performed after the technical and clinical validity of a radiomic signature is established, but is not mandatory. Feature selection may be part of discovery within a radiomics-specific trial or represent exploratory endpoints within an established trial; a previously validated radiomic signature may even be used as a primary/secondary endpoint, particularly if associations are demonstrated with specific biological processes and pathways being targeted within clinical trials. KEY POINTS: • Data-driven processes like radiomics risk false discoveries due to high-dimensionality of the dataset compared to sample size, making adequate diversity of the data, cross-validation and external validation essential to mitigate the risks of spurious associations and overfitting. • Use of radiomic signatures within clinical trials requires multistep standardisation of image acquisition, image analysis and data mining processes. • Biological correlation may be established after clinical validation but is not mandatory.


Assuntos
Radiologia , Tomografia Computadorizada por Raios X , Biomarcadores , Consenso , Humanos , Processamento de Imagem Assistida por Computador
17.
Clin Trials ; 18(2): 197-206, 2021 04.
Artigo em Inglês | MEDLINE | ID: mdl-33426918

RESUMO

BACKGROUND/AIMS: Quantitative imaging biomarkers have the potential to detect change in disease early and noninvasively, providing information about the diagnosis and prognosis of a patient, aiding in monitoring disease, and informing when therapy is effective. In clinical trials testing new therapies, there has been a tendency to ignore the variability and bias in quantitative imaging biomarker measurements. Unfortunately, this can lead to underpowered studies and incorrect estimates of the treatment effect. We illustrate the problem when non-constant measurement bias is ignored and show how treatment effect estimates can be corrected. METHODS: Monte Carlo simulation was used to assess the coverage of 95% confidence intervals for the treatment effect when non-constant bias is ignored versus when the bias is corrected for. Three examples are presented to illustrate the methods: doubling times of lung nodules, rates of change in brain atrophy in progressive multiple sclerosis clinical trials, and changes in proton-density fat fraction in trials for patients with nonalcoholic fatty liver disease. RESULTS: Incorrectly assuming that the measurement bias is constant leads to 95% confidence intervals for the treatment effect with reduced coverage (<95%); the coverage is especially reduced when the quantitative imaging biomarker measurements have good precision and/or there is a large treatment effect. Estimates of the measurement bias from technical performance validation studies can be used to correct the confidence intervals for the treatment effect. CONCLUSION: Technical performance validation studies of quantitative imaging biomarkers are needed to supplement clinical trial data to provide unbiased estimates of the treatment effect.


Assuntos
Ensaios Clínicos como Assunto , Diagnóstico por Imagem , Projetos de Pesquisa , Viés , Biomarcadores , Encéfalo/diagnóstico por imagem , Humanos , Pulmão/diagnóstico por imagem , Método de Monte Carlo , Esclerose Múltipla/diagnóstico por imagem
18.
J Ultrasound Med ; 40(3): 569-581, 2021 Mar.
Artigo em Inglês | MEDLINE | ID: mdl-33410183

RESUMO

OBJECTIVES: To quantify the bias of shear wave speed (SWS) measurements between different commercial ultrasonic shear elasticity systems and a magnetic resonance elastography (MRE) system in elastic and viscoelastic phantoms. METHODS: Two elastic phantoms, representing healthy through fibrotic liver, were measured with 5 different ultrasound platforms, and 3 viscoelastic phantoms, representing healthy through fibrotic liver tissue, were measured with 12 different ultrasound platforms. Measurements were performed with different systems at different sites, at 3 focal depths, and with different appraisers. The SWS bias across the systems was quantified as a function of the system, site, focal depth, and appraiser. A single MRE research system was also used to characterize these phantoms using discrete frequencies from 60 to 500 Hz. RESULTS: The SWS from different systems had mean difference 95% confidence intervals of ±0.145 m/s (±9.6%) across both elastic phantoms and ± 0.340 m/s (±15.3%) across the viscoelastic phantoms. The focal depth and appraiser were less significant sources of SWS variability than the system and site. Magnetic resonance elastography best matched the ultrasonic SWS in the viscoelastic phantoms using a 140 Hz source but had a - 0.27 ± 0.027-m/s (-12.2% ± 1.2%) bias when using the clinically implemented 60-Hz vibration source. CONCLUSIONS: Shear wave speed reconstruction across different manufacturer systems is more consistent in elastic than viscoelastic phantoms, with a mean difference bias of < ±10% in all cases. Magnetic resonance elastographic measurements in the elastic and viscoelastic phantoms best match the ultrasound systems with a 140-Hz excitation but have a significant negative bias operating at 60 Hz. This study establishes a foundation for meaningful comparison of SWS measurements made with different platforms.


Assuntos
Técnicas de Imagem por Elasticidade , Biomarcadores , Elasticidade , Humanos , América do Norte , Imagens de Fantasmas
19.
Skeletal Radiol ; 50(4): 693-703, 2021 Apr.
Artigo em Inglês | MEDLINE | ID: mdl-32948903

RESUMO

OBJECTIVE: To evaluate the feasibility of producing 2-dimensional (2D) virtual noncontrast images and 3-dimensional (3D) bone models from dual-energy computed tomography (DECT) arthrograms and to determine whether this is best accomplished using 190 keV virtual monoenergetic images (VMI) or virtual unenhanced (VUE) images. MATERIALS AND METHODS: VMI and VUE images were retrospectively reconstructed from patients with internal derangement of the shoulder or knee joint who underwent DECT arthrography between September 2017 and August 2019. A region of interest was placed in the area of brightest contrast, and the mean attenuation (in Hounsfield units [HUs]) was recorded. Two blinded musculoskeletal radiologists qualitatively graded the 2D images and 3D models using scores ranging from 0 to 3 (0 considered optimal). RESULTS: Twenty-six patients (mean age ± SD, 57.5 ± 16.8 years; 6 women) were included in the study. The contrast attenuation on VUE images (overall mean ± SD, 10.5 ± 16.4 HU; knee, 19.3 ± 10.7 HU; shoulder, 5.0 ± 17.2 HU) was significantly lower (p < 0.001 for all comparisons) than on VMI (overall mean ± SD, 107.7 ± 43.8 HU; knee, 104.6 ± 31.1 HU; shoulder, 109.6 ± 51.0 HU). The proportion of cases with optimal scores (0 or 1) was significantly higher with VUE than with VMI for both 2D and 3D images (p < 0.001). CONCLUSIONS: DECT arthrography can be used to produce 2D virtual noncontrast images and to generate 3D bone models. The VUE technique is superior to VMI in producing virtual noncontrast images.


Assuntos
Artrografia , Imagem Radiográfica a Partir de Emissão de Duplo Fóton , Estudos de Viabilidade , Feminino , Humanos , Interpretação de Imagem Radiográfica Assistida por Computador , Estudos Retrospectivos , Razão Sinal-Ruído , Tomografia Computadorizada por Raios X
20.
Skeletal Radiol ; 50(5): 955-965, 2021 May.
Artigo em Inglês | MEDLINE | ID: mdl-33037447

RESUMO

OBJECTIVE: To determine whether a simulated low-dose metal artifact reduction (MAR) CT technique is comparable with a clinical dose MAR technique for shoulder arthroplasty evaluation. MATERIALS AND METHODS: Two shoulder arthroplasties in cadavers and 25 shoulder arthroplasties in patients were scanned using a clinical dose (140 kVp, 300 qrmAs); cadavers were also scanned at half dose (140 kVp, 150 qrmAs). Images were reconstructed using a MAR CT algorithm at full dose and a noise-insertion algorithm simulating 50% dose reduction. For the actual and simulated half-dose cadaver scans, differences in SD for regions of interest were assessed, and streak artifact near the arthroplasty was graded by 3 blinded readers. Simulated half-dose scans were compared with full-dose scans in patients by measuring differences in implant position and by comparing readers' grades of periprosthetic osteolysis and muscle atrophy. RESULTS: The mean difference in SD between actual and simulated half-dose methods was 2.42 HU (95% CI [1.4, 3.4]). No differences in streak artifact grades were seen in 13/18 (72.2%) comparisons in cadavers. In patients, differences in implant position measurements were within 1° or 1 mm in 149/150 (99.3%) measurements. The inter-reader agreement rates were nearly identical when readers were using full-dose (77.3% [232/300] for osteolysis and 76.9% [173/225] for muscle atrophy) and simulated half-dose (76.7% [920/1200] for osteolysis and 74.0% [666/900] for muscle atrophy) scans. CONCLUSION: A simulated half-dose MAR CT technique is comparable both quantitatively and qualitatively with a standard-dose technique for shoulder arthroplasty evaluation, demonstrating that this technique could be used to reduce dose in arthroplasty imaging.


Assuntos
Artefatos , Tomografia Computadorizada por Raios X , Algoritmos , Artroplastia , Cadáver , Humanos , Metais , Imagens de Fantasmas
SELEÇÃO DE REFERÊNCIAS
Detalhe da pesquisa