Pesquisa | BVS Integralidade em Saúde

1.

THA-Net: A Deep Learning Solution for Next-Generation Templating and Patient-specific Surgical Execution.

Rouzrokh, Pouria; Khosravi, Bardia; Mickley, John P; Erickson, Bradley J; Taunton, Michael J; Wyles, Cody C.

J Arthroplasty ; 39(3): 727-733.e4, 2024 Mar.

Artigo em Inglês | MEDLINE | ID: mdl-37619804

RESUMO

BACKGROUND: This study introduces THA-Net, a deep learning inpainting algorithm for simulating postoperative total hip arthroplasty (THA) radiographs from a single preoperative pelvis radiograph input, while being able to generate predictions either unconditionally (algorithm chooses implants) or conditionally (surgeon chooses implants). METHODS: The THA-Net is a deep learning algorithm which receives an input preoperative radiograph and subsequently replaces the target hip joint with THA implants to generate a synthetic yet realistic postoperative radiograph. We trained THA-Net on 356,305 pairs of radiographs from 14,357 patients from a single institution's total joint registry and evaluated the validity (quality of surgical execution) and realism (ability to differentiate real and synthetic radiographs) of its outputs against both human-based and software-based criteria. RESULTS: The surgical validity of synthetic postoperative radiographs was significantly higher than their real counterparts (mean difference: 0.8 to 1.1 points on 10-point Likert scale, P < .001), but they were not able to be differentiated in terms of realism in blinded expert review. Synthetic images showed excellent validity and realism when analyzed with already validated deep learning models. CONCLUSION: We developed a THA next-generation templating tool that can generate synthetic radiographs graded higher on ultimate surgical execution than real radiographs from training data. Further refinement of this tool may potentiate patient-specific surgical planning and enable technologies such as robotics, navigation, and augmented reality (an online demo of THA-Net is available at: https://demo.osail.ai/tha_net).

Assuntos

Artroplastia de Quadril , Aprendizado Profundo , Prótese de Quadril , Humanos , Artroplastia de Quadril/métodos , Articulação do Quadril/diagnóstico por imagem , Articulação do Quadril/cirurgia , Radiografia , Estudos Retrospectivos

2.

THA-AID: Deep Learning Tool for Total Hip Arthroplasty Automatic Implant Detection With Uncertainty and Outlier Quantification.

Rouzrokh, Pouria; Mickley, John P; Khosravi, Bardia; Faghani, Shahriar; Moassefi, Mana; Schulz, William R; Erickson, Bradley J; Taunton, Michael J; Wyles, Cody C.

J Arthroplasty ; 39(4): 966-973.e17, 2024 Apr.

Artigo em Inglês | MEDLINE | ID: mdl-37770007

RESUMO

BACKGROUND: Revision total hip arthroplasty (THA) requires preoperatively identifying in situ implants, a time-consuming and sometimes unachievable task. Although deep learning (DL) tools have been attempted to automate this process, existing approaches are limited by classifying few femoral and zero acetabular components, only classify on anterior-posterior (AP) radiographs, and do not report prediction uncertainty or flag outlier data. METHODS: This study introduces Total Hip Arhtroplasty Automated Implant Detector (THA-AID), a DL tool trained on 241,419 radiographs that identifies common designs of 20 femoral and 8 acetabular components from AP, lateral, or oblique views and reports prediction uncertainty using conformal prediction and outlier detection using a custom framework. We evaluated THA-AID using internal, external, and out-of-domain test sets and compared its performance with human experts. RESULTS: THA-AID achieved internal test set accuracies of 98.9% for both femoral and acetabular components with no significant differences based on radiographic view. The femoral classifier also achieved 97.0% accuracy on the external test set. Adding conformal prediction increased true label prediction by 0.1% for acetabular and 0.7 to 0.9% for femoral components. More than 99% of out-of-domain and >89% of in-domain outlier data were correctly identified by THA-AID. CONCLUSIONS: The THA-AID is an automated tool for implant identification from radiographs with exceptional performance on internal and external test sets and no decrement in performance based on radiographic view. Importantly, this is the first study in orthopedics to our knowledge including uncertainty quantification and outlier detection of a DL model.

Assuntos

Artroplastia de Quadril , Aprendizado Profundo , Prótese de Quadril , Humanos , Incerteza , Acetábulo/cirurgia , Estudos Retrospectivos

3.

Quantifying Uncertainty in Deep Learning of Radiologic Images.

Faghani, Shahriar; Moassefi, Mana; Rouzrokh, Pouria; Khosravi, Bardia; Baffour, Francis I; Ringler, Michael D; Erickson, Bradley J.

Radiology ; 308(2): e222217, 2023 08.

Artigo em Inglês | MEDLINE | ID: mdl-37526541

RESUMO

In recent years, deep learning (DL) has shown impressive performance in radiologic image analysis. However, for a DL model to be useful in a real-world setting, its confidence in a prediction must also be known. Each DL model's output has an estimated probability, and these estimated probabilities are not always reliable. Uncertainty represents the trustworthiness (validity) of estimated probabilities. The higher the uncertainty, the lower the validity. Uncertainty quantification (UQ) methods determine the uncertainty level of each prediction. Predictions made without UQ methods are generally not trustworthy. By implementing UQ in medical DL models, users can be alerted when a model does not have enough information to make a confident decision. Consequently, a medical expert could reevaluate the uncertain cases, which would eventually lead to gaining more trust when using a model. This review focuses on recent trends using UQ methods in DL radiologic image analysis within a conceptual framework. Also discussed in this review are potential applications, challenges, and future directions of UQ in DL radiologic image analysis.

Assuntos

Aprendizado Profundo , Radiologia , Humanos , Incerteza , Processamento de Imagem Assistida por Computador

4.

A deep learning algorithm for detecting lytic bone lesions of multiple myeloma on CT.

Faghani, Shahriar; Baffour, Francis I; Ringler, Michael D; Hamilton-Cave, Matthew; Rouzrokh, Pouria; Moassefi, Mana; Khosravi, Bardia; Erickson, Bradley J.

Skeletal Radiol ; 52(1): 91-98, 2023 Jan.

Artigo em Inglês | MEDLINE | ID: mdl-35980454

RESUMO

BACKGROUND: Whole-body low-dose CT is the recommended initial imaging modality to evaluate bone destruction as a result of multiple myeloma. Accurate interpretation of these scans to detect small lytic bone lesions is time intensive. A functional deep learning) algorithm to detect lytic lesions on CTs could improve the value of these CTs for myeloma imaging. Our objectives were to develop a DL algorithm and determine its performance at detecting lytic lesions of multiple myeloma. METHODS: Axial slices (2-mm section thickness) from whole-body low-dose CT scans of subjects with biochemically confirmed plasma cell dyscrasias were included in the study. Data were split into train and test sets at the patient level targeting a 90%/10% split. Two musculoskeletal radiologists annotated lytic lesions on the images with bounding boxes. Subsequently, we developed a two-step deep learning model comprising bone segmentation followed by lesion detection. Unet and "You Look Only Once" (YOLO) models were used as bone segmentation and lesion detection algorithms, respectively. Diagnostic performance was determined using the area under the receiver operating characteristic curve (AUROC). RESULTS: Forty whole-body low-dose CTs from 40 subjects yielded 2193 image slices. A total of 5640 lytic lesions were annotated. The two-step model achieved a sensitivity of 91.6% and a specificity of 84.6%. Lesion detection AUROC was 90.4%. CONCLUSION: We developed a deep learning model that detects lytic bone lesions of multiple myeloma on whole-body low-dose CTs with high performance. External validation is required prior to widespread adoption in clinical practice.

Assuntos

Aprendizado Profundo , Mieloma Múltiplo , Osteólise , Humanos , Mieloma Múltiplo/diagnóstico por imagem , Mieloma Múltiplo/patologia , Algoritmos , Tomografia Computadorizada por Raios X/métodos

5.

Developing and validating a national set of standards for undergraduate medical education using the WFME framework: the experience of an accreditation system in Iran.

Gandomkar, Roghayeh; Changiz, Tahereh; Omid, Athar; Alizadeh, Mahasti; Khazaei, Majid; Heidarzadah, Abtin; Rouzrokh, Pouria; Amini, Mitra; Honarpisheh, Hamid; Laripour, Reza; Abedi, Farshid; Sabet, Babak; Mirzazadeh, Azim.

BMC Med Educ ; 23(1): 379, 2023 May 24.

Artigo em Inglês | MEDLINE | ID: mdl-37226139

RESUMO

BACKGROUND: Defining standards is the first step toward quality assurance and improvement of educational programs. This study aimed at developing and validating a set of national standards for the Undergraduate Medical Education (UME) program through an accreditation system in Iran using the World Federation for Medical Education (WFME) framework. METHODS: The first draft of standards was prepared through consultative workshops with the participation of different UME program stakeholders. Subsequently, standards were sent to medical schools and UME directors were asked to complete a web-based survey. The content validity index at the item level (I-CVI) was computed using criteria including clarity, relevance, optimization and evaluability for each standard. Afterward, a full-day consultative workshop was held and a wide range of UME stakeholders across the country (n = 150) discussed the survey results and made corrections to standards. RESULTS: Analysis of survey results showed that relevance criteria had the best CVI as only 15 (13%) standards demonstrated CVI < 0.78. More than two-thirds (71%) and a half (55%) of standards showed CVI < 0.78 for optimization and evaluability criteria. The final set of UME national standards was structured in 9 areas, 24 sub-areas, 82 basic and 40 quality development standards, and 84 annotations. CONCLUSIONS: We developed and validated national standards as a framework to ensure the quality of UME training with input from UME stakeholders. We used WFME standards as a benchmark while addressing local requirements. The standards and participatory approach to developing standards may guide relevant institutions.

Assuntos

Educação de Graduação em Medicina , Educação Médica , Humanos , Irã (Geográfico) , Acreditação , Benchmarking

6.

A Deep Learning Tool for Automated Landmark Annotation on Hip and Pelvis Radiographs.

Mulford, Kellen L; Johnson, Quinn J; Mujahed, Tala; Khosravi, Bardia; Rouzrokh, Pouria; Mickley, John P; Taunton, Michael J; Wyles, Cody C.

J Arthroplasty ; 38(10): 2024-2031.e1, 2023 10.

Artigo em Inglês | MEDLINE | ID: mdl-37236288

RESUMO

BACKGROUND: Automatic methods for labeling and segmenting pelvis structures can improve the efficiency of clinical and research workflows and reduce the variability introduced with manual labeling. The purpose of this study was to develop a single deep learning model to annotate certain anatomical structures and landmarks on antero-posterior (AP) pelvis radiographs. METHODS: A total of 1,100 AP pelvis radiographs were manually annotated by 3 reviewers. These images included a mix of preoperative and postoperative images as well as a mix of AP pelvis and hip images. A convolutional neural network was trained to segment 22 different structures (7 points, 6 lines, and 9 shapes). Dice score, which measures overlap between model output and ground truth, was calculated for the shapes and lines structures. Euclidean distance error was calculated for point structures. RESULTS: Dice score averaged across all images in the test set was 0.88 and 0.80 for the shape and line structures, respectively. For the 7-point structures, average distance between real and automated annotations ranged from 1.9 mm to 5.6 mm, with all averages falling below 3.1 mm except for the structure labeling the center of the sacrococcygeal junction, where performance was low for both human and machine-produced labels. Blinded qualitative evaluation of human and machine produced segmentations did not reveal any drastic decrease in performance of the automatic method. CONCLUSION: We present a deep learning model for automated annotation of pelvis radiographs that flexibly handles a variety of views, contrasts, and operative statuses for 22 structures and landmarks.

Assuntos

Aprendizado Profundo , Humanos , Radiografia , Redes Neurais de Computação , Pelve/diagnóstico por imagem , Período Pós-Operatório

7.

Creating High Fidelity Synthetic Pelvis Radiographs Using Generative Adversarial Networks: Unlocking the Potential of Deep Learning Models Without Patient Privacy Concerns.

Khosravi, Bardia; Rouzrokh, Pouria; Mickley, John P; Faghani, Shahriar; Larson, A Noelle; Garner, Hillary W; Howe, Benjamin M; Erickson, Bradley J; Taunton, Michael J; Wyles, Cody C.

J Arthroplasty ; 38(10): 2037-2043.e1, 2023 10.

Artigo em Inglês | MEDLINE | ID: mdl-36535448

RESUMO

BACKGROUND: In this work, we applied and validated an artificial intelligence technique known as generative adversarial networks (GANs) to create large volumes of high-fidelity synthetic anteroposterior (AP) pelvis radiographs that can enable deep learning (DL)-based image analyses, while ensuring patient privacy. METHODS: AP pelvis radiographs with native hips were gathered from an institutional registry between 1998 and 2018. The data was used to train a model to create 512 × 512 pixel synthetic AP pelvis images. The network was trained on 25 million images produced through augmentation. A set of 100 random images (50/50 real/synthetic) was evaluated by 3 orthopaedic surgeons and 2 radiologists to discern real versus synthetic images. Two models (joint localization and segmentation) were trained using synthetic images and tested on real images. RESULTS: The final model was trained on 37,640 real radiographs (16,782 patients). In a computer assessment of image fidelity, the final model achieved an "excellent" rating. In a blinded review of paired images (1 real, 1 synthetic), orthopaedic surgeon reviewers were unable to correctly identify which image was synthetic (accuracy = 55%, Kappa = 0.11), highlighting synthetic image fidelity. The synthetic and real images showed equivalent performance when they were assessed by established DL models. CONCLUSION: This work shows the ability to use a DL technique to generate a large volume of high-fidelity synthetic pelvis images not discernible from real imaging by computers or experts. These images can be used for cross-institutional sharing and model pretraining, further advancing the performance of DL models without risk to patient data safety. LEVEL OF EVIDENCE: Level III.

Assuntos

Aprendizado Profundo , Humanos , Inteligência Artificial , Privacidade , Processamento de Imagem Assistida por Computador/métodos , Pelve/diagnóstico por imagem

8.

Frank Stinchfield Award: Creation of a Patient-Specific Total Hip Arthroplasty Periprosthetic Fracture Risk Calculator.

Wyles, Cody C; Maradit-Kremers, Hilal; Fruth, Kristin M; Larson, Dirk R; Khosravi, Bardia; Rouzrokh, Pouria; Johnson, Quinn J; Berry, Daniel J; Sierra, Rafael J; Taunton, Michael J; Abdel, Matthew P.

J Arthroplasty ; 38(7S): S2-S10, 2023 07.

Artigo em Inglês | MEDLINE | ID: mdl-36933678

RESUMO

BACKGROUND: Many risk factors have been described for periprosthetic femur fracture (PPFFx) following total hip arthroplasty (THA), yet a patient-specific risk assessment tool remains elusive. The purpose of this study was to develop a high-dimensional, patient-specific risk-stratification nomogram that allows dynamic risk modification based on operative decisions. METHODS: We evaluated 16,696 primary nononcologic THAs performed between 1998 and 2018. During a mean 6-year follow-up, 558 patients (3.3%) sustained a PPFFx. Patients were characterized by individual natural language processing-assisted chart review on nonmodifiable factors (demographics, THA indication, and comorbidities), and modifiable operative decisions (femoral fixation [cemented/uncemented], surgical approach [direct anterior, lateral, and posterior], and implant type [collared/collarless]). Multivariable Cox regression models and nomograms were developed with PPFFx as a binary outcome at 90 days, 1 year, and 5 years, postoperatively. RESULTS: Patient-specific PPFFx risk based on comorbid profile was wide-ranging from 0.4-18% at 90 days, 0.4%-20% at 1 year, and 0.5%-25% at 5 years. Among 18 evaluated patient factors, 7 were retained in multivariable analyses. The 4 significant nonmodifiable factors included the following: women (hazard ratio (HR) = 1.6), older age (HR = 1.2 per 10 years), diagnosis of osteoporosis or use of osteoporosis medications (HR = 1.7), and indication for surgery other than osteoarthritis (HR = 2.2 for fracture, HR = 1.8 for inflammatory arthritis, HR = 1.7 for osteonecrosis). The 3 modifiable surgical factors were included as follows: uncemented femoral fixation (HR = 2.5), collarless femoral implants (HR = 1.3), and surgical approach other than direct anterior (lateral HR = 2.9, posterior HR = 1.9). CONCLUSION: This patient-specific PPFFx risk calculator demonstrated a wide-ranging risk based on comorbid profile and enables surgeons to quantify risk mitigation based on operative decisions. LEVEL OF EVIDENCE: Level III, Prognostic.

Assuntos

Artroplastia de Quadril , Distinções e Prêmios , Fraturas do Fêmur , Prótese de Quadril , Fraturas Periprotéticas , Humanos , Feminino , Artroplastia de Quadril/efeitos adversos , Artroplastia de Quadril/métodos , Fraturas Periprotéticas/epidemiologia , Fraturas Periprotéticas/etiologia , Fraturas Periprotéticas/cirurgia , Prótese de Quadril/efeitos adversos , Reoperação , Fraturas do Fêmur/epidemiologia , Fraturas do Fêmur/etiologia , Fraturas do Fêmur/cirurgia , Fatores de Risco , Estudos Retrospectivos

9.

Reproducibility of Deep Learning Algorithms Developed for Medical Imaging Analysis: A Systematic Review.

Moassefi, Mana; Rouzrokh, Pouria; Conte, Gian Marco; Vahdati, Sanaz; Fu, Tianyuan; Tahmasebi, Aylin; Younis, Mira; Farahani, Keyvan; Gentili, Amilcare; Kline, Timothy; Kitamura, Felipe C; Huo, Yuankai; Kuanar, Shiba; Younis, Khaled; Erickson, Bradley J; Faghani, Shahriar.

J Digit Imaging ; 36(5): 2306-2312, 2023 10.

Artigo em Inglês | MEDLINE | ID: mdl-37407841

RESUMO

Since 2000, there have been more than 8000 publications on radiology artificial intelligence (AI). AI breakthroughs allow complex tasks to be automated and even performed beyond human capabilities. However, the lack of details on the methods and algorithm code undercuts its scientific value. Many science subfields have recently faced a reproducibility crisis, eroding trust in processes and results, and influencing the rise in retractions of scientific papers. For the same reasons, conducting research in deep learning (DL) also requires reproducibility. Although several valuable manuscript checklists for AI in medical imaging exist, they are not focused specifically on reproducibility. In this study, we conducted a systematic review of recently published papers in the field of DL to evaluate if the description of their methodology could allow the reproducibility of their findings. We focused on the Journal of Digital Imaging (JDI), a specialized journal that publishes papers on AI and medical imaging. We used the keyword "Deep Learning" and collected the articles published between January 2020 and January 2022. We screened all the articles and included the ones which reported the development of a DL tool in medical imaging. We extracted the reported details about the dataset, data handling steps, data splitting, model details, and performance metrics of each included article. We found 148 articles. Eighty were included after screening for articles that reported developing a DL model for medical image analysis. Five studies have made their code publicly available, and 35 studies have utilized publicly available datasets. We provided figures to show the ratio and absolute count of reported items from included studies. According to our cross-sectional study, in JDI publications on DL in medical imaging, authors infrequently report the key elements of their study to make it reproducible.

Assuntos

Inteligência Artificial , Diagnóstico por Imagem , Humanos , Estudos Transversais , Reprodutibilidade dos Testes , Algoritmos

10.

Development of a deep learning model for the histologic diagnosis of dysplasia in Barrett's esophagus.

Faghani, Shahriar; Codipilly, D Chamil; Moassefi, Mana; Rouzrokh, Pouria; Khosravi, Bardia; Agarwal, Siddharth; Dhaliwal, Lovekirat; Katzka, David A; Hagen, Catherine; Lewis, Jason; Leggett, Cadman L; Erickson, Bradley J; Iyer, Prasad G.

Gastrointest Endosc ; 96(6): 918-925.e3, 2022 12.

Artigo em Inglês | MEDLINE | ID: mdl-35718071

RESUMO

BACKGROUND AND AIMS: The risk of progression in Barrett's esophagus (BE) increases with development of dysplasia. There is a critical need to improve the diagnosis of BE dysplasia, given substantial interobserver disagreement among expert pathologists and overdiagnosis of dysplasia by community pathologists. We developed a deep learning model to predict dysplasia grade on whole-slide imaging. METHODS: We digitized nondysplastic BE (NDBE), low-grade dysplasia (LGD), and high-grade dysplasia (HGD) histology slides. Two expert pathologists confirmed all histology and digitally annotated areas of dysplasia. Training, validation, and test sets were created (by a random 70/20/10 split). We used an ensemble approach combining a "you only look once" model to identify regions of interest and histology class (NDBE, LGD, or HGD) followed by a ResNet101 model pretrained on ImageNet applied to the regions of interest. Diagnostic performance was determined for the whole slide. RESULTS: We included slides from 542 patients (164 NDBE, 226 LGD, and 152 HGD) yielding 8596 bounding boxes in the training set, 1946 bounding boxes in the validation set, and 840 boxes in the test set. When the ensemble model was used, sensitivity and specificity for LGD was 81.3% and 100%, respectively, and >90% for NDBE and HGD. The overall positive predictive value and sensitivity metric (calculated as F1 score) was .91 for NDBE, .90 for LGD, and 1.0 for HGD. CONCLUSIONS: We successfully trained and validated a deep learning model to accurately identify dysplasia on whole-slide images. This model can potentially help improve the histologic diagnosis of BE dysplasia and the appropriate application of endoscopic therapy.

Assuntos

Adenocarcinoma , Esôfago de Barrett , Aprendizado Profundo , Neoplasias Esofágicas , Humanos , Esôfago de Barrett/diagnóstico , Esôfago de Barrett/patologia , Neoplasias Esofágicas/patologia , Adenocarcinoma/patologia , Progressão da Doença , Hiperplasia

11.

Distribution and Correlates of Hip-Knee-Ankle Angle in Early Osteoarthritis and Preoperative Total Knee Arthroplasty Patients.

Ramazanian, Taghi; Yan, Shi; Rouzrokh, Pouria; Wyles, Cody C; O Byrne, Thomas J; Taunton, Michael J; Maradit Kremers, Hilal.

J Arthroplasty ; 37(6S): S170-S175, 2022 06.

Artigo em Inglês | MEDLINE | ID: mdl-35210147

RESUMO

BACKGROUND: Several studies have investigated the distribution of hip-knee-ankle (HKA) angle in healthy populations; however, few have evaluated this metric in patients undergoing total knee arthroplasty (TKA). The purpose of this study is to compare HKA angle distribution in early and advanced knee osteoarthritis (OA) patients. METHODS: Full limb radiographs were used to measure HKA angle for 983 subjects from the Osteoarthritis Initiative (OAI) cohort and 4,901 pre-TKA patients from an institutional cohort. Measurements were made using a previously validated deep learning algorithm. Linear regression models were used to determine the association of HKA alignment angle with patient characteristics. RESULTS: The mean ± standard deviation HKA angle was -1.3° ± 3.2° in the OAI cohort and -4.1° ± 6.1° in the pre-TKA cohort. In the OAI cohort, normal alignment (64%) was the most common knee alignment followed by varus (29%), and valgus (7%). In pre-TKA patients, the most common alignment was varus (62%), followed by normal (27%) and valgus (11%). In pre-TKA patients, mean HKA angle in primary knee OA, post-traumatic knee OA, and rheumatoid arthritis patients were -4.3° ± 6.1°, -3.2° ± 6.4°, and -2.9° ± 6.1°, respectively. HKA angle was strongly associated (P < .001) with gender and body mass index. CONCLUSION: TKA patients have a wider alignment distribution and more severe varus and valgus alignment than individuals "at risk" for knee OA from the OAI cohort. These epidemiologic findings improve our understanding of HKA angle distribution and its correlation with demographic characteristics in early and late-stage arthritis.

Assuntos

Artroplastia do Joelho , Osteoartrite do Joelho , Tornozelo/cirurgia , Articulação do Tornozelo/cirurgia , Humanos , Articulação do Joelho/diagnóstico por imagem , Articulação do Joelho/cirurgia , Osteoartrite do Joelho/cirurgia , Estudos Retrospectivos

12.

A Deep Learning Tool for Automated Radiographic Measurement of Acetabular Component Inclination and Version After Total Hip Arthroplasty.

Rouzrokh, Pouria; Wyles, Cody C; Philbrick, Kenneth A; Ramazanian, Taghi; Weston, Alexander D; Cai, Jason C; Taunton, Michael J; Lewallen, David G; Berry, Daniel J; Erickson, Bradley J; Maradit Kremers, Hilal.

J Arthroplasty ; 36(7): 2510-2517.e6, 2021 07.

Artigo em Inglês | MEDLINE | ID: mdl-33678445

RESUMO

BACKGROUND: Inappropriate acetabular component angular position is believed to increase the risk of hip dislocation after total hip arthroplasty. However, manual measurement of these angles is time consuming and prone to interobserver variability. The purpose of this study was to develop a deep learning tool to automate the measurement of acetabular component angles on postoperative radiographs. METHODS: Two cohorts of 600 anteroposterior (AP) pelvis and 600 cross-table lateral hip postoperative radiographs were used to develop deep learning models to segment the acetabular component and the ischial tuberosities. Cohorts were manually annotated, augmented, and randomly split to train-validation-test data sets on an 8:1:1 basis. Two U-Net convolutional neural network models (one for AP and one for cross-table lateral radiographs) were trained for 50 epochs. Image processing was then deployed to measure the acetabular component angles on the predicted masks for anatomical landmarks. Performance of the tool was tested on 80 AP and 80 cross-table lateral radiographs. RESULTS: The convolutional neural network models achieved a mean Dice similarity coefficient of 0.878 and 0.903 on AP and cross-table lateral test data sets, respectively. The mean difference between human-level and machine-level measurements was 1.35° (σ = 1.07°) and 1.39° (σ = 1.27°) for the inclination and anteversion angles, respectively. Differences of 5° or more between human-level and machine-level measurements were observed in less than 2.5% of cases. CONCLUSION: We developed a highly accurate deep learning tool to automate the measurement of angular position of acetabular components for use in both clinical and research settings. LEVEL OF EVIDENCE: III.

Assuntos

Artroplastia de Quadril , Aprendizado Profundo , Prótese de Quadril , Acetábulo/diagnóstico por imagem , Acetábulo/cirurgia , Artroplastia de Quadril/efeitos adversos , Prótese de Quadril/efeitos adversos , Humanos , Radiografia

13.

Deep Learning Artificial Intelligence Model for Assessment of Hip Dislocation Risk Following Primary Total Hip Arthroplasty From Postoperative Radiographs.

Rouzrokh, Pouria; Ramazanian, Taghi; Wyles, Cody C; Philbrick, Kenneth A; Cai, Jason C; Taunton, Michael J; Maradit Kremers, Hilal; Lewallen, David G; Erickson, Bradley J.

J Arthroplasty ; 36(6): 2197-2203.e3, 2021 06.

Artigo em Inglês | MEDLINE | ID: mdl-33663890

RESUMO

BACKGROUND: Dislocation is a common complication following total hip arthroplasty (THA), and accounts for a high percentage of subsequent revisions. The purpose of this study is to illustrate the potential of a convolutional neural network model to assess the risk of hip dislocation based on postoperative anteroposterior pelvis radiographs. METHODS: We retrospectively evaluated radiographs for a cohort of 13,970 primary THAs with 374 dislocations over 5 years of follow-up. Overall, 1490 radiographs from dislocated and 91,094 from non-dislocated THAs were included in the analysis. A convolutional neural network object detection model (YOLO-V3) was trained to crop the images by centering on the femoral head. A ResNet18 classifier was trained to predict subsequent hip dislocation from the cropped imaging. The ResNet18 classifier was initialized with ImageNet weights and trained using FastAI (V1.0) running on PyTorch. The training was run for 15 epochs using 10-fold cross validation, data oversampling, and augmentation. RESULTS: The hip dislocation classifier achieved the following mean performance (standard deviation): accuracy = 49.5 (4.1%), sensitivity = 89.0 (2.2%), specificity = 48.8 (4.2%), positive predictive value = 3.3 (0.3%), negative predictive value = 99.5 (0.1%), and area under the receiver operating characteristic curve = 76.7 (3.6%). Saliency maps demonstrated that the model placed the greatest emphasis on the femoral head and acetabular component. CONCLUSION: Existing prediction methods fail to identify patients at high risk of dislocation following THA. Our radiographic classifier model has high sensitivity and negative predictive value, and can be combined with clinical risk factor information for rapid assessment of risk for dislocation following THA. The model further suggests radiographic locations which may be important in understanding the etiology of prosthesis dislocation. Importantly, our model is an illustration of the potential of automated imaging artificial intelligence models in orthopedics. LEVEL OF EVIDENCE: Level III.

Assuntos

Artroplastia de Quadril , Aprendizado Profundo , Luxação do Quadril , Prótese de Quadril , Artroplastia de Quadril/efeitos adversos , Inteligência Artificial , Luxação do Quadril/diagnóstico por imagem , Luxação do Quadril/epidemiologia , Prótese de Quadril/efeitos adversos , Humanos , Estudos Retrospectivos , Fatores de Risco

14.

Invited Commentary: The Double-edged Sword of Bias in Medical Imaging Artificial Intelligence.

Rouzrokh, Pouria; Erickson, Bradley J.

Radiographics ; 44(5): e230243, 2024 May.

Artigo em Inglês | MEDLINE | ID: mdl-38635455

Assuntos

Inteligência Artificial , Diagnóstico por Imagem , Humanos , Radiografia , Viés

15.

Exploring the effect of domain-specific transfer learning for thyroid nodule classification.

Vahdati, Sanaz; Khosravi, Bardia; Rouzrokh, Pouria; Erickson, Bradley J.

J Am Coll Radiol ; 2024 Jun 26.

Artigo em Inglês | MEDLINE | ID: mdl-38942163

RESUMO

Thyroid nodule evaluation using ultrasound is dependent on radiologist experience, but deep learning (DL) models can improve intra-reader agreements. DL model development for medical imaging with small datasets can be challenging. Transfer learning is a technique used in the development of DL models to improve model performance in data-limited scenarios. Here, we investigate the impact of transfer learning with domain-specific RadImageNet dataset and non-medical ImageNet on the robustness of classifying thyroid nodules into benign and malignant. We retrospectively collected 822 ultrasound images of thyroid nodules of patients who underwent fine needle aspiration in our institute. We split our data and used 101 cases in a test set and 721 cases for cross-validation. A Resnet-18 model was trained to classify thyroid nodules into benign and malignant. Then, we trained the same model architecture with transferred weights from ImageNet and RadImageNet. The model without transfer learning for thyroid nodule classification achieved an AUROC of 0.69. The AUROC of our model after transfer learning with ImageNet pre-trained weights was 0.79. Our model achieved an AUROC of 0.83 from transfer learning of the RadImageNet pre-trained weights. The AUROC from the classification model without transfer learning significantly improved after transfer learning with ImageNet (p-value = 0.03) and RadImageNet transfer learning (p-value <0.01). There was a statistically significant distinction in performance between the model utilizing RadImageNet transfer learning and that employing ImageNet transfer learning (p-value <0.01). We demonstrate the potential of RadImageNet as a domain-specific source for transfer learning in thyroid nodule classification.

16.

Synthetically enhanced: unveiling synthetic data's potential in medical imaging research.

Khosravi, Bardia; Li, Frank; Dapamede, Theo; Rouzrokh, Pouria; Gamble, Cooper U; Trivedi, Hari M; Wyles, Cody C; Sellergren, Andrew B; Purkayastha, Saptarshi; Erickson, Bradley J; Gichoya, Judy W.

EBioMedicine ; 104: 105174, 2024 Jun.

Artigo em Inglês | MEDLINE | ID: mdl-38821021

RESUMO

BACKGROUND: Chest X-rays (CXR) are essential for diagnosing a variety of conditions, but when used on new populations, model generalizability issues limit their efficacy. Generative AI, particularly denoising diffusion probabilistic models (DDPMs), offers a promising approach to generating synthetic images, enhancing dataset diversity. This study investigates the impact of synthetic data supplementation on the performance and generalizability of medical imaging research. METHODS: The study employed DDPMs to create synthetic CXRs conditioned on demographic and pathological characteristics from the CheXpert dataset. These synthetic images were used to supplement training datasets for pathology classifiers, with the aim of improving their performance. The evaluation involved three datasets (CheXpert, MIMIC-CXR, and Emory Chest X-ray) and various experiments, including supplementing real data with synthetic data, training with purely synthetic data, and mixing synthetic data with external datasets. Performance was assessed using the area under the receiver operating curve (AUROC). FINDINGS: Adding synthetic data to real datasets resulted in a notable increase in AUROC values (up to 0.02 in internal and external test sets with 1000% supplementation, p-value <0.01 in all instances). When classifiers were trained exclusively on synthetic data, they achieved performance levels comparable to those trained on real data with 200%-300% data supplementation. The combination of real and synthetic data from different sources demonstrated enhanced model generalizability, increasing model AUROC from 0.76 to 0.80 on the internal test set (p-value <0.01). INTERPRETATION: Synthetic data supplementation significantly improves the performance and generalizability of pathology classifiers in medical imaging. FUNDING: Dr. Gichoya is a 2022 Robert Wood Johnson Foundation Harold Amos Medical Faculty Development Program and declares support from RSNA Health Disparities grant (#EIHD2204), Lacuna Fund (#67), Gordon and Betty Moore Foundation, NIH (NIBIB) MIDRC grant under contracts 75N92020C00008 and 75N92020C00021, and NHLBI Award Number R01HL167811.

Assuntos

Diagnóstico por Imagem , Curva ROC , Humanos , Diagnóstico por Imagem/métodos , Algoritmos , Radiografia Torácica/métodos , Processamento de Imagem Assistida por Computador/métodos , Bases de Dados Factuais , Área Sob a Curva , Modelos Estatísticos

17.

A Guideline for Open-Source Tools to Make Medical Imaging Data Ready for Artificial Intelligence Applications: A Society of Imaging Informatics in Medicine (SIIM) Survey.

Vahdati, Sanaz; Khosravi, Bardia; Mahmoudi, Elham; Zhang, Kuan; Rouzrokh, Pouria; Faghani, Shahriar; Moassefi, Mana; Tahmasebi, Aylin; Andriole, Katherine P; Chang, Peter; Farahani, Keyvan; Flores, Mona G; Folio, Les; Houshmand, Sina; Giger, Maryellen L; Gichoya, Judy W; Erickson, Bradley J.

J Imaging Inform Med ; 2024 Apr 01.

Artigo em Inglês | MEDLINE | ID: mdl-38558368

RESUMO

In recent years, the role of Artificial Intelligence (AI) in medical imaging has become increasingly prominent, with the majority of AI applications approved by the FDA being in imaging and radiology in 2023. The surge in AI model development to tackle clinical challenges underscores the necessity for preparing high-quality medical imaging data. Proper data preparation is crucial as it fosters the creation of standardized and reproducible AI models while minimizing biases. Data curation transforms raw data into a valuable, organized, and dependable resource and is a fundamental process to the success of machine learning and analytical projects. Considering the plethora of available tools for data curation in different stages, it is crucial to stay informed about the most relevant tools within specific research areas. In the current work, we propose a descriptive outline for different steps of data curation while we furnish compilations of tools collected from a survey applied among members of the Society of Imaging Informatics (SIIM) for each of these stages. This collection has the potential to enhance the decision-making process for researchers as they select the most appropriate tool for their specific tasks.

18.

Checklist for Reproducibility of Deep Learning in Medical Imaging.

Moassefi, Mana; Singh, Yashbir; Conte, Gian Marco; Khosravi, Bardia; Rouzrokh, Pouria; Vahdati, Sanaz; Safdar, Nabile; Moy, Linda; Kitamura, Felipe; Gentili, Amilcare; Lakhani, Paras; Kottler, Nina; Halabi, Safwan S; Yacoub, Joseph H; Hou, Yuankai; Younis, Khaled; Erickson, Bradley J; Krupinski, Elizabeth; Faghani, Shahriar.

J Imaging Inform Med ; 2024 Mar 14.

Artigo em Inglês | MEDLINE | ID: mdl-38483694

RESUMO

The application of deep learning (DL) in medicine introduces transformative tools with the potential to enhance prognosis, diagnosis, and treatment planning. However, ensuring transparent documentation is essential for researchers to enhance reproducibility and refine techniques. Our study addresses the unique challenges presented by DL in medical imaging by developing a comprehensive checklist using the Delphi method to enhance reproducibility and reliability in this dynamic field. We compiled a preliminary checklist based on a comprehensive review of existing checklists and relevant literature. A panel of 11 experts in medical imaging and DL assessed these items using Likert scales, with two survey rounds to refine responses and gauge consensus. We also employed the content validity ratio with a cutoff of 0.59 to determine item face and content validity. Round 1 included a 27-item questionnaire, with 12 items demonstrating high consensus for face and content validity that were then left out of round 2. Round 2 involved refining the checklist, resulting in an additional 17 items. In the last round, 3 items were deemed non-essential or infeasible, while 2 newly suggested items received unanimous agreement for inclusion, resulting in a final 26-item DL model reporting checklist derived from the Delphi process. The 26-item checklist facilitates the reproducible reporting of DL tools and enables scientists to replicate the study's results.

19.

The use of deep learning in medical imaging to improve spine care: A scoping review of current literature and clinical applications.

Constant, Caroline; Aubin, Carl-Eric; Kremers, Hilal Maradit; Garcia, Diana V Vera; Wyles, Cody C; Rouzrokh, Pouria; Larson, Annalise Noelle.

N Am Spine Soc J ; 15: 100236, 2023 Sep.

Artigo em Inglês | MEDLINE | ID: mdl-37599816

RESUMO

Background: Artificial intelligence is a revolutionary technology that promises to assist clinicians in improving patient care. In radiology, deep learning (DL) is widely used in clinical decision aids due to its ability to analyze complex patterns and images. It allows for rapid, enhanced data, and imaging analysis, from diagnosis to outcome prediction. The purpose of this study was to evaluate the current literature and clinical utilization of DL in spine imaging. Methods: This study is a scoping review and utilized the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) methodology to review the scientific literature from 2012 to 2021. A search in PubMed, Web of Science, Embased, and IEEE Xplore databases with syntax specific for DL and medical imaging in spine care applications was conducted to collect all original publications on the subject. Specific data was extracted from the available literature, including algorithm application, algorithms tested, database type and size, algorithm training method, and outcome of interest. Results: A total of 365 studies (total sample of 232,394 patients) were included and grouped into 4 general applications: diagnostic tools, clinical decision support tools, automated clinical/instrumentation assessment, and clinical outcome prediction. Notable disparities exist in the selected algorithms and the training across multiple disparate databases. The most frequently used algorithms were U-Net and ResNet. A DL model was developed and validated in 92% of included studies, while a pre-existing DL model was investigated in 8%. Of all developed models, only 15% of them have been externally validated. Conclusions: Based on this scoping review, DL in spine imaging is used in a broad range of clinical applications, particularly for diagnosing spinal conditions. There is a wide variety of DL algorithms, database characteristics, and training methods. Future studies should focus on external validation of existing models before bringing them into clinical use.

20.

Machine Learning in Cardiovascular Imaging: A Scoping Review of Published Literature.

Rouzrokh, Pouria; Khosravi, Bardia; Vahdati, Sanaz; Moassefi, Mana; Faghani, Shahriar; Mahmoudi, Elham; Chalian, Hamid; Erickson, Bradley J.

Curr Radiol Rep ; 11(2): 34-45, 2023.

Artigo em Inglês | MEDLINE | ID: mdl-36531124

RESUMO

Purpose of Review: In this study, we planned and carried out a scoping review of the literature to learn how machine learning (ML) has been investigated in cardiovascular imaging (CVI). Recent Findings: During our search, we found numerous studies that developed or utilized existing ML models for segmentation, classification, object detection, generation, and regression applications involving cardiovascular imaging data. We first quantitatively investigated the different aspects of study characteristics, data handling, model development, and performance evaluation in all studies that were included in our review. We then supplemented these findings with a qualitative synthesis to highlight the common themes in the studied literature and provided recommendations to pave the way for upcoming research. Summary: ML is a subfield of artificial intelligence (AI) that enables computers to learn human-like decision-making from data. Due to its novel applications, ML is gaining more and more attention from researchers in the healthcare industry. Cardiovascular imaging is an active area of research in medical imaging with lots of room for incorporating new technologies, like ML. Supplementary Information: The online version contains supplementary material available at 10.1007/s40134-022-00407-8.

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

Assuntos

RESUMO

RESUMO

Assuntos

RESUMO

RESUMO

RESUMO

RESUMO

ENVIAR RESULTADO:

SELEÇÃO DE REFERÊNCIAS

Detalhe da pesquisa