RESUMO
The gold standard for enrollment and endpoint assessment in metabolic dysfunction-associated steatosis clinical trials is histologic assessment of a liver biopsy performed on glass slides. However, obtaining the evaluations from several expert pathologists on glass is challenging, as shipping the slides around the country or around the world is time-consuming and comes with the hazards of slide breakage. This study demonstrated that pathologic assessment of disease activity in steatohepatitis, performed using digital images on the AISight whole slide image management system, yields results that are comparable to those obtained using glass slides. The accuracy of scoring for steatohepatitis (nonalcoholic fatty liver disease activity score ≥4 with ≥1 for each feature and absence of atypical features suggestive of other liver disease) performed on the system was evaluated against scoring conducted on glass slides. Both methods were assessed for overall percent agreement with a consensus "ground truth" score (defined as the median score of a panel of three pathologists' glass slides). Each case was also read by three different pathologists, once on glass and once digitally with a minimum 2-week washout period between the modalities. It was demonstrated that the average agreement across three pathologists of digital scoring with ground truth was noninferior to the average agreement of glass scoring with ground truth [noninferiority margin: -0.05; difference: -0.001; 95% CI: (-0.027, 0.026); and p < 0.0001]. For each pathologist, there was a similar average agreement of digital and glass reads with glass ground truth (pathologist A, 0.843 and 0.849; pathologist B, 0.633 and 0.605; and pathologist C, 0.755 and 0.780). Here, we demonstrate that the accuracy of digital reads for steatohepatitis using digital images is equivalent to glass reads in the context of a clinical trial for scoring using the Clinical Research Network scoring system.
Assuntos
Hepatopatia Gordurosa não Alcoólica , Humanos , Hepatopatia Gordurosa não Alcoólica/patologia , Ensaios Clínicos como Assunto , Reprodutibilidade dos Testes , Biópsia , Fígado/patologia , Interpretação de Imagem Assistida por Computador/métodos , Variações Dependentes do ObservadorRESUMO
Clinical trials in metabolic dysfunction-associated steatohepatitis (MASH, formerly known as nonalcoholic steatohepatitis) require histologic scoring for assessment of inclusion criteria and endpoints. However, variability in interpretation has impacted clinical trial outcomes. We developed an artificial intelligence-based measurement (AIM) tool for scoring MASH histology (AIM-MASH). AIM-MASH predictions for MASH Clinical Research Network necroinflammation grades and fibrosis stages were reproducible (κ = 1) and aligned with expert pathologist consensus scores (κ = 0.62-0.74). The AIM-MASH versus consensus agreements were comparable to average pathologists for MASH Clinical Research Network scores (82% versus 81%) and fibrosis (97% versus 96%). Continuous scores produced by AIM-MASH for key histological features of MASH correlated with mean pathologist scores and noninvasive biomarkers and strongly predicted progression-free survival in patients with stage 3 (P < 0.0001) and stage 4 (P = 0.03) fibrosis. In a retrospective analysis of the ATLAS trial (NCT03449446), responders receiving study treatment showed a greater continuous change in fibrosis compared with placebo (P = 0.02). Overall, these results suggest that AIM-MASH may assist pathologists in histologic review of MASH clinical trials, reducing inter-rater variability on trial outcomes and offering a more sensitive and reproducible measure of patient responses.
Assuntos
Inteligência Artificial , Ensaios Clínicos como Assunto , Hepatopatia Gordurosa não Alcoólica , Humanos , Hepatopatia Gordurosa não Alcoólica/patologia , Hepatopatia Gordurosa não Alcoólica/tratamento farmacológico , Cirrose Hepática/patologia , Seleção de Pacientes , Determinação de Ponto Final , Feminino , Estudos Retrospectivos , Masculino , Automação , Hepatopatias/patologia , Reprodutibilidade dos TestesRESUMO
Clinical trials in nonalcoholic steatohepatitis (NASH) require histologic scoring for assessment of inclusion criteria and endpoints. However, guidelines for scoring key features have led to variability in interpretation, impacting clinical trial outcomes. We developed an artificial intelligence (AI)-based measurement (AIM) tool for scoring NASH histology (AIM-NASH). AIM-NASH predictions for NASH Clinical Research Network (CRN) grades of necroinflammation and stages of fibrosis aligned with expert consensus scores and were reproducible. Continuous scores produced by AIM-NASH for key histological features of NASH correlated with mean pathologist scores and with noninvasive biomarkers and strongly predicted patient outcomes. In a retrospective analysis of the ATLAS trial, previously unmet pathological endpoints were met when scored by the AIM-NASH algorithm alone. Overall, these results suggest that AIM-NASH may assist pathologists in histologic review of NASH clinical trials, reducing inter-rater variability on trial outcomes and offering a more sensitive and reproducible measure of patient therapeutic response.
RESUMO
To discover distinct immune responses promoting or inhibiting hepatocellular carcinoma (HCC), we perform a three-dimensional analysis of the immune cells, correlating immune cell types, interactions, and changes over time in an animal model displaying gender disparity in nonalcoholic fatty liver disease (NAFLD)-associated HCC. In response to a Western diet (WD), animals mount acute and chronic patterns of inflammatory cytokines, respectively. Tumor progression in males and females is associated with a predominant CD8+ > CD4+, Th1 > Th17 > Th2, NKT > NK, M1 > M2 pattern in the liver. A complete rescue of females from HCC is associated with an equilibrium Th1 = Th17 = Th2, NKT = NK, M1 = M2 pattern, while a partial rescue of males from HCC is associated with an equilibrium CD8+ = CD4+, NKT = NK and a semi-equilibrium Th1 = Th17 > Th2 but a sustained M1 > M2 pattern in the liver. Our data suggest that immunological pattern-recognition can explain immunobiology of HCC and guide immune modulatory interventions for the treatment of HCC in a gender-specific manner.
Assuntos
Carcinoma Hepatocelular , Neoplasias Hepáticas , Hepatopatia Gordurosa não Alcoólica , Animais , Carcinoma Hepatocelular/patologia , Dieta Ocidental , Progressão da Doença , Feminino , Neoplasias Hepáticas/patologia , Masculino , Hepatopatia Gordurosa não Alcoólica/patologiaRESUMO
Manufacturers of pathology imaging devices and associated software engage regulatory affairs and clinical affairs (RACA) throughout the Total Product Life Cycle (TPLC) of regulated products. A number of manufacturers, pathologists, and end users are not familiar with how RACA involvement benefits each stage of the TPLC. RACA professionals are important contributors to product development and deployment strategies because these professionals maintain an understanding of the scientific, technical, and clinical aspects of biomedical product regulation, as well as the relevant knowledge of regulatory requirements, policies, and market trends for both local and global regulations and standards. Defining a regulatory and clinical strategy at the beginning of product design enables early evaluation of risks and provides assurance that the collected evidence supports the product's clinical claims (e.g., in a marketing application), its safe and effective use, and potential reimbursement strategies. It is recommended to involve RACA early and throughout the TPLC to assist with navigating changes in the regulatory environment and dynamic diagnostic market. Here we outline how various stakeholders can utilize RACA to navigate the nuanced landscape behind the development and use of clinical diagnostic products. Collectively, this work emphasizes the critical importance of RACA as an integral part of product development and, thereby, sustained innovation.
RESUMO
CONTEXT: Text-based reporting and manual arbitration for whole slide imaging (WSI) validation studies are labor intensive and do not allow for consistent, scalable, and repeatable data collection or analysis. OBJECTIVE: The objective of this study was to establish a method of data capture and analysis using standardized codified checklists and predetermined synoptic discordance tables and to use these methods in a pilot multisite validation study. METHODS AND STUDY DESIGN: Fifteen case report form checklists were generated from the College of American Pathology cancer protocols. Prior to data collection, all hypothetical pairwise comparisons were generated, and a level of harm was determined for each possible discordance. Four sites with four pathologists each generated 264 independent reads of 33 cases. Preestablished discordance tables were applied to determine site by site and pooled accuracy, intrareader/intramodality, and interreader intramodality error rates. RESULTS: Over 10,000 hypothetical pairwise comparisons were evaluated and assigned harm in discordance tables. The average difference in error rates between WSI and glass, as compared to ground truth, was 0.75% with a lower bound of 3.23% (95% confidence interval). Major discordances occurred on challenging cases, regardless of modality. The average inter-reader agreement across sites for glass was 76.5% (weighted kappa of 0.68) and for digital it was 79.1% (weighted kappa of 0.72). CONCLUSION: These results demonstrate the feasibility and utility of employing standardized synoptic checklists and predetermined discordance tables to gather consistent, comprehensive diagnostic data for WSI validation studies. This method of data capture and analysis can be applied in large-scale multisite WSI validations.