Pesquisa | Biblioteca Virtual em Saúde

1.

iIMPACT: integrating image and molecular profiles for spatial transcriptomics analysis.

Jiang, Xi; Wang, Shidan; Guo, Lei; Zhu, Bencong; Wen, Zhuoyu; Jia, Liwei; Xu, Lin; Xiao, Guanghua; Li, Qiwei.

Genome Biol ; 25(1): 147, 2024 06 06.

Artigo em Inglês | MEDLINE | ID: mdl-38844966

RESUMO

Current clustering analysis of spatial transcriptomics data primarily relies on molecular information and fails to fully exploit the morphological features present in histology images, leading to compromised accuracy and interpretability. To overcome these limitations, we have developed a multi-stage statistical method called iIMPACT. It identifies and defines histology-based spatial domains based on AI-reconstructed histology images and spatial context of gene expression measurements, and detects domain-specific differentially expressed genes. Through multiple case studies, we demonstrate iIMPACT outperforms existing methods in accuracy and interpretability and provides insights into the cellular spatial organization and landscape of functional genes within spatial transcriptomics data.

Assuntos

Perfilação da Expressão Gênica , Transcriptoma , Perfilação da Expressão Gênica/métodos , Humanos , Análise por Conglomerados , Processamento de Imagem Assistida por Computador/métodos

2.

A critical assessment of using ChatGPT for extracting structured data from clinical notes.

Huang, Jingwei; Yang, Donghan M; Rong, Ruichen; Nezafati, Kuroush; Treager, Colin; Chi, Zhikai; Wang, Shidan; Cheng, Xian; Guo, Yujia; Klesse, Laura J; Xiao, Guanghua; Peterson, Eric D; Zhan, Xiaowei; Xie, Yang.

NPJ Digit Med ; 7(1): 106, 2024 May 01.

Artigo em Inglês | MEDLINE | ID: mdl-38693429

RESUMO

Existing natural language processing (NLP) methods to convert free-text clinical notes into structured data often require problem-specific annotations and model training. This study aims to evaluate ChatGPT's capacity to extract information from free-text medical notes efficiently and comprehensively. We developed a large language model (LLM)-based workflow, utilizing systems engineering methodology and spiral "prompt engineering" process, leveraging OpenAI's API for batch querying ChatGPT. We evaluated the effectiveness of this method using a dataset of more than 1000 lung cancer pathology reports and a dataset of 191 pediatric osteosarcoma pathology reports, comparing the ChatGPT-3.5 (gpt-3.5-turbo-16k) outputs with expert-curated structured data. ChatGPT-3.5 demonstrated the ability to extract pathological classifications with an overall accuracy of 89%, in lung cancer dataset, outperforming the performance of two traditional NLP methods. The performance is influenced by the design of the instructive prompt. Our case analysis shows that most misclassifications were due to the lack of highly specialized pathology terminology, and erroneous interpretation of TNM staging rules. Reproducibility shows the relatively stable performance of ChatGPT-3.5 over time. In pediatric osteosarcoma dataset, ChatGPT-3.5 accurately classified both grades and margin status with accuracy of 98.6% and 100% respectively. Our study shows the feasibility of using ChatGPT to process large volumes of clinical notes for structured information extraction without requiring extensive task-specific human annotation and model training. The results underscore the potential role of LLMs in transforming unstructured healthcare data into structured formats, thereby supporting research and aiding clinical decision-making.

3.

A Lung Cancer Mouse Model Database.

Cai, Ling; Gao, Ying; DeBerardinis, Ralph J; Acquaah-Mensah, George; Aidinis, Vassilis; Beane, Jennifer E; Biswal, Shyam; Chen, Ting; Concepcion-Crisol, Carla P; Grüner, Barbara M; Jia, Deshui; Jones, Robert; Kurie, Jonathan M; Lee, Min Gyu; Lindahl, Per; Lissanu, Yonathan; Lorz Lopez, Maria Corina; Martinelli, Rosanna; Mazur, Pawel K; Mazzilli, Sarah A; Mii, Shinji; Moll, Herwig; Moorehead, Roger; Morrisey, Edward E; Ng, Sheng Rong; Oser, Matthew G; Pandiri, Arun R; Powell, Charles A; Ramadori, Giorgio; Santos Lafuente, Mirentxu; Snyder, Eric; Sotillo, Rocio; Su, Kang-Yi; Taki, Tetsuro; Taparra, Kekoa; Xia, Yifeng; van Veen, Ed; Winslow, Monte M; Xiao, Guanghua; Rudin, Charles M; Oliver, Trudy G; Xie, Yang; Minna, John D.

bioRxiv ; 2024 May 14.

Artigo em Inglês | MEDLINE | ID: mdl-38464291

RESUMO

Lung cancer, the leading cause of cancer mortality, exhibits diverse histological subtypes and genetic complexities. Numerous preclinical mouse models have been developed to study lung cancer, but data from these models are disparate, siloed, and difficult to compare in a centralized fashion. Here we established the Lung Cancer Mouse Model Database (LCMMDB), an extensive repository of 1,354 samples from 77 transcriptomic datasets covering 974 samples from genetically engineered mouse models (GEMMs), 368 samples from carcinogen-induced models, and 12 samples from a spontaneous model. Meticulous curation and collaboration with data depositors have produced a robust and comprehensive database, enhancing the fidelity of the genetic landscape it depicts. The LCMMDB aligns 859 tumors from GEMMs with human lung cancer mutations, enabling comparative analysis and revealing a pressing need to broaden the diversity of genetic aberrations modeled in GEMMs. Accompanying this resource, we developed a web application that offers researchers intuitive tools for in-depth gene expression analysis. With standardized reprocessing of gene expression data, the LCMMDB serves as a powerful platform for cross-study comparison and lays the groundwork for future research, aiming to bridge the gap between mouse models and human lung cancer for improved translational relevance.

4.

Deep convolutional neural network and IoT technology for healthcare.

Wassan, Sobia; Dongyan, Hu; Suhail, Beenish; Jhanjhi, N Z; Xiao, Guanghua; Ahmed, Suhail; Murugesan, Raja Kumar.

Digit Health ; 10: 20552076231220123, 2024.

Artigo em Inglês | MEDLINE | ID: mdl-38250147

RESUMO

Background: Deep Learning is an AI technology that trains computers to analyze data in an approach similar to the human brain. Deep learning algorithms can find complex patterns in images, text, audio, and other data types to provide accurate predictions and conclusions. Neuronal networks are another name for Deep Learning. These layers are the input, the hidden, and the output of a deep learning model. First, data is taken in by the input layer, and then it is processed by the output layer. Deep Learning has many advantages over traditional machine learning algorithms like a KA-nearest neighbor, support vector algorithms, and regression approaches. Deep learning models can read more complex data than traditional machine learning methods. Objectives: This research aims to find the ideal number of best-hidden layers for the neural network and different activation function variations. The article also thoroughly analyzes how various frameworks can be used to create a comparison or fast neural networks. The final goal of the article is to investigate all such innovative techniques that allow us to speed up the training of neural networks without losing accuracy. Methods: A sample data Set from 2001 was collected by www.Kaggle.com. We can reduce the total number of layers in the deep learning model. This will enable us to use our time. To perform the ReLU activation, we will make use of two layers that are completely connected. If the value being supplied is larger than zero, the ReLU activation will return 0, and else it will output the value being input directly. Results: We use multiple parameters to determine the most effective method to test how well our method works. In the next paragraph, we'll discuss how the calculation changes secret-shared Values. By adopting 19 train set features, we train our reliable model to predict healthcare cost's (numerical) target feature. We found that 0.89503 was the best choice because it gave us a good fit (R2) and let us set enough coefficients to 0. To develop our stable model with this Set of parameters, we require 26 iterations. We use an R2 of 0.89503, an MSE of 0.01094, an RMSE of 0.10458, a mean residual deviance of 0.01094, a mean absolute error of 0.07452, and a root mean squared log error of 0.07207. After training the model on the train set, we applied the same parameters to the test set and obtained an R2 of 0.90707, MSE of 0.01045, RMSE of 0.10224, mean residual deviation of 0.01045, MAE of 0.06954, and RMSE of 0.07051, validating our solution approach. The objective value of our secured model is higher than that of the scikit-learn model, although the former performs better on goodness-of-fit criteria. As a result, our protected model performs quite well, marginally outperforming the (very optimized) scikit-learn model. Using a backpropagation algorithm and stochastic gradient descent, deep Learning develops artificial neural systems with several interconnected layers. There may be hidden layers of neurons in the network that have the tanh, rectification, and max-out hyperparameters. Modern features like momentum training, dropout, active learning rate, rate annealed, and L1 or L2 regularization provide exceptional prediction performance. The worldwide model's parameters are multi-threadedly (asynchronously) trained on the data from that node, and the model-based data is then gradually augmented by model averaging over the entire network. The method is executed on a single-node, direct H2O cluster initiated by the operator. The operation is parallel despite there just being a single node involved. The number of threads may be adjusted in the settings menu under Preferences and General. The optimal number of threads for the system is used automatically. Successful predictions in the healthcare data sets are made using the H2O Deep Learning operator. There will be a classification done since its label is binomial. The Splitting Validation operator creates test and training datasets to evaluate the model. By default, the settings of the Deep Learning activator are used. To put it another way, we'll construct two hidden layers, each containing 50 neurons. The Accuracy measure is computed by linking the annotated Sample Set with a Performer (Binominal Classification) operator. Table 3 displays the Deep Learning Model, the labeled data, and the Performance Vector that resulted from the technique. Conclusions: Deep learning algorithms can be used to design systems that report data on patients and deliver warnings to medical applications or electronic health information if there are changes in the patient's health. These systems could be created using deep Learning. This helps verify that patients get the proper effective care at the proper time for each specific patient. A healthcare decision support system was presented using the Internet of Things and deep learning methods. In the proposed system, we examined the capability of integrating deep learning technology into automatic diagnosis and IoT capabilities for faster message exchange over the Internet. We have selected the suitable Neural Network structure (number of best-hidden layers and activation function classes) to construct the e-health system. In addition, the e-health system relied on data from doctors to understand the Neural Network. In the validation method, the total evaluation of the proposed healthcare system for diagnostics provides dependability under various patient conditions. Based on evaluation and simulation findings, a dual hidden layer of feed-forward NN and its neurons store the tanh function more effectively than other NN. To overcome challenges, this study will integrate artificial intelligence with IoT. This study aims to determine the NN's optimal layer counts and activation function variations.

5.

MetaNorm: incorporating meta-analytic priors into normalization of NanoString nCounter data.

Barth, Jackson; Yang, Yuqiu; Xiao, Guanghua; Wang, Xinlei.

Bioinformatics ; 40(1)2024 01 02.

Artigo em Inglês | MEDLINE | ID: mdl-38237909

RESUMO

MOTIVATION: Non-informative or diffuse prior distributions are widely employed in Bayesian data analysis to maintain objectivity. However, when meaningful prior information exists and can be identified, using an informative prior distribution to accurately reflect current knowledge may lead to superior outcomes and great efficiency. RESULTS: We propose MetaNorm, a Bayesian algorithm for normalizing NanoString nCounter gene expression data. MetaNorm is based on RCRnorm, a powerful method designed under an integrated series of hierarchical models that allow various sources of error to be explained by different types of probes in the nCounter system. However, a lack of accurate prior information, weak computational efficiency, and instability of estimates that sometimes occur weakens the approach despite its impressive performance. MetaNorm employs priors carefully constructed from a rigorous meta-analysis to leverage information from large public data. Combined with additional algorithmic enhancements, MetaNorm improves RCRnorm by yielding more stable estimation of normalized values, better convergence diagnostics and superior computational efficiency. AVAILABILITY AND IMPLEMENTATION: R Code for replicating the meta-analysis and the normalization function can be found at github.com/jbarth216/MetaNorm.

Assuntos

Algoritmos , Análise de Dados , Teorema de Bayes

6.

Deep Learning-Based H-Score Quantification of Immunohistochemistry-Stained Images.

Wen, Zhuoyu; Luo, Danni; Wang, Shidan; Rong, Ruichen; Evers, Bret M; Jia, Liwei; Fang, Yisheng; Daoud, Elena V; Yang, Shengjie; Gu, Zifan; Arner, Emily N; Lewis, Cheryl M; Solis Soto, Luisa M; Fujimoto, Junya; Behrens, Carmen; Wistuba, Ignacio I; Yang, Donghan M; Brekken, Rolf A; O'Donnell, Kathryn A; Xie, Yang; Xiao, Guanghua.

Mod Pathol ; 37(2): 100398, 2024 Feb.

Artigo em Inglês | MEDLINE | ID: mdl-38043788

RESUMO

Immunohistochemistry (IHC) is a well-established and commonly used staining method for clinical diagnosis and biomedical research. In most IHC images, the target protein is conjugated with a specific antibody and stained using diaminobenzidine (DAB), resulting in a brown coloration, whereas hematoxylin serves as a blue counterstain for cell nuclei. The protein expression level is quantified through the H-score, calculated from DAB staining intensity within the target cell region. Traditionally, this process requires evaluation by 2 expert pathologists, which is both time consuming and subjective. To enhance the efficiency and accuracy of this process, we have developed an automatic algorithm for quantifying the H-score of IHC images. To characterize protein expression in specific cell regions, a deep learning model for region recognition was trained based on hematoxylin staining only, achieving pixel accuracy for each class ranging from 0.92 to 0.99. Within the desired area, the algorithm categorizes DAB intensity of each pixel as negative, weak, moderate, or strong staining and calculates the final H-score based on the percentage of each intensity category. Overall, this algorithm takes an IHC image as input and directly outputs the H-score within a few seconds, significantly enhancing the speed of IHC image analysis. This automated tool provides H-score quantification with precision and consistency comparable to experienced pathologists but at a significantly reduced cost during IHC diagnostic workups. It holds significant potential to advance biomedical research reliant on IHC staining for protein expression quantification.

Assuntos

Aprendizado Profundo , Humanos , Imuno-Histoquímica , Hematoxilina/metabolismo , Algoritmos , Núcleo Celular/metabolismo

7.

Mapping Cellular Interactions from Spatially Resolved Transcriptomics Data.

Zhu, James; Wang, Yunguan; Chang, Woo Yong; Malewska, Alicia; Napolitano, Fabiana; Gahan, Jeffrey C; Unni, Nisha; Zhao, Min; Yuan, Rongqing; Wu, Fangjiang; Yue, Lauren; Guo, Lei; Zhao, Zhuo; Chen, Danny Z; Hannan, Raquibul; Zhang, Siyuan; Xiao, Guanghua; Mu, Ping; Hanker, Ariella B; Strand, Douglas; Arteaga, Carlos L; Desai, Neil; Wang, Xinlei; Xie, Yang; Wang, Tao.

bioRxiv ; 2024 Jan 25.

Artigo em Inglês | MEDLINE | ID: mdl-37781617

RESUMO

Cell-cell communication (CCC) is essential to how life forms and functions. However, accurate, high-throughput mapping of how expression of all genes in one cell affects expression of all genes in another cell is made possible only recently, through the introduction of spatially resolved transcriptomics technologies (SRTs), especially those that achieve single cell resolution. However, significant challenges remain to analyze such highly complex data properly. Here, we introduce a Bayesian multi-instance learning framework, spacia, to detect CCCs from data generated by SRTs, by uniquely exploiting their spatial modality. We highlight spacia's power to overcome fundamental limitations of popular analytical tools for inference of CCCs, including losing single-cell resolution, limited to ligand-receptor relationships and prior interaction databases, high false positive rates, and most importantly the lack of consideration of the multiple-sender-to-one-receiver paradigm. We evaluated the fitness of spacia for all three commercialized single cell resolution ST technologies: MERSCOPE/Vizgen, CosMx/Nanostring, and Xenium/10X. Spacia unveiled how endothelial cells, fibroblasts and B cells in the tumor microenvironment contribute to Epithelial-Mesenchymal Transition and lineage plasticity in prostate cancer cells. We deployed spacia in a set of pan-cancer datasets and showed that B cells also participate in PDL1/PD1 signaling in tumors. We demonstrated that a CD8+ T cell/PDL1 effectiveness signature derived from spacia analyses is associated with patient survival and response to immune checkpoint inhibitor treatments in 3,354 patients. We revealed differential spatial interaction patterns between Î³Î´ T cells and liver hepatocytes in healthy and cancerous contexts. Overall, spacia represents a notable step in advancing quantitative theories of cellular communications.

8.

Deep learning of cell spatial organizations identifies clinically relevant insights in tissue images.

Wang, Shidan; Rong, Ruichen; Zhou, Qin; Yang, Donghan M; Zhang, Xinyi; Zhan, Xiaowei; Bishop, Justin; Chi, Zhikai; Wilhelm, Clare J; Zhang, Siyuan; Pickering, Curtis R; Kris, Mark G; Minna, John; Xie, Yang; Xiao, Guanghua.

Nat Commun ; 14(1): 7872, 2023 Dec 11.

Artigo em Inglês | MEDLINE | ID: mdl-38081823

RESUMO

Recent advancements in tissue imaging techniques have facilitated the visualization and identification of various cell types within physiological and pathological contexts. Despite the emergence of cell-cell interaction studies, there is a lack of methods for evaluating individual spatial interactions. In this study, we introduce Ceograph, a cell spatial organization-based graph convolutional network designed to analyze cell spatial organization (for example,. the cell spatial distribution, morphology, proximity, and interactions) derived from pathology images. Ceograph identifies key cell spatial organization features by accurately predicting their influence on patient clinical outcomes. In patients with oral potentially malignant disorders, our model highlights reduced structural concordance and increased closeness in epithelial substrata as driving features for an elevated risk of malignant transformation. In lung cancer patients, Ceograph detects elongated tumor nuclei and diminished stroma-stroma closeness as biomarkers for insensitivity to EGFR tyrosine kinase inhibitors. With its potential to predict various clinical outcomes, Ceograph offers a deeper understanding of biological processes and supports the development of personalized therapeutic strategies.

Assuntos

Aprendizado Profundo , Neoplasias Pulmonares , Humanos , Comunicação Celular , Núcleo Celular , Neoplasias Pulmonares/diagnóstico por imagem

9.

Reconstructing Spatial Transcriptomics at the Single-cell Resolution with BayesDeep.

Jiang, Xi; Dong, Lei; Wang, Shidan; Wen, Zhuoyu; Chen, Mingyi; Xu, Lin; Xiao, Guanghua; Li, Qiwei.

bioRxiv ; 2023 Dec 08.

Artigo em Inglês | MEDLINE | ID: mdl-38106214

RESUMO

Spatially resolved transcriptomics (SRT) techniques have revolutionized the characterization of molecular profiles while preserving spatial and morphological context. However, most next-generation sequencing-based SRT techniques are limited to measuring gene expression in a confined array of spots, capturing only a fraction of the spatial domain. Typically, these spots encompass gene expression from a few to hundreds of cells, underscoring a critical need for more detailed, single-cell resolution SRT data to enhance our understanding of biological functions within the tissue context. Addressing this challenge, we introduce BayesDeep, a novel Bayesian hierarchical model that leverages cellular morphological data from histology images, commonly paired with SRT data, to reconstruct SRT data at the single-cell resolution. BayesDeep effectively model count data from SRT studies via a negative binomial regression model. This model incorporates explanatory variables such as cell types and nuclei-shape information for each cell extracted from the paired histology image. A feature selection scheme is integrated to examine the association between the morphological and molecular profiles, thereby improving the model robustness. We applied BayesDeep to two real SRT datasets, successfully demonstrating its capability to reconstruct SRT data at the single-cell resolution. This advancement not only yields new biological insights but also significantly enhances various downstream analyses, such as pseudotime and cell-cell communication.

10.

Osteosarcoma Explorer: A Data Commons With Clinical, Genomic, Protein, and Tissue Imaging Data for Osteosarcoma Research.

Yang, Donghan M; Zhou, Qinbo; Furman-Cline, Lauren; Cheng, Xian; Luo, Danni; Lai, Hongyin; Li, Yueqi; Jin, Kevin W; Yao, Bo; Leavey, Patrick J; Rakheja, Dinesh; Lo, Tammy; Hall, David; Barkauskas, Donald A; Shulman, David S; Janeway, Katherine; Khanna, Chand; Gorlick, Richard; Menzies, Christopher; Zhan, Xiaowei; Xiao, Guanghua; Skapek, Stephen X; Xu, Lin; Klesse, Laura J; Crompton, Brian D; Xie, Yang.

JCO Clin Cancer Inform ; 7: e2300104, 2023 Sep.

Artigo em Inglês | MEDLINE | ID: mdl-37956387

RESUMO

PURPOSE: Osteosarcoma research advancement requires enhanced data integration across different modalities and sources. Current osteosarcoma research, encompassing clinical, genomic, protein, and tissue imaging data, is hindered by the siloed landscape of data generation and storage. MATERIALS AND METHODS: Clinical, molecular profiling, and tissue imaging data for 573 patients with pediatric osteosarcoma were collected from four public and institutional sources. A common data model incorporating standardized terminology was created to facilitate the transformation, integration, and load of source data into a relational database. On the basis of this database, a data commons accompanied by a user-friendly web portal was developed, enabling various data exploration and analytics functions. RESULTS: The Osteosarcoma Explorer (OSE) was released to the public in 2021. Leveraging a comprehensive and harmonized data set on the backend, the OSE offers a wide range of functions, including Cohort Discovery, Patient Dashboard, Image Visualization, and Online Analysis. Since its initial release, the OSE has experienced an increasing utilization by the osteosarcoma research community and provided solid, continuous user support. To our knowledge, the OSE is the largest (N = 573) and most comprehensive research data commons for pediatric osteosarcoma, a rare disease. This project demonstrates an effective framework for data integration and data commons development that can be readily applied to other projects sharing similar goals. CONCLUSION: The OSE offers an online exploration and analysis platform for integrated clinical, molecular profiling, and tissue imaging data of osteosarcoma. Its underlying data model, database, and web framework support continuous expansion onto new data modalities and sources.

Assuntos

Gerenciamento de Dados , Osteossarcoma , Criança , Humanos , Bases de Dados Factuais , Genômica , Osteossarcoma/diagnóstico por imagem , Osteossarcoma/genética

11.

Artificial intelligence in mental healthcare: an overview and future perspectives.

Jin, Kevin W; Li, Qiwei; Xie, Yang; Xiao, Guanghua.

Br J Radiol ; 96(1150): 20230213, 2023 Oct.

Artigo em Inglês | MEDLINE | ID: mdl-37698582

RESUMO

Artificial intelligence is disrupting the field of mental healthcare through applications in computational psychiatry, which leverages quantitative techniques to inform our understanding, detection, and treatment of mental illnesses. This paper provides an overview of artificial intelligence technologies in modern mental healthcare and surveys recent advances made by researchers, focusing on the nascent field of digital psychiatry. We also consider the ethical implications of artificial intelligence playing a greater role in mental healthcare.

Assuntos

Transtornos Mentais , Serviços de Saúde Mental , Psiquiatria , Humanos , Inteligência Artificial , Atenção à Saúde/métodos , Transtornos Mentais/diagnóstico , Transtornos Mentais/terapia

12.

Unsupervised domain adaptation for nuclei segmentation: Adapting from hematoxylin & eosin stained slides to immunohistochemistry stained slides using a curriculum approach.

Wang, Shidan; Rong, Ruichen; Gu, Zifan; Fujimoto, Junya; Zhan, Xiaowei; Xie, Yang; Xiao, Guanghua.

Comput Methods Programs Biomed ; 241: 107768, 2023 Nov.

Artigo em Inglês | MEDLINE | ID: mdl-37619429

RESUMO

BACKGROUND AND OBJECTIVE: Unsupervised domain adaptation (UDA) is a powerful approach in tackling domain discrepancies and reducing the burden of laborious and error-prone pixel-level annotations for instance segmentation. However, the domain adaptation strategies utilized in previous instance segmentation models pool all the labeled/detected instances together to train the instance-level GAN discriminator, which neglects the differences among multiple instance categories. Such pooling prevents UDA instance segmentation models from learning categorical correspondence between source and target domains for accurate instance classification; METHODS: To tackle this challenge, we propose an Instance Segmentation CycleGAN (ISC-GAN) algorithm for UDA multiclass-instance segmentation. We conduct extensive experiments on the multiclass nuclei recognition task to transfer knowledge from hematoxylin and eosin to immunohistochemistry stained pathology images. Specifically, we fuse CycleGAN with Mask R-CNN to learn categorical correspondence with image-level domain adaptation and virtual supervision. Moreover, we utilize Curriculum Learning to separate the learning process into two steps: (1) learning segmentation only on labeled source data, and (2) learning target domain segmentation with paired virtual labels generated by ISC-GAN. The performance was further improved through experiments with other strategies, including Shared Weights, Knowledge Distillation, and Expanded Source Data. RESULTS: Comparing to the baseline model or the three UDA instance detection and segmentation models, ISC-GAN illustrates the state-of-the-art performance, with 39.1% average precision and 48.7% average recall. The source codes of ISC-GAN are available at https://github.com/sdw95927/InstanceSegmentation-CycleGAN. CONCLUSION: ISC-GAN adapted knowledge from hematoxylin and eosin to immunohistochemistry stained pathology images, suggesting the potential for reducing the need for large annotated pathological image datasets in deep learning and computer vision tasks.

Assuntos

Algoritmos , Currículo , Amarelo de Eosina-(YS) , Hematoxilina , Imuno-Histoquímica

13.

A Deep Learning Onion Peeling Approach to Measure Oral Epithelium Layer Number.

Zhang, Xinyi; Gleber-Netto, Frederico O; Wang, Shidan; Jin, Kevin W; Yang, Donghan M; Gillenwater, Ann M; Myers, Jeffrey N; Ferrarotto, Renata; Pickering, Curtis R; Xiao, Guanghua.

Cancers (Basel) ; 15(15)2023 Jul 31.

Artigo em Inglês | MEDLINE | ID: mdl-37568707

RESUMO

Head and neck squamous cell carcinoma (HNSCC), specifically in the oral cavity (oral squamous cell carcinoma, OSCC), is a common, complex cancer that significantly affects patients' quality of life. Early diagnosis typically improves prognoses yet relies on pathologist examination of histology images that exhibit high inter- and intra-observer variation. The advent of deep learning has automated this analysis, notably with object segmentation. However, techniques for automated oral dysplasia diagnosis have been limited to shape or cell stain information, without addressing the diagnostic potential in counting the number of cell layers in the oral epithelium. Our study attempts to address this gap by combining the existing U-Net and HD-Staining architectures for segmenting the oral epithelium and introducing a novel algorithm that we call Onion Peeling for counting the epithelium layer number. Experimental results show a close correlation between our algorithmic and expert manual layer counts, demonstrating the feasibility of automated layer counting. We also show the clinical relevance of oral epithelial layer number to grading oral dysplasia severity through survival analysis. Overall, our study shows that automated counting of oral epithelium layers can represent a potential addition to the digital pathology toolbox. Model generalizability and accuracy could be improved further with a larger training dataset.

14.

ScopeViewer: A Browser-Based Solution for Visualizing Spatial Transcriptomics Data.

Luo, Danni; Robertson, Sophie; Zhan, Yuanchun; Rong, Ruichen; Wang, Shidan; Jiang, Xi; Yang, Sen; Palmer, Suzette; Jia, Liwei; Li, Qiwei; Xiao, Guanghua; Zhan, Xiaowei.

bioRxiv ; 2023 Jul 25.

Artigo em Inglês | MEDLINE | ID: mdl-37546786

RESUMO

Motivation: Spatial transcriptomics (ST) enables a high-resolution interrogation of molecular characteristics within specific spatial contexts and tissue morphology. Despite its potential, visualization of ST data is a challenging task due to the complexities in handling, sharing and visualizing large image datasets together with molecular information. Results: We introduce ScopeViewer, a browser-based software designed to overcome these challenges. ScopeViewer offers the following functionalities: (1) It visualizes large image data and associated annotations at various zoom levels, allowing for intricate exploration of the data; (2) It enables dual interactive viewing of the original images along with their annotations, providing a comprehensive understanding of the context; (3) It displays spatial molecular features with optimized bandwidth, ensuring a smooth user experience; and (4) It bolsters data security by circumventing data transfers. Availability: ScopeViewer is available at: https://datacommons.swmed.edu/scopeviewer.

15.

Deep Learning of Cell Spatial Organizations Identifies Clinically Relevant Insights in Tissue Images.

Wang, Shidan; Rong, Ruichen; Yang, Donghan M; Zhang, Xinyi; Zhan, Xiaowei; Bishop, Justin; Wilhelm, Clare J; Zhang, Siyuan; Pickering, Curtis R; Kris, Mark G; Minna, John; Xie, Yang; Xiao, Guanghua.

Res Sq ; 2023 Jul 04.

Artigo em Inglês | MEDLINE | ID: mdl-37461694

RESUMO

Recent advancements in tissue imaging techniques have facilitated the visualization and identification of various cell types within physiological and pathological contexts. Despite the emergence of cell-cell interaction studies, there is a lack of methods for evaluating individual spatial interactions. In this study, we introduce Ceograph, a novel cell spatial organization-based graph convolutional network designed to analyze cell spatial organization (i.e. the cell spatial distribution, morphology, proximity, and interactions) derived from pathology images. Ceograph identifies key cell spatial organization features by accurately predicting their influence on patient clinical outcomes. In patients with oral potentially malignant disorders, our model highlights reduced structural concordance and increased closeness in epithelial substrata as driving features for an elevated risk of malignant transformation. In lung cancer patients, Ceograph detects elongated tumor nuclei and diminished stroma-stroma closeness as biomarkers for insensitivity to EGFR tyrosine kinase inhibitors. With its potential to predict various clinical outcomes, Ceograph offers a deeper understanding of biological processes and supports the development of personalized therapeutic strategies.

16.

Image-based quantification of histological features as a function of spatial location using the Tissue Positioning System.

Rong, Ruichen; Wei, Yonglong; Li, Lin; Wang, Tao; Zhu, Hao; Xiao, Guanghua; Wang, Yunguan.

EBioMedicine ; 94: 104698, 2023 Aug.

Artigo em Inglês | MEDLINE | ID: mdl-37453365

RESUMO

BACKGROUND: Tissues such as the liver lobule, kidney nephron, and intestinal gland exhibit intricate patterns of zonated gene expression corresponding to distinct cell types and functions. To quantitatively understand zonation, it is important to measure cellular or genetic features as a function of position along a zonal axis. While it is possible to manually count, characterize, and locate features in relation to the zonal axis, it is labor-intensive and difficult to do manually while maintaining precision and accuracy. METHODS: We addressed this challenge by developing a deep-learning-based quantification method called the "Tissue Positioning System" (TPS), which can automatically analyze zonation in the liver lobule as a model system. FINDINGS: By using algorithms that identified vessels, classified vessels, and segmented zones based on the relative position along the portal vein to central vein axis, TPS was able to spatially quantify gene expression in mice with zone specific reporters. INTERPRETATION: TPS could discern expression differences between zonal reporter strains, ages, and disease states. TPS could also reveal the zonal distribution of cells previously thought to be positioned randomly. The design principles of TPS could be generalized to other tissues to explore the biology of zonation. FUNDING: CPRIT (RP190208, RP220614, RP230330) and NIH (P30CA142543, R01AA028791, R01CA251928, R01DK1253961, R01GM140012, 1R01GM141519, 1R01DE030656, 1U01CA249245). The Pollack Foundation, Simmons Comprehensive Cancer Center Cancer & Obesity Translational Pilot Award, and the Emerging Leader Award from the Mark Foundation For Cancer Research (#21-003-ELA).

Assuntos

Hepatócitos , Fígado , Camundongos , Animais , Hepatócitos/metabolismo , Fígado/metabolismo , Modelos Biológicos , Processamento de Proteína Pós-Traducional

17.

Dissecting molecular, pathological, and clinical features associated with tumor neural/neuroendocrine heterogeneity.

Cai, Ling; DeBerardinis, Ralph J; Xiao, Guanghua; Minna, John D; Xie, Yang.

iScience ; 26(6): 106983, 2023 Jun 16.

Artigo em Inglês | MEDLINE | ID: mdl-37378310

RESUMO

Lineage plasticity, especially transdifferentiation between neural/neuroendocrine (NE) and non-NE lineage, has been observed in multiple cancer types and linked to increased tumor aggressiveness. However, existing NE/non-NE subtype classifications in various cancer types were established through ad hoc approaches in different studies, making it difficult to align findings across cancer types and extend investigations to new datasets. To address this issue, we developed a generalized strategy to generate quantitative NE scores and a web application to facilitate its implementation. We applied this method to nine datasets covering seven cancer types, including two neural cancers, two neuroendocrine cancers, and three non-NE cancers. Our analysis revealed significant NE inter-tumoral heterogeneity and identified strong associations between NE scores and molecular, histological, and clinical features, including prognosis in different cancer types. These results support the translational utility of NE scores. Overall, our work demonstrated a broadly applicable strategy for determining the NE properties of tumors.

18.

A Comparative Study of Neuroendocrine Heterogeneity in Small Cell Lung Cancer and Neuroblastoma.

Cai, Ling; DeBerardinis, Ralph J; Xie, Yang; Minna, John D; Xiao, Guanghua.

Mol Cancer Res ; 21(8): 795-807, 2023 08 01.

Artigo em Inglês | MEDLINE | ID: mdl-37255415

RESUMO

Lineage plasticity has long been documented in both small cell lung cancer (SCLC) and neuroblastoma, two clinically distinct neuroendocrine (NE) cancers. In this study, we quantified the NE features of cancer as NE scores and performed a systematic comparison of SCLC and neuroblastoma. We found neuroblastoma and SCLC cell lines have highly similar molecular profiles and shared therapeutic sensitivity. In addition, NE heterogeneity was observed at both the inter- and intra-cell line levels. Surprisingly, we did not find a significant association between NE scores and overall survival in SCLC or neuroblastoma. We described many shared and unique NE score-associated features between SCLC and neuroblastoma, including dysregulation of Myc oncogenes, alterations in protein expression, metabolism, drug resistance, and selective gene dependencies. IMPLICATIONS: Our work establishes a reference for molecular changes and vulnerabilities associated with NE to non-NE transdifferentiation through mutual validation of SCLC and neuroblastoma samples.

Assuntos

Carcinoma Neuroendócrino , Neoplasias Pulmonares , Neuroblastoma , Carcinoma de Pequenas Células do Pulmão , Humanos , Carcinoma de Pequenas Células do Pulmão/genética , Neoplasias Pulmonares/genética , Neuroblastoma/genética , Linhagem Celular

19.

A comparative study of neuroendocrine heterogeneity in small cell lung cancer and neuroblastoma.

Cai, Ling; DeBerardinis, Ralph J; Xie, Yang; Minna, John D; Xiao, Guanghua.

Mol Cancer Res ; 2023 May 12.

Artigo em Inglês | MEDLINE | ID: mdl-37171983

RESUMO

Lineage plasticity has long been documented in both small cell lung cancer (SCLC) and neuroblastoma (NBL), two clinically distinct neuroendocrine (NE) cancers. In this study, we quantified the NE features of cancer as NE scores and performed a systematic comparison of SCLC and NBL. We found NBL and SCLC cell lines have highly similar molecular profiles and shared therapeutic sensitivity. In addition, NE heterogeneity was observed at both the inter- and intra-cell line levels. Surprisingly, we did not find a significant association between NE scores and overall survival in SCLC or NBL. We described many shared and unique NE score-associated features between SCLC and NBL, including dysregulation of Myc oncogenes, alterations in protein expression, metabolism, drug resistance, and selective gene dependencies. Implications: Our work establishes a reference for molecular changes and vulnerabilities associated with NE to non-NE transdifferentiation through mutual validation of SCLC and NBL samples.

20.

Deep-Learning-Based Hepatic Ploidy Quantification Using H&E Histopathology Images.

Wen, Zhuoyu; Lin, Yu-Hsuan; Wang, Shidan; Fujiwara, Naoto; Rong, Ruichen; Jin, Kevin W; Yang, Donghan M; Yao, Bo; Yang, Shengjie; Wang, Tao; Xie, Yang; Hoshida, Yujin; Zhu, Hao; Xiao, Guanghua.

Genes (Basel) ; 14(4)2023 04 16.

Artigo em Inglês | MEDLINE | ID: mdl-37107679

RESUMO

Polyploidy, the duplication of the entire genome within a single cell, is a significant characteristic of cells in many tissues, including the liver. The quantification of hepatic ploidy typically relies on flow cytometry and immunofluorescence (IF) imaging, which are not widely available in clinical settings due to high financial and time costs. To improve accessibility for clinical samples, we developed a computational algorithm to quantify hepatic ploidy using hematoxylin-eosin (H&E) histopathology images, which are commonly obtained during routine clinical practice. Our algorithm uses a deep learning model to first segment and classify different types of cell nuclei in H&E images. It then determines cellular ploidy based on the relative distance between identified hepatocyte nuclei and determines nuclear ploidy using a fitted Gaussian mixture model. The algorithm can establish the total number of hepatocytes and their detailed ploidy information in a region of interest (ROI) on H&E images. This is the first successful attempt to automate ploidy analysis on H&E images. Our algorithm is expected to serve as an important tool for studying the role of polyploidy in human liver disease.

Assuntos

Aprendizado Profundo , Humanos , Amarelo de Eosina-(YS) , Hematoxilina , Fígado , Ploidias , Poliploidia

RESUMO

Assuntos

RESUMO

RESUMO

RESUMO

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

RESUMO

Assuntos

RESUMO

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

RESUMO

RESUMO

RESUMO

Assuntos

RESUMO

RESUMO

Assuntos

RESUMO

RESUMO

Assuntos

ENVIAR RESULTADO:

SELEÇÃO DE REFERÊNCIAS

DETALHE DA PESQUISA