RESUMO
Mosquito transmission of dengue viruses to humans starts with infection of skin resident cells at the biting site. There is great interest in identifying transmission-enhancing factors in mosquito saliva in order to counteract them. Here we report the discovery of high levels of the anti-immune subgenomic flaviviral RNA (sfRNA) in dengue virus 2-infected mosquito saliva. We established that sfRNA is present in saliva using three different methods: northern blot, RT-qPCR and RNA sequencing. We next show that salivary sfRNA is protected in detergent-sensitive compartments, likely extracellular vesicles. In support of this hypothesis, we visualized viral RNAs in vesicles in mosquito saliva and noted a marked enrichment of signal from 3'UTR sequences, which is consistent with the presence of sfRNA. Furthermore, we show that incubation with mosquito saliva containing higher sfRNA levels results in higher virus infectivity in a human hepatoma cell line and human primary dermal fibroblasts. Transfection of 3'UTR RNA prior to DENV2 infection inhibited type I and III interferon induction and signaling, and enhanced viral replication. Therefore, we posit that sfRNA present in salivary extracellular vesicles is delivered to cells at the biting site to inhibit innate immunity and enhance dengue virus transmission.
Assuntos
Aedes , Culicidae , Dengue , Flavivirus , Animais , Humanos , Flavivirus/genética , RNA Subgenômico , Saliva/metabolismo , Regiões 3' não Traduzidas , Replicação Viral , RNA Viral/genética , RNA Viral/metabolismoRESUMO
There is a need for methods that can image chromosomes with genome-wide coverage, as well as greater genomic and optical resolution. We introduce OligoFISSEQ, a suite of three methods that leverage fluorescence in situ sequencing (FISSEQ) of barcoded Oligopaint probes to enable the rapid visualization of many targeted genomic regions. Applying OligoFISSEQ to human diploid fibroblast cells, we show how four rounds of sequencing are sufficient to produce 3D maps of 36 genomic targets across six chromosomes in hundreds to thousands of cells, implying a potential to image thousands of targets in only five to eight rounds of sequencing. We also use OligoFISSEQ to trace chromosomes at finer resolution, following the path of the X chromosome through 46 regions, with separate studies showing compatibility of OligoFISSEQ with immunocytochemistry. Finally, we combined OligoFISSEQ with OligoSTORM, laying the foundation for accelerated single-molecule super-resolution imaging of large swaths of, if not entire, human genomes.
Assuntos
Coloração Cromossômica/métodos , Cromossomos/química , Cromossomos/genética , Genoma Humano , Humanos , Análise de Sequência com Séries de Oligonucleotídeos/métodos , Sondas de Oligonucleotídeos , Mapeamento Físico do CromossomoRESUMO
Oligonucleotide (oligo)-based FISH has emerged as an important tool for the study of chromosome organization and gene expression and has been empowered by the commercial availability of highly complex pools of oligos. However, a dedicated bioinformatic design utility has yet to be created specifically for the purpose of identifying optimal oligo FISH probe sequences on the genome-wide scale. Here, we introduce OligoMiner, a rapid and robust computational pipeline for the genome-scale design of oligo FISH probes that affords the scientist exact control over the parameters of each probe. Our streamlined method uses standard bioinformatic file formats, allowing users to seamlessly integrate new and existing utilities into the pipeline as desired, and introduces a method for evaluating the specificity of each probe molecule that connects simulated hybridization energetics to rapidly generated sequence alignments using supervised machine learning. We demonstrate the scalability of our approach by performing genome-scale probe discovery in numerous model organism genomes and showcase the performance of the resulting probes with diffraction-limited and single-molecule superresolution imaging of chromosomal and RNA targets. We anticipate that this pipeline will make the FISH probe design process much more accessible and will more broadly facilitate the design of pools of hybridization probes for a variety of applications.
Assuntos
Genômica/métodos , Hibridização in Situ Fluorescente/métodos , Sondas de Oligonucleotídeos/química , Sondas de Oligonucleotídeos/genética , Animais , Arabidopsis , DNA/genética , DNA/metabolismo , Mineração de Dados , Humanos , Camundongos , Modelos Genéticos , Sondas de Oligonucleotídeos/metabolismoRESUMO
Chromosome organization is crucial for genome function. Here, we present a method for visualizing chromosomal DNA at super-resolution and then integrating Hi-C data to produce three-dimensional models of chromosome organization. Using the super-resolution microscopy methods of OligoSTORM and OligoDNA-PAINT, we trace 8 megabases of human chromosome 19, visualizing structures ranging in size from a few kilobases to over a megabase. Focusing on chromosomal regions that contribute to compartments, we discover distinct structures that, in spite of considerable variability, can predict whether such regions correspond to active (A-type) or inactive (B-type) compartments. Imaging through the depths of entire nuclei, we capture pairs of homologous regions in diploid cells, obtaining evidence that maternal and paternal homologous regions can be differentially organized. Finally, using restraint-based modeling to integrate imaging and Hi-C data, we implement a method-integrative modeling of genomic regions (IMGR)-to increase the genomic resolution of our traces to 10 kb.
Assuntos
Passeio de Cromossomo/métodos , Cromossomos Humanos Par 19/genética , Cromossomos Humanos Par 19/ultraestrutura , Modelos Genéticos , Células Cultivadas , Coloração Cromossômica/métodos , Estruturas Cromossômicas/química , Estruturas Cromossômicas/genética , Estruturas Cromossômicas/ultraestrutura , Cromossomos Humanos Par 19/química , Feminino , Corantes Fluorescentes , Humanos , Imageamento Tridimensional , Hibridização in Situ Fluorescente/métodos , Masculino , Sondas de Oligonucleotídeos , LinhagemRESUMO
We utilized single-molecule tethered particle motion (TPM) tracking, optimized for studying the behavior of short (0.922 µm) dsDNA molecules under shear flow conditions, in the proximity of a wall (surface). These experiments track the individual trajectories through a gold nanobead (40 nm in radius), attached to the loose end of the DNA molecules. Under such circumstances, local interactions with the wall become more pronounced, manifested through hydrodynamic interactions. To elucidate the mechanical mechanism that affects the statistics of the molecular trajectories of the tethered molecules, we estimate the resting diffusion coefficient of our system. Using this value and our measured data, we calculate the orthogonal distance of the extended DNA molecules from the surface. This calculation considers the hydrodynamic drag effect that emerges from the proximity of the molecule to the surface, using the Faxén correction factors. Our finding enables the construction of a scenario according to which the tension along the chain builds up with the applied shear force, driving the loose end of the DNA molecule away from the wall. With the extension from the wall, the characteristic times of the system decrease by three orders of magnitude, while the drag coefficients decay to a plateau value that indicates that the molecule still experiences hydrodynamic effects due to its proximity to the wall.
Assuntos
DNA/química , Resistência ao Cisalhamento , DifusãoRESUMO
PURPOSE: In conventional three-dimensional magnetic resonance elastography, motion encoding gradients (MEGs) synchronized to a mechanical excitation are applied separately in each direction to encode tissue displacement generated by the corresponding waves. This requires long acquisition times that introduce errors due to patient motion and may hinder clinical deployment of magnetic resonance elastography. In this article, a framework for MEGs sequence design is proposed to reduce scanning time and increase signal-to-noise ratio. THEORY AND METHODS: The approach is based on applying MEGs in all three directions simultaneously with varying parameters, and formulation of the problem as a linear estimation of the wave properties. Multidirectional MEGs sequences are derived by setting the problem in an experimental design framework. Such designs are implemented and evaluated on simulation and phantom data. RESULTS: Estimation error of the displacement using the proposed MEGs designs is reduced up to a factor of two in comparison with a unidirectional design with a same number of acquisitions. Alternatively, for the same error, scanning time is reduced up to a factor of three using the multidirectional designs. CONCLUSION: The proposed framework generalizes acquisition of magnetic resonance elastography, and allows quantification of design performance, and optimization-based derivation of designs.
Assuntos
Módulo de Elasticidade/fisiologia , Técnicas de Imagem por Elasticidade/métodos , Interpretação de Imagem Assistida por Computador/métodos , Armazenamento e Recuperação da Informação/métodos , Modelos Biológicos , Movimento/fisiologia , Simulação por Computador , Humanos , Aumento da Imagem/métodos , Movimento (Física) , Reprodutibilidade dos Testes , Sensibilidade e EspecificidadeRESUMO
The purpose of this work was (1) to develop a magnetic resonance elastography (MRE) system for imaging of the ex vivo human prostate and (2) to assess the diagnostic power of mono-frequency and multi-frequency MRE and diffusion weighted imaging (DWI) alone and combined as correlated with histopathology in a patient study. An electromagnetic driver was designed specifically for MRE studies in small-bore MR scanners. Ex vivo prostate specimens (post-fixation) of 14 patients who underwent radical prostatectomy were imaged with MRE at 7 T (nine cases had DWI). In six patients, the MRE examination was performed at three frequencies (600, 800, 1000 Hz) to extract the power-law exponent Gamma. The images were registered to wholemount pathology slides marked with the Gleason score. The areas under the receiver-operator-characteristic curves (AUC) were calculated. The methods were validated in a phantom study and it was demonstrated that (i) the driver does not interfere with the acquisition process and (ii) the driver can generate amplitudes greater than 100 µm for frequencies less than 1 kHz. In the quantitative study, cancerous tissue with Gleason score at least 3 + 3 was distinguished from normal tissue in the peripheral zone (PZ) with an average AUC of 0.75 (Gd ), 0.75 (Gl ), 0.70 (Gamma-Gd ), 0.68 (apparent diffusion coefficient, ADC), and 0.82 (Gd + Gl + ADC). The differentiation between PZ and central gland was modest for Gd (p < 0.07), Gl (p < 0.06) but not significant for Gamma (p < 0.2). A correlation of 0.4 kPa/h was found between the fixation time of the prostate specimen and the stiffness of the tissue, which could affect the diagnostic power results. DWI and MRE may provide complementary information; in fact MRE performed better than ADC in distinguishing normal from cancerous tissue in some cases. Multi-frequency (Gamma) analysis did not appear to improve the results. However, in light of the effect of tissue fixation, the clinical implication of our results may be inconclusive and more experiments are needed.
Assuntos
Imagem de Difusão por Ressonância Magnética/métodos , Técnicas de Imagem por Elasticidade/métodos , Neoplasias da Próstata/diagnóstico , Neoplasias da Próstata/patologia , Idoso , Área Sob a Curva , Biópsia , Módulo de Elasticidade , Humanos , Masculino , Pessoa de Meia-Idade , Imagens de Fantasmas , Próstata/patologia , Curva ROC , TransdutoresRESUMO
The purpose of this work was to assess trans-perineal prostate magnetic resonance elastography (MRE) for (1) repeatability in phantoms/volunteers and (2) diagnostic power as correlated with histopathology in prostate cancer patients. The three-dimensional (3D) displacement field was obtained using a fractionally encoded gradient echo sequence using a custom-made transducer. The repeatability of the method was assessed based on three repeat studies and by changing the driving frequency by 3% in studies on a phantom and six healthy volunteers. Subsequently, 11 patients were examined with MRE prior to radical prostatectomy. The areas under the receiver operating characteristic curves were calculated using a windowed voxel-to-voxel approach by comparing the 2D registered slides, masked with the Gleason score. For the repeatability study, the average intraclass correlation coefficient for elasticity images was 99% for repeat phantom studies, 98% for ±6 Hz phantom studies, 95% for volunteer repeat studies with 2 min acquisition time, 82% for ±2 Hz volunteer studies with 2 min acquisition time and 73% for repeat volunteer studies with 8 min acquisition time. For the patient study, the average elasticity was 8.2 ± 1.7 kPa in the prostate capsule, 7.5 ± 1.9 kPa in the peripheral zone (PZ), 9.7 ± 3.0 kPa in the central gland (CG) and 9.0 ± 3.4 kPa in the transition zone. In the patient study, cancerous tissue with Gleason score at least 3 + 3 was significantly (p < 0.05) different from normal tissue in 10 out of 11 cases with tumors in the PZ, and 6 out of 9 cases with tumors in the CG. However, the overall case-averaged area under the curve was 0.72 in the PZ and 0.67 in the CG. Cancerous tissue was not always stiffer than normal tissue. The inversion algorithm was sensitive to (i) vibration amplitude and displacement nodes and (ii) misalignment of the 3D wave field due to subject movement.
Assuntos
Técnicas de Imagem por Elasticidade/métodos , Imageamento por Ressonância Magnética/métodos , Neoplasias da Próstata/diagnóstico , Neoplasias da Próstata/patologia , Adulto , Idoso , Área Sob a Curva , Módulo de Elasticidade , Humanos , Masculino , Pessoa de Meia-Idade , Imagens de Fantasmas , Curva ROC , Reprodutibilidade dos TestesRESUMO
Nuclear organization impacts gene expression activity and cell phenotype. Our current understanding is mainly derived from ensemble-level sequencing studies that reflect the 3D genome structure of millions of cells. These approaches have provided invaluable details on the 3D organizations of the genome and their relation to other nuclear landmarks. However, they mostly lack the ability to provide multimodal information simultaneously at the single-cell level. In recent years, cutting-edge imaging technologies have risen to the challenge of simultaneously describing multiple components of the nuclear space at the single-cell level, paving the way for a deeper understanding of the genome structure-function relationship. This review will focus on the development and utilization of such technologies to gain a multi-component view of the nucleus at single-cell resolution, dissecting the complexity and heterogeneity of nuclear organization.
Assuntos
Núcleo Celular , Genoma , Núcleo Celular/metabolismo , Cromatina/metabolismoRESUMO
Nuclear compartments are prominent features of 3D chromatin organization, but sequencing depth limitations have impeded investigation at ultra fine-scale. CTCF loops are generally studied at a finer scale, but the impact of looping on proximal interactions remains enigmatic. Here, we critically examine nuclear compartments and CTCF loop-proximal interactions using a combination of in situ Hi-C at unparalleled depth, algorithm development, and biophysical modeling. Producing a large Hi-C map with 33 billion contacts in conjunction with an algorithm for performing principal component analysis on sparse, super massive matrices (POSSUMM), we resolve compartments to 500 bp. Our results demonstrate that essentially all active promoters and distal enhancers localize in the A compartment, even when flanking sequences do not. Furthermore, we find that the TSS and TTS of paused genes are often segregated into separate compartments. We then identify diffuse interactions that radiate from CTCF loop anchors, which correlate with strong enhancer-promoter interactions and proximal transcription. We also find that these diffuse interactions depend on CTCF's RNA binding domains. In this work, we demonstrate features of fine-scale chromatin organization consistent with a revised model in which compartments are more precise than commonly thought while CTCF loops are more protracted.
Assuntos
Cromatina , Elementos Facilitadores Genéticos , Cromatina/genética , Fator de Ligação a CCCTC/genética , Fator de Ligação a CCCTC/metabolismo , Elementos Facilitadores Genéticos/genética , Núcleo Celular/genética , Núcleo Celular/metabolismo , Regiões Promotoras GenéticasRESUMO
Three-dimensional protein structural data at the molecular level are pivotal for successful precision medicine. Such data are crucial not only for discovering drugs that act to block the active site of the target mutant protein but also for clarifying to the patient and the clinician how the mutations harbored by the patient work. The relative paucity of structural data reflects their cost, challenges in their interpretation, and lack of clinical guidelines for their utilization. Rapid technological advancements in experimental high-resolution structural determination increasingly generate structures. Computationally, modeling algorithms, including molecular dynamics simulations, are becoming more powerful, as are compute-intensive hardware, particularly graphics processing units, overlapping with the inception of the exascale era. Accessible, freely available, and detailed structural and dynamical data can be merged with big data to powerfully transform personalizedpharmacology. Here we review protein and emerging genome high-resolution data, along with means, applications, and examples underscoring their usefulness in precision medicine.
Assuntos
Big Data , Medicina de Precisão , Algoritmos , Genoma , Humanos , Medicina de Precisão/métodos , ProteínasRESUMO
HU is a highly conserved protein that is believed to play an important role in the architecture and dynamic compaction of bacterial DNA. Its ability to control DNA bending is crucial for functions such as transcription and replication. The effects of HU on the DNA structure have been studied so far mainly by single molecule methods that require us to apply stretching forces on the DNA and therefore may perturb the DNA-protein interaction. To overcome this hurdle, we study the effect of HU on the DNA structure without applying external forces by using an improved tethered particle motion method. By combining the results with DNA curvature analysis from atomic force microscopy measurements we find that the DNA consists of two different curvature distributions and the measured persistence length is determined by their interplay. As a result, the effective persistence length adopts a bimodal property that depends primarily on the HU concentration. The results can be explained according to a recently suggested model that distinguishes single protein binding from cooperative protein binding.
Assuntos
Proteínas de Bactérias/metabolismo , DNA Bacteriano/metabolismo , Proteínas de Ligação a DNA/metabolismo , Geobacillus stearothermophilus/metabolismo , Proteínas de Bactérias/ultraestrutura , Proteínas de Ligação a DNA/ultraestrutura , Microscopia de Força Atômica , MicroesferasRESUMO
Which signaling pathway and protein to select to mitigate the patient's expected drug resistance? The number of possibilities facing the physician is massive, and the drug combination should fit the patient status. Here, we briefly review current approaches and data and map an innovative patient-specific strategy to forecast drug resistance targets that centers on parallel (or redundant) proliferation pathways in specialized cells. It considers the availability of each protein in each pathway in the specific cell, its activating mutations, and the chromatin accessibility of its encoding gene. The construction of the resulting Proliferation Pathway Network Atlas will harness the emerging exascale computing and advanced artificial intelligence (AI) methods for therapeutic development. Merging the resulting set of targets, pathways, and proteins, with current strategies will augment the choice for the attending physicians to thwart resistance.
Assuntos
Inteligência Artificial , Sistemas Computacionais , Modelos Biológicos , Medicina de Precisão , HumanosRESUMO
Visual inspection of histopathology images of stained biopsy tissue by expert pathologists is the standard method for grading of prostate cancer (PCa). However, this process is time-consuming and subject to high inter-observer variability. Machine learning-based methods have the potential to improve efficient throughput of large volumes of slides while decreasing variability, but they are not easy to develop because they require substantial amounts of labeled training data. In this paper, we propose a deep learning-based classification technique and data augmentation methods for accurate grading of PCa in histopathology images in the presence of limited data. Our method combines the predictions of three separate convolutional neural networks (CNNs) that work with different patch sizes. This enables our method to take advantage of the greater amount of contextual information in larger patches as well as greater quantity of smaller patches in the labeled training data. The predictions produced by the three CNNs are combined using a logistic regression model, which is trained separately after the CNN training. To effectively train our models, we propose new data augmentation methods and empirically study their effects on the classification accuracy. The proposed method achieves an accuracy of [Formula: see text] in classifying cancerous patches versus benign patches and an accuracy of [Formula: see text] in classifying low-grade (i.e., Gleason grade 3) from high-grade (i.e., Gleason grades 4 and 5) patches. The agreement level of our automatic grading method with expert pathologists is within the range of agreement between pathologists. Our experiments indicate that data augmentation is necessary for achieving expert-level performance with deep learning-based methods. A combination of image-space augmentation and feature-space augmentation leads to the best results. Our study shows that well-designed and properly trained deep learning models can achieve PCa Gleason grading accuracy that is comparable to an expert pathologist.
Assuntos
Aprendizado Profundo , Interpretação de Imagem Assistida por Computador/métodos , Gradação de Tumores/métodos , Neoplasias da Próstata , Técnicas Histológicas , Humanos , Masculino , Próstata/patologia , Neoplasias da Próstata/diagnóstico , Neoplasias da Próstata/patologiaRESUMO
Histopathology of Hematoxylin and Eosin (H&E)-stained tissue obtained from biopsy is commonly used in prostate cancer (PCa) diagnosis. Automatic PCa classification of digitized H&E slides has been developed before, but no attempts have been made to classify PCa using additional tissue stains registered to H&E. In this paper, we demonstrate that using H&E, Ki67 and p63-stained (3-stain) tissue improves PCa classification relative to H&E alone. We also show that we can infer PCa-relevant Ki67 and p63 information from the H&E slides alone, and use it to achieve H&E-based PCa classification that is comparable to the 3-stain classification. Reported improvements apply to classifying benign vs. malignant tissue, and low grade (Gleason group 2) vs. high grade (Gleason groups 3,4,5) cancer. Specifically, we conducted four classification tasks using 333 tissue samples extracted from 231 radical prostatectomy patients: regression tree-based classification using either (i) 3-stain features, with a benign vs malignant area under the curve (AUC = 92.9%), or (ii) real H&E features and H&E features learned from Ki67 and p63 stains (AUC = 92.4%), as well as deep learning classification using either (iii) real 3-stain tissue patches (AUC = 94.3%) and (iv) real H&E patches and generated Ki67 and p63 patches (AUC = 93.0%) using a deep convolutional generative adversarial network. Classification performance was assessed with Monte Carlo cross validation and quantified in terms of the Area Under the Curve, Brier score, sensitivity, and specificity. Our results are interpretable and indicate that the standard H&E classification could be improved by mimicking other stain types.
Assuntos
Neoplasias da Próstata , Humanos , Antígeno Ki-67 , Masculino , Gradação de Tumores , Coloração e RotulagemRESUMO
Artificial intelligence (AI) - the ability of a machine to perform cognitive tasks to achieve a particular goal based on provided data - is revolutionizing and reshaping our health-care systems. The current availability of ever-increasing computational power, highly developed pattern recognition algorithms and advanced image processing software working at very high speeds has led to the emergence of computer-based systems that are trained to perform complex tasks in bioinformatics, medical imaging and medical robotics. Accessibility to 'big data' enables the 'cognitive' computer to scan billions of bits of unstructured information, extract the relevant information and recognize complex patterns with increasing confidence. Computer-based decision-support systems based on machine learning (ML) have the potential to revolutionize medicine by performing complex tasks that are currently assigned to specialists to improve diagnostic accuracy, increase efficiency of throughputs, improve clinical workflow, decrease human resource costs and improve treatment choices. These characteristics could be especially helpful in the management of prostate cancer, with growing applications in diagnostic imaging, surgical interventions, skills training and assessment, digital pathology and genomics. Medicine must adapt to this changing world, and urologists, oncologists, radiologists and pathologists, as high-volume users of imaging and pathology, need to understand this burgeoning science and acknowledge that the development of highly accurate AI-based decision-support applications of ML will require collaboration between data scientists, computer researchers and engineers.
Assuntos
Inteligência Artificial , Neoplasias da Próstata , Humanos , Aprendizado de Máquina , Masculino , Neoplasias da Próstata/diagnóstico , Neoplasias da Próstata/terapiaRESUMO
Eukaryotic DNA is highly organized within nuclei and this organization is important for genome function. Fluorescent in situ hybridization (FISH) approaches allow 3D architectures of genomes to be visualized. Scalable FISH technologies, which can be applied to whole animals, are needed to help unravel how genomic architecture regulates, or is regulated by, gene expression during development, growth, reproduction, and aging. Here, we describe a multiplexed DNA FISH Oligopaint library that targets the entire Caenorhabditis elegans genome at chromosome, three megabase, and 500 kb scales. We describe a hybridization strategy that provides flexibility to DNA FISH experiments by coupling a single primary probe synthesis reaction to dye conjugated detection oligos via bridge oligos, eliminating the time and cost typically associated with labeling probe sets for individual experiments. The approach allows visualization of genome organization at varying scales in all/most cells across all stages of development in an intact animal model system.
Assuntos
Caenorhabditis elegans/genética , DNA de Helmintos/genética , Variação Genética , Genoma Helmíntico , Hibridização in Situ Fluorescente/métodos , Animais , Regulação da Expressão GênicaRESUMO
Traumatic brain injury (TBI) is one of the leading causes of death and disability worldwide. Detailed studies of the microglial response after TBI require high throughput quantification of changes in microglial count and morphology in histological sections throughout the brain. In this paper, we present a fully automated end-to-end system that is capable of assessing microglial activation in white matter regions on whole slide images of Iba1 stained sections. Our approach involves the division of the full brain slides into smaller image patches that are subsequently automatically classified into white and grey matter sections. On the patches classified as white matter, we jointly apply functional minimization methods and deep learning classification to identify Iba1-immunopositive microglia. Detected cells are then automatically traced to preserve their complex branching structure after which fractal analysis is applied to determine the activation states of the cells. The resulting system detects white matter regions with 84% accuracy, detects microglia with a performance level of 0.70 (F1 score, the harmonic mean of precision and sensitivity) and performs binary microglia morphology classification with a 70% accuracy. This automated pipeline performs these analyses at a 20-fold increase in speed when compared to a human pathologist. Moreover, we have demonstrated robustness to variations in stain intensity common for Iba1 immunostaining. A preliminary analysis was conducted that indicated that this pipeline can identify differences in microglia response due to TBI. An automated solution to microglia cell analysis can greatly increase standardized analysis of brain slides, allowing pathologists and neuroscientists to focus on characterizing the associated underlying diseases and injuries.
Assuntos
Lesões Encefálicas Traumáticas/patologia , Encéfalo/patologia , Aprendizado Profundo , Processamento de Imagem Assistida por Computador/métodos , Microglia/patologia , Animais , Camundongos , Camundongos Endogâmicos C57BL , Substância Branca/patologiaRESUMO
Importance: Proper evaluation of the performance of artificial intelligence techniques in the analysis of digitized medical images is paramount for the adoption of such techniques by the medical community and regulatory agencies. Objectives: To compare several cross-validation (CV) approaches to evaluate the performance of a classifier for automatic grading of prostate cancer in digitized histopathologic images and compare the performance of the classifier when trained using data from 1 expert and multiple experts. Design, Setting, and Participants: This quality improvement study used tissue microarray data (333 cores) from 231 patients who underwent radical prostatectomy at the Vancouver General Hospital between June 27, 1997, and June 7, 2011. Digitized images of tissue cores were annotated by 6 pathologists for 4 classes (benign and Gleason grades 3, 4, and 5) between December 12, 2016, and October 5, 2017. Patches of 192 µm2 were extracted from these images. There was no overlap between patches. A deep learning classifier based on convolutional neural networks was trained to predict a class label from among the 4 classes (benign and Gleason grades 3, 4, and 5) for each image patch. The classification performance was evaluated in leave-patches-out CV, leave-cores-out CV, and leave-patients-out 20-fold CV. The analysis was performed between November 15, 2018, and January 1, 2019. Main Outcomes and Measures: The classifier performance was evaluated by its accuracy, sensitivity, and specificity in detection of cancer (benign vs cancer) and in low-grade vs high-grade differentiation (Gleason grade 3 vs grades 4-5). The statistical significance analysis was performed using the McNemar test. The agreement level between pathologists and the classifier was quantified using a quadratic-weighted κ statistic. Results: On 333 tissue microarray cores from 231 participants with prostate cancer (mean [SD] age, 63.2 [6.3] years), 20-fold leave-patches-out CV resulted in mean (SD) accuracy of 97.8% (1.2%), sensitivity of 98.5% (1.0%), and specificity of 97.5% (1.2%) for classifying benign patches vs cancerous patches. By contrast, 20-fold leave-patients-out CV resulted in mean (SD) accuracy of 85.8% (4.3%), sensitivity of 86.3% (4.1%), and specificity of 85.5% (7.2%). Similarly, 20-fold leave-cores-out CV resulted in mean (SD) accuracy of 86.7% (3.7%), sensitivity of 87.2% (4.0%), and specificity of 87.7% (5.5%). Results of McNemar tests showed that the leave-patches-out CV accuracy, sensitivity, and specificity were significantly higher than those for both leave-patients-out CV and leave-cores-out CV. Similar results were observed for classifying low-grade cancer vs high-grade cancer. When trained on a single expert, the overall agreement in grading between pathologists and the classifier ranged from 0.38 to 0.58; when trained using the majority vote among all experts, it was 0.60. Conclusions and Relevance: Results of this study suggest that in prostate cancer classification from histopathologic images, patch-wise CV and single-expert training and evaluation may lead to a biased estimation of classifier's performance. To allow reproducibility and facilitate comparison between automatic classification methods, studies in the field should evaluate their performance using patient-based CV and multiexpert data. Some of these conclusions may be generalizable to other histopathologic applications and to other applications of machine learning in medicine.
Assuntos
Inteligência Artificial , Interpretação de Imagem Assistida por Computador/métodos , Próstata , Neoplasias da Próstata , Algoritmos , Humanos , Masculino , Pessoa de Meia-Idade , Gradação de Tumores , Próstata/diagnóstico por imagem , Próstata/patologia , Neoplasias da Próstata/diagnóstico por imagem , Neoplasias da Próstata/patologia , Análise Serial de TecidosRESUMO
Genome organization involves cis and trans chromosomal interactions, both implicated in gene regulation, development, and disease. Here, we focus on trans interactions in Drosophila, where homologous chromosomes are paired in somatic cells from embryogenesis through adulthood. We first address long-standing questions regarding the structure of embryonic homolog pairing and, to this end, develop a haplotype-resolved Hi-C approach to minimize homolog misassignment and thus robustly distinguish trans-homolog from cis contacts. This computational approach, which we call Ohm, reveals pairing to be surprisingly structured genome-wide, with trans-homolog domains, compartments, and interaction peaks, many coinciding with analogous cis features. We also find a significant genome-wide correlation between pairing, transcription during zygotic genome activation, and binding of the pioneer factor Zelda. Our findings reveal a complex, highly structured organization underlying homolog pairing, first discovered a century ago in Drosophila. Finally, we demonstrate the versatility of our haplotype-resolved approach by applying it to mammalian embryos.