Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 22
Filtrar
1.
BMC Bioinformatics ; 25(1): 27, 2024 Jan 15.
Artigo em Inglês | MEDLINE | ID: mdl-38225583

RESUMO

BACKGROUND: The recent development of high-throughput sequencing has created a large collection of multi-omics data, which enables researchers to better investigate cancer molecular profiles and cancer taxonomy based on molecular subtypes. Integrating multi-omics data has been proven to be effective for building more precise classification models. Most current multi-omics integrative models use either an early fusion in the form of concatenation or late fusion with a separate feature extractor for each omic, which are mainly based on deep neural networks. Due to the nature of biological systems, graphs are a better structural representation of bio-medical data. Although few graph neural network (GNN) based multi-omics integrative methods have been proposed, they suffer from three common disadvantages. One is most of them use only one type of connection, either inter-omics or intra-omic connection; second, they only consider one kind of GNN layer, either graph convolution network (GCN) or graph attention network (GAT); and third, most of these methods have not been tested on a more complex classification task, such as cancer molecular subtypes. RESULTS: In this study, we propose a novel end-to-end multi-omics GNN framework for accurate and robust cancer subtype classification. The proposed model utilizes multi-omics data in the form of heterogeneous multi-layer graphs, which combine both inter-omics and intra-omic connections from established biological knowledge. The proposed model incorporates learned graph features and global genome features for accurate classification. We tested the proposed model on the Cancer Genome Atlas (TCGA) Pan-cancer dataset and TCGA breast invasive carcinoma (BRCA) dataset for molecular subtype and cancer subtype classification, respectively. The proposed model shows superior performance compared to four current state-of-the-art baseline models in terms of accuracy, F1 score, precision, and recall. The comparative analysis of GAT-based models and GCN-based models reveals that GAT-based models are preferred for smaller graphs with less information and GCN-based models are preferred for larger graphs with extra information.


Assuntos
Sequenciamento de Nucleotídeos em Larga Escala , Neoplasias , Conhecimento , Aprendizagem , Redes Neurais de Computação , Neoplasias/genética
2.
Comput Med Imaging Graph ; 113: 102341, 2024 04.
Artigo em Inglês | MEDLINE | ID: mdl-38277769

RESUMO

Breast cancer continues to be a significant cause of mortality among women globally. Timely identification and precise diagnosis of breast abnormalities are critical for enhancing patient prognosis. In this study, we focus on improving the early detection and accurate diagnosis of breast abnormalities, which is crucial for improving patient outcomes and reducing the mortality rate of breast cancer. To address the limitations of traditional screening methods, a novel unsupervised feature correlation network was developed to predict maps indicating breast abnormal variations using longitudinal 2D mammograms. The proposed model utilizes the reconstruction process of current year and prior year mammograms to extract tissue from different areas and analyze the differences between them to identify abnormal variations that may indicate the presence of cancer. The model incorporates a feature correlation module, an attention suppression gate, and a breast abnormality detection module, all working together to improve prediction accuracy. The proposed model not only provides breast abnormal variation maps but also distinguishes between normal and cancer mammograms, making it more advanced compared to the state-of-the-art baseline models. The results of the study show that the proposed model outperforms the baseline models in terms of Accuracy, Sensitivity, Specificity, Dice score, and cancer detection rate.


Assuntos
Neoplasias da Mama , Mamografia , Feminino , Humanos , Mamografia/métodos , Neoplasias da Mama/diagnóstico por imagem , Prognóstico
3.
Cancers (Basel) ; 14(21)2022 Oct 29.
Artigo em Inglês | MEDLINE | ID: mdl-36358753

RESUMO

Breast cancer is among the most common and fatal diseases for women, and no permanent treatment has been discovered. Thus, early detection is a crucial step to control and cure breast cancer that can save the lives of millions of women. For example, in 2020, more than 65% of breast cancer patients were diagnosed in an early stage of cancer, from which all survived. Although early detection is the most effective approach for cancer treatment, breast cancer screening conducted by radiologists is very expensive and time-consuming. More importantly, conventional methods of analyzing breast cancer images suffer from high false-detection rates. Different breast cancer imaging modalities are used to extract and analyze the key features affecting the diagnosis and treatment of breast cancer. These imaging modalities can be divided into subgroups such as mammograms, ultrasound, magnetic resonance imaging, histopathological images, or any combination of them. Radiologists or pathologists analyze images produced by these methods manually, which leads to an increase in the risk of wrong decisions for cancer detection. Thus, the utilization of new automatic methods to analyze all kinds of breast screening images to assist radiologists to interpret images is required. Recently, artificial intelligence (AI) has been widely utilized to automatically improve the early detection and treatment of different types of cancer, specifically breast cancer, thereby enhancing the survival chance of patients. Advances in AI algorithms, such as deep learning, and the availability of datasets obtained from various imaging modalities have opened an opportunity to surpass the limitations of current breast cancer analysis methods. In this article, we first review breast cancer imaging modalities, and their strengths and limitations. Then, we explore and summarize the most recent studies that employed AI in breast cancer detection using various breast imaging modalities. In addition, we report available datasets on the breast-cancer imaging modalities which are important in developing AI-based algorithms and training deep learning models. In conclusion, this review paper tries to provide a comprehensive resource to help researchers working in breast cancer imaging analysis.

4.
Med Phys ; 49(6): 3654-3669, 2022 Jun.
Artigo em Inglês | MEDLINE | ID: mdl-35271746

RESUMO

PURPOSE: Automatic detection of very small and nonmass abnormalities from mammogram images has remained challenging. In clinical practice for each patient, radiologists commonly not only screen the mammogram images obtained during the examination, but also compare them with previous mammogram images to make a clinical decision. To design an artificial intelligence (AI) system to mimic radiologists for better cancer detection, in this work we proposed an end-to-end enhanced Siamese convolutional neural network to detect breast cancer using previous year and current year mammogram images. METHODS: The proposed Siamese-based network uses high-resolution mammogram images and fuses features of pairs of previous year and current year mammogram images to predict cancer probabilities. The proposed approach is developed based on the concept of one-shot learning that learns the abnormal differences between current and prior images instead of abnormal objects, and as a result can perform better with small sample size data sets. We developed two variants of the proposed network. In the first model, to fuse the features of current and previous images, we designed an enhanced distance learning network that considers not only the overall distance, but also the pixel-wise distances between the features. In the other model, we concatenated the features of current and previous images to fuse them. RESULTS: We compared the performance of the proposed models with those of some baseline models that use current images only (ResNet and VGG) and also use current and prior images (long short-term memory [LSTM] and vanilla Siamese) in terms of accuracy, sensitivity, precision, F1 score, and area under the curve (AUC). Results show that the proposed models outperform the baseline models and the proposed model with the distance learning network performs the best (accuracy: 0.92, sensitivity: 0.93, precision: 0.91, specificity: 0.91, F1: 0.92 and AUC: 0.95). CONCLUSIONS: Integrating prior mammogram images improves automatic cancer classification, specially for very small and nonmass abnormalities. For classification models that integrate current and prior mammogram images, using an enhanced and effective distance learning network can advance the performance of the models.


Assuntos
Neoplasias da Mama , Inteligência Artificial , Neoplasias da Mama/diagnóstico por imagem , Feminino , Humanos , Aprendizado de Máquina , Mamografia/métodos , Redes Neurais de Computação
5.
Adv Exp Med Biol ; 1361: 55-74, 2022.
Artigo em Inglês | MEDLINE | ID: mdl-35230683

RESUMO

Copy number variation (CNV), which is deletion and multiplication of segments of a genome, is an important genomic alteration that has been associated with many diseases including cancer. In cancer, CNVs are mostly somatic aberrations that occur during cancer evolution. Advances in sequencing technologies and arrival of next-generation sequencing data (whole-genome sequencing and whole-exome sequencing or targeted sequencing) have opened up an opportunity to detect CNVs with higher accuracy and resolution. Many computational methods have been developed for somatic CNV detection, which is a challenging task due to complexity of cancer sequencing data, high level of noise and biases in the sequencing process, and big data nature of sequencing data. Nevertheless, computational detection of CNV in sequencing data has resulted in the discovery of actionable cancer-specific CNVs to be used to guide cancer therapeutics, contributing to significant progress in precision oncology. In this chapter, we start by introducing CNVs. Then, we discuss the main approaches and methods developed for detecting somatic CNV for next-generation sequencing data, along with its challenges. Finally, we describe the overall workflow for CNV detection and introduce the most common publicly available software tools developed for somatic CNV detection and analysis.


Assuntos
Variações do Número de Cópias de DNA , Neoplasias , Algoritmos , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Humanos , Neoplasias/genética , Medicina de Precisão , Software
6.
Med Image Anal ; 71: 102049, 2021 07.
Artigo em Inglês | MEDLINE | ID: mdl-33901993

RESUMO

The relatively recent reintroduction of deep learning has been a revolutionary force in the interpretation of diagnostic imaging studies. However, the technology used to acquire those images is undergoing a revolution itself at the very same time. Digital breast tomosynthesis (DBT) is one such technology, which has transformed the field of breast imaging. DBT, a form of three-dimensional mammography, is rapidly replacing the traditional two-dimensional mammograms. These parallel developments in both the acquisition and interpretation of breast images present a unique case study in how modern AI systems can be designed to adapt to new imaging methods. They also present a unique opportunity for co-development of both technologies that can better improve the validity of results and patient outcomes. In this review, we explore the ways in which deep learning can be best integrated into breast cancer screening workflows using DBT. We first explain the principles behind DBT itself and why it has become the gold standard in breast screening. We then survey the foundations of deep learning methods in diagnostic imaging, and review the current state of research into AI-based DBT interpretation. Finally, we present some of the limitations of integrating AI into clinical practice and the opportunities these present in this burgeoning field.


Assuntos
Neoplasias da Mama , Aprendizado Profundo , Mama/diagnóstico por imagem , Neoplasias da Mama/diagnóstico por imagem , Detecção Precoce de Câncer , Feminino , Humanos , Mamografia
7.
BMC Bioinformatics ; 21(Suppl 1): 192, 2020 Dec 09.
Artigo em Inglês | MEDLINE | ID: mdl-33297952

RESUMO

BACKGROUND: Automatic segmentation and localization of lesions in mammogram (MG) images are challenging even with employing advanced methods such as deep learning (DL) methods. We developed a new model based on the architecture of the semantic segmentation U-Net model to precisely segment mass lesions in MG images. The proposed end-to-end convolutional neural network (CNN) based model extracts contextual information by combining low-level and high-level features. We trained the proposed model using huge publicly available databases, (CBIS-DDSM, BCDR-01, and INbreast), and a private database from the University of Connecticut Health Center (UCHC). RESULTS: We compared the performance of the proposed model with those of the state-of-the-art DL models including the fully convolutional network (FCN), SegNet, Dilated-Net, original U-Net, and Faster R-CNN models and the conventional region growing (RG) method. The proposed Vanilla U-Net model outperforms the Faster R-CNN model significantly in terms of the runtime and the Intersection over Union metric (IOU). Training with digitized film-based and fully digitized MG images, the proposed Vanilla U-Net model achieves a mean test accuracy of 92.6%. The proposed model achieves a mean Dice coefficient index (DI) of 0.951 and a mean IOU of 0.909 that show how close the output segments are to the corresponding lesions in the ground truth maps. Data augmentation has been very effective in our experiments resulting in an increase in the mean DI and the mean IOU from 0.922 to 0.951 and 0.856 to 0.909, respectively. CONCLUSIONS: The proposed Vanilla U-Net based model can be used for precise segmentation of masses in MG images. This is because the segmentation process incorporates more multi-scale spatial context, and captures more local and global context to predict a precise pixel-wise segmentation map of an input full MG image. These detected maps can help radiologists in differentiating benign and malignant lesions depend on the lesion shapes. We show that using transfer learning, introducing augmentation, and modifying the architecture of the original model results in better performance in terms of the mean accuracy, the mean DI, and the mean IOU in detecting mass lesion compared to the other DL and the conventional models.


Assuntos
Processamento de Imagem Assistida por Computador/métodos , Mamografia , Redes Neurais de Computação , Automação , Bases de Dados Factuais , Humanos
8.
BMC Cancer ; 20(1): 197, 2020 Mar 12.
Artigo em Inglês | MEDLINE | ID: mdl-32164626

RESUMO

BACKGROUND: BRCA1/2 germline mutation related cancers are candidates for new immune therapeutic interventions. This study was a hypothesis generating exploration of genomic data collected at diagnosis for 19 patients. The prominent tumor mutation burden (TMB) in hereditary breast and ovarian cancers in this cohort was not correlated with high global immune activity in their microenvironments. More information is needed about the relationship between genomic instability, phenotypes and immune microenvironments of these hereditary tumors in order to find appropriate markers of immune activity and the most effective anticancer immune strategies. METHODS: Mining and statistical analyses of the original DNA and RNA sequencing data and The Cancer Genome Atlas data were performed. To interpret the data, we have used published literature and web available resources such as Gene Ontology, The Cancer immunome Atlas and the Cancer Research Institute iAtlas. RESULTS: We found that BRCA1/2 germline related breast and ovarian cancers do not represent a unique phenotypic identity, but they express a range of phenotypes similar to sporadic cancers. All breast and ovarian BRCA1/2 related tumors are characterized by high homologous recombination deficiency (HRD) and low aneuploidy. Interestingly, all sporadic high grade serous ovarian cancers (HGSOC) and most of the subtypes of triple negative breast cancers (TNBC) also express a high degree of HRD. CONCLUSIONS: TMB is not associated with the magnitude of the immune response in hereditary BRCA1/2 related breast and ovarian cancers or in sporadic TNBC and sporadic HGSOC. Hereditary tumors express phenotypes as heterogenous as sporadic tumors with various degree of "BRCAness" and various characteristics of the immune microenvironments. The subtyping criteria developed for sporadic tumors can be applied for the classification of hereditary tumors and possibly also characterization of their immune microenvironment. A high HRD score may be a good candidate biomarker for response to platinum, and potentially PARP-inhibition. TRIAL REGISTRATION: Phase I Study of the Oral PI3kinase Inhibitor BKM120 or BYL719 and the Oral PARP Inhibitor Olaparib in Patients With Recurrent TNBC or HGSOC (NCT01623349), first posted on June 20, 2012. The design and the outcome of the clinical trial is not in the scope of this study.


Assuntos
Proteína BRCA1/genética , Proteína BRCA2/genética , Cistadenocarcinoma Seroso/genética , Perfilação da Expressão Gênica/métodos , Síndrome Hereditária de Câncer de Mama e Ovário/genética , Neoplasias Ovarianas/genética , Neoplasias de Mama Triplo Negativas/genética , Cistadenocarcinoma Seroso/patologia , Mineração de Dados , Feminino , Instabilidade Genômica , Mutação em Linhagem Germinativa , Síndrome Hereditária de Câncer de Mama e Ovário/patologia , Recombinação Homóloga , Humanos , Neoplasias Ovarianas/patologia , Análise de Sequência de RNA , Neoplasias de Mama Triplo Negativas/patologia , Microambiente Tumoral , Sequenciamento Completo do Genoma
9.
Artigo em Inglês | MEDLINE | ID: mdl-30222580

RESUMO

Copy number variation (CNV) is a type of genomic/genetic variation that plays an important role in phenotypic diversity, evolution, and disease susceptibility. Next generation sequencing (NGS) technologies have created an opportunity for more accurate detection of CNVs with higher resolution. However, efficient and precise detection of CNVs remains challenging due to high levels of noise and biases, data heterogeneity, and the "big data" nature of NGS data. Sequence coverage (readcount) data are mostly used for detecting CNVs, specially for whole exome sequencing data. Readcount data are contaminated with several types of biases and noise that hinder accurate detection of CNVs. In this work, we introduce a novel preprocessing pipeline for reducing noise and biases to improve the detection accuracy of CNVs in heterogeneous NGS data, such as cancer whole exome sequencing data. We have employed several normalization methods to reduce readcount's biases that are due to GC content of reads, read alignment problems, and sample impurity. We have also developed a novel efficient and effective smoothing approach based on Taut String to reduce noise and increase CNV detection power. Using simulated and real data we showed that employing the proposed preprocessing pipeline significantly improves the accuracy of CNV detection.


Assuntos
Variações do Número de Cópias de DNA/genética , Sequenciamento do Exoma/métodos , Genômica/métodos , Genoma Humano/genética , Humanos , Neoplasias/genética , Processamento de Sinais Assistido por Computador
10.
J Cancer Res Clin Oncol ; 146(2): 503-514, 2020 Feb.
Artigo em Inglês | MEDLINE | ID: mdl-31745703

RESUMO

PURPOSE: Fusion genes can be therapeutically relevant if they result in constitutive activation of oncogenes or repression of tumor suppressors. However, the prevalence and role of fusion genes in female cancers remain largely unexplored. Here, we investigate the fusion gene landscape in triple-negative breast cancer (TNBC) and high-grade serous ovarian cancer (HGSOC), two subtypes of female cancers with high molecular similarity but limited treatment options at present. METHODS: RNA-seq was utilized to identify fusion genes in a cohort of 18 TNBC and HGSOC patients treated with the PI3K inhibitor buparlisib and the PARP inhibitor olaparib in a phase I clinical trial (NCT01623349). Differential gene expression analysis was performed to assess the function of fusion genes in silico. Finally, these findings were correlated with the reported clinical outcomes. RESULTS: A total of 156 fusion genes was detected, whereof 44/156 (28%) events occurred in more than one patient. Low recurrence across samples indicated that the majority of fusion genes were private passenger events. The long non-coding RNA MALAT1 was involved in 97/156 (62%) fusion genes, followed in prevalence by MUC16, FOXP1, WWOX and XIST. Gene expression of FOXP1 was significantly elevated in patients with vs. without FOXP1 fusion (P= 0.02). From a clinical perspective, FOXP1 fusions were associated with a favorable overall survival. CONCLUSIONS: In summary, this study provides the first characterization of fusion genes in a cohort of TNBC and HGSOC patients. An improved mechanistic understanding of fusion genes will support the future identification of innovative therapeutic approaches for these challenging diseases.


Assuntos
Cistadenocarcinoma Seroso/genética , Fusão Gênica , Neoplasias Ovarianas/genética , Neoplasias de Mama Triplo Negativas/genética , Adulto , Idoso , Aminopiridinas/administração & dosagem , Aminopiridinas/efeitos adversos , Protocolos de Quimioterapia Combinada Antineoplásica/efeitos adversos , Ensaios Clínicos Fase I como Assunto , Cistadenocarcinoma Seroso/tratamento farmacológico , Feminino , Fatores de Transcrição Forkhead/genética , Perfilação da Expressão Gênica , Humanos , Pessoa de Meia-Idade , Morfolinas/administração & dosagem , Morfolinas/efeitos adversos , Neoplasias Ovarianas/tratamento farmacológico , Ftalazinas/administração & dosagem , Ftalazinas/efeitos adversos , Piperazinas/administração & dosagem , Piperazinas/efeitos adversos , RNA-Seq/métodos , Proteínas Repressoras/genética , Neoplasias de Mama Triplo Negativas/tratamento farmacológico
11.
Mol Cancer Res ; 17(12): 2492-2507, 2019 12.
Artigo em Inglês | MEDLINE | ID: mdl-31537618

RESUMO

The major obstacle in successfully treating triple-negative breast cancer (TNBC) is resistance to cytotoxic chemotherapy, the mainstay of treatment in this disease. Previous preclinical models of chemoresistance in TNBC have suffered from a lack of clinical relevance. Using a single high dose chemotherapy treatment, we developed a novel MDA-MB-436 cell-based model of chemoresistance characterized by a unique and complex morphologic phenotype, which consists of polyploid giant cancer cells giving rise to neuron-like mononuclear daughter cells filled with smaller but functional mitochondria and numerous lipid droplets. This resistant phenotype is associated with metabolic reprogramming with a shift to a greater dependence on fatty acids and oxidative phosphorylation. We validated both the molecular and histologic features of this model in a clinical cohort of primary chemoresistant TNBCs and identified several metabolic vulnerabilities including a dependence on PLIN4, a perilipin coating the observed lipid droplets, expressed both in the TNBC-resistant cells and clinical chemoresistant tumors treated with neoadjuvant doxorubicin-based chemotherapy. These findings thus reveal a novel mechanism of chemotherapy resistance that has therapeutic implications in the treatment of drug-resistant cancer. IMPLICATIONS: These findings underlie the importance of a novel morphologic-metabolic phenotype associated with chemotherapy resistance in TNBC, and bring to light novel therapeutic targets resulting from vulnerabilities in this phenotype, including the expression of PLIN4 essential for stabilizing lipid droplets in resistant cells.


Assuntos
Reprogramação Celular/efeitos dos fármacos , Regulação Neoplásica da Expressão Gênica/efeitos dos fármacos , Perilipina-4/genética , Neoplasias de Mama Triplo Negativas/tratamento farmacológico , Antineoplásicos/farmacologia , Apoptose/efeitos dos fármacos , Linhagem Celular Tumoral , Proliferação de Células/efeitos dos fármacos , Reprogramação Celular/genética , Doxorrubicina/farmacologia , Resistencia a Medicamentos Antineoplásicos/genética , Feminino , Regulação Neoplásica da Expressão Gênica/genética , Humanos , Gotículas Lipídicas/efeitos dos fármacos , Redes e Vias Metabólicas/efeitos dos fármacos , Neoplasias de Mama Triplo Negativas/genética , Neoplasias de Mama Triplo Negativas/patologia
12.
BMC Bioinformatics ; 20(Suppl 11): 281, 2019 Jun 06.
Artigo em Inglês | MEDLINE | ID: mdl-31167642

RESUMO

BACKGROUND: The limitations of traditional computer-aided detection (CAD) systems for mammography, the extreme importance of early detection of breast cancer and the high impact of the false diagnosis of patients drive researchers to investigate deep learning (DL) methods for mammograms (MGs). Recent breakthroughs in DL, in particular, convolutional neural networks (CNNs) have achieved remarkable advances in the medical fields. Specifically, CNNs are used in mammography for lesion localization and detection, risk assessment, image retrieval, and classification tasks. CNNs also help radiologists providing more accurate diagnosis by delivering precise quantitative analysis of suspicious lesions. RESULTS: In this survey, we conducted a detailed review of the strengths, limitations, and performance of the most recent CNNs applications in analyzing MG images. It summarizes 83 research studies for applying CNNs on various tasks in mammography. It focuses on finding the best practices used in these research studies to improve the diagnosis accuracy. This survey also provides a deep insight into the architecture of CNNs used for various tasks. Furthermore, it describes the most common publicly available MG repositories and highlights their main features and strengths. CONCLUSIONS: The mammography research community can utilize this survey as a basis for their current and future studies. The given comparison among common publicly available MG repositories guides the community to select the most appropriate database for their application(s). Moreover, this survey lists the best practices that improve the performance of CNNs including the pre-processing of images and the use of multi-view images. In addition, other listed techniques like transfer learning (TL), data augmentation, batch normalization, and dropout are appealing solutions to reduce overfitting and increase the generalization of the CNN models. Finally, this survey identifies the research challenges and directions that require further investigations by the community.


Assuntos
Aprendizado Profundo , Mamografia/métodos , Redes Neurais de Computação , Neoplasias da Mama/diagnóstico por imagem , Neoplasias da Mama/patologia , Bases de Dados Factuais , Feminino , Humanos , Processamento de Imagem Assistida por Computador , Publicações , Inquéritos e Questionários
13.
BMC Bioinformatics ; 19(Suppl 11): 361, 2018 Oct 22.
Artigo em Inglês | MEDLINE | ID: mdl-30343665

RESUMO

BACKGROUND: Due to recent advances in sequencing technologies, sequence-based analysis has been widely applied to detecting copy number variations (CNVs). There are several techniques for identifying CNVs using next generation sequencing (NGS) data, however methods employing depth of coverage or read depth (RD) have recently become a main technique to identify CNVs. The main assumption of the RD-based CNV detection methods is that the readcount value at a specific genomic location is correlated with the copy number at that location. However, readcount data's noise and biases distort the association between the readcounts and copy numbers. For more accurate CNV identification, these biases and noise need to be mitigated. In this work, to detect CNVs more precisely and efficiently we propose a novel denoising method based on the total variation approach and the Taut String algorithm. RESULTS: To investigate the performance of the proposed denoising method, we computed sensitivities, false discovery rates and specificities of CNV detection when employing denoising, using both simulated and real data. We also compared the performance of the proposed denoising method, Taut String, with that of the commonly used approaches such as moving average (MA) and discrete wavelet transforms (DWT) in terms of sensitivity of detecting true CNVs and time complexity. The results show that Taut String works better than DWT and MA and has a better power to identify very narrow CNVs. The ability of Taut String denoising in preserving CNV segments' breakpoints and narrow CNVs increases the detection accuracy of segmentation algorithms, resulting in higher sensitivities and lower false discovery rates. CONCLUSIONS: In this study, we proposed a new denoising method for sequence-based CNV detection based on a signal processing technique. Existing CNV detection algorithms identify many false CNV segments and fail in detecting short CNV segments due to noise and biases. Employing an effective and efficient denoising method can significantly enhance the detection accuracy of the CNV segmentation algorithms. Advanced denoising methods from the signal processing field can be employed to implement such algorithms. We showed that non-linear denoising methods that consider sparsity and piecewise constant characteristics of CNV data result in better performance in CNV detection.


Assuntos
Variações do Número de Cópias de DNA/genética , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Algoritmos , Neoplasias da Mama/genética , Simulação por Computador , Feminino , Genômica , Humanos , Processamento de Sinais Assistido por Computador , Fatores de Tempo , Análise de Ondaletas
14.
Cancer Causes Control ; 29(3): 305-314, 2018 03.
Artigo em Inglês | MEDLINE | ID: mdl-29427260

RESUMO

PURPOSE: The purpose of the study was to assess the feasibility of quantifying long-term trends in breast tumor DNA copy number variation (CNV) profiles. METHODS: We evaluated CNV profiles in formalin-fixed paraffin-embedded (FFPE) tumor specimens from 30 randomly selected Kaiser Permanente Northern California health plan women members diagnosed with breast cancer from 1950 to 2010. Assays were conducted for five cases per decade who had available tumor blocks and pathology reports. RESULTS: As compared to the tumors from the 1970s to 2000s, the older tumors dating back to the 1950s and 1960s were much more likely to (1) fail quality control, and (2) have fewer CNV events (average 23 and 31 vs. 58 to 69), fewer CNV genes (average 5.1 and 3.7k vs. 8.1 to 10.3k), shorter CNV length (average 2,440 and 3,300k vs. 5,740 to 9,280k), fewer high frequency Del genes (37 and 25% vs. 54 to 76%), and fewer high frequency high_Amp genes (20% vs. 56 to 73%). On average, assay interpretation took an extra 60 min/specimen for cases from the 1960s versus 20 min/specimen for the most recent tumors. CONCLUSIONS: Assays conducted in the mid-2010s for CNVs may be feasible for FFPE tumor specimens dating back to the 1980s, but less feasible for older specimens.


Assuntos
Neoplasias da Mama/genética , Variações do Número de Cópias de DNA , DNA , Manejo de Espécimes , Feminino , Formaldeído , Humanos , Inclusão em Parafina , Fatores de Tempo , Fixação de Tecidos
15.
BMC Bioinformatics ; 18(1): 286, 2017 May 31.
Artigo em Inglês | MEDLINE | ID: mdl-28569140

RESUMO

BACKGROUND: Recently copy number variation (CNV) has gained considerable interest as a type of genomic/genetic variation that plays an important role in disease susceptibility. Advances in sequencing technology have created an opportunity for detecting CNVs more accurately. Recently whole exome sequencing (WES) has become primary strategy for sequencing patient samples and study their genomics aberrations. However, compared to whole genome sequencing, WES introduces more biases and noise that make CNV detection very challenging. Additionally, tumors' complexity makes the detection of cancer specific CNVs even more difficult. Although many CNV detection tools have been developed since introducing NGS data, there are few tools for somatic CNV detection for WES data in cancer. RESULTS: In this study, we evaluated the performance of the most recent and commonly used CNV detection tools for WES data in cancer to address their limitations and provide guidelines for developing new ones. We focused on the tools that have been designed or have the ability to detect cancer somatic aberrations. We compared the performance of the tools in terms of sensitivity and false discovery rate (FDR) using real data and simulated data. Comparative analysis of the results of the tools showed that there is a low consensus among the tools in calling CNVs. Using real data, tools show moderate sensitivity (~50% - ~80%), fair specificity (~70% - ~94%) and poor FDRs (~27% - ~60%). Also, using simulated data we observed that increasing the coverage more than 10× in exonic regions does not improve the detection power of the tools significantly. CONCLUSIONS: The limited performance of the current CNV detection tools for WES data in cancer indicates the need for developing more efficient and precise CNV detection methods. Due to the complexity of tumors and high level of noise and biases in WES data, employing advanced novel segmentation, normalization and de-noising techniques that are designed specifically for cancer data is necessary. Also, CNV detection development suffers from the lack of a gold standard for performance evaluation. Finally, developing tools with user-friendly user interfaces and visualization features can enhance CNV studies for a broader range of users.


Assuntos
Variações do Número de Cópias de DNA , Exoma , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Neoplasias/genética , Software , Algoritmos , Feminino , Genoma Humano , Humanos , Análise de Sequência de DNA/métodos
16.
J Cancer Res Clin Oncol ; 143(7): 1255-1262, 2017 Jul.
Artigo em Inglês | MEDLINE | ID: mdl-28247034

RESUMO

PURPOSE: Resistance to trastuzumab therapy is linked to phosphoinositol 3-kinase (PI3K) pathway activation. One key downstream effector and regulator of this pathway is the mechanistic target of rapamycin (mTOR). In 2011, a phase I/II study evaluated the combination of trastuzumab and everolimus (a mTOR inhibitor) for treatment of Her2-positive metastatic breast cancer (MBC) for patients who had progressed on trastuzumab-based therapy. METHODS: We retrospectively analyzed GeneChip microarray data from 22 of 47 patients included in the study. RESULTS: Using an unbiased approach, we found that mutations in BRAF, EGFR and KIT are significantly more common in this heavily treated population when compared with the cohort of invasive breast carcinoma patients in The Cancer Genome Atlas (TCGA). Furthermore, 10 out of 22 patients had PIK3CA mutations (45.4%) but PI3KCA status was not predictive of PFS in our cohort. Finally, the use of OncoScantm has allowed us to detected mutations in five genes that have not been shown to be mutated in TCGA subset of Her-2 overexpressing breast cancer: CTNNB1, HRAS, KRAS, NF2 and SMARCB1. CONCLUSION: Mutational burden in heavily treated trastuzumab-resistant Her2-positive metastatic breast cancer is highly variable and not directly correlated with outcome. Activation of the MAPK/ERK pathway through mutations in EGFR, BRAF or KIT may mediate resistance to trastuzumab.


Assuntos
Neoplasias da Mama/genética , Análise Mutacional de DNA , Resistencia a Medicamentos Antineoplásicos/genética , Antineoplásicos/uso terapêutico , Neoplasias da Mama/mortalidade , Intervalo Livre de Doença , Feminino , Humanos , Análise de Sequência com Séries de Oligonucleotídeos , Receptor ErbB-2/genética , Estudos Retrospectivos , Trastuzumab/uso terapêutico
17.
BMC Genomics ; 17(1): 638, 2016 08 15.
Artigo em Inglês | MEDLINE | ID: mdl-27526849

RESUMO

BACKGROUND: With advances in technologies, huge amounts of multiple types of high-throughput genomics data are available. These data have tremendous potential to identify new and clinically valuable biomarkers to guide the diagnosis, assessment of prognosis, and treatment of complex diseases, such as cancer. Integrating, analyzing, and interpreting big and noisy genomics data to obtain biologically meaningful results, however, remains highly challenging. Mining genomics datasets by utilizing advanced computational methods can help to address these issues. RESULTS: To facilitate the identification of a short list of biologically meaningful genes as candidate drivers of anti-cancer drug resistance from an enormous amount of heterogeneous data, we employed statistical machine-learning techniques and integrated genomics datasets. We developed a computational method that integrates gene expression, somatic mutation, and copy number aberration data of sensitive and resistant tumors. In this method, an integrative method based on module network analysis is applied to identify potential driver genes. This is followed by cross-validation and a comparison of the results of sensitive and resistance groups to obtain the final list of candidate biomarkers. We applied this method to the ovarian cancer data from the cancer genome atlas. The final result contains biologically relevant genes, such as COL11A1, which has been reported as a cis-platinum resistant biomarker for epithelial ovarian carcinoma in several recent studies. CONCLUSIONS: The described method yields a short list of aberrant genes that also control the expression of their co-regulated genes. The results suggest that the unbiased data driven computational method can identify biologically relevant candidate biomarkers. It can be utilized in a wide range of applications that compare two conditions with highly heterogeneous datasets.


Assuntos
Antineoplásicos/uso terapêutico , Mineração de Dados , Neoplasias Ovarianas/tratamento farmacológico , Biomarcadores Tumorais/genética , Biomarcadores Tumorais/metabolismo , Cisplatino/uso terapêutico , Análise por Conglomerados , Colágeno Tipo XI/genética , Colágeno Tipo XI/metabolismo , Variações do Número de Cópias de DNA , Bases de Dados Genéticas , Resistencia a Medicamentos Antineoplásicos , Feminino , Regulação Neoplásica da Expressão Gênica , Genômica , Humanos , Neoplasias Ovarianas/genética , Neoplasias Ovarianas/patologia
18.
EBioMedicine ; 9: 148-160, 2016 Jul.
Artigo em Inglês | MEDLINE | ID: mdl-27333046

RESUMO

Whether the human tumor virus, Epstein-Barr Virus (EBV), promotes breast cancer remains controversial and a potential mechanism has remained elusive. Here we show that EBV can infect primary mammary epithelial cells (MECs) that express the receptor CD21. EBV infection leads to the expansion of early MEC progenitor cells with a stem cell phenotype, activates MET signaling and enforces a differentiation block. When MECs were implanted as xenografts, EBV infection cooperated with activated Ras and accelerated the formation of breast cancer. Infection in EBV-related tumors was of a latency type II pattern, similar to nasopharyngeal carcinoma (NPC). A human gene expression signature for MECs infected with EBV, termed EBVness, was associated with high grade, estrogen-receptor-negative status, p53 mutation and poor survival. In 11/33 EBVness-positive tumors, EBV-DNA was detected by fluorescent in situ hybridization for the viral LMP1 and BXLF2 genes. In an analysis of the TCGA breast cancer data EBVness correlated with the presence of the APOBEC mutational signature. We conclude that a contribution of EBV to breast cancer etiology is plausible, through a mechanism in which EBV infection predisposes mammary epithelial cells to malignant transformation, but is no longer required once malignant transformation has occurred.


Assuntos
Transformação Celular Neoplásica , Herpesvirus Humano 4/patogenicidade , Neoplasias/patologia , Animais , Técnicas de Cultura de Células , Diferenciação Celular , Células Cultivadas , Análise por Conglomerados , DNA Viral/genética , DNA Viral/metabolismo , Intervalo Livre de Doença , Células Epiteliais/citologia , Células Epiteliais/transplante , Células Epiteliais/virologia , Transição Epitelial-Mesenquimal , Herpesvirus Humano 4/genética , Herpesvirus Humano 4/metabolismo , Humanos , Immunoblotting , Imuno-Histoquímica , Hibridização In Situ , Camundongos , Camundongos Endogâmicos NOD , Camundongos SCID , Neoplasias/metabolismo , Neoplasias/mortalidade , Interferência de RNA , RNA Interferente Pequeno/metabolismo , Reação em Cadeia da Polimerase em Tempo Real , Receptores de Complemento 3d/metabolismo , Fator de Transcrição STAT3/metabolismo , Transdução de Sinais , Taxa de Sobrevida , Transcriptoma , Transplante Heterólogo , Proteína Supressora de Tumor p53/genética , Proteína Supressora de Tumor p53/metabolismo , Proteínas da Matriz Viral/antagonistas & inibidores , Proteínas da Matriz Viral/genética , Proteínas da Matriz Viral/metabolismo
19.
Bioinformatics ; 32(4): 533-41, 2016 Feb 15.
Artigo em Inglês | MEDLINE | ID: mdl-26515818

RESUMO

MOTIVATION: A major goal of biomedical research is to identify molecular features associated with a biological or clinical class of interest. Differential expression analysis has long been used for this purpose; however, conventional methods perform poorly when applied to data with high within class heterogeneity. RESULTS: To address this challenge, we developed EMDomics, a new method that uses the Earth mover's distance to measure the overall difference between the distributions of a gene's expression in two classes of samples and uses permutations to obtain q-values for each gene. We applied EMDomics to the challenging problem of identifying genes associated with drug resistance in ovarian cancer. We also used simulated data to evaluate the performance of EMDomics, in terms of sensitivity and specificity for identifying differentially expressed gene in classes with high within class heterogeneity. In both the simulated and real biological data, EMDomics outperformed competing approaches for the identification of differentially expressed genes, and EMDomics was significantly more powerful than conventional methods for the identification of drug resistance-associated gene sets. EMDomics represents a new approach for the identification of genes differentially expressed between heterogeneous classes and has utility in a wide range of complex biomedical conditions in which sample classes show within class heterogeneity. AVAILABILITY AND IMPLEMENTATION: The R package is available at http://www.bioconductor.org/packages/release/bioc/html/EMDomics.html.


Assuntos
Biomarcadores Tumorais/genética , Resistencia a Medicamentos Antineoplásicos/genética , Perfilação da Expressão Gênica/métodos , Neoplasias Ovarianas/genética , Software , Antineoplásicos/farmacologia , Bases de Dados Factuais , Feminino , Regulação Neoplásica da Expressão Gênica/efeitos dos fármacos , Redes Reguladoras de Genes , Humanos , Neoplasias Ovarianas/tratamento farmacológico , Sensibilidade e Especificidade
20.
BMC Genomics ; 15: 579, 2014 Jul 09.
Artigo em Inglês | MEDLINE | ID: mdl-25011954

RESUMO

BACKGROUND: Chromosomal breakage followed by faulty DNA repair leads to gene amplifications and deletions in cancers. However, the mere assessment of the extent of genomic changes, amplifications and deletions may reduce the complexity of genomic data observed by array comparative genomic hybridization (array CGH). We present here a novel approach to array CGH data analysis, which focuses on putative breakpoints responsible for rearrangements within the genome. RESULTS: We performed array comparative genomic hybridization in 29 primary tumors from high risk patients with breast cancer. The specimens were flow sorted according to ploidy to increase tumor cell purity prior to array CGH. We describe the number of chromosomal breaks as well as the patterns of breaks on individual chromosomes in each tumor. There were differences in chromosomal breakage patterns between the 3 clinical subtypes of breast cancers, although the highest density of breaks occurred at chromosome 17 in all subtypes, suggesting a particular proclivity of this chromosome for breaks. We also observed chromothripsis affecting various chromosomes in 41% of high risk breast cancers. CONCLUSIONS: Our results provide a new insight into the genomic complexity of breast cancer. Genomic instability dependent on chromosomal breakage events is not stochastic, targeting some chromosomes clearly more than others. We report a much higher percentage of chromothripsis than described previously in other cancers and this suggests that massive genomic rearrangements occurring in a single catastrophic event may shape many breast cancer genomes.


Assuntos
Neoplasias da Mama/genética , Quebra Cromossômica , Instabilidade Genômica/genética , Neoplasias da Mama/patologia , Cromossomos Humanos/genética , Hibridização Genômica Comparativa , Predisposição Genética para Doença/genética , Genômica , Humanos , Pessoa de Meia-Idade , Gradação de Tumores
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA