Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 6 de 6
Filtrar
Mais filtros











Base de dados
Intervalo de ano de publicação
1.
Bioinformatics ; 34(8): 1329-1335, 2018 04 15.
Artigo em Inglês | MEDLINE | ID: mdl-29186294

RESUMO

Motivation: With the development of high-throughput techniques, RNA-sequencing (RNA-seq) is becoming increasingly popular as an alternative for gene expression analysis, such as RNAs profiling and classification. Identifying which type of diseases a new patient belongs to with RNA-seq data has been recognized as a vital problem in medical research. As RNA-seq data are discrete, statistical methods developed for classifying microarray data cannot be readily applied for RNA-seq data classification. Witten proposed a Poisson linear discriminant analysis (PLDA) to classify the RNA-seq data in 2011. Note, however, that the count datasets are frequently characterized by excess zeros in real RNA-seq or microRNA sequence data (i.e. when the sequence depth is not enough or small RNAs with the length of 18-30 nucleotides). Therefore, it is desired to develop a new model to analyze RNA-seq data with an excess of zeros. Results: In this paper, we propose a Zero-Inflated Poisson Logistic Discriminant Analysis (ZIPLDA) for RNA-seq data with an excess of zeros. The new method assumes that the data are from a mixture of two distributions: one is a point mass at zero, and the other follows a Poisson distribution. We then consider a logistic relation between the probability of observing zeros and the mean of the genes and the sequencing depth in the model. Simulation studies show that the proposed method performs better than, or at least as well as, the existing methods in a wide range of settings. Two real datasets including a breast cancer RNA-seq dataset and a microRNA-seq dataset are also analyzed, and they coincide with the simulation results that our proposed method outperforms the existing competitors. Availability and implementation: The software is available at http://www.math.hkbu.edu.hk/∼tongt. Contact: xwan@comp.hkbu.edu.hk or tongt@hkbu.edu.hk. Supplementary information: Supplementary data are available at Bioinformatics online.


Assuntos
Sequenciamento de Nucleotídeos em Larga Escala/métodos , Análise de Sequência de RNA/métodos , Software , Neoplasias da Mama/genética , Análise Discriminante , Feminino , Humanos , MicroRNAs
2.
J Comput Biol ; 24(11): 1099-1111, 2017 Nov.
Artigo em Inglês | MEDLINE | ID: mdl-28414553

RESUMO

High-throughput techniques bring novel tools and also statistical challenges to genomic research. Identification of which type of diseases a new patient belongs to has been recognized as an important problem. For high-dimensional small sample size data, the classical discriminant methods suffer from the singularity problem and are, therefore, no longer applicable in practice. In this article, we propose a geometric diagonalization method for the regularized discriminant analysis. We then consider a bias correction to further improve the proposed method. Simulation studies show that the proposed method performs better than, or at least as well as, the existing methods in a wide range of settings. A microarray dataset and an RNA-seq dataset are also analyzed and they demonstrate the superiority of the proposed method over the existing competitors, especially when the number of samples is small or the number of genes is large. Finally, we have developed an R package called "GDRDA" which is available upon request.


Assuntos
Algoritmos , Biomarcadores Tumorais/genética , Neoplasias da Mama/genética , Perfilação da Expressão Gênica , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Simulação por Computador , Análise Discriminante , Feminino , Humanos
3.
Cancer Immunol Immunother ; 66(6): 717-729, 2017 Jun.
Artigo em Inglês | MEDLINE | ID: mdl-28246881

RESUMO

Non-Hodgkin lymphoma (NHL) is an incurable lymphoproliferative cancer, and patients with NHL have a poor prognosis. The present study explored the regulatory mechanism of expression and possible roles of the immunosuppressive B7-H4 molecule in human NHL. For functional studies, NHL-reactive T cell lines were generated via the isolation of allogeneic CD3+ T cells from healthy donors and repeated in vitro stimulation with irradiated NHL cells isolated from patients. B7-H4 was found to be distributed in NHL cells and tissues, and its surface protein expression levels were further upregulated by the incubation of NHL cells with interleukin (IL)-6, IL-10, or interferon-γ. Additionally, the supernatants of tumor-associated macrophages (tMφs) upregulated B7-H4 surface expression by producing IL-6 and IL-10. B7-H4 expressed in NHL cells inhibited the cytotoxic activity of NHL-reactive T cells. Conversely, the inhibition of B7-H4 in NHL cells promoted T cell immunity and sensitized NHL cells to cytolysis. Furthermore, tMφs induced B7-H4 promoted NHL cell evasion of the T cell immune response. In conclusion, this study shows that NHL-expressed B7-H4 is an important immunosuppressive factor that inhibits host anti-tumor immunity to NHL. Targeting tumor-expressed B7-H4 may thus provide a new treatment strategy for NHL patients.


Assuntos
Interleucina-10/metabolismo , Interleucina-6/metabolismo , Linfoma não Hodgkin/imunologia , Linfoma não Hodgkin/metabolismo , Macrófagos/imunologia , Linfócitos T Reguladores/imunologia , Evasão Tumoral , Inibidor 1 da Ativação de Células T com Domínio V-Set/metabolismo , Comunicação Celular/imunologia , Humanos , Linfoma não Hodgkin/patologia , Células Tumorais Cultivadas
4.
BioData Min ; 7: 15, 2014.
Artigo em Inglês | MEDLINE | ID: mdl-25285156

RESUMO

BACKGROUND: Next generation sequencing technologies are powerful new tools for investigating a wide range of biological and medical questions. Statistical and computational methods are key to analyzing massive and complex sequencing data. In order to derive gene expression measures and compare these measures across samples or libraries, we first need to normalize read counts to adjust for varying sample sequencing depths and other potentially technical effects. RESULTS: In this paper, we develop a normalization method based on iterating median of M-values (IMM) for detecting the differentially expressed (DE) genes. Compared to a previous approach TMM, the IMM method improves the accuracy of DE detection. Simulation studies show that the IMM method outperforms other methods for the sample normalization. We also look into the real data and find that the genes detected by IMM but not by TMM are much more accurate than the genes detected by TMM but not by IMM. What's more, we discovered that gene UNC5C is highly associated with kidney cancer and so on.

5.
BMC Genomics ; 15: 868, 2014 Oct 06.
Artigo em Inglês | MEDLINE | ID: mdl-25286960

RESUMO

BACKGROUND: Aberrant DNA methylation is a hallmark of many cancers. Classically there are two types of endometrial cancer, endometrioid adenocarcinoma (EAC), or Type I, and uterine papillary serous carcinoma (UPSC), or Type II. However, the whole genome DNA methylation changes in these two classical types of endometrial cancer is still unknown. RESULTS: Here we described complete genome-wide DNA methylome maps of EAC, UPSC, and normal endometrium by applying a combined strategy of methylated DNA immunoprecipitation sequencing (MeDIP-seq) and methylation-sensitive restriction enzyme digestion sequencing (MRE-seq). We discovered distinct genome-wide DNA methylation patterns in EAC and UPSC: 27,009 and 15,676 recurrent differentially methylated regions (DMRs) were identified respectively, compared with normal endometrium. Over 80% of DMRs were in intergenic and intronic regions. The majority of these DMRs were not interrogated on the commonly used Infinium 450K array platform. Large-scale demethylation of chromosome X was detected in UPSC, accompanied by decreased XIST expression. Importantly, we discovered that the majority of the DMRs harbored promoter or enhancer functions and are specifically associated with genes related to uterine development and disease. Among these, abnormal methylation of transposable elements (TEs) may provide a novel mechanism to deregulate normal endometrium-specific enhancers derived from specific TEs. CONCLUSIONS: DNA methylation changes are an important signature of endometrial cancer and regulate gene expression by affecting not only proximal promoters but also distal enhancers.


Assuntos
Neoplasias do Endométrio/genética , Neoplasias do Endométrio/fisiopatologia , Elementos Facilitadores Genéticos/genética , Regiões Promotoras Genéticas/genética , Neoplasias Uterinas/genética , Neoplasias Uterinas/fisiopatologia , Proteínas Adaptadoras de Transdução de Sinal/genética , Família Aldeído Desidrogenase 1 , Carcinoma Papilar/genética , Carcinoma Papilar/metabolismo , Cromossomos Humanos X , Ilhas de CpG , DNA (Citosina-5-)-Metiltransferases/genética , DNA (Citosina-5-)-Metiltransferases/metabolismo , Metilação de DNA , Elementos de DNA Transponíveis/genética , Feminino , Humanos , Fator 4 Semelhante a Kruppel , Fatores de Transcrição Kruppel-Like/genética , Proteína 1 Homóloga a MutL , Proteínas Nucleares/genética , Polimorfismo de Nucleotídeo Único , RNA Longo não Codificante/genética , Retinal Desidrogenase/genética , Análise de Sequência de DNA
6.
BMC Bioinformatics ; 10: 146, 2009 May 15.
Artigo em Inglês | MEDLINE | ID: mdl-19445669

RESUMO

BACKGROUND: Time-course microarray experiments produce vector gene expression profiles across a series of time points. Clustering genes based on these profiles is important in discovering functional related and co-regulated genes. Early developed clustering algorithms do not take advantage of the ordering in a time-course study, explicit use of which should allow more sensitive detection of genes that display a consistent pattern over time. Peddada et al. 1 proposed a clustering algorithm that can incorporate the temporal ordering using order-restricted statistical inference. This algorithm is, however, very time-consuming and hence inapplicable to most microarray experiments that contain a large number of genes. Its computational burden also imposes difficulty to assess the clustering reliability, which is a very important measure when clustering noisy microarray data. RESULTS: We propose a computationally efficient information criterion-based clustering algorithm, called ORICC, that also takes account of the ordering in time-course microarray experiments by embedding the order-restricted inference into a model selection framework. Genes are assigned to the profile which they best match determined by a newly proposed information criterion for order-restricted inference. In addition, we also developed a bootstrap procedure to assess ORICC's clustering reliability for every gene. Simulation studies show that the ORICC method is robust, always gives better clustering accuracy than Peddada's method and saves hundreds of times computational time. Under some scenarios, its accuracy is also better than some other existing clustering methods for short time-course microarray data, such as STEM 2 and Wang et al. 3. It is also computationally much faster than Wang et al. 3. CONCLUSION: Our ORICC algorithm, which takes advantage of the temporal ordering in time-course microarray experiments, provides good clustering accuracy and is meanwhile much faster than Peddada's method. Moreover, the clustering reliability for each gene can also be assessed, which is unavailable in Peddada's method. In a real data example, the ORICC algorithm identifies new and interesting genes that previous analyses failed to reveal.


Assuntos
Análise por Conglomerados , Perfilação da Expressão Gênica/métodos , Modelos Genéticos , Modelos Estatísticos , Análise de Sequência com Séries de Oligonucleotídeos/métodos , Algoritmos , Neoplasias da Mama , Simulação por Computador , Bases de Dados Factuais , Feminino , Genes , Humanos , Projetos de Pesquisa
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA