Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 73
Filtrar
Mais filtros

Base de dados
País/Região como assunto
Tipo de documento
Intervalo de ano de publicação
1.
Cell ; 181(7): 1612-1625.e13, 2020 06 25.
Artigo em Inglês | MEDLINE | ID: mdl-32497499

RESUMO

Responses to anti-PD-1 immunotherapy occur but are infrequent in bladder cancer. The specific T cells that mediate tumor rejection are unknown. T cells from human bladder tumors and non-malignant tissue were assessed with single-cell RNA and paired T cell receptor (TCR) sequencing of 30,604 T cells from 7 patients. We find that the states and repertoires of CD8+ T cells are not distinct in tumors compared with non-malignant tissues. In contrast, single-cell analysis of CD4+ T cells demonstrates several tumor-specific states, including multiple distinct states of regulatory T cells. Surprisingly, we also find multiple cytotoxic CD4+ T cell states that are clonally expanded. These CD4+ T cells can kill autologous tumors in an MHC class II-dependent fashion and are suppressed by regulatory T cells. Further, a gene signature of cytotoxic CD4+ T cells in tumors predicts a clinical response in 244 metastatic bladder cancer patients treated with anti-PD-L1.


Assuntos
Linfócitos T CD4-Positivos/metabolismo , Neoplasias da Bexiga Urinária/genética , Neoplasias da Bexiga Urinária/metabolismo , Antígeno B7-H1/genética , Antígeno B7-H1/imunologia , Biomarcadores Farmacológicos/análise , Linfócitos T CD4-Positivos/imunologia , Linfócitos T CD8-Positivos/imunologia , Linfócitos T CD8-Positivos/metabolismo , Regulação Neoplásica da Expressão Gênica/genética , Genes MHC da Classe II , Humanos , Inibidores de Checkpoint Imunológico/farmacologia , Imunoterapia , Linfócitos do Interstício Tumoral , Receptor de Morte Celular Programada 1/genética , Receptores de Antígenos de Linfócitos T/genética , Análise de Célula Única/métodos , Linfócitos T Reguladores , Neoplasias da Bexiga Urinária/imunologia
2.
Nat Methods ; 19(4): 429-440, 2022 04.
Artigo em Inglês | MEDLINE | ID: mdl-35396482

RESUMO

Evaluating metagenomic software is key for optimizing metagenome interpretation and focus of the Initiative for the Critical Assessment of Metagenome Interpretation (CAMI). The CAMI II challenge engaged the community to assess methods on realistic and complex datasets with long- and short-read sequences, created computationally from around 1,700 new and known genomes, as well as 600 new plasmids and viruses. Here we analyze 5,002 results by 76 program versions. Substantial improvements were seen in assembly, some due to long-read data. Related strains still were challenging for assembly and genome recovery through binning, as was assembly quality for the latter. Profilers markedly matured, with taxon profilers and binners excelling at higher bacterial ranks, but underperforming for viruses and Archaea. Clinical pathogen detection results revealed a need to improve reproducibility. Runtime and memory usage analyses identified efficient programs, including top performers with other metrics. The results identify challenges and guide researchers in selecting methods for analyses.


Assuntos
Metagenoma , Metagenômica , Archaea/genética , Metagenômica/métodos , Reprodutibilidade dos Testes , Análise de Sequência de DNA , Software
3.
Brief Bioinform ; 24(4)2023 07 20.
Artigo em Inglês | MEDLINE | ID: mdl-37291798

RESUMO

The ability to identify and track T-cell receptor (TCR) sequences from patient samples is becoming central to the field of cancer research and immunotherapy. Tracking genetically engineered T cells expressing TCRs that target specific tumor antigens is important to determine the persistence of these cells and quantify tumor responses. The available high-throughput method to profile TCR repertoires is generally referred to as TCR sequencing (TCR-Seq). However, the available TCR-Seq data are limited compared with RNA sequencing (RNA-Seq). In this paper, we have benchmarked the ability of RNA-Seq-based methods to profile TCR repertoires by examining 19 bulk RNA-Seq samples across 4 cancer cohorts including both T-cell-rich and T-cell-poor tissue types. We have performed a comprehensive evaluation of the existing RNA-Seq-based repertoire profiling methods using targeted TCR-Seq as the gold standard. We also highlighted scenarios under which the RNA-Seq approach is suitable and can provide comparable accuracy to the TCR-Seq approach. Our results show that RNA-Seq-based methods are able to effectively capture the clonotypes and estimate the diversity of TCR repertoires, as well as provide relative frequencies of clonotypes in T-cell-rich tissues and low-diversity repertoires. However, RNA-Seq-based TCR profiling methods have limited power in T-cell-poor tissues, especially in highly diverse repertoires of T-cell-poor tissues. The results of our benchmarking provide an additional appealing argument to incorporate RNA-Seq into the immune repertoire screening of cancer patients as it offers broader knowledge into the transcriptomic changes that exceed the limited information provided by TCR-Seq.


Assuntos
Benchmarking , Neoplasias , Humanos , Receptores de Antígenos de Linfócitos T/genética , Linfócitos T , Neoplasias/genética , Análise de Sequência de RNA
5.
Brief Bioinform ; 23(4)2022 07 18.
Artigo em Inglês | MEDLINE | ID: mdl-35753701

RESUMO

Advances in whole-genome sequencing (WGS) promise to enable the accurate and comprehensive structural variant (SV) discovery. Dissecting SVs from WGS data presents a substantial number of challenges and a plethora of SV detection methods have been developed. Currently, evidence that investigators can use to select appropriate SV detection tools is lacking. In this article, we have evaluated the performance of SV detection tools on mouse and human WGS data using a comprehensive polymerase chain reaction-confirmed gold standard set of SVs and the genome-in-a-bottle variant set, respectively. In contrast to the previous benchmarking studies, our gold standard dataset included a complete set of SVs allowing us to report both precision and sensitivity rates of the SV detection methods. Our study investigates the ability of the methods to detect deletions, thus providing an optimistic estimate of SV detection performance as the SV detection methods that fail to detect deletions are likely to miss more complex SVs. We found that SV detection tools varied widely in their performance, with several methods providing a good balance between sensitivity and precision. Additionally, we have determined the SV callers best suited for low- and ultralow-pass sequencing data as well as for different deletion length categories.


Assuntos
Benchmarking , Genoma Humano , Animais , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Humanos , Camundongos , Sequenciamento Completo do Genoma/métodos
6.
Brief Bioinform ; 22(6)2021 11 05.
Artigo em Inglês | MEDLINE | ID: mdl-34346485

RESUMO

Estimating cell type composition of blood and tissue samples is a biological challenge relevant in both laboratory studies and clinical care. In recent years, a number of computational tools have been developed to estimate cell type abundance using gene expression data. Although these tools use a variety of approaches, they all leverage expression profiles from purified cell types to evaluate the cell type composition within samples. In this study, we compare 12 cell type quantification tools and evaluate their performance while using each of 10 separate reference profiles. Specifically, we have run each tool on over 4000 samples with known cell type proportions, spanning both immune and stromal cell types. A total of 12 of these represent in vitro synthetic mixtures and 300 represent in silico synthetic mixtures prepared using single-cell data. A final 3728 clinical samples have been collected from the Framingham cohort, for which cell populations have been quantified using electrical impedance cell counting. When tools are applied to the Framingham dataset, the tool Estimating the Proportions of Immune and Cancer cells (EPIC) produces the highest correlation, whereas Gene Expression Deconvolution Interactive Tool (GEDIT) produces the lowest error. The best tool for other datasets is varied, but CIBERSORT and GEDIT most consistently produce accurate results. We find that optimal reference depends on the tool used, and report suggested references to be used with each tool. Most tools return results within minutes, but on large datasets runtimes for CIBERSORT can exceed hours or even days. We conclude that deconvolution methods are capable of returning high-quality results, but that proper reference selection is critical.


Assuntos
Transcriptoma , Algoritmos , Biologia Computacional/métodos , Simulação por Computador , Perfilação da Expressão Gênica/métodos , Humanos
7.
Crit Care ; 27(1): 486, 2023 12 08.
Artigo em Inglês | MEDLINE | ID: mdl-38066613

RESUMO

BACKGROUND: Sepsis is a highly heterogeneous syndrome, which has hindered the development of effective therapies. This has prompted investigators to develop a precision medicine approach aimed at identifying biologically homogenous subgroups of patients with septic shock and critical illnesses. Transcriptomic analysis can identify subclasses derived from differences in underlying pathophysiological processes that may provide the basis for new targeted therapies. The goal of this study was to elucidate pathophysiological pathways and identify pediatric septic shock subclasses based on whole blood RNA expression profiles. METHODS: The subjects were critically ill children with cardiopulmonary failure who were a part of a prospective randomized insulin titration trial to treat hyperglycemia. Genome-wide expression profiling was conducted using RNA sequencing from whole blood samples obtained from 46 children with septic shock and 52 mechanically ventilated noninfected controls without shock. Patients with septic shock were allocated to subclasses based on hierarchical clustering of gene expression profiles, and we then compared clinical characteristics, plasma inflammatory markers, cell compositions using GEDIT, and immune repertoires using Imrep between the two subclasses. RESULTS: Patients with septic shock depicted alterations in innate and adaptive immune pathways. Among patients with septic shock, we identified two subtypes based on gene expression patterns. Compared with Subclass 2, Subclass 1 was characterized by upregulation of innate immunity pathways and downregulation of adaptive immunity pathways. Subclass 1 had significantly worse clinical outcomes despite the two classes having similar illness severity on initial clinical presentation. Subclass 1 had elevated levels of plasma inflammatory cytokines and endothelial injury biomarkers and demonstrated decreased percentages of CD4 T cells and B cells and less diverse T cell receptor repertoires. CONCLUSIONS: Two subclasses of pediatric septic shock patients were discovered through genome-wide expression profiling based on whole blood RNA sequencing with major biological and clinical differences. Trial Registration This is a secondary analysis of data generated as part of the observational CAF-PINT ancillary of the HALF-PINT study (NCT01565941). Registered March 29, 2012.


Assuntos
Sepse , Choque Séptico , Criança , Humanos , Perfilação da Expressão Gênica , Estudos Prospectivos , Sepse/genética , Choque Séptico/terapia , Transcriptoma , Ensaios Clínicos Controlados Aleatórios como Assunto , Estudos Observacionais como Assunto
8.
Nucleic Acids Res ; 49(17): e102, 2021 09 27.
Artigo em Inglês | MEDLINE | ID: mdl-34214168

RESUMO

Rapidly evolving RNA viruses continuously produce minority haplotypes that can become dominant if they are drug-resistant or can better evade the immune system. Therefore, early detection and identification of minority viral haplotypes may help to promptly adjust the patient's treatment plan preventing potential disease complications. Minority haplotypes can be identified using next-generation sequencing, but sequencing noise hinders accurate identification. The elimination of sequencing noise is a non-trivial task that still remains open. Here we propose CliqueSNV based on extracting pairs of statistically linked mutations from noisy reads. This effectively reduces sequencing noise and enables identifying minority haplotypes with the frequency below the sequencing error rate. We comparatively assess the performance of CliqueSNV using an in vitro mixture of nine haplotypes that were derived from the mutation profile of an existing HIV patient. We show that CliqueSNV can accurately assemble viral haplotypes with frequencies as low as 0.1% and maintains consistent performance across short and long bases sequencing platforms.


Assuntos
Algoritmos , Biologia Computacional/métodos , Haplótipos , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Infecções por Vírus de RNA/diagnóstico , Vírus de RNA/genética , COVID-19/diagnóstico , COVID-19/virologia , Frequência do Gene , Infecções por HIV/diagnóstico , Infecções por HIV/virologia , HIV-1/genética , Humanos , Mutação , Polimorfismo de Nucleotídeo Único , Infecções por Vírus de RNA/virologia , Reprodutibilidade dos Testes , SARS-CoV-2/genética , Sensibilidade e Especificidade
9.
PLoS Biol ; 17(6): e3000333, 2019 06.
Artigo em Inglês | MEDLINE | ID: mdl-31220077

RESUMO

Developing new software tools for analysis of large-scale biological data is a key component of advancing modern biomedical research. Scientific reproduction of published findings requires running computational tools on data generated by such studies, yet little attention is presently allocated to the installability and archival stability of computational software tools. Scientific journals require data and code sharing, but none currently require authors to guarantee the continuing functionality of newly published tools. We have estimated the archival stability of computational biology software tools by performing an empirical analysis of the internet presence for 36,702 omics software resources published from 2005 to 2017. We found that almost 28% of all resources are currently not accessible through uniform resource locators (URLs) published in the paper they first appeared in. Among the 98 software tools selected for our installability test, 51% were deemed "easy to install," and 28% of the tools failed to be installed at all because of problems in the implementation. Moreover, for papers introducing new software, we found that the number of citations significantly increased when authors provided an easy installation process. We propose for incorporation into journal policy several practical solutions for increasing the widespread installability and archival stability of published bioinformatics software.


Assuntos
Biologia Computacional/métodos , Disseminação de Informação/métodos , Armazenamento e Recuperação da Informação/métodos , Pesquisa Biomédica , Bases de Dados Factuais , Humanos , Internet , Software/tendências
10.
Exp Dermatol ; 30(11): 1711-1716, 2021 11.
Artigo em Inglês | MEDLINE | ID: mdl-34036652

RESUMO

BACKGROUND: Human papillomavirus (HPV) infection is known to promote the development of mucosal squamous cell carcinoma (mSCC), including pathologically high-grade lesions, but its role in cutaneous squamous cell carcinoma (cuSCC) remains unclear, particularly in lesions that are considered high risk. OBJECTIVE: We aimed to determine whether enhanced HPV transcriptional activity can be detected in high-risk cuSCC samples compared with low-grade SCC samples or normal skin. METHODS: We performed RNA sequencing of cuSCC across 23 risk-stratified skin lesions. A subset of samples was tested for the presence of HPV DNA. High-quality, non-human reads from each sample group were used for viral analysis using Microbiome Coverage Profiler. RESULTS: None of the samples analysed had detectable expression of HPV RNA, while 64% of samples tested positive for HPV DNA. All samples were found to have expression of human endogenous retrovirus, and multiple samples showed expression of other viruses. CONCLUSIONS: Viral and prophage gene expression can be monitored in cuSCC or normal skin biopsies, yet no sample in our study showed evidence of active HPV gene expression despite evidence of HPV genome presence. This suggests HPV transcription does not play a role in differentiating high-risk cuSCCs from low-risk cuSCCs or normal skin.


Assuntos
Carcinoma de Células Escamosas/patologia , Carcinoma de Células Escamosas/virologia , Expressão Gênica , Papillomaviridae/genética , Infecções por Papillomavirus/complicações , Infecções por Papillomavirus/patologia , Neoplasias Cutâneas/patologia , Neoplasias Cutâneas/virologia , Idoso , Biópsia , Sondas de DNA de HPV , Feminino , Humanos , Masculino , Medição de Risco
11.
BMC Biol ; 18(1): 92, 2020 07 28.
Artigo em Inglês | MEDLINE | ID: mdl-32723395

RESUMO

An amendment to this paper has been published and can be accessed via the original article.

12.
BMC Biol ; 18(1): 37, 2020 04 07.
Artigo em Inglês | MEDLINE | ID: mdl-32264902

RESUMO

Metagenomics studies leverage genomic reference databases to generate discoveries in basic science and translational research. However, current microbial studies use disparate reference databases that lack consistent standards of specimen inclusion, data preparation, taxon labelling and accessibility, hindering their quality and comprehensiveness, and calling for the establishment of recommendations for reference genome database assembly. Here, we analyze existing fungal and bacterial databases and discuss guidelines for the development of a master reference database that promises to improve the quality and quantity of omics research.


Assuntos
Bactérias/genética , Bases de Dados Genéticas/normas , Fungos/genética , Metagenômica/normas , Metagenômica/instrumentação
13.
J Biol Chem ; 294(37): 13530-13544, 2019 09 13.
Artigo em Inglês | MEDLINE | ID: mdl-31243103

RESUMO

Our recent single-cell transcriptomic analysis has demonstrated that heterogeneous transcriptional activity attends molecular transition from the nascent to terminally differentiated fiber cells in the developing mouse lens. To understand the role of transcriptional heterogeneity in terminal differentiation and the functional phenotype (transparency) of this tissue, here we present a single-cell analysis of the developing lens, in a transgenic paradigm of an inherited pathology, known as the lamellar cataract. Cataracts hinder transmission of light into the eye. Lamellar cataract is the most prevalent bilateral childhood cataract. In this disease of early infancy, initially, the opacities remain confined to a few fiber cells, thus presenting an opportunity to investigate early molecular events that lead to cataractogenesis. We used a previously established paradigm that faithfully recapitulates this disease in transgenic mice. About 500 single fiber cells, manually isolated from a 2-day-old transgenic lens were interrogated individually for the expression of all known 17 crystallins and 78 other relevant genes using a Biomark HD (Fluidigm). We find that fiber cells from spatially and developmentally discrete regions of the transgenic (cataract) lens show remarkable absence of the heterogeneity of gene expression. Importantly, the molecular variability of cortical fiber cells, the hallmark of the WT lens, is absent in the transgenic cataract, suggesting absence of specific cell-type(s). Interestingly, we find a repetitive pattern of gene activity in progressive states of differentiation in the transgenic lens. This molecular dysfunction portends pathology much before the physical manifestations of the disease.


Assuntos
Catarata/genética , Cristalinas/genética , Animais , Animais Geneticamente Modificados , Catarata/metabolismo , Diferenciação Celular/genética , Cristalinas/metabolismo , Proteínas do Olho/genética , Proteínas do Olho/metabolismo , Feminino , Perfilação da Expressão Gênica/métodos , Humanos , Cristalino/metabolismo , Masculino , Camundongos , Camundongos Endogâmicos C57BL , RNA Mensageiro/genética , Retina/embriologia , Análise de Célula Única/métodos , Transcriptoma/genética
16.
BMC Genomics ; 20(Suppl 5): 423, 2019 Jun 06.
Artigo em Inglês | MEDLINE | ID: mdl-31167634

RESUMO

BACKGROUND: High throughput sequencing has spurred the development of metagenomics, which involves the direct analysis of microbial communities in various environments such as soil, ocean water, and the human body. Many existing methods based on marker genes or k-mers have limited sensitivity or are too computationally demanding for many users. Additionally, most work in metagenomics has focused on bacteria and archaea, neglecting to study other key microbes such as viruses and eukaryotes. RESULTS: Here we present a method, MiCoP (Microbiome Community Profiling), that uses fast-mapping of reads to build a comprehensive reference database of full genomes from viruses and eukaryotes to achieve maximum read usage and enable the analysis of the virome and eukaryome in each sample. We demonstrate that mapping of metagenomic reads is feasible for the smaller viral and eukaryotic reference databases. We show that our method is accurate on simulated and mock community data and identifies many more viral and fungal species than previously-reported results on real data from the Human Microbiome Project. CONCLUSIONS: MiCoP is a mapping-based method that proves more effective than existing methods at abundance profiling of viruses and eukaryotes in metagenomic samples. MiCoP can be used to detect the full diversity of these communities. The code, data, and documentation are publicly available on GitHub at: https://github.com/smangul1/MiCoP .


Assuntos
Biologia Computacional/métodos , Fungos/genética , Marcadores Genéticos , Metagenômica/métodos , Microbiota , Análise de Sequência de DNA/métodos , Vírus/genética , Algoritmos , Fungos/classificação , Genoma Fúngico , Genoma Viral , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Humanos , Vírus/classificação
17.
Mol Biol Evol ; 35(11): 2736-2750, 2018 11 01.
Artigo em Inglês | MEDLINE | ID: mdl-30169787

RESUMO

As are most non-European populations, the Han Chinese are relatively understudied in population and medical genetics studies. From low-coverage whole-genome sequencing of 11,670 Han Chinese women we present a catalog of 25,057,223 variants, including 548,401 novel variants that are seen at least 10 times in our data set. Individuals from this data set came from 24 out of 33 administrative divisions across China (including 19 provinces, 4 municipalities, and 1 autonomous region), thus allowing us to study population structure, genetic ancestry, and local adaptation in Han Chinese. We identified previously unrecognized population structure along the East-West axis of China, demonstrated a general pattern of isolation-by-distance among Han Chinese, and reported unique regional signals of admixture, such as European influences among the Northwestern provinces of China. Furthermore, we identified a number of highly differentiated, putatively adaptive, loci (e.g., MTHFR, ADH7, and FADS, among others) that may be driven by immune response, climate, and diet in the Han Chinese. Finally, we have made available allele frequency estimates stratified by administrative divisions across China in the Geography of Genetic Variant browser for the broader community. By leveraging the largest currently available genetic data set for Han Chinese, we have gained insights into the history and population structure of the world's largest ethnic group.


Assuntos
Povo Asiático/genética , Evolução Biológica , Variação Genética , Adaptação Biológica , Animais , Estudos de Casos e Controles , China , Transtorno Depressivo Maior/genética , Feminino , Frequência do Gene , Humanos , Homem de Neandertal/genética , Filogeografia , Seleção Genética
19.
Bioinformatics ; 30(12): i329-37, 2014 Jun 15.
Artigo em Inglês | MEDLINE | ID: mdl-24932001

RESUMO

MOTIVATION: Next-generation sequencing technologies sequence viruses with ultra-deep coverage, thus promising to revolutionize our understanding of the underlying diversity of viral populations. While the sequencing coverage is high enough that even rare viral variants are sequenced, the presence of sequencing errors makes it difficult to distinguish between rare variants and sequencing errors. RESULTS: In this article, we present a method to overcome the limitations of sequencing technologies and assemble a diverse viral population that allows for the detection of previously undiscovered rare variants. The proposed method consists of a high-fidelity sequencing protocol and an accurate viral population assembly method, referred to as Viral Genome Assembler (VGA). The proposed protocol is able to eliminate sequencing errors by using individual barcodes attached to the sequencing fragments. Highly accurate data in combination with deep coverage allow VGA to assemble rare variants. VGA uses an expectation-maximization algorithm to estimate abundances of the assembled viral variants in the population. RESULTS on both synthetic and real datasets show that our method is able to accurately assemble an HIV viral population and detect rare variants previously undetectable due to sequencing errors. VGA outperforms state-of-the-art methods for genome-wide viral assembly. Furthermore, our method is the first viral assembly method that scales to millions of sequencing reads. AVAILABILITY: Our tool VGA is freely available at http://genetics.cs.ucla.edu/vga/


Assuntos
Algoritmos , Genoma Viral , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Genômica/métodos , HIV/genética , Análise de Sequência de DNA , Software
20.
BMC Genomics ; 15 Suppl 5: S7, 2014.
Artigo em Inglês | MEDLINE | ID: mdl-25082147

RESUMO

BACKGROUND: High throughput RNA sequencing (RNA-Seq) can generate whole transcriptome information at the single transcript level providing a powerful tool with multiple interrelated applications including transcriptome reconstruction and quantification. The sequences of novel transcripts can be reconstructed from deep RNA-Seq data, but this is computationally challenging due to sequencing errors, uneven coverage of expressed transcripts, and the need to distinguish between highly similar transcripts produced by alternative splicing. Another challenge in transcriptomic analysis comes from the ambiguities in mapping reads to transcripts. RESULTS: We present MaLTA, a method for simultaneous transcriptome assembly and quantification from Ion Torrent RNA-Seq data. Our approach explores transcriptome structure and incorporates a maximum likelihood model into the assembly and quantification procedure. A new version of the IsoEM algorithm suitable for Ion Torrent RNA-Seq reads is used to accurately estimate transcript expression levels. The MaLTA-IsoEM tool is publicly available at: http://alan.cs.gsu.edu/NGS/?q=malta CONCLUSIONS: Experimental results on both synthetic and real datasets show that Ion Torrent RNA-Seq data can be successfully used for transcriptome analyses. Experimental results suggest increased transcriptome assembly and quantification accuracy of MaLTA-IsoEM solution compared to existing state-of-the-art approaches.


Assuntos
Sequenciamento de Nucleotídeos em Larga Escala/métodos , Análise de Sequência de RNA/métodos , Transcriptoma , Algoritmos , Processamento Alternativo , Humanos , Funções Verossimilhança , Alinhamento de Sequência , Software
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA