RESUMO
Prostate tumours are highly variable in their response to therapies, but clinically available prognostic factors can explain only a fraction of this heterogeneity. Here we analysed 200 whole-genome sequences and 277 additional whole-exome sequences from localized, non-indolent prostate tumours with similar clinical risk profiles, and carried out RNA and methylation analyses in a subset. These tumours had a paucity of clinically actionable single nucleotide variants, unlike those seen in metastatic disease. Rather, a significant proportion of tumours harboured recurrent non-coding aberrations, large-scale genomic rearrangements, and alterations in which an inversion repressed transcription within its boundaries. Local hypermutation events were frequent, and correlated with specific genomic profiles. Numerous molecular aberrations were prognostic for disease recurrence, including several DNA methylation events, and a signature comprised of these aberrations outperformed well-described prognostic biomarkers. We suggest that intensified treatment of genomically aggressive localized prostate cancer may improve cure rates.
Assuntos
Genoma Humano/genética , Genômica , Mutação , Neoplasias da Próstata/genética , Neoplasias da Próstata/patologia , Cromotripsia , Variações do Número de Cópias de DNA , Metilação de DNA , Exoma/genética , Humanos , Masculino , Metástase Neoplásica/genética , Prognóstico , Neoplasias de Próstata Resistentes à Castração/genética , Neoplasias de Próstata Resistentes à Castração/patologia , RecidivaRESUMO
BACKGROUND: It is extremely common to need to select a subset of reads from a BAM file based on their specific properties. Typically, a user unpacks the BAM file to a text stream using SAMtools, parses and filters the lines using AWK, then repacks them using SAMtools. This process is tedious and error-prone. In particular, when working with many columns of data, mix-ups are common and the bit field containing the flags is unintuitive. There are several libraries for reading BAM files, such as Bio-SamTools for Perl and pysam for Python. Both allow access to the BAM's read information and can filter reads, but require substantial boilerplate code; this is high overhead for mostly ad hoc filtering. RESULTS: We have created a query language that gathers reads using a collection of predicates and common logical connectives. Queries run faster than equivalents and can be compiled to native code for embedding in larger programs. CONCLUSIONS: BAMQL provides a user-friendly, powerful and performant way to extract subsets of BAM files for ad hoc analyses or integration into applications. The query language provides a collection of predicates beyond those in SAMtools, and more flexible connectives.
Assuntos
Software , Sequência de Bases , Cromossomos/genética , Internet , Mitocôndrias/genética , Interface Usuário-ComputadorRESUMO
BACKGROUND: Illumina paired-end reads are used to analyse microbial communities by targeting amplicons of the 16S rRNA gene. Publicly available tools are needed to assemble overlapping paired-end reads while correcting mismatches and uncalled bases; many errors could be corrected to obtain higher sequence yields using quality information. RESULTS: PANDAseq assembles paired-end reads rapidly and with the correction of most errors. Uncertain error corrections come from reads with many low-quality bases identified by upstream processing. Benchmarks were done using real error masks on simulated data, a pure source template, and a pooled template of genomic DNA from known organisms. PANDAseq assembled reads more rapidly and with reduced error incorporation compared to alternative methods. CONCLUSIONS: PANDAseq rapidly assembles sequences and scales to billions of paired-end reads. Assembly of control libraries showed a 4-50% increase in the number of assembled sequences over naïve assembly with negligible loss of "good" sequence.
Assuntos
Bactérias/isolamento & purificação , Metagenômica , Software , Bactérias/genética , RNA Bacteriano/genética , RNA Ribossômico 16S/genéticaRESUMO
Soil pH is an important determinant of microbial community composition and diversity, yet few studies have characterized the specific effects of pH on individual bacterial taxa within bacterial communities, both abundant and rare. We collected composite soil samples over 2 years from an experimentally maintained pH gradient ranging from 4.5 to 7.5 from the Craibstone Experimental Farm (Craibstone, Scotland). Extracted nucleic acids were characterized by bacterial and group-specific denaturing gradient gel electrophoresis and next-generation sequencing of bacterial 16S rRNA genes. Both methods demonstrated comparable and reproducible shifts within higher taxonomic bacterial groups (e.g. Acidobacteria, Alphaproteobacteria, Verrucomicrobia, and Gammaproteobacteria) across the pH gradient. In addition, we used non-negative matrix factorization (NMF) for the first time on 16S rRNA gene data to identify positively interacting (i.e. co-occurring) operational taxonomic unit (OTU) clusters (i.e. 'components'), with abundances that correlated strongly with pH, and sample year to a lesser extent. All OTUs identified by NMF were visualized within principle coordinate analyses of UNIFRAC distances and subjected to taxonomic network analysis (SSUnique), which plotted OTU abundance and similarity against established taxonomies. Most pH-dependent OTUs identified here would not have been identified by previous methodologies for microbial community profiling and were unrelated to known lineages.
Assuntos
Bactérias/classificação , Microbiologia do Solo , Solo/química , Agricultura , Bactérias/genética , Eletroforese em Gel de Gradiente Desnaturante , Genes Bacterianos , Genes de RNAr , Concentração de Íons de Hidrogênio , Dados de Sequência Molecular , Reação em Cadeia da Polimerase , RNA Bacteriano/genética , RNA Ribossômico 16S/genética , Escócia , Análise de Sequência de DNA/métodosRESUMO
BACKGROUND: Although high-throughput sequencing of small subunit rRNA genes has revolutionized our understanding of microbial ecosystems, these technologies generate data at depths that benefit from automated analysis. Here we present AXIOME (Automation, eXtension, and Integration Of Microbial Ecology), a highly flexible and extensible management tool for popular microbial ecology analysis packages that promotes reproducibility and customization in microbial research. FINDINGS: AXIOME streamlines and manages analysis of small subunit (SSU) rRNA marker data in QIIME and mothur. AXIOME also implements features including the PAired-eND Assembler for Illumina sequences (PANDAseq), non-negative matrix factorization (NMF), multi-response permutation procedures (MRPP), exploring and recovering phylogenetic novelty (SSUnique) and indicator species analysis. AXIOME has a companion graphical user interface (GUI) and is designed to be easily extended to facilitate customized research workflows. CONCLUSIONS: AXIOME is an actively developed, open source project written in Vala and available from GitHub (http://neufeld.github.com/axiome) and as a Debian package. Axiometic, a GUI companion tool is also freely available (http://neufeld.github.com/axiometic). Given that data analysis has become an important bottleneck for microbial ecology studies, the development of user-friendly computational tools remains a high priority. AXIOME represents an important step in this direction by automating multi-step bioinformatic analyses and enabling the customization of procedures to suit the diverse research needs of the microbial ecology community.
RESUMO
Ammonia-oxidizing archaea (AOA) outnumber ammonia-oxidizing bacteria (AOB) in many terrestrial and aquatic environments. Although nitrification is the primary function of aquarium biofilters, very few studies have investigated the microorganisms responsible for this process in aquaria. This study used quantitative real-time PCR (qPCR) to quantify the ammonia monooxygenase (amoA) and 16S rRNA genes of Bacteria and Thaumarchaeota in freshwater aquarium biofilters, in addition to assessing the diversity of AOA amoA genes by denaturing gradient gel electrophoresis (DGGE) and clone libraries. AOA were numerically dominant in 23 of 27 freshwater biofilters, and in 12 of these biofilters AOA contributed all detectable amoA genes. Eight saltwater aquaria and two commercial aquarium nitrifier supplements were included for comparison. Both thaumarchaeal and bacterial amoA genes were detected in all saltwater samples, with AOA genes outnumbering AOB genes in five of eight biofilters. Bacterial amoA genes were abundant in both supplements, but thaumarchaeal amoA and 16S rRNA genes could not be detected. For freshwater aquaria, the proportion of amoA genes from AOA relative to AOB was inversely correlated with ammonium concentration. DGGE of AOA amoA genes revealed variable diversity across samples, with nonmetric multidimensional scaling (NMDS) indicating separation of freshwater and saltwater fingerprints. Composite clone libraries of AOA amoA genes revealed distinct freshwater and saltwater clusters, as well as mixed clusters containing both freshwater and saltwater amoA gene sequences. These results reveal insight into commonplace residential biofilters and suggest that aquarium biofilters may represent valuable biofilm microcosms for future studies of AOA ecology.