Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 16 de 16
Filter
Add more filters










Publication year range
1.
Nat Biotechnol ; 41(7): 915-918, 2023 Jul.
Article in English | MEDLINE | ID: mdl-36593406

ABSTRACT

Annotating newly sequenced genomes and determining alternative isoforms from long-read RNA data are complex and incompletely solved problems. Here we present IsoQuant-a computational tool using intron graphs that accurately reconstructs transcripts both with and without reference genome annotation. For novel transcript discovery, IsoQuant reduces the false-positive rate fivefold and 2.5-fold for Oxford Nanopore reference-based or reference-free mode, respectively. IsoQuant also improves performance for Pacific Biosciences data.


Subject(s)
High-Throughput Nucleotide Sequencing , RNA , Protein Isoforms/genetics , Sequence Analysis, RNA , Genome , Sequence Analysis, DNA
2.
Front Microbiol ; 13: 981458, 2022.
Article in English | MEDLINE | ID: mdl-36386613

ABSTRACT

While metagenome sequencing may provide insights on the genome sequences and composition of microbial communities, metatranscriptome analysis can be useful for studying the functional activity of a microbiome. RNA-Seq data provides the possibility to determine active genes in the community and how their expression levels depend on external conditions. Although the field of metatranscriptomics is relatively young, the number of projects related to metatranscriptome analysis increases every year and the scope of its applications expands. However, there are several problems that complicate metatranscriptome analysis: complexity of microbial communities, wide dynamic range of transcriptome expression and importantly, the lack of high-quality computational methods for assembling meta-RNA sequencing data. These factors deteriorate the contiguity and completeness of metatranscriptome assemblies, therefore affecting further downstream analysis. Here we present MetaGT, a pipeline for de novo assembly of metatranscriptomes, which is based on the idea of combining both metatranscriptomic and metagenomic data sequenced from the same sample. MetaGT assembles metatranscriptomic contigs and fills in missing regions based on their alignments to metagenome assembly. This approach allows to overcome described complexities and obtain complete RNA sequences, and additionally estimate their abundances. Using various publicly available real and simulated datasets, we demonstrate that MetaGT yields significant improvement in coverage and completeness of metatranscriptome assemblies compared to existing methods that do not exploit metagenomic data. The pipeline is implemented in NextFlow and is freely available from https://github.com/ablab/metaGT.

3.
Front Microbiol ; 12: 613791, 2021.
Article in English | MEDLINE | ID: mdl-33833738

ABSTRACT

Metagenomics is a segment of conventional microbial genomics dedicated to the sequencing and analysis of combined genomic DNA of entire environmental samples. The most critical step of the metagenomic data analysis is the reconstruction of individual genes and genomes of the microorganisms in the communities using metagenomic assemblers - computational programs that put together small fragments of sequenced DNA generated by sequencing instruments. Here, we describe the challenges of metagenomic assembly, a wide spectrum of applications in which metagenomic assemblies were used to better understand the ecology and evolution of microbial ecosystems, and present one of the most efficient microbial assemblers, SPAdes that was upgraded to become applicable for metagenomics.

4.
Stand Genomic Sci ; 12: 57, 2017.
Article in English | MEDLINE | ID: mdl-28943998

ABSTRACT

Dethiobacter alkaliphilus strain AHT1T is an anaerobic, sulfidogenic, moderately salt-tolerant alkaliphilic chemolithotroph isolated from hypersaline soda lake sediments in northeastern Mongolia. It is a Gram-positive bacterium with low GC content, within the phylum Firmicutes. Here we report its draft genome sequence, which consists of 34 contigs with a total sequence length of 3.12 Mbp. D. alkaliphilus strain AHT1T was sequenced by the Joint Genome Institute (JGI) as part of the Community Science Program due to its relevance to bioremediation and biotechnological applications.

5.
Stand Genomic Sci ; 11(1): 67, 2016.
Article in English | MEDLINE | ID: mdl-27617057

ABSTRACT

Desulfurivibrio alkaliphilus strain AHT2(T) is a strictly anaerobic sulfidogenic haloalkaliphile isolated from a composite sediment sample of eight hypersaline alkaline lakes in the Wadi al Natrun valley in the Egyptian Libyan Desert. D. alkaliphilus AHT2(T) is Gram-negative and belongs to the family Desulfobulbaceae within the Deltaproteobacteria. Here we report its genome sequence, which contains a 3.10 Mbp chromosome. D. alkaliphilus AHT2(T) is adapted to survive under highly alkaline and moderately saline conditions and therefore, is relevant to the biotechnology industry and life under extreme conditions. For these reasons, D. alkaliphilus AHT2(T) was sequenced by the DOE Joint Genome Institute as part of the Community Science Program.

6.
Stand Genomic Sci ; 11: 2, 2016.
Article in English | MEDLINE | ID: mdl-26744606

ABSTRACT

Methanospirillum hungatei strain JF1 (DSM 864) is a methane-producing archaeon and is the type species of the genus Methanospirillum, which belongs to the family Methanospirillaceae within the order Methanomicrobiales. Its genome was selected for sequencing due to its ability to utilize hydrogen and carbon dioxide and/or formate as a sole source of energy. Ecologically, M. hungatei functions as the hydrogen- and/or formate-using partner with many species of syntrophic bacteria. Its morphology is distinct from other methanogens with the ability to form long chains of cells (up to 100 µm in length), which are enclosed within a sheath-like structure, and terminal cells with polar flagella. The genome of M. hungatei strain JF1 is the first completely sequenced genome of the family Methanospirillaceae, and it has a circular genome of 3,544,738 bp containing 3,239 protein coding and 68 RNA genes. The large genome of M. hungatei JF1 suggests the presence of unrecognized biochemical/physiological properties that likely extend to the other Methanospirillaceae and include the ability to form the unusual sheath-like structure and to successfully interact with syntrophic bacteria.

7.
FEBS J ; 282(23): 4515-37, 2015 Dec.
Article in English | MEDLINE | ID: mdl-26367132

ABSTRACT

The ascomycete Geotrichum candidum is a versatile and efficient decay fungus that is involved, for example, in biodeterioration of compact discs; notably, the 3C strain was previously shown to degrade filter paper and cotton more efficiently than several industrial enzyme preparations. Glycoside hydrolase (GH) family 7 cellobiohydrolases (CBHs) are the primary constituents of industrial cellulase cocktails employed in biomass conversion, and feature tunnel-enclosed active sites that enable processive hydrolytic cleavage of cellulose chains. Understanding the structure-function relationships defining the activity and stability of GH7 CBHs is thus of keen interest. Accordingly, we report the comprehensive characterization of the GH7 CBH secreted by G. candidum (GcaCel7A). The bimodular cellulase consists of a family 1 cellulose-binding module (CBM) and linker connected to a GH7 catalytic domain that shares 64% sequence identity with the archetypal industrial GH7 CBH of Hypocrea jecorina (HjeCel7A). GcaCel7A shows activity on Avicel cellulose similar to HjeCel7A, with less product inhibition, but has a lower temperature optimum (50 °C versus 60-65 °C, respectively). Five crystal structures, with and without bound thio-oligosaccharides, show conformational diversity of tunnel-enclosing loops, including a form with partial tunnel collapse at subsite -4 not reported previously in GH7. Also, the first O-glycosylation site in a GH7 crystal structure is reported--on a loop where the glycan probably influences loop contacts across the active site and interactions with the cellulose surface. The GcaCel7A structures indicate higher loop flexibility than HjeCel7A, in accordance with sequence modifications. However, GcaCel7A retains small fluctuations in molecular simulations, suggesting high processivity and low endo-initiation probability, similar to HjeCel7A. DATABASE: Structural data are available in the Protein Data Bank under the accession numbers 5AMP, 4ZZV, 4ZZW, 4ZZT, and 4ZZU. The Geotrichum candidum GH family 7 cellobiohydrolase nucleotide sequence is available in GenBank under accession number KJ958925. ENZYMES: Glycoside hydrolase family 7 reducing end acting cellobiohydrolase.


Subject(s)
Cellulose 1,4-beta-Cellobiosidase , Geotrichum/enzymology , Molecular Dynamics Simulation , Amino Acid Sequence , Cellulose 1,4-beta-Cellobiosidase/chemistry , Cellulose 1,4-beta-Cellobiosidase/genetics , Cellulose 1,4-beta-Cellobiosidase/metabolism , Hydrogen-Ion Concentration , Kinetics , Molecular Sequence Data , Protein Conformation , Sequence Alignment , Temperature
8.
Stand Genomic Sci ; 10: 48, 2015.
Article in English | MEDLINE | ID: mdl-26380636

ABSTRACT

Bacteroides barnesiae Lan et al. 2006 is a species of the genus Bacteroides, which belongs to the family Bacteroidaceae. Strain BL2(T) is of interest because it was isolated from the gut of a chicken and the growing awareness that the anaerobic microbiota of the caecum is of benefit for the host and may impact poultry farming. The 3,621,509 bp long genome with its 3,059 protein-coding and 97 RNA genes is a part of the Genomic Encyclopedia of Type Strains, Phase I: the one thousand microbial genomes (KMG) project.

9.
Genome Announc ; 3(2)2015 Mar 12.
Article in English | MEDLINE | ID: mdl-25767232

ABSTRACT

Desulfovibrio carbinoliphilus subsp. oakridgensis FW-101-2B is an anaerobic, organic acid/alcohol-oxidizing, sulfate-reducing δ-proteobacterium. FW-101-2B was isolated from contaminated groundwater at The Field Research Center at Oak Ridge National Lab after in situ stimulation for heavy metal-reducing conditions. The genome will help elucidate the metabolic potential of sulfate-reducing bacteria during uranium reduction.

10.
BMC Genomics ; 15: 308, 2014 Apr 25.
Article in English | MEDLINE | ID: mdl-24767249

ABSTRACT

BACKGROUND: Tuberculosis (TB) poses a worldwide threat due to advancing multidrug-resistant strains and deadly co-infections with Human immunodeficiency virus. Today large amounts of Mycobacterium tuberculosis whole genome sequencing data are being assessed broadly and yet there exists no comprehensive online resource that connects M. tuberculosis genome variants with geographic origin, with drug resistance or with clinical outcome. DESCRIPTION: Here we describe a broadly inclusive unifying Genome-wide Mycobacterium tuberculosis Variation (GMTV) database, (http://mtb.dobzhanskycenter.org) that catalogues genome variations of M. tuberculosis strains collected across Russia. GMTV contains a broad spectrum of data derived from different sources and related to M. tuberculosis molecular biology, epidemiology, TB clinical outcome, year and place of isolation, drug resistance profiles and displays the variants across the genome using a dedicated genome browser. GMTV database, which includes 1084 genomes and over 69,000 SNP or Indel variants, can be queried about M. tuberculosis genome variation and putative associations with drug resistance, geographical origin, and clinical stages and outcomes. CONCLUSIONS: Implementation of GMTV tracks the pattern of changes of M. tuberculosis strains in different geographical areas, facilitates disease gene discoveries associated with drug resistance or different clinical sequelae, and automates comparative genomic analyses among M. tuberculosis strains.


Subject(s)
Databases, Genetic , Genetic Variation , Genome, Bacterial , Mycobacterium tuberculosis/genetics , Tuberculosis/epidemiology , Humans , Tuberculosis/microbiology
11.
Stand Genomic Sci ; 7(1): 91-106, 2012 Oct 10.
Article in English | MEDLINE | ID: mdl-23450070

ABSTRACT

Syntrophobacter fumaroxidans strain MPOB(T) is the best-studied species of the genus Syntrophobacter. The species is of interest because of its anaerobic syntrophic lifestyle, its involvement in the conversion of propionate to acetate, H2 and CO2 during the overall degradation of organic matter, and its release of products that serve as substrates for other microorganisms. The strain is able to ferment fumarate in pure culture to CO2 and succinate, and is also able to grow as a sulfate reducer with propionate as an electron donor. This is the first complete genome sequence of a member of the genus Syntrophobacter and a member genus in the family Syntrophobacteraceae. Here we describe the features of this organism, together with the complete genome sequence and annotation. The 4,990,251 bp long genome with its 4,098 protein-coding and 81 RNA genes is a part of the Microbial Genome Program (MGP) and the Genomes to Life (GTL) Program project.

12.
J Bacteriol ; 193(16): 4268-9, 2011 Aug.
Article in English | MEDLINE | ID: mdl-21685289

ABSTRACT

Desulfovibrio alaskensis G20 (formerly Desulfovibrio desulfuricans G20) is a Gram-negative mesophilic sulfate-reducing bacterium (SRB), known to corrode ferrous metals and to reduce toxic radionuclides and metals such as uranium and chromium to sparingly soluble and less toxic forms. We present the 3.7-Mb genome sequence to provide insights into its physiology.


Subject(s)
Desulfovibrio/classification , Desulfovibrio/genetics , Genome, Bacterial , Base Sequence , Desulfovibrio/physiology , Molecular Sequence Data
14.
BMC Genomics ; 11: 680, 2010 Nov 30.
Article in English | MEDLINE | ID: mdl-21118570

ABSTRACT

BACKGROUND: Succinate is produced petrochemically from maleic anhydride to satisfy a small specialty chemical market. If succinate could be produced fermentatively at a price competitive with that of maleic anhydride, though, it could replace maleic anhydride as the precursor of many bulk chemicals, transforming a multi-billion dollar petrochemical market into one based on renewable resources. Actinobacillus succinogenes naturally converts sugars and CO2 into high concentrations of succinic acid as part of a mixed-acid fermentation. Efforts are ongoing to maximize carbon flux to succinate to achieve an industrial process. RESULTS: Described here is the 2.3 Mb A. succinogenes genome sequence with emphasis on A. succinogenes's potential for genetic engineering, its metabolic attributes and capabilities, and its lack of pathogenicity. The genome sequence contains 1,690 DNA uptake signal sequence repeats and a nearly complete set of natural competence proteins, suggesting that A. succinogenes is capable of natural transformation. A. succinogenes lacks a complete tricarboxylic acid cycle as well as a glyoxylate pathway, and it appears to be able to transport and degrade about twenty different carbohydrates. The genomes of A. succinogenes and its closest known relative, Mannheimia succiniciproducens, were compared for the presence of known Pasteurellaceae virulence factors. Both species appear to lack the virulence traits of toxin production, sialic acid and choline incorporation into lipopolysaccharide, and utilization of hemoglobin and transferrin as iron sources. Perspectives are also given on the conservation of A. succinogenes genomic features in other sequenced Pasteurellaceae. CONCLUSIONS: Both A. succinogenes and M. succiniciproducens genome sequences lack many of the virulence genes used by their pathogenic Pasteurellaceae relatives. The lack of pathogenicity of these two succinogens is an exciting prospect, because comparisons with pathogenic Pasteurellaceae could lead to a better understanding of Pasteurellaceae virulence. The fact that the A. succinogenes genome encodes uptake and degradation pathways for a variety of carbohydrates reflects the variety of carbohydrate substrates available in the rumen, A. succinogenes's natural habitat. It also suggests that many different carbon sources can be used as feedstock for succinate production by A. succinogenes.


Subject(s)
Actinobacillus/genetics , Genome, Bacterial/genetics , Industrial Microbiology , Succinic Acid/metabolism , Actinobacillus/metabolism , Actinobacillus/pathogenicity , Bacterial Proteins/genetics , Bacterial Proteins/metabolism , Base Sequence , Cell Membrane/metabolism , Iron/metabolism , Metabolic Networks and Pathways/genetics , Molecular Sequence Data , Phylogeny , Prophages/genetics , RNA, Ribosomal, 16S/genetics , Repetitive Sequences, Nucleic Acid/genetics , Virulence/genetics
15.
Environ Microbiol ; 12(8): 2289-301, 2010 Aug.
Article in English | MEDLINE | ID: mdl-21966920

ABSTRACT

Syntrophomonas wolfei is a specialist, evolutionarily adapted for syntrophic growth with methanogens and other hydrogen- and/or formate-using microorganisms. This slow-growing anaerobe has three putative ribosome RNA operons, each of which has 16S rRNA and 23S rRNA genes of different length and multiple 5S rRNA genes. The genome also contains 10 RNA-directed, DNA polymerase genes. Genomic analysis shows that S. wolfei relies solely on the reduction of protons, bicarbonate or unsaturated fatty acids to re-oxidize reduced cofactors. Syntrophomonas wolfei lacks the genes needed for aerobic or anaerobic respiration and has an exceptionally limited ability to create ion gradients. An ATP synthase and a pyrophosphatase were the only systems detected capable of creating an ion gradient. Multiple homologues for ß-oxidation genes were present even though S. wolfei uses a limited range of fatty acids from four to eight carbons in length.Syntrophomonas wolfei, other syntrophic metabolizers with completed genomic sequences, and thermophilic anaerobes known to produce high molar ratios of hydrogen from glucose have genes to produce H(2) from NADH by an electron bifurcation mechanism. Comparative genomic analysis also suggests that formate production from NADH may involve electron bifurcation. A membrane-bound, iron-sulfur oxidoreductase found in S. wolfei and Syntrophus aciditrophicus may be uniquely involved in reverse electron transport during syntrophic fatty acid metabolism. The genome sequence of S. wolfei reveals several core reactions that may be characteristic of syntrophic fatty acid metabolism and illustrates how biological systems produce hydrogen from thermodynamically difficult reactions.


Subject(s)
Genome, Bacterial , Gram-Positive Endospore-Forming Rods/genetics , Gram-Positive Endospore-Forming Rods/metabolism , Hydrogen/metabolism , DNA, Bacterial/genetics , Fatty Acids/metabolism , Formates/metabolism , Oxidation-Reduction , RNA, Ribosomal/genetics , Sequence Analysis, DNA
16.
Int J Bioinform Res Appl ; 5(4): 458-77, 2009.
Article in English | MEDLINE | ID: mdl-19640832

ABSTRACT

We have developed a new method for frameshift detection, a combination of ab initio and alignment-based algorithms, that can serve as a useful tool for sequencing quality control in the next generation sequencing. We evaluated the method's accuracy on test sets of annotated genomic sequences with artificial frameshifts in protein coding regions. These tests have shown that the new method performs comparably to the earlier developed FrameD. On the sets of sequences produced by 454 pyrosequencing with sequence errors recovered by Sanger re-sequencing the accuracy of the method was shown to hold at the same level.


Subject(s)
Frameshift Mutation , Genome, Bacterial , Genomics/methods , Open Reading Frames , Algorithms , Sequence Alignment
SELECTION OF CITATIONS
SEARCH DETAIL
...