Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 15 de 15
Filtrar
1.
F1000Res ; 102021.
Artigo em Inglês | MEDLINE | ID: mdl-35999898

RESUMO

Threats to global biodiversity are increasingly recognised by scientists and the public as a critical challenge. Molecular sequencing technologies offer means to catalogue, explore, and monitor the richness and biogeography of life on Earth. However, exploiting their full potential requires tools that connect biodiversity infrastructures and resources. As a research infrastructure developing services and technical solutions that help integrate and coordinate life science resources across Europe, ELIXIR is a key player. To identify opportunities, highlight priorities, and aid strategic thinking, here we survey approaches by which molecular technologies help inform understanding of biodiversity. We detail example use cases to highlight how DNA sequencing is: resolving taxonomic issues; Increasing knowledge of marine biodiversity; helping understand how agriculture and biodiversity are critically linked; and playing an essential role in ecological studies. Together with examples of national biodiversity programmes, the use cases show where progress is being made but also highlight common challenges and opportunities for future enhancement of underlying technologies and services that connect molecular and wider biodiversity domains. Based on emerging themes, we propose key recommendations to guide future funding for biodiversity research: biodiversity and bioinformatic infrastructures need to collaborate closely and strategically; taxonomic efforts need to be aligned and harmonised across domains; metadata needs to be standardised and common data management approaches widely adopted; current approaches need to be scaled up dramatically to address the anticipated explosion of molecular data; bioinformatics support for biodiversity research needs to be enabled and sustained; training for end users of biodiversity research infrastructures needs to be prioritised; and community initiatives need to be proactive and focused on enabling solutions. For sequencing data to deliver their full potential they must be connected to knowledge: together, molecular sequence data collection initiatives and biodiversity research infrastructures can advance global efforts to prevent further decline of Earth's biodiversity.


Assuntos
Biodiversidade , Disciplinas das Ciências Biológicas , Biologia Computacional , Europa (Continente)
2.
Microorganisms ; 7(11)2019 Oct 26.
Artigo em Inglês | MEDLINE | ID: mdl-31717754

RESUMO

Brettanomyces naardenensis is a spoilage yeast with potential for biotechnological applications for production of innovative beverages with low alcohol content and high attenuation degree. Here, we present the first annotated genome of B. naardenensis CBS 7540. The genome of B. naardenensis CBS 7540 was assembled into 76 contigs, totaling 11,283,072 nucleotides. In total, 5168 protein-coding sequences were annotated. The study provides functional genome annotation, phylogenetic analysis, and discusses genetic determinants behind notable stress tolerance and biotechnological potential of B. naardenensis.

3.
PLoS One ; 14(5): e0215077, 2019.
Artigo em Inglês | MEDLINE | ID: mdl-31042716

RESUMO

Here, we present the genome of the industrial ethanol production strain Brettanomyces bruxellensis CBS 11270. The nuclear genome was found to be diploid, containing four chromosomes with sizes of ranging from 2.2 to 4.0 Mbp. A 75 Kbp mitochondrial genome was also identified. Comparing the homologous chromosomes, we detected that 0.32% of nucleotides were polymorphic, i.e. formed single nucleotide polymorphisms (SNPs), 40.6% of them were found in coding regions (i.e. 0.13% of all nucleotides formed SNPs and were in coding regions). In addition, 8,538 indels were found. The total number of protein coding genes was 4897, of them, 4,284 were annotated on chromosomes; and the mitochondrial genome contained 18 protein coding genes. Additionally, 595 genes, which were annotated, were on contigs not associated with chromosomes. A number of genes was duplicated, most of them as tandem repeats, including a six-gene cluster located on chromosome 3. There were also examples of interchromosomal gene duplications, including a duplication of a six-gene cluster, which was found on both chromosomes 1 and 4. Gene copy number analysis suggested loss of heterozygosity for 372 genes. This may reflect adaptation to relatively harsh but constant conditions of continuous fermentation. Analysis of gene topology showed that most of these losses occurred in clusters of more than one gene, the largest cluster comprising 33 genes. Comparative analysis against the wine isolate CBS 2499 revealed 88,534 SNPs and 8,133 indels. Moreover, when the scaffolds of the CBS 2499 genome assembly were aligned against the chromosomes of CBS 11270, many of them aligned completely, some have chunks aligned to different chromosomes, and some were in fact rearranged. Our findings indicate a highly dynamic genome within the species B. bruxellensis and a tendency towards reduction of gene number in long-term continuous cultivation.


Assuntos
Brettanomyces/metabolismo , Cromossomos Fúngicos/genética , Etanol/metabolismo , Mitocôndrias/genética , Brettanomyces/genética , Mapeamento de Sequências Contíguas , Evolução Molecular , Dosagem de Genes , Variação Genética , Tamanho do Genoma , Anotação de Sequência Molecular , Filogenia , Sequenciamento Completo do Genoma/métodos
4.
mBio ; 9(6)2018 11 20.
Artigo em Inglês | MEDLINE | ID: mdl-30459191

RESUMO

The continental subsurface is suggested to contain a significant part of the earth's total biomass. However, due to the difficulty of sampling, the deep subsurface is still one of the least understood ecosystems. Therefore, microorganisms inhabiting this environment might profoundly influence the global nutrient and energy cycles. In this study, in situ fixed RNA transcripts from two deep continental groundwaters from the Äspö Hard Rock Laboratory (a Baltic Sea-influenced water with a residence time of <20 years, defined as "modern marine," and an "old saline" groundwater with a residence time of thousands of years) were subjected to metatranscriptome sequencing. Although small subunit (SSU) rRNA gene and mRNA transcripts aligned to all three domains of life, supporting activity within these community subsets, the data also suggested that the groundwaters were dominated by bacteria. Many of the SSU rRNA transcripts grouped within newly described candidate phyla or could not be mapped to known branches on the tree of life, suggesting that a large portion of the active biota in the deep biosphere remains unexplored. Despite the extremely oligotrophic conditions, mRNA transcripts revealed a diverse range of metabolic strategies that were carried out by multiple taxa in the modern marine water that is fed by organic carbon from the surface. In contrast, the carbon dioxide- and hydrogen-fed old saline water with a residence time of thousands of years predominantly showed the potential to carry out translation. This suggested these cells were active, but waiting until an energy source episodically becomes available.IMPORTANCE A newly designed sampling apparatus was used to fix RNA under in situ conditions in the deep continental biosphere and benchmarks a strategy for deep biosphere metatranscriptomic sequencing. This apparatus enabled the identification of active community members and the processes they carry out in this extremely oligotrophic environment. This work presents for the first time evidence of eukaryotic, archaeal, and bacterial activity in two deep subsurface crystalline rock groundwaters from the Äspö Hard Rock Laboratory with different depths and geochemical characteristics. The findings highlight differences between organic carbon-fed shallow communities and carbon dioxide- and hydrogen-fed old saline waters. In addition, the data reveal a large portion of uncharacterized microorganisms, as well as the important role of candidate phyla in the deep biosphere, but also the disparity in microbial diversity when using standard microbial 16S rRNA gene amplification versus the large unknown portion of the community identified with unbiased metatranscriptomes.


Assuntos
Ambientes Extremos , Água Subterrânea/microbiologia , Microbiota/genética , Transcriptoma , Microbiologia da Água , Archaea/genética , Bactérias/genética , Biodiversidade , Perfilação da Expressão Gênica , Genes de RNAr , Filogenia , RNA Mensageiro/genética , RNA Ribossômico 16S/genética , Água do Mar/microbiologia , Análise de Sequência de DNA , Dióxido de Silício
5.
Proc Natl Acad Sci U S A ; 115(46): E10970-E10978, 2018 11 13.
Artigo em Inglês | MEDLINE | ID: mdl-30373829

RESUMO

The Populus genus is one of the major plant model systems, but genomic resources have thus far primarily been available for poplar species, and primarily Populus trichocarpa (Torr. & Gray), which was the first tree with a whole-genome assembly. To further advance evolutionary and functional genomic analyses in Populus, we produced genome assemblies and population genetics resources of two aspen species, Populus tremula L. and Populus tremuloides Michx. The two aspen species have distributions spanning the Northern Hemisphere, where they are keystone species supporting a wide variety of dependent communities and produce a diverse array of secondary metabolites. Our analyses show that the two aspens share a similar genome structure and a highly conserved gene content with P. trichocarpa but display substantially higher levels of heterozygosity. Based on population resequencing data, we observed widespread positive and negative selection acting on both coding and noncoding regions. Furthermore, patterns of genetic diversity and molecular evolution in aspen are influenced by a number of features, such as expression level, coexpression network connectivity, and regulatory variation. To maximize the community utility of these resources, we have integrated all presented data within the PopGenIE web resource (PopGenIE.org).


Assuntos
Populus/genética , Evolução Biológica , DNA de Plantas/genética , Evolução Molecular , Variação Genética , Genética Populacional/métodos , Genoma de Planta , Genômica , Desequilíbrio de Ligação/genética , Filogenia , Seleção Genética/genética , Análise de Sequência de DNA/métodos , Árvores/genética
6.
F1000Res ; 72018.
Artigo em Inglês | MEDLINE | ID: mdl-29568489

RESUMO

As a part of the ELIXIR-EXCELERATE efforts in capacity building, we present here 10 steps to facilitate researchers getting started in genome assembly and genome annotation. The guidelines given are broadly applicable, intended to be stable over time, and cover all aspects from start to finish of a general assembly and annotation project. Intrinsic properties of genomes are discussed, as is the importance of using high quality DNA. Different sequencing technologies and generally applicable workflows for genome assembly are also detailed. We cover structural and functional annotation and encourage readers to also annotate transposable elements, something that is often omitted from annotation workflows. The importance of data management is stressed, and we give advice on where to submit data and how to make your results Findable, Accessible, Interoperable, and Reusable (FAIR).

7.
PeerJ ; 5: e3702, 2017.
Artigo em Inglês | MEDLINE | ID: mdl-28879061

RESUMO

Whole genome sequencing (WGS) is a very valuable resource to understand the evolutionary history of poorly known species. However, in organisms with large genomes, as most amphibians, WGS is still excessively challenging and transcriptome sequencing (RNA-seq) represents a cost-effective tool to explore genome-wide variability. Non-model organisms do not usually have a reference genome and the transcriptome must be assembled de-novo. We used RNA-seq to obtain the transcriptomic profile for Oreobates cruralis, a poorly known South American direct-developing frog. In total, 550,871 transcripts were assembled, corresponding to 422,999 putative genes. Of those, we identified 23,500, 37,349, 38,120 and 45,885 genes present in the Pfam, EggNOG, KEGG and GO databases, respectively. Interestingly, our results suggested that genes related to immune system and defense mechanisms are abundant in the transcriptome of O. cruralis. We also present a pipeline to assist with pre-processing, assembling, evaluating and functionally annotating a de-novo transcriptome from RNA-seq data of non-model organisms. Our pipeline guides the inexperienced user in an intuitive way through all the necessary steps to build de-novo transcriptome assemblies using readily available software and is freely available at: https://github.com/biomendi/TRANSCRIPTOME-ASSEMBLY-PIPELINE/wiki.

8.
J Proteomics ; 129: 98-107, 2015 Nov 03.
Artigo em Inglês | MEDLINE | ID: mdl-26381203

RESUMO

The increasing number of bacterial genomes in combination with reproducible quantitative proteome measurements provides new opportunities to explore how genetic differences modulate proteome composition and virulence. It is challenging to combine genome and proteome data as the underlying genome influences the proteome. We present a strategy to facilitate the integration of genome data from several genetically similar bacterial strains with data-independent analysis mass spectrometry (DIA-MS) for rapid interrogation of the combined data sets. The strategy relies on the construction of a composite genome combining all genetic data in a compact format, which can accommodate the fusion with quantitative peptide and protein information determined via DIA-MS. We demonstrate the method by combining data sets from whole genome sequencing, shotgun MS and DIA-MS from 34 clinical isolates of Streptococcus pyogenes. The data structure allows for fast exploration of the data showing that undetected proteins are on average more amenable to amino acid substitution than expressed proteins. We identified several significantly differentially expressed proteins between invasive and non-invasive strains. The work underlines how integration of whole genome sequencing with accurately quantified proteomes can further advance the interpretation of the relationship between genomes, proteomes and virulence. This article is part of a Special Issue entitled: Computational Proteomics.


Assuntos
Proteínas de Bactérias/genética , Genoma Bacteriano/genética , Proteoma/genética , Proteômica/métodos , Análise de Sequência de DNA/métodos , Streptococcus pyogenes/genética , Mapeamento Cromossômico/métodos , Humanos , Espectrometria de Massas
9.
PLoS One ; 10(9): e0139080, 2015.
Artigo em Inglês | MEDLINE | ID: mdl-26413905

RESUMO

After performing de novo transcript assembly of >1 billion RNA-Sequencing reads obtained from 22 samples of different Norway spruce (Picea abies) tissues that were not surface sterilized, we found that assembled sequences captured a mix of plant, lichen, and fungal transcripts. The latter were likely expressed by endophytic and epiphytic symbionts, indicating that these organisms were present, alive, and metabolically active. Here, we show that these serendipitously sequenced transcripts need not be considered merely as contamination, as is common, but that they provide insight into the plant's phyllosphere. Notably, we could classify these transcripts as originating predominantly from Dothideomycetes and Leotiomycetes species, with functional annotation of gene families indicating active growth and metabolism, with particular regards to glucose intake and processing, as well as gene regulation.


Assuntos
Fungos/genética , Picea/genética , Picea/microbiologia , Transcriptoma/genética , Composição de Bases/genética , Regulação Fúngica da Expressão Gênica , Regulação da Expressão Gênica de Plantas , RNA Mensageiro/genética , RNA Mensageiro/metabolismo
10.
Environ Microbiol ; 17(2): 496-513, 2015 Feb.
Artigo em Inglês | MEDLINE | ID: mdl-25142400

RESUMO

Xeromyces bisporus can grow on sugary substrates down to 0.61, an extremely low water activity. Its genome size is approximately 22 Mb. Gene clusters encoding for secondary metabolites were conspicuously absent; secondary metabolites were not detected experimentally. Thus, in its 'dry' but nutrient-rich environment, X. bisporus appears to have relinquished abilities for combative interactions. Elements to sense/signal osmotic stress, e.g. HogA pathway, were present in X. bisporus. However, transcriptomes at optimal (∼ 0.89) versus low aw (0.68) revealed differential expression of only a few stress-related genes; among these, certain (not all) steps for glycerol synthesis were upregulated. Xeromyces bisporus increased glycerol production during hypo- and hyper-osmotic stress, and much of its wet weight comprised water and rinsable solutes; leaked solutes may form a protective slime. Xeromyces bisporus and other food-borne moulds increased membrane fatty acid saturation as water activity decreased. Such modifications did not appear to be transcriptionally regulated in X. bisporus; however, genes modulating sterols, phospholipids and the cell wall were differentially expressed. Xeromyces bisporus was previously proposed to be a 'chaophile', preferring solutes that disorder biomolecular structures. Both X. bisporus and the closely related xerophile, Xerochrysium xerophilum, with low membrane unsaturation indices, could represent a phylogenetic cluster of 'chaophiles'.


Assuntos
Ascomicetos/genética , Ascomicetos/metabolismo , Glicerol/metabolismo , Adaptação Fisiológica/genética , Ascomicetos/isolamento & purificação , Perfilação da Expressão Gênica , Genoma Fúngico/genética , Família Multigênica , Pressão Osmótica , Filogenia , Água
11.
BMC Bioinformatics ; 15: 227, 2014 Jun 30.
Artigo em Inglês | MEDLINE | ID: mdl-24976580

RESUMO

BACKGROUND: Genomic duplications constitute major events in the evolution of species, allowing paralogous copies of genes to take on fine-tuned biological roles. Unambiguously identifying the orthology relationship between copies across multiple genomes can be resolved by synteny, i.e. the conserved order of genomic sequences. However, a comprehensive analysis of duplication events and their contributions to evolution would require all-to-all genome alignments, which increases at N2 with the number of available genomes, N. RESULTS: Here, we introduce Kraken, software that omits the all-to-all requirement by recursively traversing a graph of pairwise alignments and dynamically re-computing orthology. Kraken scales linearly with the number of targeted genomes, N, which allows for including large numbers of genomes in analyses. We first evaluated the method on the set of 12 Drosophila genomes, finding that orthologous correspondence computed indirectly through a graph of multiple synteny maps comes at minimal cost in terms of sensitivity, but reduces overall computational runtime by an order of magnitude. We then used the method on three well-annotated mammalian genomes, human, mouse, and rat, and show that up to 93% of protein coding transcripts have unambiguous pairwise orthologous relationships across the genomes. On a nucleotide level, 70 to 83% of exons match exactly at both splice junctions, and up to 97% on at least one junction. We last applied Kraken to an RNA-sequencing dataset from multiple vertebrates and diverse tissues, where we confirmed that brain-specific gene family members, i.e. one-to-many or many-to-many homologs, are more highly correlated across species than single-copy (i.e. one-to-one homologous) genes. Not limited to protein coding genes, Kraken also identifies thousands of newly identified transcribed loci, likely non-coding RNAs that are consistently transcribed in human, chimpanzee and gorilla, and maintain significant correlation of expression levels across species. CONCLUSIONS: Kraken is a computational genome coordinate translator that facilitates cross-species comparisons, distinguishes orthologs from paralogs, and does not require costly all-to-all whole genome mappings. Kraken is freely available under LPGL from http://github.com/nedaz/kraken.


Assuntos
Genômica/métodos , Software , Animais , Mapeamento Cromossômico , Drosophila melanogaster/genética , Evolução Molecular , Genoma/genética , Humanos , Camundongos , Anotação de Sequência Molecular , Ratos , Sintenia/genética , Transcrição Gênica
12.
BMC Genomics ; 14: 347, 2013 May 24.
Artigo em Inglês | MEDLINE | ID: mdl-23706020

RESUMO

BACKGROUND: Phenomena such as incomplete lineage sorting, horizontal gene transfer, gene duplication and subsequent sub- and neo-functionalisation can result in distinct local phylogenetic relationships that are discordant with species phylogeny. In order to assess the possible biological roles for these subdivisions, they must first be identified and characterised, preferably on a large scale and in an automated fashion. RESULTS: We developed Saguaro, a combination of a Hidden Markov Model (HMM) and a Self Organising Map (SOM), to characterise local phylogenetic relationships among aligned sequences using cacti, matrices of pair-wise distance measures. While the HMM determines the genomic boundaries from aligned sequences, the SOM hypothesises new cacti in an unsupervised and iterative fashion based on the regions that were modelled least well by existing cacti. After testing the software on simulated data, we demonstrate the utility of Saguaro by testing two different data sets: (i) 181 Dengue virus strains, and (ii) 5 primate genomes. Saguaro identifies regions under lineage-specific constraint for the first set, and genomic segments that we attribute to incomplete lineage sorting in the second dataset. Intriguingly for the primate data, Saguaro also classified an additional ~3% of the genome as most incompatible with the expected species phylogeny. A substantial fraction of these regions was found to overlap genes associated with both the innate and adaptive immune systems. CONCLUSIONS: Saguaro detects distinct cacti describing local phylogenetic relationships without requiring any a priori hypotheses. We have successfully demonstrated Saguaro's utility with two contrasting data sets, one containing many members with short sequences (Dengue viral strains: n = 181, genome size = 10,700 nt), and the other with few members but complex genomes (related primate species: n = 5, genome size = 3 Gb), suggesting that the software is applicable to a wide variety of experimental populations. Saguaro is written in C++, runs on the Linux operating system, and can be downloaded from http://saguarogw.sourceforge.net/.


Assuntos
Genômica/métodos , Algoritmos , Animais , Vírus da Dengue/genética , Surtos de Doenças , Humanos , Imunidade/genética , Cadeias de Markov , Modelos Genéticos , Filogenia , Primatas/genética , Primatas/imunologia , Software , Especificidade da Espécie
13.
IMA Fungus ; 4(2): 229-41, 2013 Dec.
Artigo em Inglês | MEDLINE | ID: mdl-24563835

RESUMO

On the basis of a study of ITS sequences, Vidal et al. (Rev. Iber. Micol. 17: 22, 2000) recommended that the genus Chrysosporium be restricted to species belonging to Onygenales. Using nrLSU genes, we studied the majority of clades examined by Vidal et al. and showed that currently accepted species in Chrysosporium phylogenetically belong in six clades in three orders. Surprisingly, the xerophilic species of Chrysosporium, long thought to be a single grouping away from the majority of Chrysosporium species, occupy two clades, one in Leotiales, the other in Eurotiales. Species accepted in Leotiales are related to the sexual genus Bettsia. One is the type species B. alvei, and related asexual strains classified as C. farinicola, the second is C. fastidium transferred to Bettsia as B. fastidia. Species in the Eurotiales are transferred to Xerochrysium gen. nov., where the accepted species are X. xerophilum and X. dermatitidis, the correct name for C. inops on transfer to Xerochrysium. All accepted species are extreme xerophiles, found in dried and concentrated foods.

14.
Fungal Biol ; 115(11): 1100-11, 2011 Nov.
Artigo em Inglês | MEDLINE | ID: mdl-22036289

RESUMO

The filamentous ascomycete Xeromyces bisporus is an extreme xerophile able to grow down to a water activity of 0.62. We have inferred the phylogenetic position of Xeromyces in relation to other xerophilic and xerotolerant fungi in the order Eurotiales. Using nrDNA and betatubulin sequences, we show that it is more closely related to the xerophilic foodborne species of the genus Chrysosporium, than to the genus Monascus. The taxonomy of X. bisporus and Monascus is discussed. Based on physiological, morphological, and phylogenetic distinctiveness, we suggest that Xeromyces should be retained as a separate genus.


Assuntos
Eurotiales/classificação , Eurotiales/genética , Variação Genética , Filogenia , Água/metabolismo , Eurotiales/metabolismo , Proteínas Fúngicas/genética , Dados de Sequência Molecular
15.
Mol Phylogenet Evol ; 34(2): 334-54, 2005 Feb.
Artigo em Inglês | MEDLINE | ID: mdl-15619446

RESUMO

The biologically interesting ant-plant association, myrmecophytism, occurs in ca. 140 of the 11,000 species and 22 of the 630 genera of the coffee family (Rubiaceae). These myrmecophytic Rubiaceae species are predominantly distributed in Southeast Asia, especially the Malesian region, with comparatively few species in mainland Africa and the Neotropics. The mostly Southeast Asian genus Neonauclea s.s is one of the three Rubiaceae genera with extensive radiation of myrmecophytes and also the most speciose genus of the tribe Naucleeae s.l. We perform parsimony phylogenetic analyses of Neonauclea s.s., previously resolved as paraphyletic, and its allied genera using both ETS and ITS sequencing data to test: (1) the paraphyly of Neonauclea s.s.; (2) the phylogenetic relationships within the Ludekia-Myrmeconauclea-Neonauclea complex; and (3) the evolution of myrmecophytism within the complex. The earlier proposed paraphyly of Neonauclea s.s. appears to be the result of the combined effects of parallel substitutions in Metadina trichotoma and the sampled ITS putative pseudogenes of Neonauclea longipedunculata and losses of some synapomorphies of Neonauclea s.s. in the latter. The analyses present strong support for the monophyly of Myrmeconauclea and Neonauclea s.s. and their sister-group relationships. Our findings additionally favor the hypothesis of multiple origins of myrmecophytism in the Bornean Neonauclea, which have independently been exploited by at least three Cladomyrma ant species. Furthermore, we interpret the low levels of variation in both the ETS and ITS sequences as indication of a recent and rapid radiation for Neonauclea s.s. (with 65 species) and a recent and slow radiation for Myrmeconauclea (with three species). We argue that the rapid diversification of Neonauclea s.s. is partly associated with the nature of its fruits and its ability to colonize a wide range of habitats. We postulate that both ecological and geographical events may have been responsible for the radiation of the non-myrmecophytic Neonauclea species. Finally, we argue that the acquisition of the pseudo-multiple fruits and long-tailed seeds has allowed Myrmeconauclea to specialize on rheophytic habitats but its narrow ecological tolerance may have hindered its speciation.


Assuntos
Evolução Biológica , Filogenia , Rubiaceae/genética , Animais , Formigas/fisiologia , Rubiaceae/fisiologia
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA