Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 29
Filtrar
1.
Bioinformatics ; 38(1): 257-260, 2021 12 22.
Artigo em Inglês | MEDLINE | ID: mdl-34293115

RESUMO

MOTIVATION: DNA and RNA modifications can now be identified using nanopore sequencing. However, we currently lack a flexible software to efficiently encode, store, analyze and visualize DNA and RNA modification data. RESULTS: Here, we present ModPhred, a versatile toolkit that facilitates DNA and RNA modification analysis from nanopore sequencing reads in a user-friendly manner. ModPhred integrates probabilistic DNA and RNA modification information within the FASTQ and BAM file formats, can be used to encode multiple types of modifications simultaneously, and its output can be easily coupled to genomic track viewers, facilitating the visualization and analysis of DNA and RNA modification information in individual reads in a simple and computationally efficient manner. AVAILABILITY AND IMPLEMENTATION: ModPhred is available at https://github.com/novoalab/modPhred, is implemented in Python3, and is released under an MIT license. Docker images with all dependencies preinstalled are also provided. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Assuntos
Sequenciamento por Nanoporos , Nanoporos , Análise de Sequência de DNA/métodos , Software , DNA , RNA , Sequenciamento de Nucleotídeos em Larga Escala
2.
Nucleic Acids Res ; 48(W1): W553-W557, 2020 07 02.
Artigo em Inglês | MEDLINE | ID: mdl-32343307

RESUMO

Inferring homology relationships across genes in different species is a central task in comparative genomics. Therefore, a large number of resources and methods have been developed over the years. Some public databases include phylogenetic trees of homologous gene families which can be used to further differentiate homology relationships into orthology and paralogy. MetaPhOrs is a web server that integrates phylogenetic information from different sources to provide orthology and paralogy relationships based on a common phylogeny-based predictive algorithm and associated with a consistency-based confidence score. Here we describe the latest version of the web server which includes major new implementations and provides orthology and paralogy relationships derived from ∼8.2 million gene family trees-from 13 different source repositories across ∼4000 species with sequenced genomes. MetaPhOrs server is freely available, without registration, at http://orthology.phylomedb.org/.


Assuntos
Família Multigênica , Filogenia , Software , Homologia de Sequência
3.
Nucleic Acids Res ; 47(18): 9542-9556, 2019 10 10.
Artigo em Inglês | MEDLINE | ID: mdl-31372643

RESUMO

Restriction-modification (R-M) systems represent an effective mechanism of defence against invading bacteriophages, and are widely spread among bacteria and archaea. In acquiring a Type II R-M system via horizontal gene transfer, the new hosts become more resistant to phage infection, through the action of a restriction endonuclease (REase), which recognizes and cleaves specific target DNAs. To protect the host cell's DNA, there is also a methyltransferase (MTase), which prevents DNA cleavage by the cognate REase. In some R-M systems, the host also accepts a cis-acting transcription factor (C protein), which regulates the counteracting activities of REase and MTase to avoid host self-restriction. Our study characterized the unexpected phenotype of Escherichia coli cells, which manifested as extensive cell filamentation triggered by acquiring the Csp231I R-M system from Citrobacter sp. Surprisingly, we found that the cell morphology defect was solely dependent on the C regulator. Our transcriptome analysis supported by in vivo and in vitro assays showed that C protein directly silenced the expression of the RacR repressor to affect the Rac prophage-related genes. The rac locus ydaST genes, when derepressed, exerted a toxicity indicated by cell filamentation through an unknown mechanism. These results provide an apparent example of transcription factor cross-talk, which can have significant consequences for the host, and may represent a constraint on lateral gene transfer.


Assuntos
Bacteriófagos/genética , Enzimas de Restrição-Modificação do DNA/genética , Escherichia coli/genética , Interações Hospedeiro-Patógeno/genética , Sequência de Aminoácidos/genética , Bacteriófagos/patogenicidade , Citrobacter/genética , Enzimas de Restrição do DNA/genética , DNA Bacteriano/química , DNA Bacteriano/genética , Desoxirribonucleases de Sítio Específico do Tipo II/genética , Escherichia coli/virologia , Perfilação da Expressão Gênica/métodos , Regulação Bacteriana da Expressão Gênica , Transferência Genética Horizontal/genética , Fenótipo , Fatores de Transcrição/genética , Proteínas Virais/genética
4.
RNA ; 24(4): 609-619, 2018 04.
Artigo em Inglês | MEDLINE | ID: mdl-29358234

RESUMO

RNA molecules play important roles in virtually every cellular process. These functions are often mediated through the adoption of specific structures that enable RNAs to interact with other molecules. Thus, determining the secondary structures of RNAs is central to understanding their function and evolution. In recent years several sequencing-based approaches have been developed that allow probing structural features of thousands of RNA molecules present in a sample. Here, we describe nextPARS, a novel Illumina-based implementation of in vitro parallel probing of RNA structures. Our approach achieves comparable accuracy to previous implementations, while enabling higher throughput and sample multiplexing.


Assuntos
Sequenciamento de Nucleotídeos em Larga Escala/métodos , Conformação de Ácido Nucleico , RNA Mensageiro/química , Análise de Sequência de RNA/métodos , Biologia Computacional
5.
Nat Methods ; 13(5): 425-30, 2016 05.
Artigo em Inglês | MEDLINE | ID: mdl-27043882

RESUMO

Achieving high accuracy in orthology inference is essential for many comparative, evolutionary and functional genomic analyses, yet the true evolutionary history of genes is generally unknown and orthologs are used for very different applications across phyla, requiring different precision-recall trade-offs. As a result, it is difficult to assess the performance of orthology inference methods. Here, we present a community effort to establish standards and an automated web-based service to facilitate orthology benchmarking. Using this service, we characterize 15 well-established inference methods and resources on a battery of 20 different benchmarks. Standardized benchmarking provides a way for users to identify the most effective methods for the problem at hand, sets a minimum requirement for new tools and resources, and guides the development of more accurate orthology inference methods.


Assuntos
Biologia Computacional/normas , Genômica/normas , Filogenia , Proteômica/normas , Archaea/classificação , Archaea/genética , Bactérias/classificação , Bactérias/genética , Biologia Computacional/métodos , Bases de Dados Genéticas , Eucariotos/classificação , Eucariotos/genética , Ontologia Genética , Genômica/métodos , Modelos Genéticos , Proteômica/métodos , Análise de Sequência de Proteína , Homologia de Sequência , Especificidade da Espécie
6.
7.
Nucleic Acids Res ; 44(12): e113, 2016 07 08.
Artigo em Inglês | MEDLINE | ID: mdl-27131372

RESUMO

Many genomes display high levels of heterozygosity (i.e. presence of different alleles at the same loci in homologous chromosomes), being those of hybrid organisms an extreme such case. The assembly of highly heterozygous genomes from short sequencing reads is a challenging task because it is difficult to accurately recover the different haplotypes. When confronted with highly heterozygous genomes, the standard assembly process tends to collapse homozygous regions and reports heterozygous regions in alternative contigs. The boundaries between homozygous and heterozygous regions result in multiple assembly paths that are hard to resolve, which leads to highly fragmented assemblies with a total size larger than expected. This, in turn, causes numerous problems in downstream analyses such as fragmented gene models, wrong gene copy number, or broken synteny. To circumvent these caveats we have developed a pipeline that specifically deals with the assembly of heterozygous genomes by introducing a step to recognise and selectively remove alternative heterozygous contigs. We tested our pipeline on simulated and naturally-occurring heterozygous genomes and compared its accuracy to other existing tools. Our method is freely available at https://github.com/Gabaldonlab/redundans.


Assuntos
Genoma , Heterozigoto , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Análise de Sequência de DNA/métodos , Algoritmos , Genômica/métodos , Homozigoto , Software , Sintenia/genética
8.
PLoS Genet ; 11(10): e1005626, 2015 Oct.
Artigo em Inglês | MEDLINE | ID: mdl-26517373

RESUMO

Candida metapsilosis is a rarely-isolated, opportunistic pathogen that belongs to a clade of pathogenic yeasts known as the C. parapsilosis sensu lato species complex. To gain insight into the recent evolution of C. metapsilosis and the genetic basis of its virulence, we sequenced the genome of 11 clinical isolates from various locations, which we compared to each other and to the available genomes of the two remaining members of the complex: C. orthopsilosis and C. parapsilosis. Unexpectedly, we found compelling genomic evidence that C. metapsilosis is a highly heterozygous hybrid species, with all sequenced clinical strains resulting from the same past hybridization event involving two parental lineages that were approximately 4.5% divergent in sequence. This result indicates that the parental species are non-pathogenic, but that hybridization between them formed a new opportunistic pathogen, C. metapsilosis, that has achieved a worldwide distribution. We show that these hybrids are diploid and we identified strains carrying loci for both alternative mating types, which supports mating as the initial mechanism for hybrid formation. We trace the aftermath of this hybridization at the genomic level, and reconstruct the evolutionary relationships among the different strains. Recombination and introgression -resulting in loss of heterozygosis- between the two subgenomes have been rampant, and includes the partial overwriting of the MTLa mating locus in all strains. Collectively, our results shed light on the recent genomic evolution within the C. parapsilosis sensu lato complex, and argue for a re-definition of species within this clade, with at least five distinct homozygous lineages, some of which having the ability to form hybrids.


Assuntos
Candida/genética , Evolução Molecular , Infecções Oportunistas/genética , Virulência/genética , Candida/patogenicidade , Genoma , Heterozigoto , Humanos , Hibridização Genética , Infecções Oportunistas/microbiologia
9.
Nucleic Acids Res ; 42(Database issue): D897-902, 2014 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-24275491

RESUMO

Phylogenetic trees representing the evolutionary relationships of homologous genes are the entry point for many evolutionary analyses. For instance, the use of a phylogenetic tree can aid in the inference of orthology and paralogy relationships, and in the detection of relevant evolutionary events such as gene family expansions and contractions, horizontal gene transfer, recombination or incomplete lineage sorting. Similarly, given the plurality of evolutionary histories among genes encoded in a given genome, there is a need for the combined analysis of genome-wide collections of phylogenetic trees (phylomes). Here, we introduce a new release of PhylomeDB (http://phylomedb.org), a public repository of phylomes. Currently, PhylomeDB hosts 120 public phylomes, comprising >1.5 million maximum likelihood trees and multiple sequence alignments. In the current release, phylogenetic trees are annotated with taxonomic, protein-domain arrangement, functional and evolutionary information. PhylomeDB is also a major source for phylogeny-based predictions of orthology and paralogy, covering >10 million proteins across 1059 sequenced species. Here we describe newly implemented PhylomeDB features, and discuss a benchmark of the orthology predictions provided by the database, the impact of proteome updates and the use of the phylome approach in the analysis of newly sequenced genomes and transcriptomes.


Assuntos
Bases de Dados Genéticas , Genoma , Filogenia , Perfilação da Expressão Gênica , Humanos , Internet , Proteínas/classificação , Proteínas/genética , Proteoma
10.
Appl Environ Microbiol ; 81(24): 8445-56, 2015 Dec.
Artigo em Inglês | MEDLINE | ID: mdl-26431969

RESUMO

Comparative genomics, metagenomics, and single-cell technologies have shown that populations of microbial species encompass assemblages of closely related strains. This raises the question of whether individual bacterial lineages respond to the presence of their close relatives by modifying their gene expression or, instead, whether assemblages simply act as the arithmetic addition of their individual components. Here, we took advantage of transcriptome sequencing to address this question. For this, we analyzed the transcriptomes of two closely related strains of the extremely halophilic bacterium Salinibacter ruber grown axenically and in coculture. These organisms dominate bacterial assemblages in hypersaline environments worldwide. The strains used here cooccurred in the natural environment and are 100% identical in their 16S rRNA genes, and each strain harbors an accessory genome representing 10% of its complete genome. Overall, transcriptomic patterns from pure cultures were very similar for both strains. Expression was detected along practically the whole genome albeit with some genes at low levels. A subset of genes was very highly expressed in both strains, including genes coding for the light-driven proton pump xanthorhodopsin, genes involved in the stress response, and genes coding for transcriptional regulators. Expression differences between pure cultures affected mainly genes involved in environmental sensing. When the strains were grown in coculture, there was a modest but significant change in their individual transcription patterns compared to those in pure culture. Each strain sensed the presence of the other and responded in a specific manner, which points to fine intraspecific transcriptomic modulation.


Assuntos
Bacteroidetes/classificação , Bacteroidetes/genética , Sequenciamento de Nucleotídeos em Larga Escala , Interações Microbianas/genética , Transcriptoma/genética , Proteínas de Bactérias/genética , Bacteroidetes/metabolismo , Sequência de Bases , Técnicas de Cocultura , DNA Bacteriano/genética , Genoma Bacteriano/genética , Metagenômica , RNA Ribossômico 16S/genética , Rodopsinas Microbianas/genética , Análise de Sequência de DNA , Transcrição Gênica
11.
FEMS Yeast Res ; 15(3)2015 May.
Artigo em Inglês | MEDLINE | ID: mdl-25743787

RESUMO

The pathogenic yeast Candida albicans utilizes hydroxyderivatives of benzene via the catechol and hydroxyhydroquinone branches of the 3-oxoadipate pathway. The genetic basis and evolutionary origin of this catabolic pathway in yeasts are unknown. In this study, we identified C. albicans genes encoding the enzymes involved in the degradation of hydroxybenzenes. We found that the genes coding for core components of the 3-oxoadipate pathway are arranged into two metabolic gene clusters. Our results demonstrate that C. albicans cells cultivated in media containing hydroxybenzene substrates highly induce the transcription of these genes as well as the corresponding enzymatic activities. We also found that C. albicans cells assimilating hydroxybenzenes cope with the oxidative stress by upregulation of cellular antioxidant systems such as alternative oxidase and catalase. Moreover, we investigated the evolution of the enzymes encoded by these clusters and found that most of them share a particularly sparse phylogenetic distribution among Saccharomycotina, which is likely to have been caused by extensive gene loss. We exploited this fact to find co-evolving proteins that are suitable candidates for the missing enzymes of the pathway.


Assuntos
Adipatos/metabolismo , Candida albicans/genética , Candida albicans/metabolismo , Enzimas/genética , Redes e Vias Metabólicas/genética , Família Multigênica , Fenol/metabolismo , Antioxidantes/metabolismo , Biotransformação , Enzimas/metabolismo , Regulação Fúngica da Expressão Gênica/efeitos dos fármacos , Ordem dos Genes , Estresse Oxidativo , Filogenia , Sintenia
12.
Nat Biotechnol ; 42(1): 72-86, 2024 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-37024678

RESUMO

Transfer RNAs (tRNAs) play a central role in protein translation. Studying them has been difficult in part because a simple method to simultaneously quantify their abundance and chemical modifications is lacking. Here we introduce Nano-tRNAseq, a nanopore-based approach to sequence native tRNA populations that provides quantitative estimates of both tRNA abundances and modification dynamics in a single experiment. We show that default nanopore sequencing settings discard the vast majority of tRNA reads, leading to poor sequencing yields and biased representations of tRNA abundances based on their transcript length. Re-processing of raw nanopore current intensity signals leads to a 12-fold increase in the number of recovered tRNA reads and enables recapitulation of accurate tRNA abundances. We then apply Nano-tRNAseq to Saccharomyces cerevisiae tRNA populations, revealing crosstalks and interdependencies between different tRNA modification types within the same molecule and changes in tRNA populations in response to oxidative stress.


Assuntos
Sequenciamento por Nanoporos , Nanoporos , RNA , RNA de Transferência/química , Análise de Sequência de RNA/métodos
13.
Nucleic Acids Res ; 39(5): e32, 2011 Mar.
Artigo em Inglês | MEDLINE | ID: mdl-21149260

RESUMO

Reliable prediction of orthology is central to comparative genomics. Approaches based on phylogenetic analyses closely resemble the original definition of orthology and paralogy and are known to be highly accurate. However, the large computational cost associated to these analyses is a limiting factor that often prevents its use at genomic scales. Recently, several projects have addressed the reconstruction of large collections of high-quality phylogenetic trees from which orthology and paralogy relationships can be inferred. This provides us with the opportunity to infer the evolutionary relationships of genes from multiple, independent, phylogenetic trees. Using such strategy, we combine phylogenetic information derived from different databases, to predict orthology and paralogy relationships for 4.1 million proteins in 829 fully sequenced genomes. We show that the number of independent sources from which a prediction is made, as well as the level of consistency across predictions, can be used as reliable confidence scores. A webserver has been developed to easily access these data (http://orthology.phylomedb.org), which provides users with a global repository of phylogeny-based orthology and paralogy predictions.


Assuntos
Genômica/métodos , Filogenia , Genoma , Proteínas/classificação , Proteínas/genética , Software
14.
Nucleic Acids Res ; 39(Database issue): D556-60, 2011 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-21075798

RESUMO

The growing availability of complete genomic sequences from diverse species has brought about the need to scale up phylogenomic analyses, including the reconstruction of large collections of phylogenetic trees. Here, we present the third version of PhylomeDB (http://phylomeDB.org), a public database for genome-wide collections of gene phylogenies (phylomes). Currently, PhylomeDB is the largest phylogenetic repository and hosts 17 phylomes, comprising 416,093 trees and 165,840 alignments. It is also a major source for phylogeny-based orthology and paralogy predictions, covering about 5 million proteins in 717 fully-sequenced genomes. For each protein-coding gene in a seed genome, the database provides original and processed alignments, phylogenetic trees derived from various methods and phylogeny-based predictions of orthology and paralogy relationships. The new version of phylomeDB has been extended with novel data access and visualization features, including the possibility of programmatic access. Available seed species include model organisms such as human, yeast, Escherichia coli or Arabidopsis thaliana, but also alternative model species such as the human pathogen Candida albicans, or the pea aphid Acyrtosiphon pisum. Finally, PhylomeDB is currently being used by several genome sequencing projects that couple the genome annotation process with the reconstruction of the corresponding phylome, a strategy that provides relevant evolutionary insights.


Assuntos
Bases de Dados Genéticas , Genômica , Filogenia , Alinhamento de Sequência , Humanos , Anotação de Sequência Molecular
15.
Microorganisms ; 9(9)2021 Aug 24.
Artigo em Inglês | MEDLINE | ID: mdl-34576693

RESUMO

DNA methylation is a common, but not universal, epigenetic modification that plays an important role in multiple cellular processes. While definitely settled for numerous plant, mammalian, and bacterial species, the genome methylation in different fungal species, including widely studied and industrially-relevant yeast species, Yarrowia lipolytica, is still a matter of debate. In this paper, we report a differential DNA methylation level in the genome of Y. lipolytica subjected to sequential subculturing and to heat stress conditions. To this end, we adopted repeated batch bioreactor cultivations of Y. lipolytica subjected to thermal stress in specific time intervals. To analyze the variation in DNA methylation between stressed and control cultures, we (a) quantified the global DNA methylation status using an immuno-assay, and (b) studied DNA methylation patterns through whole-genome sequencing. Primarily, we demonstrated that 5 mC modification can be detected using a commercial immuno-assay, and that the modifications are present in Y. lipolytica's genome at ~0.5% 5 mC frequency. On the other hand, we did not observe any changes in the epigenetic response of Y. lipolytica to heat shock (HS) treatment. Interestingly, we identified a general phenomenon of decreased 5 mC level in Y. lipolytica's genome in the stationary phase of growth, when compared to a late-exponential epigenome. While this study provides an insight into the subculturing stress response and adaptation to the stress at epigenetic level by Y. lipolytica, it also leaves an open question of inability to detect any genomic DNA methylation level (either in CpG context or context-less) through whole-genome sequencing. The results of ONT sequencing, suggesting that 5 mC modification is either rare or non-existent in Y. lipolytica genome, are contradicted with the results of the immunoassay.

16.
DNA Res ; 28(3)2021 Jun 25.
Artigo em Inglês | MEDLINE | ID: mdl-34129020

RESUMO

Candida subhashii belongs to the CUG-Ser clade, a group of phylogenetically closely related yeast species that includes some human opportunistic pathogens, such as Candida albicans. Despite being present in the environment, C. subhashii was initially described as the causative agent of a case of peritonitis. Considering the relevance of whole-genome sequencing and analysis for our understanding of genome evolution and pathogenicity, we sequenced, assembled and annotated the genome of C. subhashii type strain. Our results show that C. subhashii presents a highly heterozygous genome and other signatures that point to a hybrid ancestry. The presence of functional pathways for assimilation of hydroxyaromatic compounds goes in line with the affiliation of this yeast with soil microbial communities involved in lignin decomposition. Furthermore, we observed that different clones of this strain may present circular or linear mitochondrial DNA. Re-sequencing and comparison of strains with differential mitochondrial genome topology revealed five candidate genes potentially associated with this conformational change: MSK1, SSZ1, ALG5, MRPL9 and OYE32.


Assuntos
Candida/genética , Núcleo Celular/genética , Genoma Fúngico , Genoma Mitocondrial , Redes e Vias Metabólicas , Fenóis/metabolismo , Candida/metabolismo , Sequenciamento Completo do Genoma
17.
Nat Biotechnol ; 39(10): 1278-1291, 2021 10.
Artigo em Inglês | MEDLINE | ID: mdl-33986546

RESUMO

Nanopore RNA sequencing shows promise as a method for discriminating and identifying different RNA modifications in native RNA. Expanding on the ability of nanopore sequencing to detect N6-methyladenosine, we show that other modifications, in particular pseudouridine (Ψ) and 2'-O-methylation (Nm), also result in characteristic base-calling 'error' signatures in the nanopore data. Focusing on Ψ modification sites, we detected known and uncovered previously unreported Ψ sites in mRNAs, non-coding RNAs and rRNAs, including a Pus4-dependent Ψ modification in yeast mitochondrial rRNA. To explore the dynamics of pseudouridylation, we treated yeast cells with oxidative, cold and heat stresses and detected heat-sensitive Ψ-modified sites in small nuclear RNAs, small nucleolar RNAs and mRNAs. Finally, we developed a software, nanoRMS, that estimates per-site modification stoichiometries by identifying single-molecule reads with altered current intensity and trace profiles. This work demonstrates that Nm and Ψ RNA modifications can be detected in cellular RNAs and that their modification stoichiometry can be quantified by nanopore sequencing of native RNA.


Assuntos
Sequenciamento por Nanoporos/métodos , Pseudouridina/metabolismo , RNA/metabolismo , Análise de Sequência de RNA/métodos , Algoritmos , Perfilação da Expressão Gênica , Transferases Intramoleculares/metabolismo , Mitocôndrias/genética , Pseudouridina/genética , RNA/genética , Processamento Pós-Transcricional do RNA/genética , RNA Fúngico/genética , RNA Fúngico/metabolismo , RNA Mensageiro/genética , RNA Mensageiro/metabolismo , RNA Ribossômico/genética , RNA Ribossômico/metabolismo , Saccharomyces cerevisiae/genética , Software , Estresse Fisiológico/genética
18.
Front Genet ; 11: 211, 2020.
Artigo em Inglês | MEDLINE | ID: mdl-32256520

RESUMO

The direct RNA sequencing platform offered by Oxford Nanopore Technologies allows for direct measurement of RNA molecules without the need of conversion to complementary DNA, fragmentation or amplification. As such, it is virtually capable of detecting any given RNA modification present in the molecule that is being sequenced, as well as provide polyA tail length estimations at the level of individual RNA molecules. Although this technology has been publicly available since 2017, the complexity of the raw Nanopore data, together with the lack of systematic and reproducible pipelines, have greatly hindered the access of this technology to the general user. Here we address this problem by providing a fully benchmarked workflow for the analysis of direct RNA sequencing reads, termed MasterOfPores. The pipeline starts with a pre-processing module, which converts raw current intensities into multiple types of processed data including FASTQ and BAM, providing metrics of the quality of the run, quality-filtering, demultiplexing, base-calling and mapping. In a second step, the pipeline performs downstream analyses of the mapped reads, including prediction of RNA modifications and estimation of polyA tail lengths. Four direct RNA MinION sequencing runs can be fully processed and analyzed in 10 h on 100 CPUs. The pipeline can also be executed in GPU locally or in the cloud, decreasing the run time fourfold. The software is written using the NextFlow framework for parallelization and portability, and relies on Linux containers such as Docker and Singularity for achieving better reproducibility. The MasterOfPores workflow can be executed on any Unix-compatible OS on a computer, cluster or cloud without the need of installing any additional software or dependencies, and is freely available in Github (https://github.com/biocorecrg/master_of_pores). This workflow simplifies direct RNA sequencing data analyses, facilitating the study of the (epi)transcriptome at single molecule resolution.

19.
Front Genet ; 10: 137, 2019.
Artigo em Inglês | MEDLINE | ID: mdl-30881382

RESUMO

The pre-whole genome duplication (WGD) Zygosaccharomyces clade comprises several allodiploid strain/species with industrially interesting traits. The salt-tolerant yeast ATCC42981 is a sterile and allodiploid strain which contains two subgenomes, one of them resembling the haploid parental species Z. rouxii. Recently, different mating-type-like (MTL) loci repertoires were reported for ATCC42981 and the Japanese strain JCM22060, which are considered two stocks of the same strain. MTL reconstruction by direct sequencing approach is challenging due to gene redundancy, structure complexities, and allodiploid nature of ATCC42981. Here, DBG2OLC and MaSuRCA hybrid de novo assemblies of ONT and Illumina reads were combined with in vitro long PCR to definitively solve these incongruences. ATCC42981 exhibits several chimeric MTL loci resulting from reciprocal translocation between parental haplotypes and retains two MATa/MATα expression loci, in contrast to MATα in JCM22060. Consistently to these reconstructions, JCM22060, but not ATCC42981, undergoes mating and meiosis. To ascertain whether the damage of one allele at the MAT locus regains the complete sexual cycle in ATCC42981, we removed the MATα expressed locus by gene deletion. The resulting MATa/- hemizygous mutants did not show any evidence of sporulation, as well as of self- and out-crossing fertility, probably because incomplete silencing at the chimeric HMLα cassette masks the loss of heterozygosity at the MAT locus. We also found that MATα deletion switched off a2 transcription, an activator of a-specific genes in pre-WGD species. These findings suggest that regulatory scheme of cell identity needs to be further investigated in Z. rouxii protoploid yeast.

20.
Microbiol Resour Announc ; 8(50)2019 Dec 12.
Artigo em Inglês | MEDLINE | ID: mdl-31831616

RESUMO

Chromosome-scale genome assembly of the yeast Saprochaete ingens CBS 517.90 was determined by a combination of technologies producing short (HiSeq X; Illumina) and long (MinION; Oxford Nanopore Technologies) reads. The 21.2-Mbp genome sequence has a GC content of 36.9% and codes for 6,475 predicted proteins.

SELEÇÃO DE REFERÊNCIAS
Detalhe da pesquisa