Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 13 de 13
Filtrar
1.
Mol Biol Evol ; 38(10): 4647-4654, 2021 09 27.
Artículo en Inglés | MEDLINE | ID: mdl-34320186

RESUMEN

Methods for evaluating the quality of genomic and metagenomic data are essential to aid genome assembly procedures and to correctly interpret the results of subsequent analyses. BUSCO estimates the completeness and redundancy of processed genomic data based on universal single-copy orthologs. Here, we present new functionalities and major improvements of the BUSCO software, as well as the renewal and expansion of the underlying data sets in sync with the OrthoDB v10 release. Among the major novelties, BUSCO now enables phylogenetic placement of the input sequence to automatically select the most appropriate BUSCO data set for the assessment, allowing the analysis of metagenome-assembled genomes of unknown origin. A newly introduced genome workflow increases the efficiency and runtimes especially on large eukaryotic genomes. BUSCO is the only tool capable of assessing both eukaryotic and prokaryotic species, and can be applied to various data types, from genome assemblies and metagenomic bins, to transcriptomes and gene sets.


Asunto(s)
Eucariontes , Genómica , Eucariontes/genética , Genoma Viral , Genómica/métodos , Filogenia , Flujo de Trabajo
2.
Nucleic Acids Res ; 47(D1): D807-D811, 2019 01 08.
Artículo en Inglés | MEDLINE | ID: mdl-30395283

RESUMEN

OrthoDB (https://www.orthodb.org) provides evolutionary and functional annotations of orthologs. This update features a major scaling up of the resource coverage, sampling the genomic diversity of 1271 eukaryotes, 6013 prokaryotes and 6488 viruses. These include putative orthologs among 448 metazoan, 117 plant, 549 fungal, 148 protist, 5609 bacterial, and 404 archaeal genomes, picking up the best sequenced and annotated representatives for each species or operational taxonomic unit. OrthoDB relies on a concept of hierarchy of levels-of-orthology to enable more finely resolved gene orthologies for more closely related species. Since orthologs are the most likely candidates to retain functions of their ancestor gene, OrthoDB is aimed at narrowing down hypotheses about gene functions and enabling comparative evolutionary studies. Optional registered-user sessions allow on-line BUSCO assessments of gene set completeness and mapping of the uploaded data to OrthoDB to enable further interactive exploration of related annotations and generation of comparative charts. The accelerating expansion of genomics data continues to add valuable information, and OrthoDB strives to provide orthologs from the broadest coverage of species, as well as to extensively collate available functional annotations and to compute evolutionary annotations. The data can be browsed online, downloaded or assessed via REST API or SPARQL RDF compatible with both UniProt and Ensembl.


Asunto(s)
Bases de Datos Genéticas , Evolución Molecular , Genómica/tendencias , Anotación de Secuencia Molecular , Animales , Eucariontes/genética , Variación Genética , Genoma Bacteriano/genética , Genoma Fúngico/genética , Genoma de Planta/genética , Genoma Viral/genética , Filogenia , Programas Informáticos
3.
BMC Biol ; 18(1): 142, 2020 10 19.
Artículo en Inglés | MEDLINE | ID: mdl-33070780

RESUMEN

BACKGROUND: The western flower thrips, Frankliniella occidentalis (Pergande), is a globally invasive pest and plant virus vector on a wide array of food, fiber, and ornamental crops. The underlying genetic mechanisms of the processes governing thrips pest and vector biology, feeding behaviors, ecology, and insecticide resistance are largely unknown. To address this gap, we present the F. occidentalis draft genome assembly and official gene set. RESULTS: We report on the first genome sequence for any member of the insect order Thysanoptera. Benchmarking Universal Single-Copy Ortholog (BUSCO) assessments of the genome assembly (size = 415.8 Mb, scaffold N50 = 948.9 kb) revealed a relatively complete and well-annotated assembly in comparison to other insect genomes. The genome is unusually GC-rich (50%) compared to other insect genomes to date. The official gene set (OGS v1.0) contains 16,859 genes, of which ~ 10% were manually verified and corrected by our consortium. We focused on manual annotation, phylogenetic, and expression evidence analyses for gene sets centered on primary themes in the life histories and activities of plant-colonizing insects. Highlights include the following: (1) divergent clades and large expansions in genes associated with environmental sensing (chemosensory receptors) and detoxification (CYP4, CYP6, and CCE enzymes) of substances encountered in agricultural environments; (2) a comprehensive set of salivary gland genes supported by enriched expression; (3) apparent absence of members of the IMD innate immune defense pathway; and (4) developmental- and sex-specific expression analyses of genes associated with progression from larvae to adulthood through neometaboly, a distinct form of maturation differing from either incomplete or complete metamorphosis in the Insecta. CONCLUSIONS: Analysis of the F. occidentalis genome offers insights into the polyphagous behavior of this insect pest that finds, colonizes, and survives on a widely diverse array of plants. The genomic resources presented here enable a more complete analysis of insect evolution and biology, providing a missing taxon for contemporary insect genomics-based analyses. Our study also offers a genomic benchmark for molecular and evolutionary investigations of other Thysanoptera species.


Asunto(s)
Genoma de los Insectos , Rasgos de la Historia de Vida , Thysanoptera/fisiología , Transcriptoma , Animales , Productos Agrícolas , Conducta Alimentaria , Cadena Alimentaria , Inmunidad Innata/genética , Percepción , Filogenia , Reproducción/genética , Thysanoptera/genética , Thysanoptera/inmunología
5.
Mol Biol Evol ; 35(3): 543-548, 2018 Mar 01.
Artículo en Inglés | MEDLINE | ID: mdl-29220515

RESUMEN

Genomics promises comprehensive surveying of genomes and metagenomes, but rapidly changing technologies and expanding data volumes make evaluation of completeness a challenging task. Technical sequencing quality metrics can be complemented by quantifying completeness of genomic data sets in terms of the expected gene content of Benchmarking Universal Single-Copy Orthologs (BUSCO, http://busco.ezlab.org). The latest software release implements a complete refactoring of the code to make it more flexible and extendable to facilitate high-throughput assessments. The original six lineage assessment data sets have been updated with improved species sampling, 34 new subsets have been built for vertebrates, arthropods, fungi, and prokaryotes that greatly enhance resolution, and data sets are now also available for nematodes, protists, and plants. Here, we present BUSCO v3 with example analyses that highlight the wide-ranging utility of BUSCO assessments, which extend beyond quality control of genomics data sets to applications in comparative genomics analyses, gene predictor training, metagenomics, and phylogenomics.

6.
Nucleic Acids Res ; 45(D1): D744-D749, 2017 01 04.
Artículo en Inglés | MEDLINE | ID: mdl-27899580

RESUMEN

OrthoDB is a comprehensive catalog of orthologs, genes inherited by extant species from a single gene in their last common ancestor. In 2016 OrthoDB reached its 9th release, growing to over 22 million genes from over 5000 species, now adding plants, archaea and viruses. In this update we focused on usability of this fast-growing wealth of data: updating the user and programmatic interfaces to browse and query the data, and further enhancing the already extensive integration of available gene functional annotations. Collating functional annotations from over 100 resources, and enabled us to propose descriptive titles for 87% of ortholog groups. Additionally, OrthoDB continues to provide computed evolutionary annotations and to allow user queries by sequence homology. The OrthoDB resource now enables users to generate publication-quality comparative genomics charts, as well as to upload, analyze and interactively explore their own private data. OrthoDB is available from http://orthodb.org.


Asunto(s)
Biología Computacional/métodos , Bases de Datos Genéticas , Evolución Molecular , Genómica/métodos , Algoritmos , Animales , Archaea/genética , Bacterias/genética , Hongos/genética , Anotación de Secuencia Molecular , Plantas/genética , Programas Informáticos , Interfaz Usuario-Computador , Virus/genética , Navegador Web
7.
Nucleic Acids Res ; 43(Database issue): D250-6, 2015 Jan.
Artículo en Inglés | MEDLINE | ID: mdl-25428351

RESUMEN

Orthology, refining the concept of homology, is the cornerstone of evolutionary comparative studies. With the ever-increasing availability of genomic data, inference of orthology has become instrumental for generating hypotheses about gene functions crucial to many studies. This update of the OrthoDB hierarchical catalog of orthologs (http://www.orthodb.org) covers 3027 complete genomes, including the most comprehensive set of 87 arthropods, 61 vertebrates, 227 fungi and 2627 bacteria (sampling the most complete and representative genomes from over 11,000 available). In addition to the most extensive integration of functional annotations from UniProt, InterPro, GO, OMIM, model organism phenotypes and COG functional categories, OrthoDB uniquely provides evolutionary annotations including rates of ortholog sequence divergence, copy-number profiles, sibling groups and gene architectures. We re-designed the entirety of the OrthoDB website from the underlying technology to the user interface, enabling the user to specify species of interest and to select the relevant orthology level by the NCBI taxonomy. The text searches allow use of complex logic with various identifiers of genes, proteins, domains, ontologies or annotation keywords and phrases. Gene copy-number profiles can also be queried. This release comes with the freely available underlying ortholog clustering pipeline (http://www.orthodb.org/software).


Asunto(s)
Bases de Datos Genéticas , Homología de Secuencia , Algoritmos , Animales , Curaduría de Datos , Eucariontes/genética , Evolución Molecular , Genoma Microbiano , Humanos , Programas Informáticos
8.
Bioinformatics ; 31(19): 3210-2, 2015 Oct 01.
Artículo en Inglés | MEDLINE | ID: mdl-26059717

RESUMEN

MOTIVATION: Genomics has revolutionized biological research, but quality assessment of the resulting assembled sequences is complicated and remains mostly limited to technical measures like N50. RESULTS: We propose a measure for quantitative assessment of genome assembly and annotation completeness based on evolutionarily informed expectations of gene content. We implemented the assessment procedure in open-source software, with sets of Benchmarking Universal Single-Copy Orthologs, named BUSCO. AVAILABILITY AND IMPLEMENTATION: Software implemented in Python and datasets available for download from http://busco.ezlab.org. CONTACT: evgeny.zdobnov@unige.ch SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Asunto(s)
Biología Computacional/métodos , Dosificación de Gen/genética , Genoma , Genómica/métodos , Anotación de Secuencia Molecular/métodos , Programas Informáticos , Animales , Humanos
9.
Braz J Microbiol ; 55(2): 1415-1425, 2024 Jun.
Artículo en Inglés | MEDLINE | ID: mdl-38619733

RESUMEN

Pseudomonas aeruginosa is the main pathogen associated with pulmonary exacerbation in patients with cystic fibrosis (CF). CF is a multisystemic genetic disease caused by mutations in the cystic fibrosis transmembrane conductance regulator gene, which mainly affects pulmonary function. P. aeruginosa isolated from individuals with CF in Brazil is not commonly associated with multidrug resistance (MDR), especially when compared to global occurrence, where the presence of epidemic clones, capable of expressing resistance to several drugs, is often reported. Due to the recent observations of MDR isolates of P. aeruginosa in our centers, combined with these characteristics, whole-genome sequencing was employed for analyses related to antimicrobial resistance, plasmid identification, search for phages, and characterization of CF clones. All isolates in this study were polymyxin B resistant, exhibiting diverse mutations and reduced susceptibility to carbapenems. Alterations in mexZ can result in the overexpression of the MexXY efflux pump. Mutations in oprD, pmrB, parS, gyrA and parC may confer reduced susceptibility to antimicrobials by affecting permeability, as observed in phenotypic tests. The phage findings led to the assumption of horizontal genetic transfer, implicating dissemination between P. aeruginosa isolates. New sequence types were described, and none of the isolates showed an association with epidemic CF clones. Analysis of the genetic context of P. aeruginosa resistance to polymyxin B allowed us to understand the different mechanisms of resistance to antimicrobials, in addition to subsidizing the understanding of possible relationships with epidemic strains that circulate among individuals with CF observed in other countries.


Asunto(s)
Antibacterianos , Fibrosis Quística , Pruebas de Sensibilidad Microbiana , Polimixina B , Infecciones por Pseudomonas , Pseudomonas aeruginosa , Fibrosis Quística/microbiología , Fibrosis Quística/complicaciones , Humanos , Polimixina B/farmacología , Pseudomonas aeruginosa/genética , Pseudomonas aeruginosa/efectos de los fármacos , Pseudomonas aeruginosa/aislamiento & purificación , Pseudomonas aeruginosa/virología , Infecciones por Pseudomonas/microbiología , Antibacterianos/farmacología , Mutación , Farmacorresistencia Bacteriana/genética , Brasil , Proteínas Bacterianas/genética , Secuenciación Completa del Genoma , Farmacorresistencia Bacteriana Múltiple/genética
10.
Genome Biol Evol ; 12(1): 3534-3549, 2020 01 01.
Artículo en Inglés | MEDLINE | ID: mdl-31778187

RESUMEN

The dipluran two-pronged bristletail Campodea augens is a blind ancestrally wingless hexapod with the remarkable capacity to regenerate lost body appendages such as its long antennae. As sister group to Insecta (sensu stricto), Diplura are key to understanding the early evolution of hexapods and the origin and evolution of insects. Here we report the 1.2-Gb draft genome of C. augens and results from comparative genomic analyses with other arthropods. In C. augens, we uncovered the largest chemosensory gene repertoire of ionotropic receptors in the animal kingdom, a massive expansion that might compensate for the loss of vision. We found a paucity of photoreceptor genes mirroring at the genomic level the secondary loss of an ancestral external photoreceptor organ. Expansions of detoxification and carbohydrate metabolism gene families might reflect adaptations for foraging behavior, and duplicated apoptotic genes might underlie its high regenerative potential. The C. augens genome represents one of the key references for studying the emergence of genomic innovations in insects, the most diverse animal group, and opens up novel opportunities to study the under-explored biology of diplurans.


Asunto(s)
Artrópodos/genética , Evolución Molecular , Genoma , Animales , Apoptosis/genética , Artrópodos/clasificación , Metabolismo de los Hidratos de Carbono/genética , Insectos/genética , Familia de Multigenes , Filogenia , Dominios Proteicos , Virus ARN/genética , Receptores de Superficie Celular/genética , Visión Ocular/genética , Xenobióticos/metabolismo
11.
Methods Mol Biol ; 1858: 59-74, 2019.
Artículo en Inglés | MEDLINE | ID: mdl-30414111

RESUMEN

The increasing affordability of sequencing technologies offers many new and exciting opportunities to address a diverse array of biological questions. This is evidenced in entomological research by numerous genomics and transcriptomics studies that attempt to decipher the often complex relationships among different species or orders and to build "omics" resources to drive advancement of the molecular understanding of insect biology. Being able to gauge the quality of the sequencing data is of critical importance to understanding the potential limitations on the types of questions that these data can be reliably used to address. This chapter details the use of the Benchmarking Universal Single-Copy Orthologue (BUSCO) assessment tool to estimate the completeness of transcriptomes, genome assemblies, and annotated gene sets in terms of their expected gene content.


Asunto(s)
Biología Computacional/métodos , Genoma de los Insectos , Insectos/genética , Anotación de Secuencia Molecular/métodos , Análisis de Secuencia de ADN/métodos , Programas Informáticos , Transcriptoma , Animales , Benchmarking , Dosificación de Gen , Secuenciación de Nucleótidos de Alto Rendimiento/métodos
12.
Genome Biol Evol ; 9(2): 415-430, 2017 02 01.
Artículo en Inglés | MEDLINE | ID: mdl-28137743

RESUMEN

Insects comprise the most diverse and successful animal group with over one million described species that are found in almost every terrestrial and limnic habitat, with many being used as important models in genetics, ecology, and evolutionary research. Genome sequencing projects have greatly expanded the sampling of species from many insect orders, but genomic resources for species of certain insect lineages have remained relatively limited to date. To address this paucity, we sequenced the genome of the banded demoiselle, Calopteryx splendens, a damselfly (Odonata: Zygoptera) belonging to Palaeoptera, the clade containing the first winged insects. The 1.6 Gbp C. splendens draft genome assembly is one of the largest insect genomes sequenced to date and encodes a predicted set of 22,523 protein-coding genes. Comparative genomic analyses with other sequenced insects identified a relatively small repertoire of C. splendens detoxification genes, which could explain its previously noted sensitivity to habitat pollution. Intriguingly, this repertoire includes a cytochrome P450 gene not previously described in any insect genome. The C. splendens immune gene repertoire appears relatively complete and features several genes encoding novel multi-domain peptidoglycan recognition proteins. Analysis of chemosensory genes revealed the presence of both gustatory and ionotropic receptors, as well as the insect odorant receptor coreceptor gene (OrCo) and at least four partner odorant receptors (ORs). This represents the oldest known instance of a complete OrCo/OR system in insects, and provides the molecular underpinning for odonate olfaction. The C. splendens genome improves the sampling of insect lineages that diverged before the radiation of Holometabola and offers new opportunities for molecular-level evolutionary, ecological, and behavioral studies.


Asunto(s)
Genoma de los Insectos , Odonata/genética , Animales , Evolución Molecular , Proteínas de Insectos/genética , Anotación de Secuencia Molecular , Odonata/clasificación , Sistemas de Lectura Abierta , Filogenia
13.
Genome Biol Evol ; 8(6): 1762-75, 2016 06 27.
Artículo en Inglés | MEDLINE | ID: mdl-26951779

RESUMEN

Metaseiulus occidentalis is an eyeless phytoseiid predatory mite employed for the biological control of agricultural pests including spider mites. Despite appearances, these predator and prey mites are separated by some 400 Myr of evolution and radically different lifestyles. We present a 152-Mb draft assembly of the M. occidentalis genome: Larger than that of its favored prey, Tetranychus urticae, but considerably smaller than those of many other chelicerates, enabling an extremely contiguous and complete assembly to be built-the best arachnid to date. Aided by transcriptome data, genome annotation cataloged 18,338 protein-coding genes and identified large numbers of Helitron transposable elements. Comparisons with other arthropods revealed a particularly dynamic and turbulent genomic evolutionary history. Its genes exhibit elevated molecular evolution, with strikingly high numbers of intron gains and losses, in stark contrast to the deer tick Ixodes scapularis Uniquely among examined arthropods, this predatory mite's Hox genes are completely atomized, dispersed across the genome, and it encodes five copies of the normally single-copy RNA processing Dicer-2 gene. Examining gene families linked to characteristic biological traits of this tiny predator provides initial insights into processes of sex determination, development, immune defense, and how it detects, disables, and digests its prey. As the first reference genome for the Phytoseiidae, and for any species with the rare sex determination system of parahaploidy, the genome of the western orchard predatory mite improves genomic sampling of chelicerates and provides invaluable new resources for functional genomic analyses of this family of agriculturally important mites.


Asunto(s)
Ácaros y Garrapatas/genética , Genes Homeobox/genética , Genómica , Animales , Genoma , Secuenciación de Nucleótidos de Alto Rendimiento , Intrones/genética , Control Biológico de Vectores , Tetranychidae/genética , Transcriptoma/genética
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA