Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 12 de 12
Filtrar
1.
BMC Bioinformatics ; 23(1): 499, 2022 Nov 19.
Artículo en Inglés | MEDLINE | ID: mdl-36402957

RESUMEN

BACKGROUND: Genotyping and sequencing technologies produce increasingly large numbers of genetic markers with potentially high rates of missing or erroneous data. Therefore, the construction of linkage maps is more and more complex. Moreover, the size of segregating populations remains constrained by cost issues and is less and less commensurate with the numbers of SNPs available. Thus, guaranteeing a statistically robust marker order requires that maps include only a carefully selected subset of SNPs. RESULTS: In this context, the SeSAM software allows automatic genetic map construction using seriation and placement approaches, to produce (1) a high-robustness framework map which includes as many markers as possible while keeping the order robustness beyond a given statistical threshold, and (2) a high-density total map including the framework plus almost all polymorphic markers. During this process, care is taken to limit the impact of genotyping errors and of missing data on mapping quality. SeSAM can be used with a wide range of biparental populations including from outcrossing species for which phases are inferred on-the-fly by maximum-likelihood during map elongation. The package also includes functions to simulate data sets, convert data formats, detect putative genotyping errors, visualize data and map quality (including graphical genotypes), and merge several maps into a consensus. SeSAM is also suitable for interactive map construction, by providing lower-level functions for 2-point and multipoint EM analyses. The software is implemented in a R package including functions in C++. CONCLUSIONS: SeSAM is a fully automatic linkage mapping software designed to (1) produce a framework map as robust as desired by optimizing the selection of a subset of markers, and (2) produce a high-density map including almost all polymorphic markers. The software can be used with a wide range of biparental mapping populations including cases from outcrossing. SeSAM is freely available under a GNU GPL v3 license and works on Linux, Windows, and macOS platforms. It can be downloaded together with its user-manual and quick-start tutorial from ForgeMIA (SeSAM project) at https://forgemia.inra.fr/gqe-acep/sesam/-/releases.


Asunto(s)
Polimorfismo de Nucleótido Simple , Programas Informáticos , Mapeo Cromosómico , Marcadores Genéticos , Genotipo
2.
BMC Genomics ; 23(1): 618, 2022 Aug 25.
Artículo en Inglés | MEDLINE | ID: mdl-36008774

RESUMEN

BACKGROUND: Vagococcus fluvialis is a species of lactic acid bacteria found both free-living in river and seawater and associated to hosts, such as marine sponges. This species has been greatly understudied, with no complete genome assembly available to date, which is essential for the characterisation of the mobilome. RESULTS: We sequenced and assembled de novo the complete genome sequences of five V. fluvialis isolates recovered from marine sponges. Pangenome analysis of the V. fluvialis species (total of 17 genomes) showed a high intraspecific diversity, with 45.5% of orthologous genes found to be strain specific. Despite this diversity, analyses of gene functions clustered all V. fluvialis species together and separated them from other sequenced Vagococcus species. V. fluvialis strains from different habitats were highly similar in terms of functional diversity but the sponge-isolated strains were enriched in several functions related to the marine environment. Furthermore, sponge-isolated strains carried a significantly higher number of mobile genetic elements (MGEs) compared to previously sequenced V. fluvialis strains from other environments. Sponge-isolated strains carried up to 4 circular plasmids each, including a 48-kb conjugative plasmid. Three of the five strains carried an additional circular extrachromosomal sequence, assumed to be an excised prophage as it contained mainly viral genes and lacked plasmid replication genes. Insertion sequences (ISs) were up to five times more abundant in the genomes of sponge-isolated strains compared to the others, including several IS families found exclusively in these genomes. CONCLUSIONS: Our findings highlight the dynamics and plasticity of the V. fluvialis genome. The abundance of mobile genetic elements in the genomes of sponge-isolated V. fluvialis strains suggests that the mobilome might be key to understanding the genomic signatures of symbiosis in bacteria.


Asunto(s)
Poríferos , Animales , Enterococcaceae/genética , Secuencias Repetitivas Esparcidas/genética , Filogenia , Poríferos/genética , Análisis de Secuencia de ADN
3.
BMC Genomics ; 23(1): 192, 2022 Mar 08.
Artículo en Inglés | MEDLINE | ID: mdl-35260071

RESUMEN

BACKGROUND: The hard clam Mercenaria mercenaria is a major marine resource along the Atlantic coasts of North America and has been introduced to other continents for resource restoration or aquaculture activities. Significant mortality events have been reported in the species throughout its native range as a result of diseases (microbial infections, leukemia) and acute environmental stress. In this context, the characterization of the hard clam genome can provide highly needed resources to enable basic (e.g., oncogenesis and cancer transmission, adaptation biology) and applied (clam stock enhancement, genomic selection) sciences. RESULTS: Using a combination of long and short-read sequencing technologies, a 1.86 Gb chromosome-level assembly of the clam genome was generated. The assembly was scaffolded into 19 chromosomes, with an N50 of 83 Mb. Genome annotation yielded 34,728 predicted protein-coding genes, markedly more than the few other members of the Venerida sequenced so far, with coding regions representing only 2% of the assembly. Indeed, more than half of the genome is composed of repeated elements, including transposable elements. Major chromosome rearrangements were detected between this assembly and another recent assembly derived from a genetically segregated clam stock. Comparative analysis of the clam genome allowed the identification of a marked diversification in immune-related proteins, particularly extensive tandem duplications and expansions in tumor necrosis factors (TNFs) and C1q domain-containing proteins, some of which were previously shown to play a role in clam interactions with infectious microbes. The study also generated a comparative repertoire highlighting the diversity and, in some instances, the specificity of LTR-retrotransposons elements, particularly Steamer elements in bivalves. CONCLUSIONS: The diversity of immune molecules in M. mercenaria may allow this species to cope with varying and complex microbial and environmental landscapes. The repertoire of transposable elements identified in this study, particularly Steamer elements, should be a prime target for the investigation of cancer cell development and transmission among bivalve mollusks.


Asunto(s)
Mercenaria , Animales , Cromosomas , Elementos Transponibles de ADN/genética , Mercenaria/genética , América del Norte , Retroelementos
4.
BMC Bioinformatics ; 22(1): 303, 2021 Jun 05.
Artículo en Inglés | MEDLINE | ID: mdl-34090340

RESUMEN

BACKGROUND: Long-read sequencing is revolutionizing genome assembly: as PacBio and Nanopore technologies become more accessible in technicity and in cost, long-read assemblers flourish and are starting to deliver chromosome-level assemblies. However, these long reads are usually error-prone, making the generation of a haploid reference out of a diploid genome a difficult enterprise. Failure to properly collapse haplotypes results in fragmented and structurally incorrect assemblies and wreaks havoc on orthology inference pipelines, yet this serious issue is rarely acknowledged and dealt with in genomic projects, and an independent, comparative benchmark of the capacity of assemblers and post-processing tools to properly collapse or purge haplotypes is still lacking. RESULTS: We tested different assembly strategies on the genome of the rotifer Adineta vaga, a non-model organism for which high coverages of both PacBio and Nanopore reads were available. The assemblers we tested (Canu, Flye, NextDenovo, Ra, Raven, Shasta and wtdbg2) exhibited strikingly different behaviors when dealing with highly heterozygous regions, resulting in variable amounts of uncollapsed haplotypes. Filtering reads generally improved haploid assemblies, and we also benchmarked three post-processing tools aimed at detecting and purging uncollapsed haplotypes in long-read assemblies: HaploMerger2, purge_haplotigs and purge_dups. CONCLUSIONS: We provide a thorough evaluation of popular assemblers on a non-model eukaryote genome with variable levels of heterozygosity. Our study highlights several strategies using pre and post-processing approaches to generate haploid assemblies with high continuity and completeness. This benchmark will help users to improve haploid assemblies of non-model organisms, and evaluate the quality of their own assemblies.


Asunto(s)
Genómica , Secuenciación de Nucleótidos de Alto Rendimiento , Genoma , Haplotipos , Análisis de Secuencia de ADN
5.
PLoS Comput Biol ; 14(3): e1005992, 2018 03.
Artículo en Inglés | MEDLINE | ID: mdl-29543809

RESUMEN

We present a new educational initiative called Meet-U that aims to train students for collaborative work in computational biology and to bridge the gap between education and research. Meet-U mimics the setup of collaborative research projects and takes advantage of the most popular tools for collaborative work and of cloud computing. Students are grouped in teams of 4-5 people and have to realize a project from A to Z that answers a challenging question in biology. Meet-U promotes "coopetition," as the students collaborate within and across the teams and are also in competition with each other to develop the best final product. Meet-U fosters interactions between different actors of education and research through the organization of a meeting day, open to everyone, where the students present their work to a jury of researchers and jury members give research seminars. This very unique combination of education and research is strongly motivating for the students and provides a formidable opportunity for a scientific community to unite and increase its visibility. We report on our experience with Meet-U in two French universities with master's students in bioinformatics and modeling, with protein-protein docking as the subject of the course. Meet-U is easy to implement and can be straightforwardly transferred to other fields and/or universities. All the information and data are available at www.meet-u.org.


Asunto(s)
Biología Computacional/educación , Biología Computacional/métodos , Investigación/educación , Humanos , Proyectos de Investigación , Estudiantes , Universidades
6.
Front Genet ; 15: 1308527, 2024.
Artículo en Inglés | MEDLINE | ID: mdl-38384712

RESUMEN

High-quality genomes obtained using long-read data allow not only for a better understanding of heterozygosity levels, repeat content, and more accurate gene annotation and prediction when compared to those obtained with short-read technologies, but also allow to understand haplotype divergence. Advances in long-read sequencing technologies in the last years have made it possible to produce such high-quality assemblies for non-model organisms. This allows us to revisit genomes, which have been problematic to scaffold to chromosome-scale with previous generations of data and assembly software. Nematoda, one of the most diverse and speciose animal phyla within metazoans, remains poorly studied, and many previously assembled genomes are fragmented. Using long reads obtained with Nanopore R10.4.1 and PacBio HiFi, we generated highly contiguous assemblies of a diploid nematode of the Mermithidae family, for which no closely related genomes are available to date, as well as a collapsed assembly and a phased assembly for a triploid nematode from the Panagrolaimidae family. Both genomes had been analysed before, but the fragmented assemblies had scaffold sizes comparable to the length of long reads prior to assembly. Our new assemblies illustrate how long-read technologies allow for a much better representation of species genomes. We are now able to conduct more accurate downstream assays based on more complete gene and transposable element predictions.

7.
R Soc Open Sci ; 10(6): 230423, 2023 Jun.
Artículo en Inglés | MEDLINE | ID: mdl-37351491

RESUMEN

Well-annotated and contiguous genomes are an indispensable resource for understanding the evolution, development, and metabolic capacities of organisms. Sponges, an ecologically important non-bilaterian group of primarily filter-feeding sessile aquatic organisms, are underrepresented with respect to available genomic resources. Here we provide a high-quality and well-annotated genome of Aphrocallistes vastus, a glass sponge (Porifera: Hexactinellida) that forms large reef structures off the coast of British Columbia (Canada). We show that its genome is approximately 80 Mb, small compared to most other metazoans, and contains nearly 2500 nested genes, more than other genomes. Hexactinellida is characterized by a unique skeletal architecture made of amorphous silicon dioxide (SiO2), and we identified 419 differentially expressed genes between the osculum, i.e. the vertical growth zone of the sponge, and the main body. Among the upregulated ones, mineralization-related genes such as glassin, as well as collagens and actins, dominate the expression profile during growth. Silicateins, suggested being involved in silica mineralization, especially in demosponges, were not found at all in the A. vastus genome and suggests that the underlying mechanisms of SiO2 deposition in the Silicea sensu stricto (Hexactinellida + Demospongiae) may not be homologous.

8.
Sci Rep ; 12(1): 14226, 2022 08 20.
Artículo en Inglés | MEDLINE | ID: mdl-35987814

RESUMEN

Stylommatophoran pulmonate land slugs and snails successfully completed the water-to-land transition from an aquatic ancestor and flourished on land. Of the 30,000 estimated species, very few genomes have so far been published. Here, we assembled and characterized a chromosome-level genome of the "Spanish" slug, Arion vulgaris Moquin-Tandon, 1855, a notorious pest land slug in Europe. Using this reference genome, we conclude that a whole-genome duplication event occurred approximately 93-109 Mya at the base of Stylommatophora and might have promoted land invasion and adaptive radiation. Comparative genomic analyses reveal that genes related to the development of kidney, blood vessels, muscle, and nervous systems had expanded in the last common ancestor of land pulmonates, likely an evolutionary response to the terrestrial challenges of gravity and water loss. Analyses of A. vulgaris gene families and positively selected genes show the slug has evolved a stronger ability to counteract the greater threats of external damage, radiation, and water loss lacking a protective shell. Furthermore, a recent burst of long interspersed elements in the genome of A. vulgaris might affect gene regulation and contribute to rapid phenotype changes in A. vulgaris, which might be conducive to its rapid adaptation and invasiveness.


Asunto(s)
Gastrópodos , Animales , Europa (Continente) , Gastrópodos/genética , Caracoles/genética , Agua
9.
Evol Appl ; 15(11): 1730-1748, 2022 Nov.
Artículo en Inglés | MEDLINE | ID: mdl-36426129

RESUMEN

The European flat oyster (Ostrea edulis L.) is a native bivalve of the European coasts. Harvest of this species has declined during the last decades because of the appearance of two parasites that have led to the collapse of the stocks and the loss of the natural oyster beds. O. edulis has been the subject of numerous studies in population genetics and on the detection of the parasites Bonamia ostreae and Marteilia refringens. These studies investigated immune responses to these parasites at the molecular and cellular levels. Several genetic improvement programs have been initiated especially for parasite resistance. Within the framework of a European project (PERLE 2) that aims to produce genetic lines of O. edulis with hardiness traits (growth, survival, resistance) for the purpose of repopulating natural oyster beds in Brittany and reviving the culture of this species in the foreshore, obtaining a reference genome becomes essential as done recently in many bivalve species of aquaculture interest. Here, we present a chromosome-level genome assembly and annotation for the European flat oyster, generated by combining PacBio, Illumina, 10X linked, and Hi-C sequencing. The finished assembly is 887.2 Mb with a scaffold-N50 of 97.1 Mb scaffolded on the expected 10 pseudochromosomes. Annotation of the genome revealed the presence of 35,962 protein-coding genes. We analyzed in detail the transposable element (TE) diversity in the flat oyster genome, highlighted some specificities in tRNA and miRNA composition, and provided the first insight into the molecular response of O. edulis to M. refringens. This genome provides a reference for genomic studies on O. edulis to better understand its basic physiology and as a useful resource for genetic breeding in support of aquaculture and natural reef restoration.

10.
Sci Adv ; 7(41): eabg4216, 2021 Oct 08.
Artículo en Inglés | MEDLINE | ID: mdl-34613768

RESUMEN

Bdelloid rotifers are notorious as a speciose ancient clade comprising only asexual lineages. Thanks to their ability to repair highly fragmented DNA, most bdelloid species also withstand complete desiccation and ionizing radiation. Producing a well-assembled reference genome is a critical step to developing an understanding of the effects of long-term asexuality and DNA breakage on genome evolution. To this end, we present the first high-quality chromosome-level genome assemblies for the bdelloid Adineta vaga, composed of six pairs of homologous (diploid) chromosomes with a footprint of paleotetraploidy. The observed large-scale losses of heterozygosity are signatures of recombination between homologous chromosomes, either during mitotic DNA double-strand break repair or when resolving programmed DNA breaks during a modified meiosis. Dynamic subtelomeric regions harbor more structural diversity (e.g., chromosome rearrangements, transposable elements, and haplotypic divergence). Our results trigger the reappraisal of potential meiotic processes in bdelloid rotifers and help unravel the factors underlying their long-term asexual evolutionary success.

11.
Genome Biol ; 21(1): 148, 2020 06 18.
Artículo en Inglés | MEDLINE | ID: mdl-32552806

RESUMEN

Hi-C exploits contact frequencies between pairs of loci to bridge and order contigs during genome assembly, resulting in chromosome-level assemblies. Because few robust programs are available for this type of data, we developed instaGRAAL, a complete overhaul of the GRAAL program, which has adapted the latter to allow efficient assembly of large genomes. instaGRAAL features a number of improvements over GRAAL, including a modular correction approach that optionally integrates independent data. We validate the program using data for two brown algae, and human, to generate near-complete assemblies with minimal human intervention.


Asunto(s)
Cromosomas , Genómica/métodos , Algas Marinas/genética , Programas Informáticos , Humanos
12.
Nat Commun ; 11(1): 5795, 2020 11 16.
Artículo en Inglés | MEDLINE | ID: mdl-33199682

RESUMEN

Chromosomes of all species studied so far display a variety of higher-order organisational features, such as self-interacting domains or loops. These structures, which are often associated to biological functions, form distinct, visible patterns on genome-wide contact maps generated by chromosome conformation capture approaches such as Hi-C. Here we present Chromosight, an algorithm inspired from computer vision that can detect patterns in contact maps. Chromosight has greater sensitivity than existing methods on synthetic simulated data, while being faster and applicable to any type of genomes, including bacteria, viruses, yeasts and mammals. Our method does not require any prior training dataset and works well with default parameters on data generated with various protocols.


Asunto(s)
Cromosomas/genética , Computadores , Reconocimiento de Normas Patrones Automatizadas , Algoritmos , Cromosomas Fúngicos/genética , Cromosomas Humanos/genética , Genoma Fúngico , Humanos , Saccharomyces cerevisiae/genética , Flujo de Trabajo
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA