Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 20 de 25
Filter
1.
Annu Rev Genet ; 48: 561-81, 2014.
Article in English | MEDLINE | ID: mdl-25292358

ABSTRACT

Studies of the population dynamics of transposable elements (TEs) in Drosophila melanogaster indicate that consistent forces are affecting TEs independently of their modes of transposition and regulation. New sequencing technologies enable biologists to sample genomes at an unprecedented scale in order to quantify genome-wide polymorphism for annotated and novel TE insertions. In this review, we first present new insights gleaned from high-throughput data for population genomics studies of D. melanogaster. We then consider the latest population genomics models for TE evolution and present examples of functional evidence revealed by genome-wide studies of TE population dynamics in D. melanogaster. Although most of the TE insertions are deleterious or neutral, some TE insertions increase the fitness of the individual that carries them and play a role in genome adaptation.


Subject(s)
DNA Transposable Elements/genetics , Evolution, Molecular , Metagenomics , Selection, Genetic/genetics , Animals , Drosophila melanogaster/genetics
2.
PLoS Comput Biol ; 17(10): e1009321, 2021 10.
Article in English | MEDLINE | ID: mdl-34662331

ABSTRACT

In 2020, the world faced the Severe Acute Respiratory Syndrome Coronavirus 2 (SARS-CoV-2) pandemic that drastically altered people's lives. Since then, many countries have been forced to suspend public gatherings, leading to many conference cancellations, postponements, or reorganizations. Switching from a face-to-face to a remote conference became inevitable and the ultimate solution to sustain scientific exchanges at the national and the international levels. The same year, as a committee, we were in charge of organizing the major French annual conference that covers all computational biology areas: The "Journées Ouvertes en Biologie, Informatique et Mathématiques" (JOBIM). Despite the health crisis, we succeeded in changing the conference format from face to face to remote in a very short amount of time. Here, we propose 10 simple rules based on this experience to modify a conference format in an optimized and cost-effective way. In addition to the suggested rules, we decided to emphasize an unexpected benefit of this situation: a significant reduction in greenhouse gas (GHG) emissions related to travel for scientific conference attendance. We believe that even once the SARS-CoV-2 crisis is over, we collectively will have an opportunity to think about the way we approach such scientific events over the longer term.


Subject(s)
COVID-19 , Computational Biology , Congresses as Topic , Pandemics , SARS-CoV-2 , Videoconferencing , COVID-19/epidemiology , COVID-19/transmission , Computational Biology/organization & administration , Feasibility Studies , France , Greenhouse Gases/analysis , Humans , Interpersonal Relations , Teleworking , Travel
3.
PLoS Genet ; 15(2): e1007900, 2019 02.
Article in English | MEDLINE | ID: mdl-30753202

ABSTRACT

Most of the current knowledge on the genetic basis of adaptive evolution is based on the analysis of single nucleotide polymorphisms (SNPs). Despite increasing evidence for their causal role, the contribution of structural variants to adaptive evolution remains largely unexplored. In this work, we analyzed the population frequencies of 1,615 Transposable Element (TE) insertions annotated in the reference genome of Drosophila melanogaster, in 91 samples from 60 worldwide natural populations. We identified a set of 300 polymorphic TEs that are present at high population frequencies, and located in genomic regions with high recombination rate, where the efficiency of natural selection is high. The age and the length of these 300 TEs are consistent with relatively young and long insertions reaching high frequencies due to the action of positive selection. Besides, we identified a set of 21 fixed TEs also likely to be adaptive. Indeed, we, and others, found evidence of selection for 84 of these reference TE insertions. The analysis of the genes located nearby these 84 candidate adaptive insertions suggested that the functional response to selection is related with the GO categories of response to stimulus, behavior, and development. We further showed that a subset of the candidate adaptive TEs affects expression of nearby genes, and five of them have already been linked to an ecologically relevant phenotypic effect. Our results provide a more complete understanding of the genetic variation and the fitness-related traits relevant for adaptive evolution. Similar studies should help uncover the importance of TE-induced adaptive mutations in other species as well.


Subject(s)
Behavior, Animal/physiology , DNA Transposable Elements/genetics , Drosophila melanogaster/genetics , Gene Expression Regulation, Developmental/genetics , Genome, Insect/genetics , Mutation/genetics , Stress, Physiological/genetics , Animals , Evolution, Molecular , Gene Frequency/genetics , Polymorphism, Single Nucleotide/genetics , Selection, Genetic/genetics
4.
BMC Bioinformatics ; 22(Suppl 6): 396, 2021 Aug 06.
Article in English | MEDLINE | ID: mdl-34362304

ABSTRACT

BACKGROUND: Meiotic recombination is a vital biological process playing an essential role in genome's structural and functional dynamics. Genomes exhibit highly various recombination profiles along chromosomes associated with several chromatin states. However, eu-heterochromatin boundaries are not available nor easily provided for non-model organisms, especially for newly sequenced ones. Hence, we miss accurate local recombination rates necessary to address evolutionary questions. RESULTS: Here, we propose an automated computational tool, based on the Marey maps method, allowing to identify heterochromatin boundaries along chromosomes and estimating local recombination rates. Our method, called BREC (heterochromatin Boundaries and RECombination rate estimates) is non-genome-specific, running even on non-model genomes as long as genetic and physical maps are available. BREC is based on pure statistics and is data-driven, implying that good input data quality remains a strong requirement. Therefore, a data pre-processing module (data quality control and cleaning) is provided. Experiments show that BREC handles different markers' density and distribution issues. CONCLUSIONS: BREC's heterochromatin boundaries have been validated with cytological equivalents experimentally generated on the fruit fly Drosophila melanogaster genome, for which BREC returns congruent corresponding values. Also, BREC's recombination rates have been compared with previously reported estimates. Based on the promising results, we believe our tool has the potential to help bring data science into the service of genome biology and evolution. We introduce BREC within an R-package and a Shiny web-based user-friendly application yielding a fast, easy-to-use, and broadly accessible resource. The BREC R-package is available at the GitHub repository https://github.com/GenomeStructureOrganization .


Subject(s)
Heterochromatin , Mobile Applications , Animals , Chromosome Mapping , Drosophila melanogaster/genetics , Heterochromatin/genetics , Recombination, Genetic
5.
BMC Bioinformatics ; 22(1): 349, 2021 Jun 26.
Article in English | MEDLINE | ID: mdl-34174810

ABSTRACT

BACKGROUND: Plasmids are mobile genetic elements that often carry accessory genes, and are vectors for horizontal transfer between bacterial genomes. Plasmid detection in large genomic datasets is crucial to analyze their spread and quantify their role in bacteria adaptation and particularly in antibiotic resistance propagation. Bioinformatics methods have been developed to detect plasmids. However, they suffer from low sensitivity (i.e., most plasmids remain undetected) or low precision (i.e., these methods identify chromosomes as plasmids), and are overall not adapted to identify plasmids in whole genomes that are not fully assembled (contigs and scaffolds). RESULTS: We developed PlasForest, a homology-based random forest classifier identifying bacterial plasmid sequences in partially assembled genomes. Without knowing the taxonomical origin of the samples, PlasForest identifies contigs as plasmids or chromosomes with a F1 score of 0.950. Notably, it can detect 77.4% of plasmid contigs below 1 kb with 2.8% of false positives and 99.9% of plasmid contigs over 50 kb with 2.2% of false positives. CONCLUSIONS: PlasForest outperforms other currently available tools on genomic datasets by being both sensitive and precise. The performance of PlasForest on metagenomic assemblies are currently well below those of other k-mer-based methods, and we discuss how homology-based approaches could improve plasmid detection in such datasets.


Subject(s)
Genome, Bacterial , Genomics , Computational Biology , Metagenomics , Plasmids
6.
Bioinformatics ; 36(4): 1191-1197, 2020 02 15.
Article in English | MEDLINE | ID: mdl-31580402

ABSTRACT

MOTIVATION: Transposable elements (TEs) constitute a significant proportion of the majority of genomes sequenced to date. TEs are responsible for a considerable fraction of the genetic variation within and among species. Accurate genotyping of TEs in genomes is therefore crucial for a complete identification of the genetic differences among individuals, populations and species. RESULTS: In this work, we present a new version of T-lex, a computational pipeline that accurately genotypes and estimates the population frequencies of reference TE insertions using short-read high-throughput sequencing data. In this new version, we have re-designed the T-lex algorithm to integrate the BWA-MEM short-read aligner, which is one of the most accurate short-read mappers and can be launched on longer short-reads (e.g. reads >150 bp). We have added new filtering steps to increase the accuracy of the genotyping, and new parameters that allow the user to control both the minimum and maximum number of reads, and the minimum number of strains to genotype a TE insertion. We also showed for the first time that T-lex3 provides accurate TE calls in a plant genome. AVAILABILITY AND IMPLEMENTATION: To test the accuracy of T-lex3, we called 1630 individual TE insertions in Drosophila melanogaster, 1600 individual TE insertions in humans, and 3067 individual TE insertions in the rice genome. We showed that this new version of T-lex is a broadly applicable and accurate tool for genotyping and estimating TE frequencies in organisms with different genome sizes and different TE contents. T-lex3 is available at Github: https://github.com/GonzalezLab/T-lex3. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Subject(s)
DNA Transposable Elements , Drosophila melanogaster/genetics , Animals , Gene Frequency , Genotype , Humans , Whole Genome Sequencing
7.
Nucleic Acids Res ; 46(18): 9524-9536, 2018 10 12.
Article in English | MEDLINE | ID: mdl-30312469

ABSTRACT

Transposable elements (TEs) are parasitic DNA sequences that threaten genome integrity by replicative transposition in host gonads. The Piwi-interacting RNAs (piRNAs) pathway is assumed to maintain Drosophila genome homeostasis by downregulating transcriptional and post-transcriptional TE expression in the ovary. However, the bursts of transposition that are expected to follow transposome derepression after piRNA pathway impairment have not yet been reported. Here, we show, at a genome-wide level, that piRNA loss in the ovarian somatic cells boosts several families of the endogenous retroviral subclass of TEs, at various steps of their replication cycle, from somatic transcription to germinal genome invasion. For some of these TEs, the derepression caused by the loss of piRNAs is backed up by another small RNA pathway (siRNAs) operating in somatic tissues at the post transcriptional level. Derepressed transposition during 70 successive generations of piRNA loss exponentially increases the genomic copy number by up to 10-fold.


Subject(s)
DNA Transposable Elements/genetics , Drosophila melanogaster/genetics , Germ Cells/metabolism , Ovary/metabolism , RNA, Small Interfering/genetics , Aneuploidy , Animals , Drosophila melanogaster/cytology , Female , Gene Silencing , Genome, Insect/genetics , Germ Cells/cytology , Ovary/cytology , Signal Transduction/genetics
8.
PLoS Biol ; 14(12): e2000618, 2016 Dec.
Article in English | MEDLINE | ID: mdl-27918584

ABSTRACT

Gene copy-number variations are widespread in natural populations, but investigating their phenotypic consequences requires contemporary duplications under selection. Such duplications have been found at the ace-1 locus (encoding the organophosphate and carbamate insecticides' target) in the mosquito Anopheles gambiae (the major malaria vector); recent studies have revealed their intriguing complexity, consistent with the involvement of various numbers and types (susceptible or resistant to insecticide) of copies. We used an integrative approach, from genome to phenotype level, to investigate the influence of duplication architecture and gene-dosage on mosquito fitness. We found that both heterogeneous (i.e., one susceptible and one resistant ace-1 copy) and homogeneous (i.e., identical resistant copies) duplications segregated in field populations. The number of copies in homogeneous duplications was variable and positively correlated with acetylcholinesterase activity and resistance level. Determining the genomic structure of the duplicated region revealed that, in both types of duplication, ace-1 and 11 other genes formed tandem 203kb amplicons. We developed a diagnostic test for duplications, which showed that ace-1 was amplified in all 173 resistant mosquitoes analyzed (field-collected in several African countries), in heterogeneous or homogeneous duplications. Each type was associated with different fitness trade-offs: heterogeneous duplications conferred an intermediate phenotype (lower resistance and fitness costs), whereas homogeneous duplications tended to increase both resistance and fitness cost, in a complex manner. The type of duplication selected seemed thus to depend on the intensity and distribution of selection pressures. This versatility of trade-offs available through gene duplication highlights the importance of large mutation events in adaptation to environmental variation. This impressive adaptability could have a major impact on vector control in Africa.


Subject(s)
Anopheles/genetics , Gene Duplication , Genes, Insect , Animals , Chromosome Mapping , DNA Copy Number Variations
9.
Nucleic Acids Res ; 43(4): e22, 2015 Feb 27.
Article in English | MEDLINE | ID: mdl-25510498

ABSTRACT

Transposable elements (TEs) constitute the most active, diverse and ancient component in a broad range of genomes. Complete understanding of genome function and evolution cannot be achieved without a thorough understanding of TE impact and biology. However, in-depth analysis of TEs still represents a challenge due to the repetitive nature of these genomic entities. In this work, we present a broadly applicable and flexible tool: T-lex2. T-lex2 is the only available software that allows routine, automatic and accurate genotyping of individual TE insertions and estimation of their population frequencies both using individual strain and pooled next-generation sequencing data. Furthermore, T-lex2 also assesses the quality of the calls allowing the identification of miss-annotated TEs and providing the necessary information to re-annotate them. The flexible and customizable design of T-lex2 allows running it in any genome and for any type of TE insertion. Here, we tested the fidelity of T-lex2 using the fly and human genomes. Overall, T-lex2 represents a significant improvement in our ability to analyze the contribution of TEs to genome function and evolution as well as learning about the biology of TEs. T-lex2 is freely available online at http://sourceforge.net/projects/tlex.


Subject(s)
Genotyping Techniques/methods , High-Throughput Nucleotide Sequencing , Interspersed Repetitive Sequences , Molecular Sequence Annotation/methods , Sequence Analysis, DNA , Software , Animals , Drosophila melanogaster/genetics , Genome, Human , Genome, Insect , Humans
10.
Proc Natl Acad Sci U S A ; 108(38): 16116-21, 2011 Sep 20.
Article in English | MEDLINE | ID: mdl-21876149

ABSTRACT

In bacteria, genes with related functions often are grouped together in operons and are cotranscribed as a single polycistronic mRNA. In eukaryotes, functionally related genes generally are scattered across the genome. Notable exceptions include gene clusters for catabolic pathways in yeast, synthesis of secondary metabolites in filamentous fungi, and the major histocompatibility complex in animals. Until quite recently it was thought that gene clusters in plants were restricted to tandem duplicates (for example, arrays of leucine-rich repeat disease-resistance genes). However, operon-like clusters of coregulated nonhomologous genes are an emerging theme in plant biology, where they may be involved in the synthesis of certain defense compounds. These clusters are unlikely to have arisen by horizontal gene transfer, and the mechanisms behind their formation are poorly understood. Previously in thale cress (Arabidopsis thaliana) we identified an operon-like gene cluster that is required for the synthesis and modification of the triterpene thalianol. Here we characterize a second operon-like triterpene cluster (the marneral cluster) from A. thaliana, compare the features of these two clusters, and investigate the evolutionary events that have led to cluster formation. We conclude that common mechanisms are likely to underlie the assembly and control of operon-like gene clusters in plants.


Subject(s)
Arabidopsis/genetics , Arabidopsis/metabolism , Chromosomes, Plant/genetics , Multigene Family , Acyltransferases/genetics , Acyltransferases/metabolism , Arabidopsis Proteins/classification , Arabidopsis Proteins/genetics , Arabidopsis Proteins/metabolism , Chromosome Mapping , Cytochrome P-450 Enzyme System/genetics , Cytochrome P-450 Enzyme System/metabolism , Gas Chromatography-Mass Spectrometry , Gene Duplication , Gene Expression Regulation, Plant , Genome, Plant/genetics , Intramolecular Transferases/genetics , Intramolecular Transferases/metabolism , Models, Genetic , Molecular Structure , Mutation , Phylogeny , Plant Leaves/genetics , Plant Leaves/metabolism , Plant Roots/genetics , Plant Roots/metabolism , Plants, Genetically Modified , Triterpenes/analysis , Triterpenes/chemistry , Triterpenes/metabolism
11.
Nucleic Acids Res ; 39(6): e36, 2011 Mar.
Article in English | MEDLINE | ID: mdl-21177644

ABSTRACT

Transposable elements (TEs) are repetitive DNA sequences that are ubiquitous, extremely abundant and dynamic components of practically all genomes. Much effort has gone into annotation of TE copies in reference genomes. The sequencing cost reduction and the newly available next-generation sequencing (NGS) data from multiple strains within a species offer an unprecedented opportunity to study population genomics of TEs in a range of organisms. Here, we present a computational pipeline (T-lex) that uses NGS data to detect the presence/absence of annotated TE copies. T-lex can use data from a large number of strains and returns estimates of population frequencies of individual TE insertions in a reasonable time. We experimentally validated the accuracy of T-lex detecting presence or absence of 768 previously identified TE copies in two resequenced Drosophila melanogaster strains. Approximately 95% of the TE insertions were detected with 100% sensitivity and 97% specificity. We show that even at low levels of coverage T-lex produces accurate results for TE copies that it can identify reliably but that the rate of 'no data' calls increases as the coverage falls below 15×. T-lex is a broadly applicable and flexible tool that can be used in any genome provided the availability of the reference genome, individual TE copy annotation and NGS data.


Subject(s)
DNA Transposable Elements , Sequence Analysis, DNA/methods , Software , Animals , Drosophila melanogaster/genetics , Genomics/methods , High-Throughput Nucleotide Sequencing
12.
Genome Biol ; 24(1): 63, 2023 04 03.
Article in English | MEDLINE | ID: mdl-37013657

ABSTRACT

Transposable Element MOnitoring with LOng-reads (TrEMOLO) is a new software that combines assembly- and mapping-based approaches to robustly detect genetic elements called transposable elements (TEs). Using high- or low-quality genome assemblies, TrEMOLO can detect most TE insertions and deletions and estimate their allele frequency in populations. Benchmarking with simulated data revealed that TrEMOLO outperforms other state-of-the-art computational tools. TE detection and frequency estimation by TrEMOLO were validated using simulated and experimental datasets. Therefore, TrEMOLO is a comprehensive and suitable tool to accurately study TE dynamics. TrEMOLO is available under GNU GPL3.0 at https://github.com/DrosophilaGenomeEvolution/TrEMOLO .


Subject(s)
DNA Transposable Elements , Software , Gene Frequency , High-Throughput Nucleotide Sequencing , Sequence Analysis, DNA
13.
Mol Biol Evol ; 28(5): 1633-44, 2011 May.
Article in English | MEDLINE | ID: mdl-21172826

ABSTRACT

Transposable elements (TEs) are the primary contributors to the genome bulk in many organisms and are major players in genome evolution. A clear and thorough understanding of the population dynamics of TEs is therefore essential for full comprehension of the eukaryotic genome evolution and function. Although TEs in Drosophila melanogaster have received much attention, population dynamics of most TE families in this species remains entirely unexplored. It is not clear whether the same population processes can account for the population behaviors of all TEs in Drosophila or whether, as has been suggested previously, different orders behave according to very different rules. In this work, we analyzed population frequencies for a large number of individual TEs (755 TEs) in five North American and one sub-Saharan African D. melanogaster populations (75 strains in total). These TEs have been annotated in the reference D. melanogaster euchromatic genome and have been sampled from all three major orders (non-LTR, LTR, and TIR) and from all families with more than 20 TE copies (55 families in total). We find strong evidence that TEs in Drosophila across all orders and families are subject to purifying selection at the level of ectopic recombination. We showed that strength of this selection varies predictably with recombination rate, length of individual TEs, and copy number and length of other TEs in the same family. Importantly, these rules do not appear to vary across orders. Finally, we built a statistical model that considered only individual TE-level (such as the TE length) and family-level properties (such as the copy number) and were able to explain more than 40% of the variation in TE frequencies in D. melanogaster.


Subject(s)
DNA Transposable Elements/genetics , Drosophila melanogaster/genetics , Genetic Variation , Animals , Gene Expression , Gene Frequency , Metagenomics , Models, Genetic , Recombination, Genetic , Selection, Genetic , Sequence Analysis, DNA , Terminal Repeat Sequences
14.
Methods Mol Biol ; 2443: 327-385, 2022.
Article in English | MEDLINE | ID: mdl-35037215

ABSTRACT

Plant genomes contain a particularly high proportion of repeated structures of various types. This chapter proposes a guided tour of the available software that can help biologists to scan automatically for these repeats in sequence data or check hypothetical models intended to characterize their structures. Since transposable elements (TEs) are a major source of repeats in plants, many methods have been used or developed for this broad class of sequences. They are representative of the range of tools available for other classes of repeats and we have provided two sections on this topic (for the analysis of genomes or directly of sequenced reads), as well as a selection of the main existing software. It may be hard to keep up with the profusion of proposals in this dynamic field and the rest of the chapter is devoted to the foundations of an efficient search for repeats and more complex patterns. We first introduce the key concepts of the art of indexing and mapping or querying sequences. We end the chapter with the more prospective issue of building models of repeat families. We present the Machine Learning approach first, seeking to build predictors automatically for some families of ET, from a set of sequences known to belong to this family. A second approach, the linguistic (or syntactic) approach, allows biologists to describe themselves and check the validity of models of their favorite repeat family.


Subject(s)
Genome, Plant , Software , DNA Transposable Elements/genetics , Plants/genetics , Prospective Studies
15.
J Bioinform Comput Biol ; 19(6): 2140016, 2021 12.
Article in English | MEDLINE | ID: mdl-34923926

ABSTRACT

In this paper, we investigate througth a premilinary study the influence of repeat elements during the assembly process. We analyze the link between the presence and the nature of one type of repeat element, called transposable element (TE) and misassembly events in genome assemblies. We propose to improve assemblies by taking into account the presence of repeat elements, including TEs, during the scaffolding step. We analyze the results and relate the misassemblies to TEs before and after correction.


Subject(s)
DNA Transposable Elements
16.
Nat Neurosci ; 24(2): 186-196, 2021 02.
Article in English | MEDLINE | ID: mdl-33432196

ABSTRACT

Retrotransposons can cause somatic genome variation in the human nervous system, which is hypothesized to have relevance to brain development and neuropsychiatric disease. However, the detection of individual somatic mobile element insertions presents a difficult signal-to-noise problem. Using a machine-learning method (RetroSom) and deep whole-genome sequencing, we analyzed L1 and Alu retrotransposition in sorted neurons and glia from human brains. We characterized two brain-specific L1 insertions in neurons and glia from a donor with schizophrenia. There was anatomical distribution of the L1 insertions in neurons and glia across both hemispheres, indicating retrotransposition occurred during early embryogenesis. Both insertions were within the introns of genes (CNNM2 and FRMD4A) inside genomic loci associated with neuropsychiatric disorders. Proof-of-principle experiments revealed these L1 insertions significantly reduced gene expression. These results demonstrate that RetroSom has broad applications for studies of brain development and may provide insight into the possible pathological effects of somatic retrotransposition.


Subject(s)
Machine Learning , Mutagenesis, Insertional/genetics , Neuroglia , Neurons , Adaptor Proteins, Signal Transducing/genetics , Adult , Cation Transport Proteins/genetics , Embryonic Development/genetics , Female , Genome/genetics , HeLa Cells , High-Throughput Nucleotide Sequencing , Humans , Long Interspersed Nucleotide Elements , Mental Disorders/genetics , Pregnancy , Retroelements , Schizophrenia/genetics
18.
Viruses ; 11(8)2019 08 15.
Article in English | MEDLINE | ID: mdl-31443175

ABSTRACT

Viruses are able to evolve in vitro by mutations after serial passages in cell cultures, which can lead to either a loss, or an increase, of virulence. Cyprinid herpesvirus 3 (CyHV-3), a 295-kb double-stranded DNA virus, is the etiological agent of the koi herpesvirus disease (KHVD). To assess the influence of serial passages, an isolate of CyHV-3 (KHV-T) was passaged 99 times onto common carp brain (CCB) cells, and virus virulence was evaluated during passages through the experimental infections of common carp. After 78 CCB passages, the isolate was much less virulent than the original form. A comparative genomic analysis of these three forms of KHV-T (P0, P78 and P99) revealed a limited number of variations. The largest one was a deletion of 1363 bp in the predicted ORF150, which was detected in P78, but not in P99. This unexpected finding was confirmed by conventional PCR and digital PCR. The results presented here primarily suggest that, CyHV-3 evolves, at least in vitro, through an assemblage of haplotypes that alternatively become dominant or under-represented.


Subject(s)
Fish Diseases/virology , Herpesviridae Infections/veterinary , Herpesviridae/genetics , Animals , Biological Evolution , Carps/virology , Haplotypes , Herpesviridae/classification , Herpesviridae/growth & development , Herpesviridae/pathogenicity , Herpesviridae Infections/virology , Open Reading Frames , Serial Passage , Virulence
20.
Mob DNA ; 6: 13, 2015.
Article in English | MEDLINE | ID: mdl-26244060

ABSTRACT

DNA derived from transposable elements (TEs) constitutes large parts of the genomes of complex eukaryotes, with major impacts not only on genomic research but also on how organisms evolve and function. Although a variety of methods and tools have been developed to detect and annotate TEs, there are as yet no standard benchmarks-that is, no standard way to measure or compare their accuracy. This lack of accuracy assessment calls into question conclusions from a wide range of research that depends explicitly or implicitly on TE annotation. In the absence of standard benchmarks, toolmakers are impeded in improving their tools, annotators cannot properly assess which tools might best suit their needs, and downstream researchers cannot judge how accuracy limitations might impact their studies. We therefore propose that the TE research community create and adopt standard TE annotation benchmarks, and we call for other researchers to join the authors in making this long-overdue effort a success.

SELECTION OF CITATIONS
SEARCH DETAIL