Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 11 de 11
Filter
Add more filters










Publication year range
1.
RNA ; 29(12): 1839-1855, 2023 12.
Article in English | MEDLINE | ID: mdl-37816550

ABSTRACT

The tremendous rate with which data is generated and analysis methods emerge makes it increasingly difficult to keep track of their domain of applicability, assumptions, limitations, and consequently, of the efficacy and precision with which they solve specific tasks. Therefore, there is an increasing need for benchmarks, and for the provision of infrastructure for continuous method evaluation. APAeval is an international community effort, organized by the RNA Society in 2021, to benchmark tools for the identification and quantification of the usage of alternative polyadenylation (APA) sites from short-read, bulk RNA-sequencing (RNA-seq) data. Here, we reviewed 17 tools and benchmarked eight on their ability to perform APA identification and quantification, using a comprehensive set of RNA-seq experiments comprising real, synthetic, and matched 3'-end sequencing data. To support continuous benchmarking, we have incorporated the results into the OpenEBench online platform, which allows for continuous extension of the set of methods, metrics, and challenges. We envisage that our analyses will assist researchers in selecting the appropriate tools for their studies, while the containers and reproducible workflows could easily be deployed and extended to evaluate new methods or data sets.


Subject(s)
Benchmarking , RNA , RNA/genetics , RNA-Seq , Polyadenylation , Sequence Analysis, RNA/methods
2.
bioRxiv ; 2023 Jun 26.
Article in English | MEDLINE | ID: mdl-37425672

ABSTRACT

The tremendous rate with which data is generated and analysis methods emerge makes it increasingly difficult to keep track of their domain of applicability, assumptions, and limitations and consequently, of the efficacy and precision with which they solve specific tasks. Therefore, there is an increasing need for benchmarks, and for the provision of infrastructure for continuous method evaluation. APAeval is an international community effort, organized by the RNA Society in 2021, to benchmark tools for the identification and quantification of the usage of alternative polyadenylation (APA) sites from short-read, bulk RNA-sequencing (RNA-seq) data. Here, we reviewed 17 tools and benchmarked eight on their ability to perform APA identification and quantification, using a comprehensive set of RNA-seq experiments comprising real, synthetic, and matched 3'-end sequencing data. To support continuous benchmarking, we have incorporated the results into the OpenEBench online platform, which allows for seamless extension of the set of methods, metrics, and challenges. We envisage that our analyses will assist researchers in selecting the appropriate tools for their studies. Furthermore, the containers and reproducible workflows generated in the course of this project can be seamlessly deployed and extended in the future to evaluate new methods or datasets.

3.
Nucleic Acids Res ; 51(D1): D1353-D1359, 2023 Jan 06.
Article in English | MEDLINE | ID: mdl-36399499

ABSTRACT

The Open Targets Platform (https://platform.opentargets.org/) is an open source resource to systematically assist drug target identification and prioritisation using publicly available data. Since our last update, we have reimagined, redesigned, and rebuilt the Platform in order to streamline data integration and harmonisation, expand the ways in which users can explore the data, and improve the user experience. The gene-disease causal evidence has been enhanced and expanded to better capture disease causality across rare, common, and somatic diseases. For target and drug annotations, we have incorporated new features that help assess target safety and tractability, including genetic constraint, PROTACtability assessments, and AlphaFold structure predictions. We have also introduced new machine learning applications for knowledge extraction from the published literature, clinical trial information, and drug labels. The new technologies and frameworks introduced since the last update will ease the introduction of new features and the creation of separate instances of the Platform adapted to user requirements. Our new Community forum, expanded training materials, and outreach programme support our users in a range of use cases.

4.
Nucleic Acids Res ; 49(D1): D1302-D1310, 2021 01 08.
Article in English | MEDLINE | ID: mdl-33196847

ABSTRACT

The Open Targets Platform (https://www.targetvalidation.org/) provides users with a queryable knowledgebase and user interface to aid systematic target identification and prioritisation for drug discovery based upon underlying evidence. It is publicly available and the underlying code is open source. Since our last update two years ago, we have had 10 releases to maintain and continuously improve evidence for target-disease relationships from 20 different data sources. In addition, we have integrated new evidence from key datasets, including prioritised targets identified from genome-wide CRISPR knockout screens in 300 cancer models (Project Score), and GWAS/UK BioBank statistical genetic analysis evidence from the Open Targets Genetics Portal. We have evolved our evidence scoring framework to improve target identification. To aid the prioritisation of targets and inform on the potential impact of modulating a given target, we have added evaluation of post-marketing adverse drug reactions and new curated information on target tractability and safety. We have also developed the user interface and backend technologies to improve performance and usability. In this article, we describe the latest enhancements to the Platform, to address the fundamental challenge that developing effective and safe drugs is difficult and expensive.


Subject(s)
Antineoplastic Agents/therapeutic use , Drugs, Investigational/therapeutic use , Knowledge Bases , Molecular Targeted Therapy/methods , Neoplasms/drug therapy , Software , Antineoplastic Agents/chemistry , Databases, Factual , Datasets as Topic , Drug Discovery/methods , Drugs, Investigational/chemistry , Humans , Internet , Neoplasms/classification , Neoplasms/genetics , Neoplasms/pathology
5.
Nucleic Acids Res ; 49(D1): D1311-D1320, 2021 01 08.
Article in English | MEDLINE | ID: mdl-33045747

ABSTRACT

Open Targets Genetics (https://genetics.opentargets.org) is an open-access integrative resource that aggregates human GWAS and functional genomics data including gene expression, protein abundance, chromatin interaction and conformation data from a wide range of cell types and tissues to make robust connections between GWAS-associated loci, variants and likely causal genes. This enables systematic identification and prioritisation of likely causal variants and genes across all published trait-associated loci. In this paper, we describe the public resources we aggregate, the technology and analyses we use, and the functionality that the portal offers. Open Targets Genetics can be searched by variant, gene or study/phenotype. It offers tools that enable users to prioritise causal variants and genes at disease-associated loci and access systematic cross-disease and disease-molecular trait colocalization analysis across 92 cell types and tissues including the eQTL Catalogue. Data visualizations such as Manhattan-like plots, regional plots, credible sets overlap between studies and PheWAS plots enable users to explore GWAS signals in depth. The integrated data is made available through the web portal, for bulk download and via a GraphQL API, and the software is open source. Applications of this integrated data include identification of novel targets for drug discovery and drug repurposing.


Subject(s)
Databases, Genetic , Genome, Human , Inflammatory Bowel Diseases/genetics , Molecular Targeted Therapy/methods , Quantitative Trait Loci , Software , Chromatin/chemistry , Chromatin/metabolism , Datasets as Topic , Drug Discovery/methods , Drug Repositioning/methods , Genome-Wide Association Study , Genotype , Humans , Inflammatory Bowel Diseases/drug therapy , Inflammatory Bowel Diseases/metabolism , Inflammatory Bowel Diseases/pathology , Internet , Phenotype , Quantitative Trait, Heritable
6.
Evol Appl ; 13(5): 1009-1025, 2020 May.
Article in English | MEDLINE | ID: mdl-32431749

ABSTRACT

Genetic diversity is the determinant for pest species' success and vector competence. Understanding the ecological and evolutionary processes that determine the genetic diversity is fundamental to help identify the spatial scale at which pest populations are best managed. In the present study, we present the first comprehensive analysis of the genetic diversity and evolution of Rhopalosiphum padi, a major pest of cereals and a main vector of the barley yellow dwarf virus (BYDV), in England. We have used a genotyping-by-sequencing approach to study whether (a) there is any underlying population genetic structure at a national and regional scale in this pest that can disperse long distances; (b) the populations evolve as a response to environmental change and selective pressures; and (c) the populations comprise anholocyclic lineages. Individual R. padi were collected using the Rothamsted Insect Survey's suction-trap network at several sites across England between 2004 and 2016 as part of the RIS long-term nationwide surveillance. Results identified two genetic clusters in England that mostly corresponded to a North-South division, although gene flow is ongoing between the two subpopulations. These genetic clusters do not correspond to different life cycle types, and cyclical parthenogenesis is predominant in England. Results also show that there is dispersal with gene flow across England, although there is a reduction between the northern and southern sites with the south-western population being the most genetically differentiated. There is no evidence for isolation by distance and other factors such as primary host distribution, uncommon in the south and absent in the south-west, could influence the dispersal patterns. Finally, results also show no evidence for the evolution of the R. padi population, and it is demographically stable despite the ongoing environmental change. These results are discussed in view of their relevance to pest management and the transmission of BYDV.

7.
BMC Plant Biol ; 20(1): 170, 2020 Apr 16.
Article in English | MEDLINE | ID: mdl-32299364

ABSTRACT

BACKGROUND: High post-anthesis (p.a) temperatures reduce mature grain weights in wheat and other cereals. However, the causes of this reduction are not entirely known. Control of grain expansion by the maternally derived pericarp of the grain has previously been suggested, although this interaction has not been investigated under high p.a. temperatures. Down-regulation of pericarp localised genes that regulate cell wall expansion under high p.a. temperatures may limit expansion of the encapsulated endosperm due to a loss of plasticity in the pericarp, reducing mature grain weight. Here the effect of high p.a. temperatures on the transcriptome of the pericarp and endosperm of the wheat grain during early grain-filling was investigated via RNA-Seq and is discussed alongside grain moisture dynamics during early grain development and mature grain weight. RESULTS: High p.a. temperatures applied from 6-days after anthesis (daa) and until 18daa reduced the grain's ability to accumulate water, with total grain moisture and percentage grain moisture content being significantly reduced from 14daa onwards. Mature grain weight was also significantly reduced by the same high p.a. temperatures applied from 6daa for 4-days or more, in a separate experiment. Comparison of our RNA-Seq data from whole grains, with existing data sets from isolated pericarp and endosperm tissues enabled the identification of subsets of genes whose expression was significantly affected by high p.a. temperature and predominantly expressed in either tissue. Hierarchical clustering and gene ontology analysis resulted in the identification of a number of genes implicated in the regulation of cell wall expansion, predominantly expressed in the pericarp and significantly down-regulated under high p.a. temperatures, including endoglucanase, xyloglucan endotransglycosylases and a ß-expansin. An over-representation of genes involved in the 'cuticle development' functional pathway that were expressed in the pericarp and affected by high p.a. temperatures was also observed. CONCLUSIONS: High p.a. temperature induced down-regulation of genes involved in regulating pericarp cell wall expansion. This concomitant down-regulation with a reduction in total grain moisture content and grain weight following the same treatment period, adds support to the theory that high p.a. temperatures may cause a reduction in mature grain weight as result of decreased pericarp cell wall expansion.


Subject(s)
Hot Temperature , Plant Proteins/metabolism , Seeds/growth & development , Transcriptome , Triticum/metabolism , Edible Grain/growth & development , Edible Grain/metabolism , Seeds/metabolism , Triticum/growth & development
8.
BMC Genomics ; 20(1): 628, 2019 Aug 01.
Article in English | MEDLINE | ID: mdl-31370780

ABSTRACT

BACKGROUND: Free asparagine is the precursor for acrylamide formation during cooking and processing of grains, tubers, beans and other crop products. In wheat grain, free asparagine, free glutamine and total free amino acids accumulate to high levels in response to sulphur deficiency. In this study, RNA-seq data were acquired for the embryo and endosperm of two genotypes of bread wheat, Spark and SR3, growing under conditions of sulphur sufficiency and deficiency, and sampled at 14 and 21 days post anthesis (dpa). The aim was to provide new knowledge and understanding of the genetic control of asparagine accumulation and breakdown in wheat grain. RESULTS: There were clear differences in gene expression patterns between the genotypes. Sulphur responses were greater at 21 dpa than 14 dpa, and more evident in SR3 than Spark. TaASN2 was the most highly expressed asparagine synthetase gene in the grain, with expression in the embryo much higher than in the endosperm, and higher in Spark than SR3 during early development. There was a trend for genes encoding enzymes of nitrogen assimilation to be more highly expressed in Spark than SR3 when sulphur was supplied. TaASN2 expression in the embryo of SR3 increased in response to sulphur deficiency at 21 dpa, although this was not observed in Spark. This increase in TaASN2 expression was accompanied by an increase in glutamine synthetase gene expression and a decrease in asparaginase gene expression. Asparagine synthetase and asparaginase gene expression in the endosperm responded in the opposite way. Genes encoding regulatory protein kinases, SnRK1 and GCN2, both implicated in regulating asparagine synthetase gene expression, also responded to sulphur deficiency. Genes encoding bZIP transcription factors, including Opaque2/bZIP9, SPA/bZIP25 and BLZ1/OHP1/bZIP63, all of which contain SnRK1 target sites, were also expressed. Homeologues of many genes showed differential expression patterns and responses, including TaASN2. CONCLUSIONS: Data on the genetic control of free asparagine accumulation in wheat grain and its response to sulphur supply showed grain asparagine levels to be determined in the embryo, and identified genes encoding signalling and metabolic proteins involved in asparagine metabolism that respond to sulphur availability.


Subject(s)
Asparagine/metabolism , Gene Expression Regulation, Plant/drug effects , Genotype , Sulfur/pharmacology , Triticum/genetics , Triticum/metabolism , Sequence Analysis, RNA , Transcription Factors/genetics , Triticum/drug effects , Triticum/enzymology
9.
Sci Data ; 6(1): 128, 2019 07 22.
Article in English | MEDLINE | ID: mdl-31332220

ABSTRACT

The London Planetree (Platanus acerifolia) are present throughout the world. The tree is considered a greening plant and is commonly planted in streets, parks, and courtyards. The Sycamore lace bug (Corythucha ciliata) is a serious pest of this tree. To determine the molecular mechanism behind the interaction between the London Planetree and the Sycamore lace bug, we generated a comprehensive RNA-seq dataset (630,835,762 clean reads) for P. acerifolia by sequencing both infected and non-infected leaves of C. ciliata using the Illumina Hiseq 4000 system. We assembled the transcriptomes using the Trinity De Novo assembly followed by annotation. In total, 121,136 unigenes were obtained, and 80,559 unigenes were successfully annotated. From the 121,136 unigenes, we identified 3,010,256 SNPs, 39,097 microsatellites locus, and 1,916 transcription factors. The transcriptomic dataset we present are the first reports of transcriptome information in Platanus species and will be incredibly useful in future studies with P. acerifolia and other Platanus species, especially in the areas of genomics, molecular biology, physiology, and population genetics.


Subject(s)
Hemiptera , Magnoliopsida/genetics , Transcription Factors/genetics , Transcriptome , Animals , Genes, Plant , Genetic Markers , Herbivory , Microsatellite Repeats , Polymorphism, Single Nucleotide , Trees/genetics
10.
BMC Genomics ; 19(1): 624, 2018 Aug 22.
Article in English | MEDLINE | ID: mdl-30134833

ABSTRACT

BACKGROUND: The new genomic technologies have provided novel insights into the genetics of interactions between vectors, viruses and hosts, which are leading to advances in the control of arboviruses of medical importance. However, the development of tools and resources available for vectors of non-zoonotic arboviruses remains neglected. Biting midges of the genus Culicoides transmit some of the most important arboviruses of wildlife and livestock worldwide, with a global impact on economic productivity, health and welfare. The absence of a suitable reference genome has hindered genomic analyses to date in this important genus of vectors. In the present study, the genome of Culicoides sonorensis, a vector of bluetongue virus (BTV) in the USA, has been sequenced to provide the first reference genome for these vectors. In this study, we also report the use of the reference genome to perform initial transcriptomic analyses of vector competence for BTV. RESULTS: Our analyses reveal that the genome is 189 Mb, assembled in 7974 scaffolds. Its annotation using the transcriptomic data generated in this study and in a previous study has identified 15,612 genes. Gene expression analyses of C. sonorensis females infected with BTV performed in this study revealed 165 genes that were differentially expressed between vector competent and refractory females. Two candidate genes, glutathione S-transferase (gst) and the antiviral helicase ski2, previously recognized as involved in vector competence for BTV in C. sonorensis (gst) and repressing dsRNA virus propagation (ski2), were confirmed in this study. CONCLUSIONS: The reference genome of C. sonorensis has enabled preliminary analyses of the gene expression profiles of vector competent and refractory individuals. The genome and transcriptomes generated in this study provide suitable tools for future research on arbovirus transmission. These provide a valuable resource for these vector lineage, which diverged from other major Dipteran vector families over 200 million years ago. The genome will be a valuable source of comparative data for other important Dipteran vector families including mosquitoes (Culicidae) and sandflies (Psychodidae), and together with the transcriptomic data can yield potential targets for transgenic modification in vector control and functional studies.


Subject(s)
Bluetongue virus/physiology , Bluetongue/transmission , Ceratopogonidae/genetics , Ceratopogonidae/virology , Genome, Insect , Insect Vectors , Animals , Bluetongue/immunology , Bluetongue/virology , Bluetongue virus/immunology , Ceratopogonidae/immunology , Evolution, Molecular , Gene Expression Profiling , Host-Pathogen Interactions/genetics , Host-Pathogen Interactions/immunology , Immunity, Innate/genetics , Insect Vectors/genetics , Insect Vectors/physiology , Molecular Sequence Annotation , Sequence Analysis, DNA , Transcriptome/genetics
SELECTION OF CITATIONS
SEARCH DETAIL
...