Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 26
Filtrar
1.
BMC Bioinformatics ; 25(1): 184, 2024 May 09.
Artigo em Inglês | MEDLINE | ID: mdl-38724907

RESUMO

BACKGROUND: Major advances in sequencing technologies and the sharing of data and metadata in science have resulted in a wealth of publicly available datasets. However, working with and especially curating public omics datasets remains challenging despite these efforts. While a growing number of initiatives aim to re-use previous results, these present limitations that often lead to the need for further in-house curation and processing. RESULTS: Here, we present the Omics Dataset Curation Toolkit (OMD Curation Toolkit), a python3 package designed to accompany and guide the researcher during the curation process of metadata and fastq files of public omics datasets. This workflow provides a standardized framework with multiple capabilities (collection, control check, treatment and integration) to facilitate the arduous task of curating public sequencing data projects. While centered on the European Nucleotide Archive (ENA), the majority of the provided tools are generic and can be used to curate datasets from different sources. CONCLUSIONS: Thus, it offers valuable tools for the in-house curation previously needed to re-use public omics data. Due to its workflow structure and capabilities, it can be easily used and benefit investigators in developing novel omics meta-analyses based on sequencing data.


Assuntos
Curadoria de Dados , Software , Fluxo de Trabalho , Curadoria de Dados/métodos , Metadados , Bases de Dados Genéticas , Genômica/métodos , Biologia Computacional/métodos
2.
Genes (Basel) ; 14(8)2023 08 19.
Artigo em Inglês | MEDLINE | ID: mdl-37628701

RESUMO

Gut microbiomes of fish species consist of thousands of bacterial taxa that interact among each other, their environment, and the host. These complex networks of interactions are regulated by a diverse range of factors, yet little is known about the hierarchy of these interactions. Here, we introduce SAMBA (Structure-Learning of Aquaculture Microbiomes using a Bayesian Approach), a computational tool that uses a unified Bayesian network approach to model the network structure of fish gut microbiomes and their interactions with biotic and abiotic variables associated with typical aquaculture systems. SAMBA accepts input data on microbial abundance from 16S rRNA amplicons as well as continuous and categorical information from distinct farming conditions. From this, SAMBA can create and train a network model scenario that can be used to (i) infer information of how specific farming conditions influence the diversity of the gut microbiome or pan-microbiome, and (ii) predict how the diversity and functional profile of that microbiome would change under other variable conditions. SAMBA also allows the user to visualize, manage, edit, and export the acyclic graph of the modelled network. Our study presents examples and test results of Bayesian network scenarios created by SAMBA using data from a microbial synthetic community, and the pan-microbiome of gilthead sea bream (Sparus aurata) in different feeding trials. It is worth noting that the usage of SAMBA is not limited to aquaculture systems as it can be used for modelling microbiome-host network relationships of any vertebrate organism, including humans, in any system and/or ecosystem.


Assuntos
Microbiota , Dourada , Animais , Humanos , Teorema de Bayes , RNA Ribossômico 16S/genética , Aprendizagem , Microbiota/genética , Aquicultura
3.
Microbiome ; 11(1): 130, 2023 06 13.
Artigo em Inglês | MEDLINE | ID: mdl-37312139

RESUMO

BACKGROUND: Nudibranchs comprise a group of > 6000 marine soft-bodied mollusk species known to use secondary metabolites (natural products) for chemical defense. The full diversity of these metabolites and whether symbiotic microbes are responsible for their synthesis remains unexplored. Another issue in searching for undiscovered natural products is that computational analysis of genomes of uncultured microbes can result in detection of novel biosynthetic gene clusters; however, their in vivo functionality is not guaranteed which limits further exploration of their pharmaceutical or industrial potential. To overcome these challenges, we used a fluorescent pantetheine probe, which produces a fluorescent CoA-analog employed in biosynthesis of secondary metabolites, to label and capture bacterial symbionts actively producing these compounds in the mantle of the nudibranch Doriopsilla fulva. RESULTS: We recovered the genome of Candidatus Doriopsillibacter californiensis from the Ca. Tethybacterales order, an uncultured lineage of sponge symbionts not found in nudibranchs previously. It forms part of the core skin microbiome of D. fulva and is nearly absent in its internal organs. We showed that crude extracts of D. fulva contained secondary metabolites that were consistent with the presence of a beta-lactone encoded in Ca. D. californiensis genome. Beta-lactones represent an underexplored group of secondary metabolites with pharmaceutical potential that have not been reported in nudibranchs previously. CONCLUSIONS: Altogether, this study shows how probe-based, targeted sorting approaches can capture bacterial symbionts producing secondary metabolites in vivo. Video Abstract.


Assuntos
Produtos Biológicos , Gastrópodes , Animais , Bactérias/genética , Corantes Fluorescentes , Lactonas , Preparações Farmacêuticas
4.
Viruses ; 15(5)2023 05 19.
Artigo em Inglês | MEDLINE | ID: mdl-37243281

RESUMO

The environmental impact of uncultured phages is shaped by their preferred life cycle (lytic or lysogenic). However, our ability to predict it is very limited. We aimed to discriminate between lytic and lysogenic phages by comparing the similarity of their genomic signatures to those of their hosts, reflecting their co-evolution. We tested two approaches: (1) similarities of tetramer relative frequencies, (2) alignment-free comparisons based on exact k = 14 oligonucleotide matches. First, we explored 5126 reference bacterial host strains and 284 associated phages and found an approximate threshold for distinguishing lysogenic and lytic phages using both oligonucleotide-based methods. The analysis of 6482 plasmids revealed the potential for horizontal gene transfer between different host genera and, in some cases, distant bacterial taxa. Subsequently, we experimentally analyzed combinations of 138 Klebsiella pneumoniae strains and their 41 phages and found that the phages with the largest number of interactions with these strains in the laboratory had the shortest genomic distances to K. pneumoniae. We then applied our methods to 24 single-cells from a hot spring biofilm containing 41 uncultured phage-host pairs, and the results were compatible with the lysogenic life cycle of phages detected in this environment. In conclusion, oligonucleotide-based genome analysis methods can be used for predictions of (1) life cycles of environmental phages, (2) phages with the broadest host range in culture collections, and (3) potential horizontal gene transfer by plasmids.


Assuntos
Bacteriófagos , Bacteriófagos/genética , Lisogenia , Genômica , Genoma Viral , Bactérias/genética , Oligonucleotídeos
5.
Front Microbiol ; 14: 1154815, 2023.
Artigo em Inglês | MEDLINE | ID: mdl-37213502

RESUMO

A major challenge in microbial ecology is to understand the principles and processes by which microbes associate and interact in community assemblages. Microbial communities in mountain glaciers are unique as first colonizers and nutrient enrichment drivers for downstream ecosystems. However, mountain glaciers have been distinctively sensitive to climate perturbations and have suffered a severe retreat over the past 40 years, compelling us to understand glacier ecosystems before their disappearance. This is the first study in an Andean glacier in Ecuador offering insights into the relationship of physicochemical variables and altitude on the diversity and structure of bacterial communities. Our study covered extreme Andean altitudes at the Cayambe Volcanic Complex, from 4,783 to 5,583 masl. Glacier soil and ice samples were used as the source for 16S rRNA gene amplicon libraries. We found (1) effects of altitude on diversity and community structure, (2) the presence of few significantly correlated nutrients to community structure, (3) sharp differences between glacier soil and glacier ice in diversity and community structure, where, as quantified by the Shannon γ-diversity distribution, the meta-community in glacier soil showed more diversity than in glacier ice; this pattern was related to the higher variability of the physicochemical distribution of variables in the former substrate, and (4) significantly abundant genera associated with either high or low altitudes that could serve as biomarkers for studies on climate change. Our results provide the first assessment of these unexplored communities, before their potential disappearance due to glacier retreat and climate change.

6.
Genes (Basel) ; 14(2)2023 01 19.
Artigo em Inglês | MEDLINE | ID: mdl-36833195

RESUMO

The GPRO suite is an in-progress bioinformatic project for -omics data analysis. As part of the continued growth of this project, we introduce a client- and server-side solution for comparative transcriptomics and analysis of variants. The client-side consists of two Java applications called "RNASeq" and "VariantSeq" to manage pipelines and workflows based on the most common command line interface tools for RNA-seq and Variant-seq analysis, respectively. As such, "RNASeq" and "VariantSeq" are coupled with a Linux server infrastructure (named GPRO Server-Side) that hosts all dependencies of each application (scripts, databases, and command line interface software). Implementation of the Server-Side requires a Linux operating system, PHP, SQL, Python, bash scripting, and third-party software. The GPRO Server-Side can be installed, via a Docker container, in the user's PC under any operating system or on remote servers, as a cloud solution. "RNASeq" and "VariantSeq" are both available as desktop (RCP compilation) and web (RAP compilation) applications. Each application has two execution modes: a step-by-step mode enables each step of the workflow to be executed independently, and a pipeline mode allows all steps to be run sequentially. "RNASeq" and "VariantSeq" also feature an experimental, online support system called GENIE that consists of a virtual (chatbot) assistant and a pipeline jobs panel coupled with an expert system. The chatbot can troubleshoot issues with the usage of each tool, the pipeline jobs panel provides information about the status of each computational job executed in the GPRO Server-Side, while the expert system provides the user with a potential recommendation to identify or fix failed analyses. Our solution is a ready-to-use topic specific platform that combines the user-friendliness, robustness, and security of desktop software, with the efficiency of cloud/web applications to manage pipelines and workflows based on command line interface software.


Assuntos
Software , Interface Usuário-Computador , Humanos , Fluxo de Trabalho , Biologia Computacional , Bases de Dados Factuais
7.
Biology (Basel) ; 12(2)2023 Feb 16.
Artigo em Inglês | MEDLINE | ID: mdl-36829597

RESUMO

Organisms are unique physical entities in which information is stored and continuously processed. The digital nature of DNA sequences enables the construction of a dynamic information reservoir. However, the distinction between the hardware and software components in the information flow is crucial to identify the mechanisms generating specific genomic signatures. In this work, we perform a bibliometric analysis to identify the different purposes of looking for particular patterns in DNA sequences associated with a given phenotype. This study has enabled us to make a conceptual breakdown of the genomic signature and differentiate the leading applications. On the one hand, it refers to gene expression profiling associated with a biological function, which may be shared across taxa. This signature is the focus of study in precision medicine. On the other hand, it also refers to characteristic patterns in species-specific DNA sequences. This interpretation plays a key role in comparative genomics, identifying evolutionary relationships. Looking at the relevant studies in our bibliographic database, we highlight the main factors causing heterogeneities in genome composition and how they can be quantified. All these findings lead us to reformulate some questions relevant to evolutionary biology.

8.
PLoS One ; 17(9): e0272878, 2022.
Artigo em Inglês | MEDLINE | ID: mdl-36048821

RESUMO

Ribosomal DNA (rDNA) is the genetic loci that encodes rRNA in eukaryotes. It is typically arranged as tandem repeats that vary in copy number within the same species. We have recently shown that rDNA repeats copy number in the yeast Saccharomyces cerevisiae is controlled by cell volume via a feedback circuit that senses cell volume by means of the concentration of the free upstream activator factor (UAF). The UAF strongly binds the rDNA gene promoter, but is also able to repress SIR2 deacetylase gene transcription that, in turn, represses rDNA amplification. In this way, the cells with a smaller DNA copy number than what is optimal evolve to increase that copy number until they reach a number that sequestrates free UAF and provokes SIR2 derepression that, in turn, blocks rDNA amplification. Here we propose a mathematical model to show that this evolutionary process can amplify rDNA repeats independently of the selective advantage of yeast cells having bigger or smaller rDNA copy numbers. We test several variants of this process and show that it can explain the observed experimental results independently of natural selection. These results predict that an autoregulated feedback circuit may, in some instances, drive to non Darwinian deterministic evolution for a limited time period.


Assuntos
Proteínas de Saccharomyces cerevisiae , Saccharomyces cerevisiae , Variações do Número de Cópias de DNA , DNA Ribossômico/genética , DNA Ribossômico/metabolismo , Retroalimentação , Saccharomyces cerevisiae/genética , Saccharomyces cerevisiae/metabolismo , Proteínas de Saccharomyces cerevisiae/genética , Proteínas de Saccharomyces cerevisiae/metabolismo , Seleção Genética , Fatores de Transcrição/metabolismo
9.
Viruses ; 14(5)2022 05 23.
Artigo em Inglês | MEDLINE | ID: mdl-35632855

RESUMO

The generation of different types of defective viral genomes (DVG) is an unavoidable consequence of the error-prone replication of RNA viruses. In recent years, a particular class of DVGs, those containing long deletions or genome rearrangements, has gain interest due to their potential therapeutic and biotechnological applications. Identifying such DVGs in high-throughput sequencing (HTS) data has become an interesting computational problem. Several algorithms have been proposed to accomplish this goal, though all incur false positives, a problem of practical interest if such DVGs have to be synthetized and tested in the laboratory. We present a metasearch tool, DVGfinder, that wraps the two most commonly used DVG search algorithms in a single workflow for the identification of the DVGs in HTS data. DVGfinder processes the results of ViReMa-a and DI-tector and uses a gradient boosting classifier machine learning algorithm to reduce the number of false-positive events. The program also generates output files in user-friendly HTML format, which can help users to explore the DVGs identified in the sample. We evaluated the performance of DVGfinder compared to the two search algorithms used separately and found that it slightly improves sensitivities for low-coverage synthetic HTS data and DI-tector precision for high-coverage samples. The metasearch program also showed higher sensitivity on a real sample for which a set of copy-backs were previously validated.


Assuntos
Vírus Defeituosos , Vírus de RNA , Vírus Defeituosos/genética , Genoma Viral , Sequenciamento de Nucleotídeos em Larga Escala , Vírus de RNA/genética , RNA-Seq
10.
Comput Struct Biotechnol J ; 20: 421-433, 2022.
Artigo em Inglês | MEDLINE | ID: mdl-35035791

RESUMO

The human gut holds a special place in the study of different microbial environments due to growing evidence that the gut microbiota is related to host health. However, despite extensive research, there is still a lack of knowledge about the core taxa forming the gut microbiota and, moreover, available information is biased towards western microbiomes in both genome databases and most core taxa studies. To tackle these limitations, we tested a database enrichment strategy and analyzed public datasets of whole-genome shotgun data, generated from 545 fecal samples, comprising three gradients of westernization. The NT database was selected as a baseline of biological diversity, subsequently being combined with various studies of interest related to the human microbiota. This enrichment strategy made it possible to improve classification capacity, compared to the original unenriched database, regarding the various lifestyles and populations studied. The effects of incomplete-taxonomy metagenome-assembled genomes on genome database enrichment were also examined, revealing that, while they are helpful, they should be used with caution depending on the taxonomic level of interest. Moreover, in terms of high prevalence, the core analysis revealed a conserved set of bacterial taxa in the healthy human gut microbiota worldwide, despite apparent lifestyle differences. Such taxa show a set of traits, metabolic roles, and ancestral status, making them suitable candidates for a hypothetical phylogenetic core of mutualistic microorganisms co-evolving with the human species.

11.
Sci Rep ; 11(1): 6921, 2021 03 25.
Artigo em Inglês | MEDLINE | ID: mdl-33767340

RESUMO

Human blood metagenomics has revealed the presence of different types of viruses in apparently healthy subjects. By far, anelloviruses constitute the viral family that is more frequently found in human blood, although amplification biases and contaminations pose a major challenge in this field. To investigate this further, we subjected pooled plasma samples from 120 healthy donors in Spain to high-speed centrifugation, RNA and DNA extraction, random amplification, and massive parallel sequencing. Our results confirm the extensive presence of anelloviruses in such samples, which represented nearly 97% of the total viral sequence reads obtained. We assembled 114 different viral genomes belonging to this family, revealing remarkable diversity. Phylogenetic analysis of ORF1 suggested 28 potentially novel anellovirus species, 24 of which were validated by Sanger sequencing to discard artifacts. These findings underscore the importance of implementing more efficient purification procedures that enrich the viral fraction as an essential step in virome studies and question the suggested pathological role of anelloviruses.


Assuntos
Anelloviridae/isolamento & purificação , Sangue/virologia , Viroma , Voluntários Saudáveis , Humanos , Metagenômica
12.
Sci Rep ; 10(1): 19073, 2020 11 04.
Artigo em Inglês | MEDLINE | ID: mdl-33149190

RESUMO

Progressive evolution, or the tendency towards increasing complexity, is a controversial issue in biology, which resolution entails a proper measurement of complexity. Genomes are the best entities to address this challenge, as they encode the historical information of a species' biotic and environmental interactions. As a case study, we have measured genome sequence complexity in the ancient phylum Cyanobacteria. To arrive at an appropriate measure of genome sequence complexity, we have chosen metrics that do not decipher biological functionality but that show strong phylogenetic signal. Using a ridge regression of those metrics against root-to-tip distance, we detected positive trends towards higher complexity in three of them. Lastly, we applied three standard tests to detect if progressive evolution is passive or driven-the minimum, ancestor-descendant, and sub-clade tests. These results provide evidence for driven progressive evolution at the genome-level in the phylum Cyanobacteria.


Assuntos
Cianobactérias/genética , Evolução Molecular , Genoma Bacteriano , Cianobactérias/classificação , Filogenia
13.
Fertil Steril ; 113(6): 1261-1274, 2020 06.
Artigo em Inglês | MEDLINE | ID: mdl-32482256

RESUMO

OBJECTIVE: To determine the molecular functions of genes exhibiting altered expression in the endometrium of women with uterine disorders affecting fertility. DESIGN: Retrospective analysis integrating case and control data from multiple cohorts with endometrium gene expression in women with uterine disorders. SETTING: Infertility research department affiliated with a university hospital. PATIENT(S): Two hundred and forty women, 121 of whom were controls, 119 of whom had endometrial adenocarcinoma (ADC), recurrent implantation failure (RIF), recurrent pregnancy loss (RPL), or stage II-IV endometriosis. INTERVENTION(S): None. MAIN OUTCOME MEASURE(S): Genomewide gene expression and altered molecular functions in the endometrium of each uterine disorder. RESULT(S): Using robust analysis methods, we identified statistically significantly altered endometrial functions in all the uterine disorders. Cell cycle alterations were shared among all the pathologies investigated. Endometriosis was characterized by the down-regulation of ciliary processes. Among the endometriosis, ADC, and RIF samples, mitochondrial dysfunction and protein degradation were shared dysregulated processes. In addition, RPL had the most distinct functional profile, and 95% of affected functions were down-regulated. CONCLUSION(S): The most robust functions dysregulated in the endometrium of patients with uterine disorders across sample cohorts implicated an endometrial factor at the gene expression level. This shared endometrial factor affects endometrial receptivity processes.


Assuntos
Endométrio/fisiopatologia , Fertilidade/genética , Infertilidade Feminina/genética , Doenças Uterinas/genética , Aborto Habitual/genética , Aborto Habitual/fisiopatologia , Adenocarcinoma/complicações , Adenocarcinoma/genética , Adenocarcinoma/fisiopatologia , Bases de Dados Genéticas , Implantação do Embrião/genética , Neoplasias do Endométrio/complicações , Neoplasias do Endométrio/genética , Neoplasias do Endométrio/fisiopatologia , Endometriose/complicações , Endometriose/genética , Endometriose/fisiopatologia , Feminino , Regulação da Expressão Gênica , Estudo de Associação Genômica Ampla , Humanos , Infertilidade Feminina/etiologia , Infertilidade Feminina/fisiopatologia , Estudos Retrospectivos , Fatores de Risco , Doenças Uterinas/epidemiologia
14.
BMC Plant Biol ; 20(1): 79, 2020 Feb 17.
Artigo em Inglês | MEDLINE | ID: mdl-32066386

RESUMO

BACKGROUND: The olive tree is of particular economic interest in the Mediterranean basin. Researchers have conducted several studies on one of the most devastating disorders affecting this tree, the Verticillium wilt, which causes substantial economic losses in numerous areas. We analyzed metatranscriptomic samples taken from a previous study conducted on leaves and roots of Olea europaea that were infected with Verticillium dahliae. In addition, we also analyzed mechanically damaged roots. The aim of our approach is to describe the dynamics of the root microbiome after severe perturbations. RESULTS: Our results not only describe the dynamics of the microbial community associated with the disturbance, but also show the high complexity of these systems and explain how this can lead to a conflicting assignment of the various types of parasitism observed in a specific organism. CONCLUSIONS: Our findings indicate that this infection, although led by Verticillium, is driven not by a single species, but by a polymicrobial consortium that also includes natural endophytes of the olive tree. This community contains both biotrophic and necrotrophic organisms that alternate and live together during the infection. In addition, opportunistic organisms appear that take profit not from plant tissues, but from new emerging populations of microorganisms. Therefore, this system can be described as a complex biological system composed of different interacting communities. Notably, our work has important considerations when it comes to classifying the type of parasitism of a given species.


Assuntos
Microbiota , Olea/genética , Doenças das Plantas/genética , Transcriptoma , Verticillium/fisiologia , Olea/metabolismo , Olea/microbiologia , Doenças das Plantas/microbiologia , Folhas de Planta/genética , Folhas de Planta/metabolismo , Folhas de Planta/microbiologia , Raízes de Plantas/genética , Raízes de Plantas/metabolismo , Raízes de Plantas/microbiologia
15.
BMC Bioinformatics ; 18(1): 421, 2017 Sep 20.
Artigo em Inglês | MEDLINE | ID: mdl-28931371

RESUMO

BACKGROUND: The possibility of integrating viral vectors to become a persistent part of the host genome makes them a crucial element of clinical gene therapy. However, viral integration has associated risks, such as the unintentional activation of oncogenes that can result in cancer. Therefore, the analysis of integration sites of retroviral vectors is a crucial step in developing safer vectors for therapeutic use. RESULTS: Here we present VISMapper, a vector integration site analysis web server, to analyze next-generation sequencing data for retroviral vector integration sites. VISMapper can be found at: http://vismapper.babelomics.org . CONCLUSIONS: Because it uses novel mapping algorithms VISMapper is remarkably faster than previous available programs. It also provides a useful graphical interface to analyze the integration sites found in the genomic context.


Assuntos
Terapia Genética/métodos , Interface Usuário-Computador , Integração Viral/genética , Sequência de Bases , Vetores Genéticos , Sequenciamento de Nucleotídeos em Larga Escala , Humanos , Internet
16.
BMC Bioinformatics ; 18(1): 142, 2017 Mar 02.
Artigo em Inglês | MEDLINE | ID: mdl-28249561

RESUMO

BACKGROUND: Reactome aims to provide bioinformatics tools for visualisation, interpretation and analysis of pathway knowledge to support basic research, genome analysis, modelling, systems biology and education. Pathway analysis methods have a broad range of applications in physiological and biomedical research; one of the main problems, from the analysis methods performance point of view, is the constantly increasing size of the data samples. RESULTS: Here, we present a new high-performance in-memory implementation of the well-established over-representation analysis method. To achieve the target, the over-representation analysis method is divided in four different steps and, for each of them, specific data structures are used to improve performance and minimise the memory footprint. The first step, finding out whether an identifier in the user's sample corresponds to an entity in Reactome, is addressed using a radix tree as a lookup table. The second step, modelling the proteins, chemicals, their orthologous in other species and their composition in complexes and sets, is addressed with a graph. The third and fourth steps, that aggregate the results and calculate the statistics, are solved with a double-linked tree. CONCLUSION: Through the use of highly optimised, in-memory data structures and algorithms, Reactome has achieved a stable, high performance pathway analysis service, enabling the analysis of genome-wide datasets within seconds, allowing interactive exploration and analysis of high throughput data. The proposed pathway analysis approach is available in the Reactome production web site either via the AnalysisService for programmatic access or the user submission interface integrated into the PathwayBrowser. Reactome is an open data and open source project and all of its source code, including the one described here, is available in the AnalysisTools repository in the Reactome GitHub ( https://github.com/reactome/ ).


Assuntos
Biologia Computacional , Software , Algoritmos , Bases de Dados Factuais , Humanos , Ácidos Nucleicos/metabolismo , Proteínas/metabolismo
17.
BMC Bioinformatics ; 17: 107, 2016 Feb 27.
Artigo em Inglês | MEDLINE | ID: mdl-26921234

RESUMO

BACKGROUND: The use of nanopore technologies is expected to spread in the future because they are portable and can sequence long fragments of DNA molecules without prior amplification. The first nanopore sequencer available, the MinION™ from Oxford Nanopore Technologies, is a USB-connected, portable device that allows real-time DNA analysis. In addition, other new instruments are expected to be released soon, which promise to outperform the current short-read technologies in terms of throughput. Despite the flood of data expected from this technology, the data analysis solutions currently available are only designed to manage small projects and are not scalable. RESULTS: Here we present HPG Pore, a toolkit for exploring and analysing nanopore sequencing data. HPG Pore can run on both individual computers and in the Hadoop distributed computing framework, which allows easy scale-up to manage the large amounts of data expected to result from extensive use of nanopore technologies in the future. CONCLUSIONS: HPG Pore allows for virtually unlimited sequencing data scalability, thus guaranteeing its continued management in near future scenarios. HPG Pore is available in GitHub at http://github.com/opencb/hpg-pore.


Assuntos
DNA/genética , Nanoporos , Análise de Sequência de DNA/instrumentação
18.
Bioinformatics ; 31(19): 3228-30, 2015 Oct 01.
Artigo em Inglês | MEDLINE | ID: mdl-26040457

RESUMO

UNLABELLED: Genomic technologies allow laboratories to produce large-scale data sets, either through the use of next-generation sequencing or microarray platforms. To explore these data sets and obtain maximum value from the data, researchers view their results alongside all the known features of a given reference genome. To study transcriptional changes that occur under a given condition, researchers search for regions of the genome that are differentially expressed between different experimental conditions. In order to identify these regions several algorithms have been developed over the years, along with some bioinformatic platforms that enable their use. However, currently available applications for comparative microarray analysis exclusively focus on changes in gene expression within known transcribed regions of predicted protein-coding genes, the changes that occur in non-predictable genetic elements, such as non-coding RNAs. Here, we present a web application for the visualization of strand-specific tiling microarray or next-generation sequencing data that allows customized detection of differentially expressed regions all along the genome in an unspecific manner, that allows identification of all RNA sequences, predictable or not. AVAILABILITY AND IMPLEMENTATION: The web application is freely accessible at http://tilingscan.uv.es/. TilingScan is implemented in PHP and JavaScript. CONTACT: vicente.arnau@uv.es SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Assuntos
Biologia Computacional/métodos , Gráficos por Computador , DNA/química , Perfilação da Expressão Gênica , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Internet , Análise de Sequência de DNA/métodos , Algoritmos , Genoma Humano , Genômica , Humanos
19.
Bioinformatics ; 30(23): 3396-8, 2014 Dec 01.
Artigo em Inglês | MEDLINE | ID: mdl-25143289

RESUMO

UNLABELLED: HPG Aligner applies suffix arrays for DNA read mapping. This implementation produces a highly sensitive and extremely fast mapping of DNA reads that scales up almost linearly with read length. The approach presented here is faster (over 20× for long reads) and more sensitive (over 98% in a wide range of read lengths) than the current state-of-the-art mappers. HPG Aligner is not only an optimal alternative for current sequencers but also the only solution available to cope with longer reads and growing throughputs produced by forthcoming sequencing technologies. AVAILABILITY AND IMPLEMENTATION: https://github.com/opencb/hpg-aligner.


Assuntos
Sequenciamento de Nucleotídeos em Larga Escala/métodos , Alinhamento de Sequência/métodos , Análise de Sequência de DNA/métodos , Algoritmos , Animais , Drosophila/genética , Humanos , Software
20.
Bioinformatics ; 29(6): 813-4, 2013 Mar 15.
Artigo em Inglês | MEDLINE | ID: mdl-23355291

RESUMO

Eukaryotic gene expression is regulated both at the transcription and the mRNA degradation levels. The implementation of functional genomics methods that allow the simultaneous measurement of transcription (TR) and degradation (DR) rates for thousands of mRNAs is a huge improvement in this field. One of the best established methods for mRNA stability determination is genomic run-on (GRO). It allows the measurement of DR, TR and mRNA levels during cell dynamic responses. Here, we offer a software package that provides improved algorithms for determination of mRNA stability during dynamic GRO experiments.


Assuntos
Estabilidade de RNA , RNA Mensageiro/metabolismo , Software , Algoritmos , Genômica/métodos , Humanos , Internet
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA
...