Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 60
Filtrar
1.
J Gen Virol ; 105(5)2024 May.
Artigo em Inglês | MEDLINE | ID: mdl-38767624

RESUMO

Naturally occurring isolates of baculoviruses, such as the Bombyx mori nucleopolyhedrovirus (BmNPV), usually consist of numerous genetically different haplotypes. Deciphering the different haplotypes of such isolates is hampered by the large size of the dsDNA genome, as well as the short read length of next generation sequencing (NGS) techniques that are widely applied for baculovirus isolate characterization. In this study, we addressed this challenge by combining the accuracy of NGS to determine single nucleotide variants (SNVs) as genetic markers with the long read length of Nanopore sequencing technique. This hybrid approach allowed the comprehensive analysis of genetically homogeneous and heterogeneous isolates of BmNPV. Specifically, this allowed the identification of two putative major haplotypes in the heterogeneous isolate BmNPV-Ja by SNV position linkage. SNV positions, which were determined based on NGS data, were linked by the long Nanopore reads in a Position Weight Matrix. Using a modified Expectation-Maximization algorithm, the Nanopore reads were assigned according to the occurrence of variable SNV positions by machine learning. The cohorts of reads were de novo assembled, which led to the identification of BmNPV haplotypes. The method demonstrated the strength of the combined approach of short- and long-read sequencing techniques to decipher the genetic diversity of baculovirus isolates.


Assuntos
Bombyx , Haplótipos , Sequenciamento de Nucleotídeos em Larga Escala , Sequenciamento por Nanoporos , Nucleopoliedrovírus , Polimorfismo de Nucleotídeo Único , Nucleopoliedrovírus/genética , Nucleopoliedrovírus/classificação , Nucleopoliedrovírus/isolamento & purificação , Animais , Sequenciamento por Nanoporos/métodos , Bombyx/virologia , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Genoma Viral
2.
Plant Cell Environ ; 47(7): 2675-2692, 2024 Jul.
Artigo em Inglês | MEDLINE | ID: mdl-38600764

RESUMO

The restriction of plant-symbiont dinitrogen fixation by an insect semiochemical had not been previously described. Here we report on a glycosylated triketide δ-lactone from Nephrotoma cornicina crane flies, cornicinine, that causes chlorosis in the floating-fern symbioses from the genus Azolla. Only the glycosylated trans-A form of chemically synthesized cornicinine was active: 500 nM cornicinine in the growth medium turned all cyanobacterial filaments from Nostoc azollae inside the host leaf-cavities into akinetes typically secreting CTB-bacteriocins. Cornicinine further inhibited akinete germination in Azolla sporelings, precluding re-establishment of the symbiosis during sexual reproduction. It did not impact development of the plant Arabidopsis thaliana or several free-living cyanobacteria from the genera Anabaena or Nostoc but affected the fern host without cyanobiont. Fern-host mRNA sequencing from isolated leaf cavities confirmed high NH4-assimilation and proanthocyanidin biosynthesis in this trichome-rich tissue. After cornicinine treatment, it revealed activation of Cullin-RING ubiquitin-ligase-pathways, known to mediate metabolite signaling and plant elicitation consistent with the chlorosis phenotype, and increased JA-oxidase, sulfate transport and exosome formation. The work begins to uncover molecular mechanisms of cyanobiont differentiation in a seed-free plant symbiosis important for wetland ecology or circular crop-production today, that once caused massive CO2 draw-down during the Eocene geological past.


Assuntos
Gleiquênias , Lactonas , Simbiose , Animais , Lactonas/metabolismo , Gleiquênias/fisiologia , Gleiquênias/microbiologia , Gleiquênias/efeitos dos fármacos , Dípteros/fisiologia , Glicosilação , Cianobactérias/metabolismo , Cianobactérias/fisiologia , Cianobactérias/genética , Nostoc/fisiologia , Nostoc/genética , Nostoc/metabolismo , Folhas de Planta/metabolismo , Folhas de Planta/fisiologia
3.
Plant J ; 112(4): 897-918, 2022 11.
Artigo em Inglês | MEDLINE | ID: mdl-36073999

RESUMO

Breeding has increasingly altered the genetics of crop plants since the domestication of their wild progenitors. It is postulated that the genetic diversity of elite wheat breeding pools is too narrow to cope with future challenges. In contrast, plant genetic resources (PGRs) of wheat stored in genebanks are valuable sources of unexploited genetic diversity. Therefore, to ensure breeding progress in the future, it is of prime importance to identify the useful allelic diversity available in PGRs and to transfer it into elite breeding pools. Here, a diverse collection consisting of modern winter wheat cultivars and genebank accessions was investigated based on reduced-representation genomic sequencing and an iSelect single nucleotide polymorphism (SNP) chip array. Analyses of these datasets provided detailed insights into population structure, levels of genetic diversity, sources of new allelic diversity, and genomic regions affected by breeding activities. We identified 57 regions representing genomic signatures of selection and 827 regions representing private alleles associated exclusively with genebank accessions. The presence of known functional wheat genes, quantitative trait loci, and large chromosomal modifications, i.e., introgressions from wheat wild relatives, provided initial evidence for putative traits associated within these identified regions. These findings were supported by the results of ontology enrichment analyses. The results reported here will stimulate further research and promote breeding in the future by allowing for the targeted introduction of novel allelic diversity into elite wheat breeding pools.


Assuntos
Pão , Triticum , Triticum/genética , Alelos , Melhoramento Vegetal , Genoma de Planta/genética , Polimorfismo de Nucleotídeo Único/genética
4.
Plant J ; 106(1): 8-22, 2021 04.
Artigo em Inglês | MEDLINE | ID: mdl-33577114

RESUMO

Genome editing by RNA-guided nucleases, such as SpCas9, has been used in numerous different plant species. However, to what extent multiple independent loci can be targeted simultaneously by multiplexing has not been well documented. Here, we developed a toolkit, based on a highly intron-optimized zCas9i gene, which allows assembly of nuclease constructs expressing up to 32 single guide RNAs (sgRNAs). We used this toolkit to explore the limits of multiplexing in two major model species, and report on the isolation of transgene-free octuple (8×) Nicotiana benthamiana and duodecuple (12×) Arabidopsis thaliana mutant lines in a single generation (T1 and T2 , respectively). We developed novel counter-selection markers for N. benthamiana, most importantly Sl-FAST2, comparable to the well-established Arabidopsis seed fluorescence marker, and FCY-UPP, based on the production of toxic 5-fluorouracil in the presence of a precursor. Targeting eight genes with an array of nine different sgRNAs and relying on FCY-UPP for selection of non-transgenic T1 , we identified N. benthamiana mutant lines with astonishingly high efficiencies: All analyzed plants carried mutations in all genes (approximately 112/116 target sites edited). Furthermore, we targeted 12 genes by an array of 24 sgRNAs in A. thaliana. Efficiency was significantly lower in A. thaliana, and our results indicate Cas9 availability is the limiting factor in such higher-order multiplexing applications. We identified a duodecuple mutant line by a combination of phenotypic screening and amplicon sequencing. The resources and results presented provide new perspectives for how multiplexing can be used to generate complex genotypes or to functionally interrogate groups of candidate genes.


Assuntos
Arabidopsis/genética , Nicotiana/genética , Plantas Geneticamente Modificadas/genética , Sistemas CRISPR-Cas/genética , Edição de Genes , Genoma de Planta/genética , Mutação/genética
5.
Genomics ; 113(6): 4173-4183, 2021 11.
Artigo em Inglês | MEDLINE | ID: mdl-34774678

RESUMO

Cherries are stone fruits and belong to the economically important plant family of Rosaceae with worldwide cultivation of different species. The ground cherry, Prunus fruticosa Pall., is an ancestor of cultivated sour cherry, an important tetraploid cherry species. Here, we present a long read chromosome-level draft genome assembly and related plastid sequences using the Oxford Nanopore Technology PromethION platform and R10.3 pore type. We generated a final consensus genome sequence of 366 Mb comprising eight chromosomes. The N50 scaffold was ~44 Mb with the longest chromosome being 66.5 Mb. The chloroplast and mitochondrial genomes were 158,217 bp and 383,281 bp long, which is in accordance with previously published plastid sequences. This is the first report of the genome of ground cherry (P. fruticosa) sequenced by long read technology only. The datasets obtained from this study provide a foundation for future breeding, molecular and evolutionary analysis in Prunus studies.


Assuntos
Physalis , Prunus , Cromossomos , Physalis/genética , Melhoramento Vegetal , Prunus/genética , Tetraploidia
6.
BMC Genomics ; 22(1): 188, 2021 Mar 16.
Artigo em Inglês | MEDLINE | ID: mdl-33726677

RESUMO

BACKGROUND: Basenjis are considered an ancient dog breed of central African origins that still live and hunt with tribesmen in the African Congo. Nicknamed the barkless dog, Basenjis possess unique phylogeny, geographical origins and traits, making their genome structure of great interest. The increasing number of available canid reference genomes allows us to examine the impact the choice of reference genome makes with regard to reference genome quality and breed relatedness. RESULTS: Here, we report two high quality de novo Basenji genome assemblies: a female, China (CanFam_Bas), and a male, Wags. We conduct pairwise comparisons and report structural variations between assembled genomes of three dog breeds: Basenji (CanFam_Bas), Boxer (CanFam3.1) and German Shepherd Dog (GSD) (CanFam_GSD). CanFam_Bas is superior to CanFam3.1 in terms of genome contiguity and comparable overall to the high quality CanFam_GSD assembly. By aligning short read data from 58 representative dog breeds to three reference genomes, we demonstrate how the choice of reference genome significantly impacts both read mapping and variant detection. CONCLUSIONS: The growing number of high-quality canid reference genomes means the choice of reference genome is an increasingly critical decision in subsequent canid variant analyses. The basal position of the Basenji makes it suitable for variant analysis for targeted applications of specific dog breeds. However, we believe more comprehensive analyses across the entire family of canids is more suited to a pangenome approach. Collectively this work highlights the importance the choice of reference genome makes in all variation studies.


Assuntos
Lobos , Animais , China , Cromossomos , Cães , Feminino , Genoma , Genômica , Masculino , Lobos/genética
7.
Mol Ecol ; 30(23): 6211-6228, 2021 12.
Artigo em Inglês | MEDLINE | ID: mdl-34324751

RESUMO

Introduced populations of invasive organisms have to cope with novel environmental challenges, while having reduced genetic variation caused by founder effects. The mechanisms associated with this "genetic paradox of invasive species" has received considerable attention, yet few studies have examined the genomic architecture of invasive species. Populations of the heart node ant Cardiocondyla obscurior belong to two distinct lineages, a New World lineage so far only found in Latin America and a more globally distributed Old World lineage. In the present study, we use population genomic approaches to compare populations of the two lineages with apparent divergent invasive potential. We find that the strong genetic differentiation of the two lineages began at least 40,000 generations ago and that activity of transposable elements (TEs) has contributed significantly to the divergence of both lineages, possibly linked to the very unusual genomic distribution of TEs in this species. Furthermore, we show that introgression from the Old World lineage is a dominant source of genetic diversity in the New World lineage, despite the lineages' strong genetic differentiation. Our study uncovers mechanisms underlying novel genetic variation in introduced populations of C. obscurior that could contribute to the species' adaptive potential.


Assuntos
Formigas , Elementos de DNA Transponíveis , Animais , Formigas/genética , Elementos de DNA Transponíveis/genética , Variação Genética , Genômica , Espécies Introduzidas
8.
Bioinformatics ; 35(22): 4812-4814, 2019 11 01.
Artigo em Inglês | MEDLINE | ID: mdl-31225867

RESUMO

SUMMARY: Statistical dependencies are present in a variety of sequence data, but are not discernible from traditional sequence logos. Here, we present the R package DepLogo for visualizing inter-position dependencies in aligned sequence data as dependency logos. Dependency logos make dependency structures, which correspond to regular co-occurrences of symbols at dependent positions, visually perceptible. To this end, sequences are partitioned based on their symbols at highly dependent positions as measured by mutual information, and each partition obtains its own visual representation. We illustrate the utility of the DepLogo package in several use cases generating dependency logos from DNA, RNA and protein sequences. AVAILABILITY AND IMPLEMENTATION: The DepLogo R package is available from CRAN and its source code is available at https://github.com/Jstacs/DepLogo. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Assuntos
Software , DNA , Matrizes de Pontuação de Posição Específica , Análise de Sequência de DNA
9.
BMC Bioinformatics ; 19(1): 189, 2018 05 30.
Artigo em Inglês | MEDLINE | ID: mdl-29843602

RESUMO

BACKGROUND: Genome annotation is of key importance in many research questions. The identification of protein-coding genes is often based on transcriptome sequencing data, ab-initio or homology-based prediction. Recently, it was demonstrated that intron position conservation improves homology-based gene prediction, and that experimental data improves ab-initio gene prediction. RESULTS: Here, we present an extension of the gene prediction program GeMoMa that utilizes amino acid sequence conservation, intron position conservation and optionally RNA-seq data for homology-based gene prediction. We show on published benchmark data for plants, animals and fungi that GeMoMa performs better than the gene prediction programs BRAKER1, MAKER2, and CodingQuarry, and purely RNA-seq-based pipelines for transcript identification. In addition, we demonstrate that using multiple reference organisms may help to further improve the performance of GeMoMa. Finally, we apply GeMoMa to four nematode species and to the recently published barley reference genome indicating that current annotations of protein-coding genes may be refined using GeMoMa predictions. CONCLUSIONS: GeMoMa might be of great utility for annotating newly sequenced genomes but also for finding homologs of a specific gene or gene family. GeMoMa has been published under GNU GPL3 and is freely available at http://www.jstacs.de/index.php/GeMoMa .


Assuntos
Perfilação da Expressão Gênica , Genes Fúngicos , Genes de Plantas , Análise de Sequência de RNA , Homologia de Sequência de Aminoácidos , Software , Animais , Genômica , Hordeum/genética , Íntrons , Anotação de Sequência Molecular , Nematoides/genética
10.
BMC Genomics ; 19(1): 409, 2018 May 29.
Artigo em Inglês | MEDLINE | ID: mdl-29843596

RESUMO

BACKGROUND: Understanding the genetic basis of frost tolerance (FT) in wheat (Triticum aestivum L.) is essential for preventing yield losses caused by frost due to cellular damage, dehydration and reduced metabolism. FT is a complex trait regulated by a number of genes and several gene families. Availability of the wheat genomic sequence opens new opportunities for exploring candidate genes diversity for FT. Therefore, the objectives of this study were to identity SNPs and insertion-deletion (indels) in genes known to be involved in frost tolerance and to perform association genetics analysis of respective SNPs and indels on FT. RESULTS: Here we report on the sequence analysis of 19 candidate genes for FT in wheat assembled using the Chinese Spring IWGSC RefSeq v1.0. Out of these, the tandem duplicated C-repeat binding factors (CBF), i.e. CBF-A3, CBF-A5, CBF-A10, CBF-A13, CBF-A14, CBF-A15, CBF-A18, the vernalisation response gene VRN-A1, VRN-B3, the photoperiod response genes PPD-B1 and PPD-D1 revealed association to FT in 235 wheat cultivars. Within six genes (CBF-A3, CBF-A15, VRN-A1, VRN-B3, PPD-B1 and PPD-D1) amino acid (AA) substitutions in important protein domains were identified. The amino acid substitution effect in VRN-A1 on FT was confirmed and new AA substitutions in CBF-A3, CBF-A15, VRN-B3, PPD-B1 and PPD-D1 located at highly conserved sites were detected. Since these results rely on phenotypic data obtained at five locations in 2 years, detection of significant associations of FT to AA changes in CBF-A3, CBF-A15, VRN-A1, VRN-B3, PPD-B1 and PPD-D1 may be exploited in marker assisted breeding for frost tolerance in winter wheat. CONCLUSIONS: A set of 65 primer pairs for the genes mentioned above from a previous study was BLASTed against the IWGSC RefSeq resulting in the identification of 39 primer combinations covering the full length of 19 genes. This work demonstrates the usefulness of the IWGSC RefSeq in specific primer development for highly conserved gene families in hexaploid wheat and, that a candidate gene association genetics approach based on the sequence data is an efficient tool to identify new alleles of genes important for the response to abiotic stress in wheat.


Assuntos
Substituição de Aminoácidos , Sequência Conservada , Proteínas de Plantas/genética , Triticum/genética , Temperatura Baixa , Haplótipos , Mutação INDEL , Desequilíbrio de Ligação , Fenótipo , Proteínas de Plantas/química , Polimorfismo de Nucleotídeo Único , Triticum/fisiologia
11.
J Gen Virol ; 99(9): 1307-1320, 2018 09.
Artigo em Inglês | MEDLINE | ID: mdl-30045782

RESUMO

Kimura two-parameter nucleotide distance comparisons based on polyhedrin/granulin (polh/gran), late expression factor 8 (lef-8) and late expression factor 9 (lef-9) are a widely applied method for species demarcation for lepidopteran-specific baculoviruses. Baculoviruses are considered to belong to the same species when a pairwise distance threshold of 0.015 is not exceeded and are considered as possibly belonging to the same species with a distance of up to 0.050. In the present work this method was revised and extended for 172 entirely sequenced lepidopteran, hymenopteran and dipteran baculovirus genomes by applying the nucleotide sequences of all 38 known baculovirus core genes for pairwise distance calculations. On the basis of this large dataset, the previously established standard thresholds for baculovirus species demarcation were adjusted for pairwise nucleotide distances estimated from the alignments of all 38 core genes. With the newly applied thresholds for the 38 core-gene dataset, a more sophisticated Kimura two-parameter method was established, avoiding the possible influence of the chimerical polh gene of the Autographa californica multiple nucleopolyhedrovirus. Based on the new dataset, the present classification of baculovirus species was confirmed. Thereby the Kimura two-parameter method for baculovirus demarcation was extended to include the information from all 38 Baculoviridae core genes, which represent the established standard information for baculovirus phylogeny to date.


Assuntos
Baculoviridae/genética , Genoma Viral , Baculoviridae/classificação , Sequência de Bases , DNA Viral/genética , Filogenia , Alinhamento de Sequência , Especificidade da Espécie
12.
Nucleic Acids Res ; 44(9): e89, 2016 05 19.
Artigo em Inglês | MEDLINE | ID: mdl-26893356

RESUMO

Annotation of protein-coding genes is very important in bioinformatics and biology and has a decisive influence on many downstream analyses. Homology-based gene prediction programs allow for transferring knowledge about protein-coding genes from an annotated organism to an organism of interest.Here, we present a homology-based gene prediction program called GeMoMa. GeMoMa utilizes the conservation of intron positions within genes to predict related genes in other organisms. We assess the performance of GeMoMa and compare it with state-of-the-art competitors on plant and animal genomes using an extended best reciprocal hit approach. We find that GeMoMa often makes more precise predictions than its competitors yielding a substantially increased number of correct transcripts. Subsequently, we exemplarily validate GeMoMa predictions using Sanger sequencing. Finally, we use RNA-seq data to compare the predictions of homology-based gene prediction programs, and find again that GeMoMa performs well.Hence, we conclude that exploiting intron position conservation improves homology-based gene prediction, and we make GeMoMa freely available as command-line tool and Galaxy integration.


Assuntos
Biologia Computacional/métodos , Modelos Genéticos , Anotação de Sequência Molecular/métodos , RNA Mensageiro/genética , Análise de Sequência de RNA/métodos , Algoritmos , Animais , Arabidopsis/genética , Sequência de Bases , Carica/genética , Galinhas/genética , Sequenciamento de Nucleotídeos em Larga Escala , Humanos , Íntrons/genética , Camundongos , Oryza/genética , Reação em Cadeia da Polimerase , Homologia de Sequência do Ácido Nucleico , Solanum tuberosum/genética , Nicotiana/genética
13.
BMC Plant Biol ; 17(1): 137, 2017 08 10.
Artigo em Inglês | MEDLINE | ID: mdl-28797222

RESUMO

BACKGROUND: Genetic mapping of phenotypic traits generally focuses on a single time point, but biomass accumulates continuously during plant development. Resolution of the temporal dynamics that affect biomass recently became feasible using non-destructive imaging. RESULTS: With the aim to identify key genetic factors for vegetative biomass formation from the seedling stage to flowering, we explored growth over time in a diverse collection of two-rowed spring barley accessions. High heritabilities facilitated the temporal analysis of trait relationships and identification of quantitative trait loci (QTL). Biomass QTL tended to persist only a short period during early growth. More persistent QTL were detected around the booting stage. We identified seven major biomass QTL, which together explain 55% of the genetic variance at the seedling stage, and 43% at the booting stage. Three biomass QTL co-located with genes or QTL involved in phenology. The most important locus for biomass was independent from phenology and is located on chromosome 7HL at 141 cM. This locus explained ~20% of the genetic variance, was significant over a long period of time and co-located with HvDIM, a gene involved in brassinosteroid synthesis. CONCLUSIONS: Biomass is a dynamic trait and is therefore orchestrated by different QTL during early and late growth stages. Marker-assisted selection for high biomass at booting stage is most effective by also including favorable alleles from seedling biomass QTL. Selection for dynamic QTL may enhance genetic gain for complex traits such as biomass or, in the future, even grain yield.


Assuntos
Variação Genética , Hordeum/crescimento & desenvolvimento , Hordeum/genética , Locos de Características Quantitativas , Biomassa , Estações do Ano
14.
Nucleic Acids Res ; 43(18): e119, 2015 Oct 15.
Artigo em Inglês | MEDLINE | ID: mdl-26116565

RESUMO

Binding of transcription factors to DNA is one of the keystones of gene regulation. The existence of statistical dependencies between binding site positions is widely accepted, while their relevance for computational predictions has been debated. Building probabilistic models of binding sites that may capture dependencies is still challenging, since the most successful motif discovery approaches require numerical optimization techniques, which are not suited for selecting dependency structures. To overcome this issue, we propose sparse local inhomogeneous mixture (Slim) models that combine putative dependency structures in a weighted manner allowing for numerical optimization of dependency structure and model parameters simultaneously. We find that Slim models yield a substantially better prediction performance than previous models on genomic context protein binding microarray data sets and on ChIP-seq data sets. To elucidate the reasons for the improved performance, we develop dependency logos, which allow for visual inspection of dependency structures within binding sites. We find that the dependency structures discovered by Slim models are highly diverse and highly transcription factor-specific, which emphasizes the need for flexible dependency models. The observed dependency structures range from broad heterogeneities to sparse dependencies between neighboring and non-neighboring binding site positions.


Assuntos
Modelos Estatísticos , Elementos Reguladores de Transcrição , Análise de Sequência de DNA/métodos , Fatores de Transcrição/metabolismo , Sítios de Ligação , DNA/química , DNA/metabolismo , Humanos , Motivos de Nucleotídeos
15.
Bioinformatics ; 31(15): 2595-7, 2015 Aug 01.
Artigo em Inglês | MEDLINE | ID: mdl-25810428

RESUMO

Precision-recall (PR) and receiver operating characteristic (ROC) curves are valuable measures of classifier performance. Here, we present the R-package PRROC, which allows for computing and visualizing both PR and ROC curves. In contrast to available R-packages, PRROC allows for computing PR and ROC curves and areas under these curves for soft-labeled data using a continuous interpolation between the points of PR curves. In addition, PRROC provides a generic plot function for generating publication-quality graphics of PR and ROC curves.


Assuntos
Gráficos por Computador , Interpretação Estatística de Dados , Computação Matemática , Curva ROC , Software , Área Sob a Curva , Humanos , Reconhecimento Automatizado de Padrão , Interface Usuário-Computador
16.
BMC Bioinformatics ; 16: 387, 2015 Nov 17.
Artigo em Inglês | MEDLINE | ID: mdl-26577052

RESUMO

BACKGROUND: For three decades, sequence logos are the de facto standard for the visualization of sequence motifs in biology and bioinformatics. Reasons for this success story are their simplicity and clarity. The number of inferred and published motifs grows with the number of data sets and motif extraction algorithms. Hence, it becomes more and more important to perceive differences between motifs. However, motif differences are hard to detect from individual sequence logos in case of multiple motifs for one transcription factor, highly similar binding motifs of different transcription factors, or multiple motifs for one protein domain. RESULTS: Here, we present DiffLogo, a freely available, extensible, and user-friendly R package for visualizing motif differences. DiffLogo is capable of showing differences between DNA motifs as well as protein motifs in a pair-wise manner resulting in publication-ready figures. In case of more than two motifs, DiffLogo is capable of visualizing pair-wise differences in a tabular form. Here, the motifs are ordered by similarity, and the difference logos are colored for clarity. We demonstrate the benefit of DiffLogo on CTCF motifs from different human cell lines, on E-box motifs of three basic helix-loop-helix transcription factors as examples for comparison of DNA motifs, and on F-box domains from three different families as example for comparison of protein motifs. CONCLUSIONS: DiffLogo provides an intuitive visualization of motif differences. It enables the illustration and investigation of differences between highly similar motifs such as binding patterns of transcription factors for different cell types, treatments, and algorithmic approaches.


Assuntos
Algoritmos , Motivos de Aminoácidos/genética , Fatores de Transcrição Hélice-Alça-Hélice Básicos/genética , Gráficos por Computador , Motivos de Nucleotídeos/genética , Análise de Sequência de DNA/métodos , Software , Fator de Ligação a CCCTC , Biologia Computacional/métodos , Humanos , Estrutura Terciária de Proteína , Proteínas Repressoras/genética , Células Tumorais Cultivadas
17.
Nucleic Acids Res ; 41(21): e197, 2013 Nov.
Artigo em Inglês | MEDLINE | ID: mdl-24057214

RESUMO

De novo motif discovery has been an important challenge of bioinformatics for the past two decades. Since the emergence of high-throughput techniques like ChIP-seq, ChIP-exo and protein-binding microarrays (PBMs), the focus of de novo motif discovery has shifted to runtime and accuracy on large data sets. For this purpose, specialized algorithms have been designed for discovering motifs in ChIP-seq or PBM data. However, none of the existing approaches work perfectly for all three high-throughput techniques. In this article, we propose Dimont, a general approach for fast and accurate de novo motif discovery from high-throughput data. We demonstrate that Dimont yields a higher number of correct motifs from ChIP-seq data than any of the specialized approaches and achieves a higher accuracy for predicting PBM intensities from probe sequence than any of the approaches specifically designed for that purpose. Dimont also reports the expected motifs for several ChIP-exo data sets. Investigating differences between in vitro and in vivo binding, we find that for most transcription factors, the motifs discovered by Dimont are in good accordance between techniques, but we also find notable exceptions. We also observe that modeling intra-motif dependencies may increase accuracy, which indicates that more complex motif models are a worthwhile field of research.


Assuntos
Imunoprecipitação da Cromatina/métodos , DNA/química , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Análise Serial de Proteínas/métodos , Análise de Sequência de DNA/métodos , Humanos , Motivos de Nucleotídeos , Software
18.
Nucleic Acids Res ; 41(17): 8045-60, 2013 Sep.
Artigo em Inglês | MEDLINE | ID: mdl-23821666

RESUMO

The mineralocorticoid receptor (MR) is a ligand-induced transcription factor belonging to the steroid receptor family and involved in water-electrolyte homeostasis, blood pressure regulation, inflammation and fibrosis in the renocardiovascular system. The MR shares a common hormone-response-element with the glucocorticoid receptor but nevertheless elicits MR-specific effects including enhanced epidermal growth factor receptor (EGFR) expression via unknown mechanisms. The EGFR is a receptor tyrosine kinase that leads to activation of MAP kinases, but that can also function as a signal transducer for other signaling pathways. In the present study, we mechanistically investigate the interaction between a newly discovered MR- but not glucocorticoid receptor- responsive-element (=MRE1) of the EGFR promoter, specificity protein 1 (SP1) and MR to gain general insights into MR-specificity. Biological relevance of the interaction for EGFR expression and consequently for different signaling pathways in general is demonstrated in human, rat and murine vascular smooth muscle cells and cells of EGFR knockout mice. A genome-wide promoter search for identical binding regions followed by quantitative PCR validation suggests that the identified MR-SP1-MRE1 interaction might be applicable to other genes. Overall, a novel principle of MR-specific gene expression is explored that applies to the pathophysiologically relevant expression of the EGFR and potentially also to other genes.


Assuntos
Receptores ErbB/genética , Receptores de Mineralocorticoides/metabolismo , Elementos de Resposta , Fator de Transcrição Sp1/metabolismo , Transcrição Gênica , Aldosterona/farmacologia , Animais , Sequência de Bases , Sítios de Ligação , Células Cultivadas , Receptores ErbB/metabolismo , Células HEK293 , Humanos , Camundongos , Camundongos Knockout , Dados de Sequência Molecular , Regiões Promotoras Genéticas , Estrutura Terciária de Proteína , Ratos , Receptores de Mineralocorticoides/química , Transdução de Sinais , Fator de Transcrição Sp1/antagonistas & inibidores , Fator de Transcrição Sp3/metabolismo , Regulação para Cima
19.
Nucleic Acids Res ; 40(17): 8240-54, 2012 Sep 01.
Artigo em Inglês | MEDLINE | ID: mdl-22730287

RESUMO

The plant-specific, B3 domain-containing transcription factor ABSCISIC ACID INSENSITIVE3 (ABI3) is an essential component of the regulatory network controlling the development and maturation of the Arabidopsis thaliana seed. Genome-wide chromatin immunoprecipitation (ChIP-chip), transcriptome analysis, quantitative reverse transcriptase-polymerase chain reaction and a transient promoter activation assay have been combined to identify a set of 98 ABI3 target genes. Most of these presumptive ABI3 targets require the presence of abscisic acid for their activation and are specifically expressed during seed maturation. ABI3 target promoters are enriched for G-box-like and RY-like elements. The general occurrence of these cis motifs in non-ABI3 target promoters suggests the existence of as yet unidentified regulatory signals, some of which may be associated with epigenetic control. Several members of the ABI3 regulon are also regulated by other transcription factors, including the seed-specific, B3 domain-containing FUS3 and LEC2. The data strengthen and extend the notion that ABI3 is essential for the protection of embryonic structures from desiccation and raise pertinent questions regarding the specificity of promoter recognition.


Assuntos
Proteínas de Arabidopsis/metabolismo , Arabidopsis/genética , Regulação da Expressão Gênica de Plantas , Regulon , Fatores de Transcrição/metabolismo , Arabidopsis/embriologia , Arabidopsis/metabolismo , Imunoprecipitação da Cromatina , DNA de Plantas/química , DNA de Plantas/metabolismo , Perfilação da Expressão Gênica , Motivos de Nucleotídeos , Regiões Promotoras Genéticas , Sementes/metabolismo
20.
Sci Rep ; 14(1): 8525, 2024 04 12.
Artigo em Inglês | MEDLINE | ID: mdl-38609404

RESUMO

Rapid and reliable detection of pathogens is crucial to complement the growing industry of mass-reared insects, in order to safeguard the insect colonies from outbreak of diseases, which may cause significant economic loss. Current diagnostic methods are mainly based on conventional PCR and microscopic examination, requiring prior knowledge of disease symptoms and are limited to identifying known pathogens. Here, we present a rapid nanopore-based metagenomics approach for detecting entomopathogens from the European house cricket (Acheta domesticus). In this study, the Acheta domesticus densovirus (AdDV) was detected from diseased individuals using solely Nanopore sequencing. Virus reads and genome assemblies were obtained within twenty-four hours after sequencing. Subsequently, due to the length of the Nanopore reads, it was possible to reconstruct significantly large parts or even the entire AdDV genome to conduct studies for genotype identification. Variant analysis indicated the presence of three AdDV genotypes within the same house cricket population, with association to the vital status of the diseased crickets. This contrast provided compelling evidence for the existence of non-lethal AdDV genotypes. These findings demonstrated nanopore-based metagenomics sequencing as a powerful addition to the diagnostic tool kit for routine pathogen surveillance and diagnosis in the insect rearing industry.


Assuntos
Densovirus , Gryllidae , Sequenciamento por Nanoporos , Humanos , Animais , Densovirus/genética , Genótipo , Surtos de Doenças
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA